BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os03g0219400 Os03g0219400|AK100702
(605 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os03g0219400 Glycoside hydrolase, family 20 protein 1095 0.0
Os07g0575500 Glycoside hydrolase, family 20 protein 595 e-170
Os05g0115900 Glycoside hydrolase, family 20 protein 229 3e-60
Os01g0891000 Glycoside hydrolase, family 20 protein 221 2e-57
Os05g0415700 Glycoside hydrolase, family 20 protein 214 2e-55
Os05g0390200 121 2e-27
>Os03g0219400 Glycoside hydrolase, family 20 protein
Length = 605
Score = 1095 bits (2831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 535/563 (95%), Positives = 535/563 (95%)
Query: 43 AQKVQVWPKPTSISWPSAVYAPLSPSFSVRAVLSHPSLRQAVAFYTRLIRAERHAPLVPP 102
AQKVQVWPKPTSISWPSAVYAPLSPSFSVRAVLSHPSLRQAVAFYTRLIRAERHAPLVPP
Sbjct: 43 AQKVQVWPKPTSISWPSAVYAPLSPSFSVRAVLSHPSLRQAVAFYTRLIRAERHAPLVPP 102
Query: 103 ANYTLSRVPVRTLTLSVSDPDVPLGPAVDESYTLSVLPDSGSADISAATPWGAIRGLETF 162
ANYTLSRVPVRTLTLSVSDPDVPLGPAVDESYTLSVLPDSGSADISAATPWGAIRGLETF
Sbjct: 103 ANYTLSRVPVRTLTLSVSDPDVPLGPAVDESYTLSVLPDSGSADISAATPWGAIRGLETF 162
Query: 163 SQLXXXXXXXXXXXQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRAMAFNK 222
SQL QPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRAMAFNK
Sbjct: 163 SQLAWAGGGAASGGQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRAMAFNK 222
Query: 223 LNVFHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRVIPEID 282
LNVFHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRVIPEID
Sbjct: 223 LNVFHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRVIPEID 282
Query: 283 MPGHTGSWAGAYPEIVTCANRFWXXXXXXXXXXXXGTGQLNPLNPKTYRVAQDVLRDMVA 342
MPGHTGSWAGAYPEIVTCANRFW GTGQLNPLNPKTYRVAQDVLRDMVA
Sbjct: 283 MPGHTGSWAGAYPEIVTCANRFWAPHAEPALAAEPGTGQLNPLNPKTYRVAQDVLRDMVA 342
Query: 343 LFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFVAQELNRTVVY 402
LFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFVAQELNRTVVY
Sbjct: 343 LFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFVAQELNRTVVY 402
Query: 403 WEDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSASYYYLDCGHGG 462
WEDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSASYYYLDCGHGG
Sbjct: 403 WEDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSASYYYLDCGHGG 462
Query: 463 WVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHGLTDDEAQLVLG 522
WVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHGLTDDEAQLVLG
Sbjct: 463 WVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHGLTDDEAQLVLG 522
Query: 523 GEVALWSEQSDETVLDARLWPRXXXXXETLWSGNKGSNGKKRYANATDRLNDWRHRMVER 582
GEVALWSEQSDETVLDARLWPR ETLWSGNKGSNGKKRYANATDRLNDWRHRMVER
Sbjct: 523 GEVALWSEQSDETVLDARLWPRAAAAAETLWSGNKGSNGKKRYANATDRLNDWRHRMVER 582
Query: 583 GIRAEPIQPLWCSLHPGMCNLSQ 605
GIRAEPIQPLWCSLHPGMCNLSQ
Sbjct: 583 GIRAEPIQPLWCSLHPGMCNLSQ 605
>Os07g0575500 Glycoside hydrolase, family 20 protein
Length = 706
Score = 595 bits (1535), Expect = e-170, Method: Compositional matrix adjust.
Identities = 314/572 (54%), Positives = 379/572 (66%), Gaps = 27/572 (4%)
Query: 46 VQVWPKPTSISWPSAVYA-PLSPSFSVRAVLSHPSLRQAVAFYTRLIRAERHAPLVPPA- 103
V VWPKPTS+SW A +S SF V A + L A Y L+ AER+ PLV PA
Sbjct: 33 VNVWPKPTSMSWAEPHMAVRVSSSFHVVAPSGNAHLLSAARRYAALLLAERYRPLVTPAV 92
Query: 104 NYTLSRV---------PVRTLTLSVSDPDVPLGPAVDESYTLSVLPDSGSADISAATPWG 154
N T + LTL+VSD PL VDESY L +LP +A ++AAT WG
Sbjct: 93 NVTAGGAGAGAAGRGAELGYLTLAVSDLHAPLQHGVDESYALEILPAGAAATVTAATAWG 152
Query: 155 AIRGLETFSQLXXXXXXXXXXXQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHT 214
A+RGLETFSQL +V +G+ + DRP + HRG++LDT R ++PV DIL T
Sbjct: 153 AMRGLETFSQLAWWCGRERAV---LVAAGVRVEDRPLYPHRGLMLDTGRTYFPVADILRT 209
Query: 215 LRAMAFNKLNVFHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFG 274
+ AMA NK+NVFHWHITD+QSFP+ LP+ P LA GSY MRYT +DV+ IV FA + G
Sbjct: 210 IDAMAANKMNVFHWHITDSQSFPLELPSEPALAEKGSYGDGMRYTVDDVKLIVDFAMNRG 269
Query: 275 IRVIPEIDMPGHTGSWAGAYPEIVTCANRFWXXXXX---XXXXXXXGTGQLNPLNPKTYR 331
+RV+PEID PGHT SWAGAYPE+V+CA FW G GQLNPL PKTY+
Sbjct: 270 VRVVPEIDTPGHTASWAGAYPELVSCAGEFWLPDASDWPSRLAAEPGAGQLNPLEPKTYQ 329
Query: 332 VAQDVLRDMVALFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPF 391
V +V+ D+ +LFPD + H GADEV CW DP ++R+LA GGT LLE F+ A P
Sbjct: 330 VMSNVINDVTSLFPDGFYHAGADEVTPGCWNADPSIQRYLARGGTLSRLLEKFVGAAHPL 389
Query: 392 VAQELNRTVVYWEDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSA 451
+ NRT VYWEDVLL V V + +P ETTILQTWN+G NT+ +V AGYRAIVSSA
Sbjct: 390 IVSR-NRTAVYWEDVLLDQAVNVTASAIPPETTILQTWNNGGNNTRLIVRAGYRAIVSSA 448
Query: 452 SYYYLDCGHGGWVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHG 511
S+YYLDCGHG + GNDS YD +D G +GGSWC P+KTWQRVYDYD+ G
Sbjct: 449 SFYYLDCGHGDFAGNDSAYDDPR---------SDYGTSGGSWCGPYKTWQRVYDYDVAGG 499
Query: 512 LTDDEAQLVLGGEVALWSEQSDETVLDARLWPRXXXXXETLWSGNKGSNGKKRYANATDR 571
LT +EA+LV+GGEVA+W+EQ D VLD R+WPR E LWSGN+ + G+KRYA ATDR
Sbjct: 500 LTAEEARLVVGGEVAMWTEQVDAAVLDGRVWPRASAMAEALWSGNRDATGRKRYAEATDR 559
Query: 572 LNDWRHRMVERGIRAEPIQPLWCSLHPGMCNL 603
L DWRHRMV RG+RAEPIQPLWC PGMCNL
Sbjct: 560 LTDWRHRMVGRGVRAEPIQPLWCRNRPGMCNL 591
>Os05g0115900 Glycoside hydrolase, family 20 protein
Length = 541
Score = 229 bits (585), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 170/502 (33%), Positives = 226/502 (45%), Gaps = 76/502 (15%)
Query: 112 VRTLTLSVSDPDVPLGPAVDESYTLSVLPDSG------SADISAATPWGAIRGLETFSQL 165
V LT+ V+ D L VDESYT+ V G A I A T +GAIRGLETFSQL
Sbjct: 101 VGKLTVVVASADEKLELGVDESYTIYVAAAGGVNSIVGGATIEANTIYGAIRGLETFSQL 160
Query: 166 XXXXXXXXXXXQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRAMAFNKLNV 225
P IE D P F RG+LLDT+R+F PV I + +M+F+KLNV
Sbjct: 161 CVFNYDTKNVEVRHAPWYIE--DEPRFAFRGLLLDTSRHFLPVDVIKQVIDSMSFSKLNV 218
Query: 226 FHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRVIPEIDMPG 285
HWHI D QSFP+ +P+ P L GSYS RYT D R IVS+A GI V+ EID+PG
Sbjct: 219 LHWHIIDEQSFPLEVPSYPKLWK-GSYSKLERYTVEDARDIVSYARKRGIHVMAEIDVPG 277
Query: 286 HTGSWAGAYPEIVTCANRFWXXXXXXXXXXXXGTGQLNPLNPKTYRVAQDVLRDMVALFP 345
H SW YP + W L+ + T+ V +L DM +FP
Sbjct: 278 HAESWGKGYP-------KLWPSPKCRE--------PLDVTSNFTFEVISGILSDMRKIFP 322
Query: 346 DPYLHGGADEVNTACWEDDPVVRRFLAEGG--THDHLLELFINATRPFVAQELNRTVVYW 403
H G DEV T CW P V+++L E T D + A +A LN V W
Sbjct: 323 FGLFHLGGDEVYTGCWNATPHVKQWLHERNMTTKDAYKYFVLKAQE--IAINLNWIPVNW 380
Query: 404 EDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSASYYYLDCGHGGW 463
E+ K + P T++ W GP +VV G+R I+S+ +YLD
Sbjct: 381 EETFNSFKENLNP------LTVVHNWL-GPGVCPKVVEKGFRCIMSNQGVWYLDH----- 428
Query: 464 VGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHGLTDDEAQ-LVLG 522
+ P WQ Y + L G+ + Q LVLG
Sbjct: 429 -------------------LDVP-------------WQDFYTSEPLAGINNTAQQKLVLG 456
Query: 523 GEVALWSEQSDETVLDARLWPRXXXXXETLWSGNKGSNGKKRYANATDRLNDWRHRMVER 582
GEV +W E +D + + +WPR E +WS + + + RL+ +R + R
Sbjct: 457 GEVCMWGETADTSDVQQTIWPRAAAAAERMWSQLEAISAQDLETTVLARLHYFRCLLNHR 516
Query: 583 GIRAEPIQPLWCS---LHPGMC 601
GI A P+ + + PG C
Sbjct: 517 GIAAAPVTNSYARRPPIGPGSC 538
>Os01g0891000 Glycoside hydrolase, family 20 protein
Length = 526
Score = 221 bits (562), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 176/561 (31%), Positives = 261/561 (46%), Gaps = 96/561 (17%)
Query: 46 VQVWPKPTSISWPSA-VYAPLSPSFSVRAVLSHPS----LRQAVAFYTRLIRAERHAPLV 100
+ +WP PTS+S + +Y + S+ ++P L+ A L++
Sbjct: 28 IDLWPMPTSVSHGTQRLYVSKDITMSMEGS-TYPDGKGILKDAFQRVVDLMKLNHVVDGA 86
Query: 101 PPANYTLSRVPVRTLTLSVSDPDVPLGPAVDESYTLSVLPDSG---SADISAATPWGAIR 157
P+++ L+ V V V P+ L VDESY LSV P +G I A T +GA+
Sbjct: 87 NPSSFVLTGVNV-----VVHSPEDELKFGVDESYNLSV-PTAGYPLRVQIEAQTVFGALH 140
Query: 158 GLETFSQLXXXXXXXXXXXQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRA 217
L+TFSQL ++ + ISD P F +RG+L+DT+R++ PV I +
Sbjct: 141 ALQTFSQLCYFDFTSKLIE--LISAPWRISDTPRFPYRGLLIDTSRHYLPVTVIKKVIDT 198
Query: 218 MAFNKLNVFHWHITDAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRV 277
MA++KLNV HWHI DAQSFPI +P+ P L N GSYS + RYT +D IV +A + G+ V
Sbjct: 199 MAYSKLNVLHWHIVDAQSFPIEIPSYPKLWN-GSYSFSERYTTSDAVDIVRYAENRGVNV 257
Query: 278 IPEIDMPGHTGSWAGAYPEIVTCANRFWXXXXXXXXXXXXGTGQLNPLNPKTYRVAQDVL 337
+ EID+PGH SW YP + W L+ N T+ V +L
Sbjct: 258 MAEIDVPGHALSWGVGYPSL-------WPSDSCKEP--------LDVSNNFTFGVIDGIL 302
Query: 338 RDMVALFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFV--AQE 395
D +F ++H G DEVNT+CW P ++++L + + +A R FV +Q+
Sbjct: 303 SDFSKVFKFKFVHLGGDEVNTSCWTATPHIKKWLDDNQMN------VSDAYRYFVLRSQK 356
Query: 396 LNRT----VVYWEDVL--LGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVS 449
L + V+ WE+ G K+ T++ W G + +VVAAG R IVS
Sbjct: 357 LAISHGYDVINWEETFNNFGDKLD--------RRTVVHNWL-GEDVAPKVVAAGLRCIVS 407
Query: 450 SASYYYLDCGHGGWVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDIL 509
+ +YLD TW+ Y + L
Sbjct: 408 NQDKWYLDHLDA-------------------------------------TWEGFYTNEPL 430
Query: 510 HGLTDDEAQ-LVLGGEVALWSEQSDETVLDARLWPRXXXXXETLWSGNKGSNGKKRYANA 568
G+ D E Q LV+GGEV +W EQ D + ++ +WPR E LW+ + R
Sbjct: 431 KGIDDPEQQSLVIGGEVCMWGEQIDASDIEQTIWPRAAAAAERLWTPIEKIAEDPRL--V 488
Query: 569 TDRLNDWRHRMVERGIRAEPI 589
T RL +R + +RG+ A P+
Sbjct: 489 TSRLARFRCLLNQRGVAAAPV 509
>Os05g0415700 Glycoside hydrolase, family 20 protein
Length = 531
Score = 214 bits (544), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 160/490 (32%), Positives = 229/490 (46%), Gaps = 89/490 (18%)
Query: 115 LTLSVSDPDVPLGPAVDESYTLSVLPDSGS---ADISAATPWGAIRGLETFSQLXXXXXX 171
+ + V P L VDESY LSV P +GS A I A T +GA+ LETFSQL
Sbjct: 99 VNVVVHLPGDELNFGVDESYNLSV-PATGSPIYAQIEAQTVFGALHALETFSQLCNFDFT 157
Query: 172 XXXXXQPIVPSGIEISDRPHFTHRGILLDTARNFYPVRDILHTLRAMAFNKLNVFHWHIT 231
P I+D P F +RG+L+DT+R++ PV I + +M ++KLNV HWHI
Sbjct: 158 SRLIELQSAP--WSITDMPRFPYRGLLIDTSRHYLPVPVIKSVIDSMTYSKLNVLHWHIV 215
Query: 232 DAQSFPIVLPTVPNLANSGSYSPTMRYTENDVRHIVSFAASFGIRVIPEIDMPGHTGSWA 291
D QSFPI +P+ P L N G+YS + RYT +D IV +A G+ V+ EID+PGH SW
Sbjct: 216 DEQSFPIEIPSYPKLWN-GAYSYSERYTMDDAIDIVQYAERRGVNVLAEIDVPGHALSWG 274
Query: 292 GAYPEI---VTCANRFWXXXXXXXXXXXXGTGQLNPLNPKTYRVAQDVLRDMVALFPDPY 348
YP + TC L+ + T++V +L D +F +
Sbjct: 275 VGYPSLWPSATCKE------------------PLDVSSESTFQVINGILSDFSKVFKFKF 316
Query: 349 LHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFV--AQELNRT----VVY 402
+H G DEVNT+CW P V+ +LA+ G + +A R FV AQ++ ++ V+
Sbjct: 317 VHLGGDEVNTSCWTSTPRVKAWLAQHGMKES------DAYRYFVLRAQKIAKSHGYEVIN 370
Query: 403 WEDVL--LGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSSASYYYLDCGH 460
WE+ G K+ T++ W G ++VVAAG R IVS+ +YLD
Sbjct: 371 WEETFNNFGDKLD--------RRTVVHNWLGGGV-AEKVVAAGLRCIVSNQDKWYLDHLE 421
Query: 461 GGWVGNDSRYDKQEKEREGTPLFNDPGGTGGSWCAPFKTWQRVYDYDILHGLTDDEAQ-L 519
TW Y + L + + Q L
Sbjct: 422 -------------------------------------VTWDGFYMNEPLRNIKNPAQQKL 444
Query: 520 VLGGEVALWSEQSDETVLDARLWPRXXXXXETLWSGNKGSNGKKRYANATDRLNDWRHRM 579
VLGGEV +W+E D + + +WPR E LW+ + + + A + RL +R +
Sbjct: 445 VLGGEVCMWAEHIDASDIQQTIWPRAAAAAERLWTPFEKLSKEWEIAALSARLARFRCLL 504
Query: 580 VERGIRAEPI 589
RGI A P+
Sbjct: 505 NHRGIAAGPV 514
>Os05g0390200
Length = 162
Score = 121 bits (303), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/111 (57%), Positives = 75/111 (67%), Gaps = 13/111 (11%)
Query: 340 MVALFPDPYLHGGADEVNTACWEDDPVVRRFLAEGGTHDHLLELFINATRPFVAQELNRT 399
MVALFPDPYLHGG DEVNTACWE+DPVVRRFLAEGGTH+HLLE+FIN TRPFVAQELN+
Sbjct: 1 MVALFPDPYLHGGTDEVNTACWENDPVVRRFLAEGGTHNHLLEVFINTTRPFVAQELNQ- 59
Query: 400 VVYWEDVLLGPKVTVGPTILPRETTILQTWNDGPENTKRVVAAGYRAIVSS 450
+ W V P+ P P I E+ VA+ +R I ++
Sbjct: 60 -LPWHSV---PQPLGQPVCRPPSHRI--------ESAAATVASSFRRIAAA 98
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.319 0.136 0.436
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 21,127,479
Number of extensions: 953326
Number of successful extensions: 1998
Number of sequences better than 1.0e-10: 6
Number of HSP's gapped: 1977
Number of HSP's successfully gapped: 6
Length of query: 605
Length of database: 17,035,801
Length adjustment: 107
Effective length of query: 498
Effective length of database: 11,448,903
Effective search space: 5701553694
Effective search space used: 5701553694
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 159 (65.9 bits)