BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os05g0474900 Os05g0474900|AK102680
(554 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os05g0474900 Protein of unknown function Cys-rich family pr... 1094 0.0
Os01g0825900 Protein of unknown function Cys-rich family pr... 352 4e-97
Os03g0299800 Protein of unknown function Cys-rich family pr... 337 2e-92
Os11g0109600 Protein of unknown function Cys-rich family pr... 334 1e-91
Os11g0109700 Protein of unknown function Cys-rich family pr... 275 7e-74
Os05g0341900 234 9e-62
Os12g0109700 Protein of unknown function Cys-rich family pr... 86 8e-17
>Os05g0474900 Protein of unknown function Cys-rich family protein
Length = 554
Score = 1094 bits (2830), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 527/554 (95%), Positives = 527/554 (95%)
Query: 1 MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR 60
MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR
Sbjct: 1 MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR 60
Query: 61 FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL 120
FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL
Sbjct: 61 FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL 120
Query: 121 NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP 180
NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP
Sbjct: 121 NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP 180
Query: 181 KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMMXXXXXXXXXCFAQYALCGLNL 240
KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMM CFAQYALCGLNL
Sbjct: 181 KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMMVVVVLLNLNCFAQYALCGLNL 240
Query: 241 GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS 300
GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS
Sbjct: 241 GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS 300
Query: 301 RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG 360
RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG
Sbjct: 301 RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG 360
Query: 361 NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK 420
NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK
Sbjct: 361 NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK 420
Query: 421 RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM 480
RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM
Sbjct: 421 RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM 480
Query: 481 TPLSREDGLPLFRSNPGSPYRSSTASPSIFIMEXXXXXXXXXXXXXXXXXXTMGDRTMKA 540
TPLSREDGLPLFRSNPGSPYRSSTASPSIFIME TMGDRTMKA
Sbjct: 481 TPLSREDGLPLFRSNPGSPYRSSTASPSIFIMESPSAPRRSPGPSPLGGSPTMGDRTMKA 540
Query: 541 PTPSVLHRDGEPEL 554
PTPSVLHRDGEPEL
Sbjct: 541 PTPSVLHRDGEPEL 554
>Os01g0825900 Protein of unknown function Cys-rich family protein
Length = 525
Score = 352 bits (903), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 172/373 (46%), Positives = 233/373 (62%), Gaps = 17/373 (4%)
Query: 94 HEHFQFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSK 153
H F+R+INW ++ CK+W+K PLN+AL AW+ CV +G +L L++ G+LNRA PSK
Sbjct: 85 HVEVHFVRRINWSSVFSFCKNWLKHPLNIALLAWLLCVAAAGGMLILLLLGLLNRAFPSK 144
Query: 154 SQRDAWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPN 213
R W E++NQILNALFTLM +YQHP I++ VLLCRW +D LRK YCKNG +P
Sbjct: 145 PLRHHWIEIDNQILNALFTLMSIYQHPSLIHHLVLLCRWRPEDAAELRKVYCKNGDRRPG 204
Query: 214 EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIIS 273
E HM C +QY +C L YR R + + A AG Y + S
Sbjct: 205 ERAHMSVVVALLHVTCISQYVVCNLYWAYRSRSRSEFADNFFFVLGVVAPVVAGAYTVYS 264
Query: 274 PLGKDYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSE--ERRFVESRPEWVGGLMD 331
PLG+D D + A+ + +++ I++E R V P W GGL+D
Sbjct: 265 PLGRDTDDD---------------ASGEEAKQQQQHMIEAELPGTRTVVVDPVWAGGLLD 309
Query: 332 FWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNEN 391
++ + LS C+ CVFGWNM+RLGFGNMYVH A F+L C+APF++FN+ A++I++ +
Sbjct: 310 CGEDPAACCLSSLCTFCVFGWNMERLGFGNMYVHTAMFLLLCVAPFWVFNITALHIHDYD 369
Query: 392 LREALGLTGLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSLA 451
L +A+G G+ALCF GLLYGGFWR+QMRKRF LP + +CC SA TD +WL C C+LA
Sbjct: 370 LSDAVGAAGIALCFLGLLYGGFWRVQMRKRFALPGSRWCCGSASLTDYARWLFCWPCALA 429
Query: 452 QEVRTADYYDIAE 464
QEVRT + YD+ +
Sbjct: 430 QEVRTGNLYDVED 442
>Os03g0299800 Protein of unknown function Cys-rich family protein
Length = 610
Score = 337 bits (863), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 175/403 (43%), Positives = 233/403 (57%), Gaps = 24/403 (5%)
Query: 102 KINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFE 161
KI WG LW W ++P N A+ W+A V +LF++MTGML+ A+P QR W E
Sbjct: 148 KIKWGKLWSYAVSWCRKPENFAMIIWLAFVAAGLLMLFMLMTGMLDSAIPDDEQRKKWTE 207
Query: 162 VNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQ---KDVLVLRKTYCKNGTYKPNEWMHM 218
V NQILNALFT+MCLYQHPK ++ VLL RW D +RK YCK+G +P++ HM
Sbjct: 208 VINQILNALFTIMCLYQHPKIFHHLVLLLRWRPGAGADREEIRKVYCKDGAPRPHDRAHM 267
Query: 219 MXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKD 278
+ C AQY C L Y R ERP + + + G AGLY PLG+
Sbjct: 268 LVVVVLLHATCLAQYFCCALFWSYARKERPDWALNIGYGLGTGCPVIAGLYAAYGPLGRK 327
Query: 279 YDTELTEVDQEAQTEL-TRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNIS 337
+ E AQ RPA + +E I+ RR V S PEW GGL D D+ +
Sbjct: 328 QHEDSDEESAAAQAGGGNRPAENDREVE-----IKIYNRRVVVSSPEWSGGLFDCCDDGT 382
Query: 338 LAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALG 397
+ LS C+ CVFGWNM+RLGFGNMYVH TF+L C+APF IF++ A+N++++++R+ +
Sbjct: 383 VCALSATCTFCVFGWNMERLGFGNMYVHAFTFILLCVAPFLIFSVTALNVHDDDIRDTVV 442
Query: 398 LTGLALCFFGLLYGGFWRIQMRKRFNLPAN-------------NFCCRSAEATDCFQWLC 444
G+ L G LYGGFWR QMRKR+ LPA+ CR+A +DC +WL
Sbjct: 443 SVGVLLGLCGFLYGGFWRTQMRKRYKLPASGCGCGCECGAGGQGHACRAA-VSDCAKWLF 501
Query: 445 CSSCSLAQEVRTADYYDIAEDR-SYTEQITARSQHVMTPLSRE 486
C SC+LAQEVRTA++YD+ +DR + + V+ PL RE
Sbjct: 502 CWSCALAQEVRTANFYDVEDDRFVFHGARNEDGRAVLVPLPRE 544
>Os11g0109600 Protein of unknown function Cys-rich family protein
Length = 1124
Score = 334 bits (856), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 173/408 (42%), Positives = 231/408 (56%), Gaps = 9/408 (2%)
Query: 99 FIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDA 158
F+R ++W L C W K P+N AL W+A V A +FL+MTG LN A+P+ S+R
Sbjct: 75 FVRSVDWRALRAKCLAWAKHPMNAALLIWLAFVAGGVAFVFLLMTGALNSAVPAASRRRR 134
Query: 159 WFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYK-PNEWMH 217
W EV NQ+LNALFT+MC+YQHPK ++ LL RW DV LR YCKNG E +H
Sbjct: 135 WTEVANQMLNALFTIMCVYQHPKLCHHLALLLRWRAADVAELRALYCKNGAAGLRRERLH 194
Query: 218 MMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGK 277
+ CFAQY C L + R RP + V L +++ +G A LY + PLG+
Sbjct: 195 VAAVVLLFHATCFAQYGYCALFWFFGRDNRPDLAVNLCMALGLGFPIVAALYMVYGPLGR 254
Query: 278 DYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNIS 337
D E + ++ + S R V ++PEW GGL D D+ +
Sbjct: 255 KIVLIPASTDDEENLNSQVDEANAIAVTAQ---CDSNRNRAVVAKPEWAGGLFDVGDDPT 311
Query: 338 LAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALG 397
+A LS+ C+ CVFGWNM+RLG GNMYVH+ TF L C AP +F +AA+N++++ LR +G
Sbjct: 312 VAALSLSCTFCVFGWNMERLGLGNMYVHVFTFALLCAAPVLVFAVAALNVHDDTLRFVVG 371
Query: 398 LTGLALCFFGLLYGGFWRIQMRKRFNLPAN--NFCCRSAEATDCFQWLCCSSCSLAQEVR 455
G L GL YGGFWR QMR+RF LPA+ + C A A D +WLCC+ C+LAQEVR
Sbjct: 372 AAGALLSVLGLTYGGFWRAQMRRRFGLPAHRWSMCGGRATAADYGKWLCCAPCALAQEVR 431
Query: 456 TADYYDIAEDRSYTE--QITARSQHVMTPLSREDGLPLFRSNPGSPYR 501
TA+ YD+ ED Y + + + M PL RE G + P P R
Sbjct: 432 TANLYDVEEDVLYAKGGEEEEEEEAAMAPLERE-GCIVAVDAPPLPMR 478
>Os11g0109700 Protein of unknown function Cys-rich family protein
Length = 553
Score = 275 bits (703), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 160/405 (39%), Positives = 219/405 (54%), Gaps = 31/405 (7%)
Query: 95 EHFQFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKS 154
+H INW + KDWI P+N+A+ W+ CV VSGA+L L++ G+L+ A P+ +
Sbjct: 61 DHLSAGIAINWSSVRSATKDWITNPMNIAMLLWLLCVAVSGAMLVLLLLGLLDGAFPTPA 120
Query: 155 QRDAWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGT-YKPN 213
R+ W E+NNQ+LNALFTLM LYQHP ++ LLCRW D LR Y K+G +
Sbjct: 121 ARNHWIEINNQVLNALFTLMSLYQHPVLCHHLFLLCRWRPADAADLRAAYFKDGAGPRHG 180
Query: 214 EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIIS 273
E HM QY LCGL GY + RP + + + A A +Y + S
Sbjct: 181 ERAHMAVVVALLHLTVACQYVLCGLYWGYTKKTRPELVENGFFVLGVVAPVVAVVYTVCS 240
Query: 274 PLGKDYDTELT---EVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLM 330
PLGKD EL D +Q + T A PEW GG+
Sbjct: 241 PLGKDNYGELACPNAFDSVSQHKCTGHAVVE---------------------PEWAGGMF 279
Query: 331 DFWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNE 390
D + + +LS+ C+ C FGWNM+RLGFG+M+VH ATF+L C AP ++ ++A++I++
Sbjct: 280 DCGGDATAWWLSLSCTFCAFGWNMERLGFGSMFVHTATFVLLCFAPLWVMGVSALHIHDV 339
Query: 391 NLREALGLTGLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSL 450
+ + +G G LC GLLYGG+WRIQMR+RF LPA+ CC S TD +WL C C+L
Sbjct: 340 VIGDMVGGAGALLCVCGLLYGGYWRIQMRERFGLPASTACCGSPSVTDYARWLFCWPCAL 399
Query: 451 AQEVRTADYYDIAEDRSYTEQITARSQHVMTPLSREDGLPLFRSN 495
AQEVRT Y I + Y + V+ + E LPL S+
Sbjct: 400 AQEVRTESLYHIDCETFY------KKLPVVDDVEDEKRLPLLASH 438
>Os05g0341900
Length = 521
Score = 234 bits (598), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 160/417 (38%), Positives = 218/417 (52%), Gaps = 40/417 (9%)
Query: 98 QFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRD 157
+ +R+++ + C W++ P ++AL AW CV SGA+L L++ G L+ A P KS R+
Sbjct: 54 RLLRRLSPASVARACGRWLRHPAHLALLAWALCVAASGAMLALLLLGALDGAFPRKSARN 113
Query: 158 AWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPN---- 213
W EVNNQ+LNALFTLM +YQHP ++ +L RW DV LRK Y +
Sbjct: 114 RWIEVNNQVLNALFTLMSIYQHPALFHHAAMLLRWRPDDVKALRKAYRRRRKAAAAGDGA 173
Query: 214 ---EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAA--AFAGL 268
E +HM CFAQYA+CGL GY R RP T IGAA A AGL
Sbjct: 174 GGWERLHMSVVVALLHVACFAQYAMCGLYWGYSRKARP--DAAETSLAVIGAATPALAGL 231
Query: 269 YNIISPLGKDYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGG 328
Y PLG+ ATS E+ + EW GG
Sbjct: 232 YAYFGPLGRRKPGT---------------ATSARHQEEPDDLELAAAAAADVVVAEWAGG 276
Query: 329 LMDFWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNIN 388
L+D D+ + +LS C+ CVFGWNM+R+G GN +VH TF L C AP ++ N+AA+NI
Sbjct: 277 LLDVGDDPTAWWLSCLCTFCVFGWNMERMGLGNKHVHAVTFALLCFAPLWVLNVAAMNIR 336
Query: 389 NENLREALGLTGLALCFFGLLYGGFWRIQMRKRFNL-----PANNFCCRSAEA-TDCFQW 442
+E + +A+G +ALC GLLYGG+WR +MR+RF L CC S + D +W
Sbjct: 337 DEAVGDAVGAVAVALCALGLLYGGYWRARMRRRFGLLPGRHGGGGACCGSPSSLADYLRW 396
Query: 443 LCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQH--------VMTPLSREDGLPL 491
+ C SC+LAQEVRTA+ + D + + S ++ PL RE+G+ L
Sbjct: 397 MFCWSCALAQEVRTANVLLLDADEAGGAGGGSSSSGGGGRGDATLLQPLPRENGVKL 453
>Os12g0109700 Protein of unknown function Cys-rich family protein
Length = 219
Score = 85.9 bits (211), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 40/71 (56%), Positives = 47/71 (66%)
Query: 400 GLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADY 459
G LC GLLYGG+WRIQMR+RF LPA+ CC S TD +WL C C+LAQEVRTA
Sbjct: 18 GALLCVCGLLYGGYWRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASL 77
Query: 460 YDIAEDRSYTE 470
Y I + Y +
Sbjct: 78 YHIDGETFYKK 88
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.324 0.137 0.441
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 18,697,471
Number of extensions: 748045
Number of successful extensions: 2111
Number of sequences better than 1.0e-10: 7
Number of HSP's gapped: 2098
Number of HSP's successfully gapped: 7
Length of query: 554
Length of database: 17,035,801
Length adjustment: 106
Effective length of query: 448
Effective length of database: 11,501,117
Effective search space: 5152500416
Effective search space used: 5152500416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 159 (65.9 bits)