BLASTP 2.2.23 [Feb-03-2010]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Os05g0474900 Os05g0474900|AK102680
         (554 letters)

Database: rap3 
           52,214 sequences; 17,035,801 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Os05g0474900  Protein of unknown function Cys-rich family pr...  1094   0.0  
Os01g0825900  Protein of unknown function Cys-rich family pr...   352   4e-97
Os03g0299800  Protein of unknown function Cys-rich family pr...   337   2e-92
Os11g0109600  Protein of unknown function Cys-rich family pr...   334   1e-91
Os11g0109700  Protein of unknown function Cys-rich family pr...   275   7e-74
Os05g0341900                                                      234   9e-62
Os12g0109700  Protein of unknown function Cys-rich family pr...    86   8e-17
>Os05g0474900 Protein of unknown function Cys-rich family protein
          Length = 554

 Score = 1094 bits (2830), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 527/554 (95%), Positives = 527/554 (95%)

Query: 1   MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR 60
           MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR
Sbjct: 1   MVSNGNEDLKADVELVESTTVDNDTGAPGASTLPTQGVPRQGKQRNGFLNFCNRFSSGDR 60

Query: 61  FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL 120
           FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL
Sbjct: 61  FKKLGPSPSFKFRQLALERDEFSRSIHSDSHDNHEHFQFIRKINWGHLWVMCKDWIKEPL 120

Query: 121 NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP 180
           NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP
Sbjct: 121 NMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFEVNNQILNALFTLMCLYQHP 180

Query: 181 KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMMXXXXXXXXXCFAQYALCGLNL 240
           KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMM         CFAQYALCGLNL
Sbjct: 181 KRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPNEWMHMMVVVVLLNLNCFAQYALCGLNL 240

Query: 241 GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS 300
           GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS
Sbjct: 241 GYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKDYDTELTEVDQEAQTELTRPATS 300

Query: 301 RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG 360
           RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG
Sbjct: 301 RTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNISLAYLSIFCSCCVFGWNMQRLGFG 360

Query: 361 NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK 420
           NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK
Sbjct: 361 NMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALGLTGLALCFFGLLYGGFWRIQMRK 420

Query: 421 RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM 480
           RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM
Sbjct: 421 RFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQHVM 480

Query: 481 TPLSREDGLPLFRSNPGSPYRSSTASPSIFIMEXXXXXXXXXXXXXXXXXXTMGDRTMKA 540
           TPLSREDGLPLFRSNPGSPYRSSTASPSIFIME                  TMGDRTMKA
Sbjct: 481 TPLSREDGLPLFRSNPGSPYRSSTASPSIFIMESPSAPRRSPGPSPLGGSPTMGDRTMKA 540

Query: 541 PTPSVLHRDGEPEL 554
           PTPSVLHRDGEPEL
Sbjct: 541 PTPSVLHRDGEPEL 554
>Os01g0825900 Protein of unknown function Cys-rich family protein
          Length = 525

 Score =  352 bits (903), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 172/373 (46%), Positives = 233/373 (62%), Gaps = 17/373 (4%)

Query: 94  HEHFQFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSK 153
           H    F+R+INW  ++  CK+W+K PLN+AL AW+ CV  +G +L L++ G+LNRA PSK
Sbjct: 85  HVEVHFVRRINWSSVFSFCKNWLKHPLNIALLAWLLCVAAAGGMLILLLLGLLNRAFPSK 144

Query: 154 SQRDAWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPN 213
             R  W E++NQILNALFTLM +YQHP  I++ VLLCRW  +D   LRK YCKNG  +P 
Sbjct: 145 PLRHHWIEIDNQILNALFTLMSIYQHPSLIHHLVLLCRWRPEDAAELRKVYCKNGDRRPG 204

Query: 214 EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIIS 273
           E  HM          C +QY +C L   YR   R          + + A   AG Y + S
Sbjct: 205 ERAHMSVVVALLHVTCISQYVVCNLYWAYRSRSRSEFADNFFFVLGVVAPVVAGAYTVYS 264

Query: 274 PLGKDYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSE--ERRFVESRPEWVGGLMD 331
           PLG+D D +               A+   + +++   I++E    R V   P W GGL+D
Sbjct: 265 PLGRDTDDD---------------ASGEEAKQQQQHMIEAELPGTRTVVVDPVWAGGLLD 309

Query: 332 FWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNEN 391
             ++ +   LS  C+ CVFGWNM+RLGFGNMYVH A F+L C+APF++FN+ A++I++ +
Sbjct: 310 CGEDPAACCLSSLCTFCVFGWNMERLGFGNMYVHTAMFLLLCVAPFWVFNITALHIHDYD 369

Query: 392 LREALGLTGLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSLA 451
           L +A+G  G+ALCF GLLYGGFWR+QMRKRF LP + +CC SA  TD  +WL C  C+LA
Sbjct: 370 LSDAVGAAGIALCFLGLLYGGFWRVQMRKRFALPGSRWCCGSASLTDYARWLFCWPCALA 429

Query: 452 QEVRTADYYDIAE 464
           QEVRT + YD+ +
Sbjct: 430 QEVRTGNLYDVED 442
>Os03g0299800 Protein of unknown function Cys-rich family protein
          Length = 610

 Score =  337 bits (863), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 175/403 (43%), Positives = 233/403 (57%), Gaps = 24/403 (5%)

Query: 102 KINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDAWFE 161
           KI WG LW     W ++P N A+  W+A V     +LF++MTGML+ A+P   QR  W E
Sbjct: 148 KIKWGKLWSYAVSWCRKPENFAMIIWLAFVAAGLLMLFMLMTGMLDSAIPDDEQRKKWTE 207

Query: 162 VNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQ---KDVLVLRKTYCKNGTYKPNEWMHM 218
           V NQILNALFT+MCLYQHPK  ++ VLL RW      D   +RK YCK+G  +P++  HM
Sbjct: 208 VINQILNALFTIMCLYQHPKIFHHLVLLLRWRPGAGADREEIRKVYCKDGAPRPHDRAHM 267

Query: 219 MXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGKD 278
           +         C AQY  C L   Y R ERP   + +   +  G    AGLY    PLG+ 
Sbjct: 268 LVVVVLLHATCLAQYFCCALFWSYARKERPDWALNIGYGLGTGCPVIAGLYAAYGPLGRK 327

Query: 279 YDTELTEVDQEAQTEL-TRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNIS 337
              +  E    AQ     RPA +   +E     I+   RR V S PEW GGL D  D+ +
Sbjct: 328 QHEDSDEESAAAQAGGGNRPAENDREVE-----IKIYNRRVVVSSPEWSGGLFDCCDDGT 382

Query: 338 LAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALG 397
           +  LS  C+ CVFGWNM+RLGFGNMYVH  TF+L C+APF IF++ A+N++++++R+ + 
Sbjct: 383 VCALSATCTFCVFGWNMERLGFGNMYVHAFTFILLCVAPFLIFSVTALNVHDDDIRDTVV 442

Query: 398 LTGLALCFFGLLYGGFWRIQMRKRFNLPAN-------------NFCCRSAEATDCFQWLC 444
             G+ L   G LYGGFWR QMRKR+ LPA+                CR+A  +DC +WL 
Sbjct: 443 SVGVLLGLCGFLYGGFWRTQMRKRYKLPASGCGCGCECGAGGQGHACRAA-VSDCAKWLF 501

Query: 445 CSSCSLAQEVRTADYYDIAEDR-SYTEQITARSQHVMTPLSRE 486
           C SC+LAQEVRTA++YD+ +DR  +        + V+ PL RE
Sbjct: 502 CWSCALAQEVRTANFYDVEDDRFVFHGARNEDGRAVLVPLPRE 544
>Os11g0109600 Protein of unknown function Cys-rich family protein
          Length = 1124

 Score =  334 bits (856), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 173/408 (42%), Positives = 231/408 (56%), Gaps = 9/408 (2%)

Query: 99  FIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRDA 158
           F+R ++W  L   C  W K P+N AL  W+A V    A +FL+MTG LN A+P+ S+R  
Sbjct: 75  FVRSVDWRALRAKCLAWAKHPMNAALLIWLAFVAGGVAFVFLLMTGALNSAVPAASRRRR 134

Query: 159 WFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYK-PNEWMH 217
           W EV NQ+LNALFT+MC+YQHPK  ++  LL RW   DV  LR  YCKNG      E +H
Sbjct: 135 WTEVANQMLNALFTIMCVYQHPKLCHHLALLLRWRAADVAELRALYCKNGAAGLRRERLH 194

Query: 218 MMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIISPLGK 277
           +          CFAQY  C L   + R  RP + V L +++ +G    A LY +  PLG+
Sbjct: 195 VAAVVLLFHATCFAQYGYCALFWFFGRDNRPDLAVNLCMALGLGFPIVAALYMVYGPLGR 254

Query: 278 DYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLMDFWDNIS 337
                    D E          +  ++  +     S   R V ++PEW GGL D  D+ +
Sbjct: 255 KIVLIPASTDDEENLNSQVDEANAIAVTAQ---CDSNRNRAVVAKPEWAGGLFDVGDDPT 311

Query: 338 LAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNENLREALG 397
           +A LS+ C+ CVFGWNM+RLG GNMYVH+ TF L C AP  +F +AA+N++++ LR  +G
Sbjct: 312 VAALSLSCTFCVFGWNMERLGLGNMYVHVFTFALLCAAPVLVFAVAALNVHDDTLRFVVG 371

Query: 398 LTGLALCFFGLLYGGFWRIQMRKRFNLPAN--NFCCRSAEATDCFQWLCCSSCSLAQEVR 455
             G  L   GL YGGFWR QMR+RF LPA+  + C   A A D  +WLCC+ C+LAQEVR
Sbjct: 372 AAGALLSVLGLTYGGFWRAQMRRRFGLPAHRWSMCGGRATAADYGKWLCCAPCALAQEVR 431

Query: 456 TADYYDIAEDRSYTE--QITARSQHVMTPLSREDGLPLFRSNPGSPYR 501
           TA+ YD+ ED  Y +  +     +  M PL RE G  +    P  P R
Sbjct: 432 TANLYDVEEDVLYAKGGEEEEEEEAAMAPLERE-GCIVAVDAPPLPMR 478
>Os11g0109700 Protein of unknown function Cys-rich family protein
          Length = 553

 Score =  275 bits (703), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 160/405 (39%), Positives = 219/405 (54%), Gaps = 31/405 (7%)

Query: 95  EHFQFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKS 154
           +H      INW  +    KDWI  P+N+A+  W+ CV VSGA+L L++ G+L+ A P+ +
Sbjct: 61  DHLSAGIAINWSSVRSATKDWITNPMNIAMLLWLLCVAVSGAMLVLLLLGLLDGAFPTPA 120

Query: 155 QRDAWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGT-YKPN 213
            R+ W E+NNQ+LNALFTLM LYQHP   ++  LLCRW   D   LR  Y K+G   +  
Sbjct: 121 ARNHWIEINNQVLNALFTLMSLYQHPVLCHHLFLLCRWRPADAADLRAAYFKDGAGPRHG 180

Query: 214 EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAAAFAGLYNIIS 273
           E  HM             QY LCGL  GY +  RP +       + + A   A +Y + S
Sbjct: 181 ERAHMAVVVALLHLTVACQYVLCGLYWGYTKKTRPELVENGFFVLGVVAPVVAVVYTVCS 240

Query: 274 PLGKDYDTELT---EVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGGLM 330
           PLGKD   EL      D  +Q + T  A                        PEW GG+ 
Sbjct: 241 PLGKDNYGELACPNAFDSVSQHKCTGHAVVE---------------------PEWAGGMF 279

Query: 331 DFWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNINNE 390
           D   + +  +LS+ C+ C FGWNM+RLGFG+M+VH ATF+L C AP ++  ++A++I++ 
Sbjct: 280 DCGGDATAWWLSLSCTFCAFGWNMERLGFGSMFVHTATFVLLCFAPLWVMGVSALHIHDV 339

Query: 391 NLREALGLTGLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSL 450
            + + +G  G  LC  GLLYGG+WRIQMR+RF LPA+  CC S   TD  +WL C  C+L
Sbjct: 340 VIGDMVGGAGALLCVCGLLYGGYWRIQMRERFGLPASTACCGSPSVTDYARWLFCWPCAL 399

Query: 451 AQEVRTADYYDIAEDRSYTEQITARSQHVMTPLSREDGLPLFRSN 495
           AQEVRT   Y I  +  Y      +   V+  +  E  LPL  S+
Sbjct: 400 AQEVRTESLYHIDCETFY------KKLPVVDDVEDEKRLPLLASH 438
>Os05g0341900 
          Length = 521

 Score =  234 bits (598), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 160/417 (38%), Positives = 218/417 (52%), Gaps = 40/417 (9%)

Query: 98  QFIRKINWGHLWVMCKDWIKEPLNMALFAWIACVTVSGAILFLVMTGMLNRALPSKSQRD 157
           + +R+++   +   C  W++ P ++AL AW  CV  SGA+L L++ G L+ A P KS R+
Sbjct: 54  RLLRRLSPASVARACGRWLRHPAHLALLAWALCVAASGAMLALLLLGALDGAFPRKSARN 113

Query: 158 AWFEVNNQILNALFTLMCLYQHPKRIYYFVLLCRWEQKDVLVLRKTYCKNGTYKPN---- 213
            W EVNNQ+LNALFTLM +YQHP   ++  +L RW   DV  LRK Y +           
Sbjct: 114 RWIEVNNQVLNALFTLMSIYQHPALFHHAAMLLRWRPDDVKALRKAYRRRRKAAAAGDGA 173

Query: 214 ---EWMHMMXXXXXXXXXCFAQYALCGLNLGYRRSERPPIGVGLTISVAIGAA--AFAGL 268
              E +HM          CFAQYA+CGL  GY R  RP      T    IGAA  A AGL
Sbjct: 174 GGWERLHMSVVVALLHVACFAQYAMCGLYWGYSRKARP--DAAETSLAVIGAATPALAGL 231

Query: 269 YNIISPLGKDYDTELTEVDQEAQTELTRPATSRTSLEKRYSFIQSEERRFVESRPEWVGG 328
           Y    PLG+                    ATS    E+      +          EW GG
Sbjct: 232 YAYFGPLGRRKPGT---------------ATSARHQEEPDDLELAAAAAADVVVAEWAGG 276

Query: 329 LMDFWDNISLAYLSIFCSCCVFGWNMQRLGFGNMYVHIATFMLFCLAPFFIFNLAAVNIN 388
           L+D  D+ +  +LS  C+ CVFGWNM+R+G GN +VH  TF L C AP ++ N+AA+NI 
Sbjct: 277 LLDVGDDPTAWWLSCLCTFCVFGWNMERMGLGNKHVHAVTFALLCFAPLWVLNVAAMNIR 336

Query: 389 NENLREALGLTGLALCFFGLLYGGFWRIQMRKRFNL-----PANNFCCRSAEA-TDCFQW 442
           +E + +A+G   +ALC  GLLYGG+WR +MR+RF L          CC S  +  D  +W
Sbjct: 337 DEAVGDAVGAVAVALCALGLLYGGYWRARMRRRFGLLPGRHGGGGACCGSPSSLADYLRW 396

Query: 443 LCCSSCSLAQEVRTADYYDIAEDRSYTEQITARSQH--------VMTPLSREDGLPL 491
           + C SC+LAQEVRTA+   +  D +      + S          ++ PL RE+G+ L
Sbjct: 397 MFCWSCALAQEVRTANVLLLDADEAGGAGGGSSSSGGGGRGDATLLQPLPRENGVKL 453
>Os12g0109700 Protein of unknown function Cys-rich family protein
          Length = 219

 Score = 85.9 bits (211), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 40/71 (56%), Positives = 47/71 (66%)

Query: 400 GLALCFFGLLYGGFWRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADY 459
           G  LC  GLLYGG+WRIQMR+RF LPA+  CC S   TD  +WL C  C+LAQEVRTA  
Sbjct: 18  GALLCVCGLLYGGYWRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASL 77

Query: 460 YDIAEDRSYTE 470
           Y I  +  Y +
Sbjct: 78  YHIDGETFYKK 88
  Database: rap3
    Posted date:  Nov 19, 2010  6:03 PM
  Number of letters in database: 17,035,801
  Number of sequences in database:  52,214
  
Lambda     K      H
   0.324    0.137    0.441 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 18,697,471
Number of extensions: 748045
Number of successful extensions: 2111
Number of sequences better than 1.0e-10: 7
Number of HSP's gapped: 2098
Number of HSP's successfully gapped: 7
Length of query: 554
Length of database: 17,035,801
Length adjustment: 106
Effective length of query: 448
Effective length of database: 11,501,117
Effective search space: 5152500416
Effective search space used: 5152500416
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 159 (65.9 bits)