BLASTP 2.2.23 [Feb-03-2010]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Os01g0265800 Os01g0265800|AK099896
         (524 letters)

Database: rap3 
           52,214 sequences; 17,035,801 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Os01g0265800  Similar to Sulfated surface glycoprotein 185 p...   640   0.0  
Os01g0614500  RNA-binding region RNP-1 (RNA recognition moti...   173   3e-43
Os02g0602600  RNA-binding region RNP-1 (RNA recognition moti...   134   2e-31
Os11g0603300                                                      124   1e-28
Os02g0755400  Similar to RNA recognition motif-containing pr...    89   1e-17
Os06g0622900  Similar to RNA-binding region containing prote...    80   4e-15
Os02g0714000  Similar to Yarrowia lipolytica chromosome C of...    76   7e-14
Os03g0286500  Similar to RNA-binding region containing prote...    75   1e-13
Os01g0958500  Similar to RNA binding protein-like                  69   1e-11
Os03g0245900                                                       68   2e-11
>Os01g0265800 Similar to Sulfated surface glycoprotein 185 precursor (SSG 185)
          Length = 524

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/478 (71%), Positives = 343/478 (71%)

Query: 1   EPQTPHHTAPLSPPLHSRRNPSPKVPPRLALAVAMGKKRKLDSKSPXXXXXXXXXXXXXX 60
           EPQTPHHTAPLSPPLHSRRNPSPKVPPRLALAVAMGKKRKLDSKSP              
Sbjct: 1   EPQTPHHTAPLSPPLHSRRNPSPKVPPRLALAVAMGKKRKLDSKSPAAARSAAARAAAAA 60

Query: 61  XXXXXXXXXXEPSSQPEALAEDPAPSSQPLGLSSEGAGERMMSRXXXXXXXXXXXXXXXX 120
                     EPSSQPEALAEDPAPSSQPLGLSSEGAGERMMSR                
Sbjct: 61  AAAAAAAAVAEPSSQPEALAEDPAPSSQPLGLSSEGAGERMMSREAGGGEEEEVEEVEVE 120

Query: 121 XXXXXXXXXXXXXXXXXXXXXXXRDADSIQALLNSFPKDQLVELLSAAALSHEDVLTAVH 180
                                  RDADSIQALLNSFPKDQLVELLSAAALSHEDVLTAVH
Sbjct: 121 EEVEVDEDEDGEGEGEEEEEAAERDADSIQALLNSFPKDQLVELLSAAALSHEDVLTAVH 180

Query: 181 RAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSR 240
           RAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSR
Sbjct: 181 RAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSR 240

Query: 241 RSGARAALREPQKKIGNRTTACQLASVGPVPPGGMATNXXXXXXXXXXXXXXXXXSEYTQ 300
           RSGARAALREPQKKIGNRTTACQLASVGPVPPGGMATN                 SEYTQ
Sbjct: 241 RSGARAALREPQKKIGNRTTACQLASVGPVPPGGMATNPAPAVAPAPAQLALPPVSEYTQ 300

Query: 301 RKIFVSNVGADIDPQKLLQFFSKYGEIEEGPLGLDKVTGKPKGFALFVYKTLDSAKKALQ 360
           RKIFVSNVGADIDPQKLLQFFSKYGEIEEGPLGLDKVTGKPKGFALFVYKTLDSAKKALQ
Sbjct: 301 RKIFVSNVGADIDPQKLLQFFSKYGEIEEGPLGLDKVTGKPKGFALFVYKTLDSAKKALQ 360

Query: 361 EPHKQFEGVVLHCQKAIDGPKPNKXXXXXXXXXXXXXXXXXXXXXXXXHSHSLPGAAVGG 420
           EPHKQFEGVVLHCQKAIDGPKPNK                        HSHSLPGAAVGG
Sbjct: 361 EPHKQFEGVVLHCQKAIDGPKPNKGGGLGGLYGAGTSGGRKGAGGYGAHSHSLPGAAVGG 420

Query: 421 HVMPSPVSSLTSXXXXXXXXXXXXXXXQALTAILASQXXXXXXXXXXXXXXXXSGLPN 478
           HVMPSPVSSLTS               QALTAILASQ                SGLPN
Sbjct: 421 HVMPSPVSSLTSLPGVAGGPGVNPALGQALTAILASQGGGLGLNNILGVGANGSGLPN 478
>Os01g0614500 RNA-binding region RNP-1 (RNA recognition motif) domain containing
           protein
          Length = 447

 Score =  173 bits (439), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 101/232 (43%), Positives = 138/232 (59%), Gaps = 10/232 (4%)

Query: 147 DSIQALLNSFPKDQLVELLSAAALSHEDVLTAVHRAADADPALRKIFVHGLGWDATAETL 206
           D +  L+    +DQL ++ + AAL+    L AV  AAD DPALRK+FV GLGW+  +++L
Sbjct: 33  DDLLRLVEPLSRDQLADIAATAALASGVALDAVRAAADRDPALRKLFVRGLGWETNSDSL 92

Query: 207 TEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSGARAALREPQKKIGNRTTACQLAS 266
              FSA+G++E+  V+TD++TG+ KGYGF+ F     A  AL+EP KKI  R T  QLA 
Sbjct: 93  RAIFSAFGDLEEAVVITDKSTGRSKGYGFVTFRHADSAVLALKEPSKKIDGRMTVTQLA- 151

Query: 267 VGPVPPGGMATNXXXXXXXXXXXXXXXXXSEYTQRKIFVSNVGADIDPQKLLQFFSKYGE 326
                    A                   ++ + RKIFV NV AD+  ++LL  F+ YGE
Sbjct: 152 ---------AAGAAGGASGGAAGAGGAPAADVSLRKIFVGNVPADMPSERLLAHFAAYGE 202

Query: 327 IEEGPLGLDKVTGKPKGFALFVYKTLDSAKKALQEPHKQFEGVVLHCQKAID 378
           IEEGPLG DK TGK +GFALFVYKT + A+ +L +  K  +G  L C+ AI+
Sbjct: 203 IEEGPLGFDKQTGKFRGFALFVYKTPEGAQASLVDSVKVIDGHQLVCKLAIE 254
>Os02g0602600 RNA-binding region RNP-1 (RNA recognition motif) domain containing
           protein
          Length = 415

 Score =  134 bits (336), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 125/224 (55%), Gaps = 30/224 (13%)

Query: 156 FPKDQLVELLSAAALSHEDVLTAVHRAADADPALRKIFVHGLGWDATAETLTEAFSAYGE 215
           F +++L++LL  A L +  + + +   A++D A R++FVHGL    TA  +  AF+ +G 
Sbjct: 70  FTRNELLDLLVEACLRNPALRSRLAATAESDAAHRRLFVHGLSPGVTAAAMAAAFAPFGA 129

Query: 216 IEDLRVVTDRATGKCKGYGFILFSRRSGARAALREPQKK---IGNRTTACQLASVGPVPP 272
           +++   V DRATG+C+GYGF+ F RRS AR AL         +G R  ACQLAS+GP  P
Sbjct: 130 LDECHAVADRATGRCRGYGFVTFRRRSAARRALAADASSRLAVGGRPVACQLASLGPTSP 189

Query: 273 GGMATNXXXXXXXXXXXXXXXXXSEYTQRKIFVSNVGADIDPQKLLQFFSKYGEIEEGPL 332
                                       RK+FV NV A     +L + FS++GEIE GPL
Sbjct: 190 ---------------------------DRKLFVDNVPARAAHDELRRLFSRFGEIEAGPL 222

Query: 333 GLDKVTGKPKGFALFVYKTLDSAKKALQEPHKQFEGVVLHCQKA 376
           G D+ TG+ +G+A+F YK  +   KAL+E    F+G  LHC++A
Sbjct: 223 GADRATGQFRGYAIFFYKYPEGLTKALEERKVVFDGCELHCRRA 266
>Os11g0603300 
          Length = 415

 Score =  124 bits (312), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/204 (38%), Positives = 109/204 (53%), Gaps = 20/204 (9%)

Query: 175 VLTAVHRAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYG 234
           +L  +  AADA P+ R++FVHGL   A A  L  AFS +G + +  VV  RATG CKG+G
Sbjct: 73  LLARIRAAADASPSHRRLFVHGLPPHADAPALAAAFSRFGPLAECDVVARRATGACKGFG 132

Query: 235 FILFSRRSGARAALREPQKKIGNRTTACQLASVGPVPPGGMATNXXXXXXXXXXXXXXXX 294
           F+ F  R+ AR ALRE    +G                GG+A                  
Sbjct: 133 FVTFQSRAAARRALRE----VGR---------------GGVAV-AGRAVSAQYATAGAAA 172

Query: 295 XSEYTQRKIFVSNVGADIDPQKLLQFFSKYGEIEEGPLGLDKVTGKPKGFALFVYKTLDS 354
            +    R+++V+NV      ++L  FF+ +GE+E GP G D  TG  +G ALFVY+  + 
Sbjct: 173 AASAAGRRVYVTNVAPGASAERLRAFFAGFGELEGGPFGFDADTGSSRGCALFVYRAAED 232

Query: 355 AKKALQEPHKQFEGVVLHCQKAID 378
           A++AL+EP++ FEG  LHCQ A D
Sbjct: 233 ARRALEEPYRVFEGRTLHCQLAAD 256
>Os02g0755400 Similar to RNA recognition motif-containing protein SEB-4
          Length = 176

 Score = 88.6 bits (218), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 52/90 (57%)

Query: 180 HRAADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFS 239
           HR    D  L K+FV GL W+  ++ L + F  YGEI +  V+TDR T + KGYGF+ F 
Sbjct: 36  HRTRFGDTTLTKVFVGGLAWETPSKGLQDHFQQYGEILEAVVITDRETSRSKGYGFVTFR 95

Query: 240 RRSGARAALREPQKKIGNRTTACQLASVGP 269
               AR A+R P   IG R   C +AS+GP
Sbjct: 96  EPESAREAVRNPNPTIGGRRANCNIASMGP 125
>Os06g0622900 Similar to RNA-binding region containing protein 1 (HSRNASEB)
           (ssDNA binding protein SEB4) (CLL-associated antigen
           KW-5). Splice isoform 2
          Length = 275

 Score = 80.1 bits (196), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 47/84 (55%)

Query: 185 ADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSGA 244
            D    K+FV GL W+  +E L   F AYGEI +  V+TDRATG+ KGYGF+ F     A
Sbjct: 28  GDTTYTKVFVGGLAWETRSEGLRAHFEAYGEILEAVVITDRATGRSKGYGFVTFRDPDSA 87

Query: 245 RAALREPQKKIGNRTTACQLASVG 268
           R A  +P   I  R   C LA +G
Sbjct: 88  RMACMDPYPVIDGRRANCNLAILG 111
>Os02g0714000 Similar to Yarrowia lipolytica chromosome C of strain CLIB99 of
           Yarrowia lipolytica
          Length = 287

 Score = 75.9 bits (185), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 35/84 (41%), Positives = 47/84 (55%)

Query: 185 ADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSGA 244
            D    K+FV GL W+ T+E L   +  +GEI +  V+TDR +G+ KGYGF+ F     A
Sbjct: 32  GDTTYTKVFVGGLAWETTSERLRRFYDRFGEILEAVVITDRHSGRSKGYGFVTFRDPESA 91

Query: 245 RAALREPQKKIGNRTTACQLASVG 268
           R A  +P   I  R   C LAS+G
Sbjct: 92  RKACEDPTPVIDGRRANCNLASLG 115
>Os03g0286500 Similar to RNA-binding region containing protein 1 (HSRNASEB)
           (ssDNA binding protein SEB4) (CLL-associated antigen
           KW-5). Splice isoform 2
          Length = 310

 Score = 75.1 bits (183), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 35/90 (38%), Positives = 50/90 (55%)

Query: 182 AADADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRR 241
           AA  D  L K+FV GL W+   +TL E F  +G+I +  +++D+ TG+ KGYGF+ F   
Sbjct: 25  AAFGDTTLTKVFVGGLAWETHKDTLREHFERFGDILEAVIISDKLTGRSKGYGFVTFKEA 84

Query: 242 SGARAALREPQKKIGNRTTACQLASVGPVP 271
             A+ A  +    I  R   C LAS+G  P
Sbjct: 85  DAAKKACEDATPVINGRRANCNLASLGAKP 114
>Os01g0958500 Similar to RNA binding protein-like
          Length = 310

 Score = 68.6 bits (166), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 44/83 (53%)

Query: 186 DPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSGAR 245
           D  L K+FV GL W+   E +   F  +G+I +  V+TD+ TG+ KGYGF+ F     A 
Sbjct: 17  DTTLTKVFVGGLAWETQKEGMRGYFEQFGDILEAVVITDKNTGRSKGYGFVTFREPEAAM 76

Query: 246 AALREPQKKIGNRTTACQLASVG 268
            A  +P   I  R   C LA +G
Sbjct: 77  KACFDPYPVIDGRRANCNLAYLG 99
>Os03g0245900 
          Length = 101

 Score = 67.8 bits (164), Expect = 2e-11,   Method: Composition-based stats.
 Identities = 33/85 (38%), Positives = 46/85 (54%)

Query: 184 DADPALRKIFVHGLGWDATAETLTEAFSAYGEIEDLRVVTDRATGKCKGYGFILFSRRSG 243
           D D    K+FV GL W+   + +   F  +GEI +  V+ D+ TG+ KGYGF+ F    G
Sbjct: 2   DGDTTFTKLFVGGLPWETRGDAVRRHFEQFGEIVEAVVIADKHTGRSKGYGFVTFRDPDG 61

Query: 244 ARAALREPQKKIGNRTTACQLASVG 268
           A  AL++P   I  R   C LA+ G
Sbjct: 62  AARALQDPTPVIDGRRANCNLAAFG 86
  Database: rap3
    Posted date:  Nov 19, 2010  6:03 PM
  Number of letters in database: 17,035,801
  Number of sequences in database:  52,214
  
Lambda     K      H
   0.314    0.131    0.379 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 12,603,087
Number of extensions: 429682
Number of successful extensions: 1905
Number of sequences better than 1.0e-10: 10
Number of HSP's gapped: 1905
Number of HSP's successfully gapped: 11
Length of query: 524
Length of database: 17,035,801
Length adjustment: 105
Effective length of query: 419
Effective length of database: 11,553,331
Effective search space: 4840845689
Effective search space used: 4840845689
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 158 (65.5 bits)