BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os12g0109700 Os12g0109700|AK058609
(219 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os12g0109700 Protein of unknown function Cys-rich family pr... 316 1e-86
Os11g0109700 Protein of unknown function Cys-rich family pr... 279 1e-75
Os01g0825900 Protein of unknown function Cys-rich family pr... 96 2e-20
Os11g0109600 Protein of unknown function Cys-rich family pr... 74 6e-14
Os05g0341900 71 8e-13
Os05g0474900 Protein of unknown function Cys-rich family pr... 67 1e-11
>Os12g0109700 Protein of unknown function Cys-rich family protein
Length = 219
Score = 316 bits (809), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 159/188 (84%), Positives = 159/188 (84%)
Query: 32 WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKKLPV 91
WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKKLPV
Sbjct: 32 WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKKLPV 91
Query: 92 VDAEKRQPLLLASHHVQFHEPPDTMIMATXXXXXXXXXXXXXXMXXXXXXXXXXXXXXXG 151
VDAEKRQPLLLASHHVQFHEPPDTMIMAT M G
Sbjct: 92 VDAEKRQPLLLASHHVQFHEPPDTMIMATSEESSDHVVVVHEEMVPPAVQVVFEQVVVEG 151
Query: 152 DKSEEECSAVHDEKIMGLPLPESVIVVDAEIPASLSDGSWTVEKVKRLMNVVTLVSLLIL 211
DKSEEECSAVHDEKIMGLPLPESVIVVDAEIPASLSDGSWTVEKVKRLMNVVTLVSLLIL
Sbjct: 152 DKSEEECSAVHDEKIMGLPLPESVIVVDAEIPASLSDGSWTVEKVKRLMNVVTLVSLLIL 211
Query: 212 LYTRGFIR 219
LYTRGFIR
Sbjct: 212 LYTRGFIR 219
>Os11g0109700 Protein of unknown function Cys-rich family protein
Length = 553
Score = 279 bits (714), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/192 (76%), Positives = 150/192 (78%), Gaps = 5/192 (2%)
Query: 32 WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKKLPV 91
WRIQMRERFGLPAS ACCGSPSVTDYARWLFCWPCALAQEVRT SLYHID ETFYKKLPV
Sbjct: 363 WRIQMRERFGLPASTACCGSPSVTDYARWLFCWPCALAQEVRTESLYHIDCETFYKKLPV 422
Query: 92 VD---AEKRQPLLLASHHVQFHEPPDTMIMATXXXXXXXXXXXXXXMXXXXXXXXXXXXX 148
VD EKR P LLASHHVQFHEPPDTMIMA M
Sbjct: 423 VDDVEDEKRLP-LLASHHVQFHEPPDTMIMAASEGSNDHVVIVHEEMVPPAVQVVVEQVV 481
Query: 149 XXGDKSEEECSAVHDEKIMGLPLPESVIVV-DAEIPASLSDGSWTVEKVKRLMNVVTLVS 207
GDKSEEECSAVHDEKIMG PLPESV++V D EIPASLSDGSWTVEKVKRL+NVVTLVS
Sbjct: 482 VEGDKSEEECSAVHDEKIMGSPLPESVVIVDDDEIPASLSDGSWTVEKVKRLINVVTLVS 541
Query: 208 LLILLYTRGFIR 219
LLILLYTRGFIR
Sbjct: 542 LLILLYTRGFIR 553
>Os01g0825900 Protein of unknown function Cys-rich family protein
Length = 525
Score = 95.5 bits (236), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 41/59 (69%), Positives = 48/59 (81%), Gaps = 2/59 (3%)
Query: 32 WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHID--GETFYKK 88
WR+QMR+RF LP S CCGS S+TDYARWLFCWPCALAQEVRT +LY ++ G FY+K
Sbjct: 392 WRVQMRKRFALPGSRWCCGSASLTDYARWLFCWPCALAQEVRTGNLYDVEDGGGVFYEK 450
>Os11g0109600 Protein of unknown function Cys-rich family protein
Length = 1124
Score = 74.3 bits (181), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 33/59 (55%), Positives = 40/59 (67%), Gaps = 2/59 (3%)
Query: 32 WRIQMRERFGLPAS--AACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKK 88
WR QMR RFGLPA + C G + DY +WL C PCALAQEVRTA+LY ++ + Y K
Sbjct: 388 WRAQMRRRFGLPAHRWSMCGGRATAADYGKWLCCAPCALAQEVRTANLYDVEEDVLYAK 446
>Os05g0341900
Length = 521
Score = 70.9 bits (172), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 33/58 (56%), Positives = 40/58 (68%), Gaps = 6/58 (10%)
Query: 32 WRIQMRERFGL-----PASAACCGSPS-VTDYARWLFCWPCALAQEVRTASLYHIDGE 83
WR +MR RFGL ACCGSPS + DY RW+FCW CALAQEVRTA++ +D +
Sbjct: 362 WRARMRRRFGLLPGRHGGGGACCGSPSSLADYLRWMFCWSCALAQEVRTANVLLLDAD 419
>Os05g0474900 Protein of unknown function Cys-rich family protein
Length = 554
Score = 66.6 bits (161), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/57 (54%), Positives = 37/57 (64%)
Query: 32 WRIQMRERFGLPASAACCGSPSVTDYARWLFCWPCALAQEVRTASLYHIDGETFYKK 88
WRIQMR+RF LPA+ CC S TD +WL C C+LAQEVRTA Y I + Y +
Sbjct: 414 WRIQMRKRFNLPANNFCCRSAEATDCFQWLCCSSCSLAQEVRTADYYDIAEDRSYTE 470
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.322 0.135 0.426
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 5,676,389
Number of extensions: 189032
Number of successful extensions: 560
Number of sequences better than 1.0e-10: 7
Number of HSP's gapped: 552
Number of HSP's successfully gapped: 7
Length of query: 219
Length of database: 17,035,801
Length adjustment: 97
Effective length of query: 122
Effective length of database: 11,971,043
Effective search space: 1460467246
Effective search space used: 1460467246
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 154 (63.9 bits)