BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os03g0627500 Os03g0627500|AK067707
(512 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os03g0627500 KH domain containing protein 818 0.0
Os12g0597600 KH domain containing protein 368 e-102
Os10g0414700 KH domain containing protein 182 4e-46
Os10g0415108 168 7e-42
Os02g0125500 162 4e-40
Os03g0376800 KH domain containing protein 109 5e-24
Os10g0495000 KH domain containing protein 100 2e-21
Os09g0498600 KH domain containing protein 87 3e-17
Os10g0564000 KH domain containing protein 85 1e-16
Os08g0200400 KH domain containing protein 83 4e-16
>Os03g0627500 KH domain containing protein
Length = 512
Score = 818 bits (2112), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/451 (90%), Positives = 410/451 (90%)
Query: 1 ANMDGLVENFDADDLGEMPQNHYNEEQLIPYSDVSHPYNEEPDNMDNVEEGNPYIQQVSL 60
ANMDGLVENFDADDLGEMPQNHYNEEQLIPYSDVSHPYNEEPDNMDNVEEGNPYIQQVSL
Sbjct: 1 ANMDGLVENFDADDLGEMPQNHYNEEQLIPYSDVSHPYNEEPDNMDNVEEGNPYIQQVSL 60
Query: 61 YSEEPENQYNEEPSNPYQEESDNAYNGEVKQQDSLPVEADKKWPGWPGESVFRILIPAQK 120
YSEEPENQYNEEPSNPYQEESDNAYNGEVKQQDSLPVEADKKWPGWPGESVFRILIPAQK
Sbjct: 61 YSEEPENQYNEEPSNPYQEESDNAYNGEVKQQDSLPVEADKKWPGWPGESVFRILIPAQK 120
Query: 121 VGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKDEPDAPISPAMDGLFRVY 180
VGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKDEPDAPISPAMDGLFRVY
Sbjct: 121 VGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKDEPDAPISPAMDGLFRVY 180
Query: 181 KRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQDSSKSIVRIVET 240
KRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQDSSKSIVRIVET
Sbjct: 181 KRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQDSSKSIVRIVET 240
Query: 241 LPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLVDRSVLPLFEGQMKMHNAQREQAM 300
LPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLVDRSVLPLFEGQMKMHNAQREQAM
Sbjct: 241 LPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLVDRSVLPLFEGQMKMHNAQREQAM 300
Query: 301 AAXXXXXXXXXXXXXXXXXXXXXXXXXXXXQFMXXXXXXXXXXXXXVPSMEKQPHYGISA 360
AA QFM VPSMEKQPHYGISA
Sbjct: 301 AAPQPWGPPQPWGPPPSHLPPGGPGYGGHPQFMPPRPQDNYYPPPDVPSMEKQPHYGISA 360
Query: 361 YGREAPTGVSASGNQPPSHVASQVTHNMQIPLSYADAVIGAAGASISYIRRHSGATVTIQ 420
YGREAPTGVSASGNQPPSHVASQVTHNMQIPLSYADAVIGAAGASISYIRRHSGATVTIQ
Sbjct: 361 YGREAPTGVSASGNQPPSHVASQVTHNMQIPLSYADAVIGAAGASISYIRRHSGATVTIQ 420
Query: 421 ESRGAPGEMTVEIIGSASQVQTAQQLVQNFM 451
ESRGAPGEMTVEIIGSASQVQTAQQLVQNFM
Sbjct: 421 ESRGAPGEMTVEIIGSASQVQTAQQLVQNFM 451
>Os12g0597600 KH domain containing protein
Length = 367
Score = 368 bits (945), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/297 (67%), Positives = 226/297 (76%), Gaps = 5/297 (1%)
Query: 157 VMISAKDEPDAPISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSL 216
VMISAKDEPDAP+ PA+DGL RV+KRITDG DG+S QP+R VGPTRLLVPASQAGSL
Sbjct: 12 VMISAKDEPDAPLPPAVDGLLRVHKRITDGLDGESDQPQRAAGTVGPTRLLVPASQAGSL 71
Query: 217 IGKQGATIKSIQDSSKSIVRIVETLPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFL 276
IGKQGATIKSIQD+SK ++RI+E++P VAL+DDRVVEIQGEP+ V KA+E IASHLRKFL
Sbjct: 72 IGKQGATIKSIQDASKCVLRILESVPPVALSDDRVVEIQGEPLDVHKAVELIASHLRKFL 131
Query: 277 VDRSVLPLFEGQMKMHNAQREQAMAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXQFMXXX 336
VDRSVLPLFE QMK+HNA REQ M QFM
Sbjct: 132 VDRSVLPLFEMQMKVHNAHREQPMPP-PQTWGPPPPWGHPSNVPPGGPGYGGNPQFMPPR 190
Query: 337 XXXXXXXXXXVPSMEKQPHYGISAYGREA-PTGV-SASGNQPPSHVASQVTHNMQIPLSY 394
VP +EKQPHYGIS+YGR+A PTG ASGNQ P H +SQ+TH+MQ+PLSY
Sbjct: 191 PQDHYYPPPDVPPVEKQPHYGISSYGRDAPPTGAPPASGNQHPPHGSSQITHSMQVPLSY 250
Query: 395 ADAVIGAAGASISYIRRHSGATVTIQESRGAPGEMTVEIIGSASQVQTAQQLVQNFM 451
ADAVIGAAGASISYIRRHSGAT++IQE G PGEMTVEI GSASQVQTAQQL++NFM
Sbjct: 251 ADAVIGAAGASISYIRRHSGATISIQE--GVPGEMTVEISGSASQVQTAQQLIKNFM 305
>Os10g0414700 KH domain containing protein
Length = 586
Score = 182 bits (463), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 129/189 (68%), Gaps = 6/189 (3%)
Query: 104 PGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKD 163
PGWPG SVFR+LIPA KVGA+IG GE ++++CEE+KA ++++ G ER V+I AK+
Sbjct: 53 PGWPGTSVFRMLIPATKVGAVIGHSGERLRRLCEETKACVRVIGGHFAAAERAVIIFAKE 112
Query: 164 EPDAPISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGAT 223
+PD P PA+D L RVY+ I + D G R +N+ R+L P+ QA SLIG QG+
Sbjct: 113 QPDEPKPPAIDALLRVYECIIN----DDGLDVR-YNNIVVARILTPSEQAASLIGDQGSV 167
Query: 224 IKSIQDSSKSIVRIVE-TLPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLVDRSVL 282
I I+ +SK+ + +++ LP VAL DD ++EI G P V +ALE +A HLRK+LV RSV+
Sbjct: 168 INYIKKASKTNIHVIDGDLPPVALEDDMIIEIWGLPARVHQALELVACHLRKYLVHRSVI 227
Query: 283 PLFEGQMKM 291
PLF+ + +
Sbjct: 228 PLFDPHVSI 236
Score = 70.1 bits (170), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/105 (42%), Positives = 63/105 (60%), Gaps = 4/105 (3%)
Query: 349 SMEKQPHYGISAYG-REAPTGVSASGNQPPSHVASQVTHNMQIPLSYADAVIGAAGASIS 407
++E H +SA E P V S V SQV MQ+P+ YA+AVIG GA I
Sbjct: 434 NVENLQHCRVSACAPEELPNVVVPSLTSQSPAVTSQVIMKMQVPIFYAEAVIGPTGARID 493
Query: 408 YIRRHSGATVTIQESRGAPGEMTVEIIGS-ASQVQTAQQLVQNFM 451
YIR+ SG++V I++ + M++EI GS A+ VQ A+QL++NFM
Sbjct: 494 YIRQASGSSVVIKDLDDS--AMSIEITGSAATDVQIAEQLIKNFM 536
>Os10g0415108
Length = 649
Score = 168 bits (426), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 91/222 (40%), Positives = 128/222 (57%), Gaps = 39/222 (17%)
Query: 104 PGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKD 163
PGWPG SVFR+LIPA KVGA+IG GE ++++CEE+KA ++++ G ER V+I AK+
Sbjct: 133 PGWPGTSVFRMLIPATKVGAVIGHSGERLRRLCEETKACVRVIGGHFAAAERAVIIFAKE 192
Query: 164 EPDAPISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGAT 223
+PD P PA+D L RVY+ I + D G R +N+ R+L P+ QA SLIG QG+
Sbjct: 193 QPDEPKPPAIDALLRVYECIIN----DDGLDVR-YNNIVVARILTPSEQAASLIGDQGSV 247
Query: 224 IKSIQDSSKSIVRIVET----------------------------------LPLVALNDD 249
I I+ +SK+ + ++ LP VAL DD
Sbjct: 248 INYIKKASKTNIHVIGNFLTLMHLLEPLVPSIDKFDISGLQLSIYTDADGDLPPVALEDD 307
Query: 250 RVVEIQGEPVGVQKALESIASHLRKFLVDRSVLPLFEGQMKM 291
++EI G P V +ALE +A HLRK+LV RSV+PLF+ + +
Sbjct: 308 MIIEIWGLPARVHQALELVACHLRKYLVHRSVIPLFDPHVSI 349
>Os02g0125500
Length = 458
Score = 162 bits (411), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 83/193 (43%), Positives = 125/193 (64%), Gaps = 12/193 (6%)
Query: 102 KWPGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILD-GPPGVPERTVMIS 160
+WPGWPG SVFR+++ KVG +IGR+G+ IK++CE+++AR+++L+ R V+IS
Sbjct: 89 RWPGWPGASVFRLVVATDKVGGLIGRRGDTIKRLCEDTRARVRVLEAAAAAAANRIVLIS 148
Query: 161 AKDEPDAPISPAMDGLFRVYKRITD-----GSDGDSGQPERNISNVGPTRLLVPASQAGS 215
A +E A + PAMD +++ I D D SG S +LLVP++QA
Sbjct: 149 ATEESQAELPPAMDAAIKIFMHINDIEKINCDDTLSGSAPEKCS----AKLLVPSAQATH 204
Query: 216 LIGKQGATIKSIQDSSKSIVRIVETLPLVALN--DDRVVEIQGEPVGVQKALESIASHLR 273
LIGKQG IKSIQ+++ + V+I++ + L++ + D+R+V+I G P+ V AL+S+ LR
Sbjct: 205 LIGKQGVRIKSIQETTGATVKIIDKVELLSYDVVDERIVDIHGAPLKVLHALKSVLGVLR 264
Query: 274 KFLVDRSVLPLFE 286
KFLVD VL LFE
Sbjct: 265 KFLVDHGVLHLFE 277
Score = 77.4 bits (189), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/66 (59%), Positives = 49/66 (74%)
Query: 383 QVTHNMQIPLSYADAVIGAAGASISYIRRHSGATVTIQESRGAPGEMTVEIIGSASQVQT 442
++T MQIPL +A+ +IGA G +ISYIR SGA V ++ESR P E+ V I GS+SQVQT
Sbjct: 322 KITQTMQIPLPFAEEIIGARGQNISYIRSVSGAVVDLEESRDYPNEVLVMIKGSSSQVQT 381
Query: 443 AQQLVQ 448
A QLVQ
Sbjct: 382 AHQLVQ 387
>Os03g0376800 KH domain containing protein
Length = 542
Score = 109 bits (272), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 117/204 (57%), Gaps = 14/204 (6%)
Query: 78 QEESDNAYNGEVKQQDSLPVEADKKWPGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCE 137
+ SD A NG K+++ D + P ++V+R L P++K+G+IIGR GE K+M
Sbjct: 12 KRHSDYAENGGGKRRN----PGDDTYAPGPDDTVYRYLCPSRKIGSIIGRGGEIAKQMRA 67
Query: 138 ESKARIKILDGPPGVPERTVMISAKD-------EPDAPISPAMDGLFRVYKRITDGSDGD 190
+++A+I+I + G ER + I + + + + PA D LFRV+++++ D
Sbjct: 68 DTQAKIRIGESVSGCDERVITIFSSSRETNTLVDAEDKVCPAQDALFRVHEKLSIDDDIG 127
Query: 191 SGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQDSSKSIVRIV--ETLPLVALND 248
+ + + ++ V RLLVP+ Q G +IGK G I+ I+ + + +R++ E LP A++
Sbjct: 128 NEESDEGLAQV-TVRLLVPSDQIGCIIGKGGHIIQGIRSDTGAHIRVLSNENLPACAISG 186
Query: 249 DRVVEIQGEPVGVQKALESIASHL 272
D +++I G+ V+KAL ++S L
Sbjct: 187 DELLQISGDSTVVRKALLQVSSRL 210
Score = 66.6 bits (161), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 95/173 (54%), Gaps = 9/173 (5%)
Query: 109 ESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKDEPDAP 168
E R+L A VG +IG+ G IK++ +ES A IK+ D + + +SAK+ + P
Sbjct: 279 EFSLRLLCAASNVGGVIGKGGGIIKQIRQESGAFIKV-DSSNTEDDCIITVSAKEFFEDP 337
Query: 169 ISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQ 228
+SP ++ + R ++ +D +S P TRLLV S+ G LIGK G+ I I+
Sbjct: 338 VSPTINAAVHLQPRCSEKTDPESAIPSYT------TRLLVSTSRIGCLIGKGGSIITEIR 391
Query: 229 DSSKSIVRIV--ETLPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLVDR 279
+S++ +RI+ E +P VA D+ +V+I G+ V+ AL I + L+ +R
Sbjct: 392 RTSRANIRILSKENVPKVAAEDEEMVQISGDLDVVRHALLQITTRLKANFFER 444
>Os10g0495000 KH domain containing protein
Length = 762
Score = 100 bits (250), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 114/211 (54%), Gaps = 21/211 (9%)
Query: 83 NAYNGEVKQQDSLPVEADKKWPGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCEESKAR 142
N+ +G+ K+ +S D P E+++RIL P +K+G+++GR G+ +K + + +KA+
Sbjct: 21 NSDDGKRKRLNSR--HDDGTISSEPIETIYRILCPVKKIGSVLGRGGDIVKALRDTTKAK 78
Query: 143 IKILDGPPGVPERTVMI---SAKDEPDA------------PISPAMDGLFRVYKRITDGS 187
I++ D PG ER ++I S++ E A P A D L +++ +I
Sbjct: 79 IRVADSIPGADERVIIIFNYSSQTEEAAQNISTDGFEDMKPHCFAQDALLKIHDKIAADE 138
Query: 188 DGDSGQPERNISNVGPT--RLLVPASQAGSLIGKQGATIKSIQDSSKSIVRIV--ETLPL 243
D +G NV R+LVP +Q G L+GK G+ I+ +++ + + +R++ E LP
Sbjct: 139 DLHAGIVHEKSENVDDVIARILVPGNQVGCLLGKGGSIIQQLRNDTGAGIRVLPSENLPQ 198
Query: 244 VALNDDRVVEIQGEPVGVQKALESIASHLRK 274
AL D +V+I G V+KAL I++ L +
Sbjct: 199 CALKSDELVQISGSSSLVRKALYEISTRLHQ 229
Score = 67.8 bits (164), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 91/168 (54%), Gaps = 13/168 (7%)
Query: 109 ESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILD-GPPGVPERTVMISAKDEPDA 167
E +IL ++ +G +IG+ G ++++ +++ A +++ + G ER +++S+++ PD
Sbjct: 294 EFSIKILCASEHIGQVIGKSGGNVRQVEQQTGACVQVKEVGKNASEERLIVVSSQEIPDD 353
Query: 168 PISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSI 227
P+SP ++ L ++ +++ ++ TRL+VP+++ G +IG+ G I +
Sbjct: 354 PVSPTIEALILLHSKVSTLAENHHLT----------TRLVVPSNKVGCIIGEGGKVITEM 403
Query: 228 QDSSKSIVRIVETL--PLVALNDDRVVEIQGEPVGVQKALESIASHLR 273
+ + + +R+ P D+ +V++ G P + AL IAS LR
Sbjct: 404 RRRTGAEIRVYSKADKPKYLSFDEELVQVAGLPAIARGALTEIASRLR 451
>Os09g0498600 KH domain containing protein
Length = 398
Score = 87.0 bits (214), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 92/171 (53%), Gaps = 10/171 (5%)
Query: 109 ESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILDGPPGVPERTVMISAKDEPDAP 168
E VFR++ + VG+IIG+ G I+ + E+ A IKI++ ER ++ISA + +
Sbjct: 25 EIVFRMICLNEMVGSIIGKGGSTIRALQSETGASIKIIEPNSDSEERVIVISAHENSEMM 84
Query: 169 ISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPASQAGSLIGKQGATIKSIQ 228
SPA D + RV+ RI++ S + S+ RLLVP+ G L+GK G+ I ++
Sbjct: 85 HSPAQDAVLRVHSRISESS--------MDKSSAVTARLLVPSQHIGCLLGKGGSIIAEMR 136
Query: 229 DSSKSIVRIV--ETLPLVALNDDRVVEIQGEPVGVQKALESIASHLRKFLV 277
+ + +RI E +P A +D +V++ G +Q AL I +R ++
Sbjct: 137 KITGAGIRIFGNEQIPRCAQRNDELVQVTGSFQSIQDALLHITGRIRDVII 187
>Os10g0564000 KH domain containing protein
Length = 100
Score = 84.7 bits (208), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 40/67 (59%), Positives = 55/67 (82%)
Query: 382 SQVTHNMQIPLSYADAVIGAAGASISYIRRHSGATVTIQESRGAPGEMTVEIIGSASQVQ 441
+Q+T MQIPL+YA+ +IG GA+I+YIR +SGA VTIQES G+P ++TVE+ G++SQVQ
Sbjct: 25 TQITQTMQIPLTYAEDIIGVKGANIAYIRANSGAVVTIQESLGSPDDITVEMKGTSSQVQ 84
Query: 442 TAQQLVQ 448
A QL+Q
Sbjct: 85 AAYQLIQ 91
>Os08g0200400 KH domain containing protein
Length = 441
Score = 83.2 bits (204), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 96/191 (50%), Gaps = 12/191 (6%)
Query: 92 QDSLPVEADKKWPGWPGESVFRILIPAQKVGAIIGRKGEFIKKMCEESKARIKILDGPPG 151
Q LPV P GE V R+L PA K+G +IG+ G IK + +ES ARI + D
Sbjct: 104 QSVLPVIPAYNTPKCSGELVLRVLCPAGKIGLVIGKGGVTIKSIRKESGARIDVDDSKND 163
Query: 152 VPERTVMISAKDEPDAPISPAMDGLFRVYKRITDGSDGDSGQPERNISNVGPTRLLVPAS 211
E + I++ + D S A++ + + +I D ++G + N+ RLLVP
Sbjct: 164 REESIITITSNEATDDAKSAAVEAVLLLQSKINDDNEG-----KMNL------RLLVPGK 212
Query: 212 QAGSLIGKQGATIKSIQDSSKSIVRIVE-TLPLVALNDDRVVEIQGEPVGVQKALESIAS 270
G LIGK G+ + ++ +K+ + I + P A + D +VE+ GE ++ AL I
Sbjct: 213 VIGCLIGKGGSIVNDMRSKTKAAIYISKGEKPRKASSSDELVEVFGEVENLRDALVQIVL 272
Query: 271 HLRKFLVDRSV 281
LR ++ SV
Sbjct: 273 RLRDDVLRDSV 283
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.311 0.130 0.368
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 14,195,784
Number of extensions: 608051
Number of successful extensions: 1499
Number of sequences better than 1.0e-10: 11
Number of HSP's gapped: 1446
Number of HSP's successfully gapped: 16
Length of query: 512
Length of database: 17,035,801
Length adjustment: 105
Effective length of query: 407
Effective length of database: 11,553,331
Effective search space: 4702205717
Effective search space used: 4702205717
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 158 (65.5 bits)