BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os02g0670000 Os02g0670000|AK073148
(475 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os02g0670000 Protein of unknown function DUF300 family protein 982 0.0
Os06g0726600 Protein of unknown function DUF300 family protein 538 e-153
Os05g0516900 Protein of unknown function DUF300 family protein 390 e-108
Os04g0563100 Protein of unknown function DUF300 family protein 181 1e-45
Os07g0244300 Protein of unknown function DUF300 family protein 139 6e-33
Os07g0506000 Protein of unknown function DUF300 family protein 125 9e-29
Os03g0406900 Protein of unknown function DUF300 family protein 112 4e-25
>Os02g0670000 Protein of unknown function DUF300 family protein
Length = 475
Score = 982 bits (2539), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/475 (100%), Positives = 475/475 (100%)
Query: 1 MRVNPALFLPLMAEYAAPTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVI 60
MRVNPALFLPLMAEYAAPTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVI
Sbjct: 1 MRVNPALFLPLMAEYAAPTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVI 60
Query: 61 LMVPCYAVESYVSLVNPDTSVYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREG 120
LMVPCYAVESYVSLVNPDTSVYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREG
Sbjct: 61 LMVPCYAVESYVSLVNPDTSVYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREG 120
Query: 121 GGDSGEPLLHGASEKGIIHHHFPVNYILKPWRMGVRFYQIIKFGIFQYVIIKTLTASLSL 180
GGDSGEPLLHGASEKGIIHHHFPVNYILKPWRMGVRFYQIIKFGIFQYVIIKTLTASLSL
Sbjct: 121 GGDSGEPLLHGASEKGIIHHHFPVNYILKPWRMGVRFYQIIKFGIFQYVIIKTLTASLSL 180
Query: 181 ILQPFGAYCDGEFNLRCGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSF 240
ILQPFGAYCDGEFNLRCGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSF
Sbjct: 181 ILQPFGAYCDGEFNLRCGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSF 240
Query: 241 KSIVFLTWWQGIMIAIMYSLGLVRSPLAQSLELKSSIQDFIICIEMGIASVVHLYVFPAK 300
KSIVFLTWWQGIMIAIMYSLGLVRSPLAQSLELKSSIQDFIICIEMGIASVVHLYVFPAK
Sbjct: 241 KSIVFLTWWQGIMIAIMYSLGLVRSPLAQSLELKSSIQDFIICIEMGIASVVHLYVFPAK 300
Query: 301 PYSLLGNHRSPENISVLGDYAATDPVDPDEIKDISRPTKLRLPQLEPDEIIVTNVKESVR 360
PYSLLGNHRSPENISVLGDYAATDPVDPDEIKDISRPTKLRLPQLEPDEIIVTNVKESVR
Sbjct: 301 PYSLLGNHRSPENISVLGDYAATDPVDPDEIKDISRPTKLRLPQLEPDEIIVTNVKESVR 360
Query: 361 DFVIGSGEYVIKDLKFTMKQAVRPVGKRFEKLMKKKGKFGQSRDDNWVSTSTPQRAIHGI 420
DFVIGSGEYVIKDLKFTMKQAVRPVGKRFEKLMKKKGKFGQSRDDNWVSTSTPQRAIHGI
Sbjct: 361 DFVIGSGEYVIKDLKFTMKQAVRPVGKRFEKLMKKKGKFGQSRDDNWVSTSTPQRAIHGI 420
Query: 421 DDPLICGSSSDSGIGRGKRHRRDVSSAGVVDSWEGSDQTSDGYVIRGRRWEIKKS 475
DDPLICGSSSDSGIGRGKRHRRDVSSAGVVDSWEGSDQTSDGYVIRGRRWEIKKS
Sbjct: 421 DDPLICGSSSDSGIGRGKRHRRDVSSAGVVDSWEGSDQTSDGYVIRGRRWEIKKS 475
>Os06g0726600 Protein of unknown function DUF300 family protein
Length = 479
Score = 538 bits (1387), Expect = e-153, Method: Compositional matrix adjust.
Identities = 273/463 (58%), Positives = 342/463 (73%), Gaps = 8/463 (1%)
Query: 15 YAAPTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVILMVPCYAVESYVSL 74
YA P WA + +G F++ S+SLS++L+F HLSAY NPEEQKF++GVILMVPCYAVESY+SL
Sbjct: 14 YAPPIWASITAGIFVITSLSLSLFLLFNHLSAYKNPEEQKFLVGVILMVPCYAVESYISL 73
Query: 75 VNPDTSVYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREGGGDSGEPLLHGASE 134
VNP SV ILRD YEAFAMYCFGRY+ ACLGGE+RTI FLKREG S PLL +
Sbjct: 74 VNPSISVDIEILRDGYEAFAMYCFGRYLVACLGGEDRTIEFLKREGSSGSDVPLLDHETG 133
Query: 135 KGIIHHHFPVNYILKPWRMGVRFYQIIKFGIFQYVIIKTLTASLSLILQPFGAYCDGEFN 194
+ ++H FP+NY+LKPW +G FY +IKFG+ QYVIIKT+ A L++IL+ FG YC+GEF
Sbjct: 134 QRYVNHPFPMNYMLKPWPLGEWFYLVIKFGLVQYVIIKTICAILAVILESFGVYCEGEFK 193
Query: 195 LRCGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGIMI 254
CGY Y A VLNFSQ WALYCLV++Y A KDELAHIKPLAKFL+FKSIVFLTWWQG++I
Sbjct: 194 WNCGYSYTAVVLNFSQSWALYCLVQFYAAIKDELAHIKPLAKFLTFKSIVFLTWWQGVVI 253
Query: 255 AIMYSLGLVRSPLAQSLELKSSIQDFIICIEMGIASVVHLYVFPAKPYSLLGNHRSPENI 314
A++Y+ GL+R P+AQ L+ KSSIQDFIICIEMG+AS+ HLYVFPAKPY ++G+ R +
Sbjct: 254 ALLYNWGLLRGPIAQELQFKSSIQDFIICIEMGVASIAHLYVFPAKPYEMMGD-RFIGGV 312
Query: 315 SVLGDYAATD-PVDPDEIKDISRPTKLRLPQLEPDEIIVTNVKESVRDFVIGSGEYVIKD 373
SVLGDYA+ D P+DPDE+KD RPTK RLPQ T +KESVRD V+G GEY++ D
Sbjct: 313 SVLGDYASVDCPLDPDEVKDSERPTKTRLPQPGDRVRCSTGIKESVRDVVLGGGEYIVND 372
Query: 374 LKFTMKQAVRPVGKRFEKLMKK----KGKFGQSRDDNWV-STSTPQRAIHGIDDPLICGS 428
LKFT+ AV P+ ++ ++ + + + ++ DD+ + S + R I GIDDPL+ GS
Sbjct: 373 LKFTVNHAVEPINEKLHRISQNIKKHEKEKKKTNDDSCINSQQSLSRVISGIDDPLLNGS 432
Query: 429 SSD-SGIGRGKRHRRDVSSAGVVDSWEGSDQTSDGYVIRGRRW 470
SD SG + ++HRR E SDQ GY IRG RW
Sbjct: 433 LSDNSGQKKSRKHRRKSGYGSAESGGESSDQGLGGYEIRGHRW 475
>Os05g0516900 Protein of unknown function DUF300 family protein
Length = 488
Score = 390 bits (1002), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/444 (45%), Positives = 280/444 (63%), Gaps = 40/444 (9%)
Query: 18 PTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVILMVPCYAVESYVSLVNP 77
P+W I+ +G + S+ LS++LIF+HL AY+ PEEQKF++G+ILMVP YAV+S+ SL+N
Sbjct: 39 PSWPIVSAGISVTASLVLSLFLIFEHLCAYHQPEEQKFLIGLILMVPVYAVQSFFSLLNS 98
Query: 78 DTSVYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREGGGDSGEPLLHGASEKGI 137
+ + C ++RD YEAFAMYCF RY+ ACLGGEE TI F++ PLL + GI
Sbjct: 99 NVAFICELMRDCYEAFAMYCFERYLIACLGGEESTIRFMEGRFQFSESSPLLDVDYDYGI 158
Query: 138 IHHHFPVNYILKPWRMGVRFYQIIKFGIFQYVIIKTLTASLSLILQPFGAYCDGEFNLRC 197
+ H FP+N+ ++ W +G FY +K GI QY+I+K + A L++ +Q G Y +G+F R
Sbjct: 159 VKHPFPLNWFMRNWYLGPDFYHAVKVGIVQYMILKPICAILAIFMQLIGIYGEGKFAWRY 218
Query: 198 GYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGIMIAIM 257
GYPY A VLNFSQ WALYCL+++YTATK++L IKPL+KFL+FKSIVFLTWWQGI +A +
Sbjct: 219 GYPYLAIVLNFSQTWALYCLIQFYTATKEKLEPIKPLSKFLTFKSIVFLTWWQGIAVAFL 278
Query: 258 YSLGLVRSPLAQSLELKSSIQDFIICIEMGIASVVHLYVFPAKPYSLLGNHRSPENISVL 317
+S GL + LAQ + + IQD+IIC+EMG+A+VVHL VFPAKPY RS N++V+
Sbjct: 279 FSTGLFKGHLAQRFQTR--IQDYIICLEMGVAAVVHLKVFPAKPYR--RGERSVSNVAVM 334
Query: 318 GDYAATDPVDPDEIKDI------------SRPTKLRLPQLEPDEIIVTNVKESVRDFVIG 365
DYA+ DP+E ++I SR +L PQ SVRD V+G
Sbjct: 335 SDYASLGASDPEEEREIDNVAIMQAARPDSRDRRLSFPQ-------------SVRDVVLG 381
Query: 366 SGEYVIKDLKFTMKQAVRPVGKRFEKLMKKKGKFGQ-----------SRDDNWVSTSTPQ 414
SGE ++ D+K+T+ V PV + F K+ + + + ++DD+ V
Sbjct: 382 SGEIMVDDVKYTVSHVVEPVERSFSKINRTLHQISENVKQLEKQKRKAKDDSDVPLEPFS 441
Query: 415 RAIHGIDDPLICGSSSDSGIGRGK 438
D + GS SDSG+ R K
Sbjct: 442 EEFAEAHDNVFGGSVSDSGLARKK 465
>Os04g0563100 Protein of unknown function DUF300 family protein
Length = 104
Score = 181 bits (459), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 85/92 (92%), Positives = 90/92 (97%)
Query: 197 CGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGIMIAI 256
C YPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQG++IAI
Sbjct: 1 CRYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGVVIAI 60
Query: 257 MYSLGLVRSPLAQSLELKSSIQDFIICIEMGI 288
MYSLGL+RSPLAQSLELKSSIQDFIICIE+ +
Sbjct: 61 MYSLGLLRSPLAQSLELKSSIQDFIICIEVLV 92
>Os07g0244300 Protein of unknown function DUF300 family protein
Length = 403
Score = 139 bits (349), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 143/273 (52%), Gaps = 23/273 (8%)
Query: 39 LIFQHLSAYNNPEEQKFVLGVILMVPCYAVESYVSLVNPDTSVYCGILRDAYEAFAMYCF 98
L+++HL Y P Q+F++ +ILMVP YAV S++SLV P +++Y +R+ Y+A+ +Y F
Sbjct: 28 LVYRHLLHYAEPTHQRFIVRIILMVPVYAVMSFLSLVLPGSAIYFNSIREIYDAWVIYNF 87
Query: 99 GRYITACLGGEERTIAFLKREGGGDSGEPLLHGASEKGIIHHHFPVNYILKPWRMGVRFY 158
A +GG + L G S +P F + + RF
Sbjct: 88 FSLCLAWVGGPGAVVVSLT----GRSLKP------------SWFMMTCCFSAVPLDGRFI 131
Query: 159 QIIKFGIFQYVIIKTLTASLSLILQPFGAYCDGEFNLRCGYPYFAAVLNFSQYWALYCLV 218
+ K G Q+VI+K + ++ IL G Y DG F++ Y Y + S AL+ L
Sbjct: 132 RRCKQGCLQFVILKPILVVITFILYAKGKYEDGNFSVNQSYLYITIIYTISYSMALFALA 191
Query: 219 EWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGIMIAIMYSLGLVRSPLAQSLELKSSIQ 278
+Y A +D L P+ KF+ KS+VFLT+WQG+++ + +S ++ E + +Q
Sbjct: 192 LFYVACRDLLQPYNPVPKFIIIKSVVFLTYWQGVLVFLA-----AKSRFIKNAEEAAYLQ 246
Query: 279 DFIICIEMGIASVVHLYVFPAKPYSLLGNHRSP 311
+F++C+EM IA++ H + F K Y+ G++ P
Sbjct: 247 NFVLCVEMLIAAIGHQFAFSYKEYA--GSNARP 277
>Os07g0506000 Protein of unknown function DUF300 family protein
Length = 301
Score = 125 bits (313), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 154/294 (52%), Gaps = 27/294 (9%)
Query: 17 APTWAILISGFFMLLSVSLSMYLIFQHLSAYNNPEEQKFVLGVILMVPCYAVESYVSLVN 76
APT +L + ++LS+ ++ L+ QHL + NP+EQK +L ++LM P YA+ S+V L++
Sbjct: 16 APTLTLLGAACCVMLSMHFTVQLVSQHLFYWKNPKEQKAILIIVLMAPLYAINSFVGLLD 75
Query: 77 PDTS----VYCGILRDAYEAFAMYCFGRYITACLGGEERTIAFLKREGGGDSGEPLLHGA 132
S + +++ YEA A+ + +A + + ++
Sbjct: 76 IKGSKTFFTFLDAVKECYEALAI--------------AKFMALMYSYLNISISKNIVPDE 121
Query: 133 SEKGIIHHHFPVNYIL-KPWRMGVRFYQIIKFGIFQYVIIKTLTASLSLILQPFGAYCDG 191
+ ++HH FPV+ L + R+ + +++K+ +Q+V+++ + A L + LQ G Y
Sbjct: 122 IKGRVLHHSFPVSLFLPRNVRLEHKTLKLLKYWTWQFVVVRPICAILMITLQLLGLYPSW 181
Query: 192 EFNLRCGYPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQG 251
+ F +LNFS ALY LV +Y ELA KPLAKFL K IVF ++WQG
Sbjct: 182 -----VSWT-FTIILNFSVSMALYALVIFYHLFAKELAPHKPLAKFLCIKGIVFFSFWQG 235
Query: 252 IMIAIMYSLGLVRSP--LAQSLELKSSIQDFIICIEMGIASVVHLYVFPAKPYS 303
+ ++ ++G+++S ++ +IQ+ ++ IEM SV+ Y + PYS
Sbjct: 236 FALEVLAAVGIIQSHHFWLDVEHIQEAIQNVLVIIEMVFFSVLQQYAYHVAPYS 289
>Os03g0406900 Protein of unknown function DUF300 family protein
Length = 120
Score = 112 bits (281), Expect = 4e-25, Method: Composition-based stats.
Identities = 50/88 (56%), Positives = 70/88 (79%), Gaps = 4/88 (4%)
Query: 199 YPYFAAVLNFSQYWALYCLVEWYTATKDELAHIKPLAKFLSFKSIVFLTWWQGIMIAIMY 258
YPY A V+NFSQ WALYCLV++Y AT ++L I+PLAKF+SFK+IVF TWWQG+ IAI+
Sbjct: 33 YPYIAVVINFSQTWALYCLVKFYNATHEKLQEIRPLAKFISFKAIVFATWWQGLGIAIIC 92
Query: 259 SLGLVRSPLAQSLELKSSIQDFIICIEM 286
+G+ L + +++++IQDF+ICIE+
Sbjct: 93 HIGI----LPKEGKVQNAIQDFLICIEV 116
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.324 0.140 0.434
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 16,623,109
Number of extensions: 729241
Number of successful extensions: 1666
Number of sequences better than 1.0e-10: 7
Number of HSP's gapped: 1649
Number of HSP's successfully gapped: 7
Length of query: 475
Length of database: 17,035,801
Length adjustment: 105
Effective length of query: 370
Effective length of database: 11,553,331
Effective search space: 4274732470
Effective search space used: 4274732470
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 158 (65.5 bits)