BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os01g0823500 Os01g0823500|AK064237
(379 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os01g0823500 Protein of unknown function DUF623, plant doma... 580 e-166
Os05g0477200 Protein of unknown function DUF623, plant doma... 202 4e-52
Os07g0679200 Protein of unknown function DUF623, plant doma... 81 1e-15
Os05g0324600 Protein of unknown function DUF623, plant doma... 80 2e-15
Os05g0517100 80 3e-15
Os01g0749800 79 4e-15
Os03g0336900 Protein of unknown function DUF623, plant doma... 69 6e-12
Os02g0679700 Protein of unknown function DUF623, plant doma... 69 7e-12
Os01g0226700 Protein of unknown function DUF623, plant doma... 67 3e-11
>Os01g0823500 Protein of unknown function DUF623, plant domain containing protein
Length = 379
Score = 580 bits (1495), Expect = e-166, Method: Compositional matrix adjust.
Identities = 300/379 (79%), Positives = 300/379 (79%)
Query: 1 MGRHKFRLSDMIPNAWFFKLRDMRXXXXXXXXXXXXXXXXXVVTQSSVAVSRAGRACRPL 60
MGRHKFRLSDMIPNAWFFKLRDMR VVTQSSVAVSRAGRACRPL
Sbjct: 1 MGRHKFRLSDMIPNAWFFKLRDMRAARGGAGAGGGGASHGGVVTQSSVAVSRAGRACRPL 60
Query: 61 PNTPRHGALSLPHRASYYYTPRAGDLLVGSPLHPKCSDTQFPPLQLXXXXXXXXXXXXXX 120
PNTPRHGALSLPHRASYYYTPRAGDLLVGSPLHPKCSDTQFPPLQL
Sbjct: 61 PNTPRHGALSLPHRASYYYTPRAGDLLVGSPLHPKCSDTQFPPLQLSPPRKSRRRHRRRS 120
Query: 121 VKLAPXXXXXXXXXXXXXTGCRCGRKPELVVVEAPDTPPCRRDKFVGYNXXXXXXXXXXV 180
VKLAP TGCRCGRKPELVVVEAPDTPPCRRDKFVGYN V
Sbjct: 121 VKLAPSVSGSSVLSSPVSTGCRCGRKPELVVVEAPDTPPCRRDKFVGYNDDDDDEEEEEV 180
Query: 181 EFKKPTVAVAACDELDGKVITSATDIIIDLRTEKRPDKVLPPIVTKPARRELDGCDLEEK 240
EFKKPTVAVAACDELDGKVITSATDIIIDLRTEKRPDKVLPPIVTKPARRELDGCDLEEK
Sbjct: 181 EFKKPTVAVAACDELDGKVITSATDIIIDLRTEKRPDKVLPPIVTKPARRELDGCDLEEK 240
Query: 241 HIDVVRRASAKKPTTLLEQSKPRRSVSSARRLKTRANTPRIVXXXXXXXXXXXXXXXXXX 300
HIDVVRRASAKKPTTLLEQSKPRRSVSSARRLKTRANTPRIV
Sbjct: 241 HIDVVRRASAKKPTTLLEQSKPRRSVSSARRLKTRANTPRIVAKKSKPPPPPPPAAARSP 300
Query: 301 XXXXXXXLAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYH 360
LAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYH
Sbjct: 301 APTTKPPLAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYH 360
Query: 361 DLIVDVFEHIWANLADIKM 379
DLIVDVFEHIWANLADIKM
Sbjct: 361 DLIVDVFEHIWANLADIKM 379
>Os05g0477200 Protein of unknown function DUF623, plant domain containing protein
Length = 472
Score = 202 bits (513), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 160/399 (40%), Positives = 197/399 (49%), Gaps = 46/399 (11%)
Query: 1 MGRHKFRLSDMIPNAWFFKLRDMRXXXXXXXXXXXXXXXXXVVTQSSVAVSRAGRACRPL 60
MGR KFRLSDM+PNAWF+KLRDMR + SS ++ R RA +
Sbjct: 89 MGRRKFRLSDMMPNAWFYKLRDMRARGGRGATA--------MQPPSSSSLMRGSRAAQQQ 140
Query: 61 PNTPRHGALS-----LPHRASYYYTPRAGDLLVGSPLHP-KCSDTQFPPLQLXXXXXXXX 114
T R G S LPHRASYYYT R ++ P P + D QFP L L
Sbjct: 141 AGTWRLGTSSSSSSLLPHRASYYYTTRDREVPPLPPPPPPRGVDDQFPSLTLSPPLPTRN 200
Query: 115 XXXXXXVKLAPXXXXXXXXXXXXXT----GCRCGRKPELVVVEAPDTPPCRRDKFVGYNX 170
V + GC P V +A + CRRD F+G +
Sbjct: 201 SRRRHRVGRFGSTEMDGGELVLAPSDDHDGCSHQEPP---VADASGSSRCRRDMFIGRDG 257
Query: 171 XXXXXXXXXVEFKKPTVAVAACDE---LDGKVITS---ATDIIIDLRTEKRPDKVLPPIV 224
VEF++ V +E +D KVITS + + P++VL P+V
Sbjct: 258 GRG------VEFRRRATTVDGPEEDAAVDVKVITSDADIIIDLGADDDDDTPERVLRPVV 311
Query: 225 TKPARRELDGCD-LEEKHIDVVRRASAKKPTTLLEQ------SKPRRSVSSARR-LKTRA 276
T+PARRELD C+ E KH+D+ + + + KPRRS S+RR LKTR
Sbjct: 312 TRPARRELDWCEPAEVKHVDLAELMTPRASSASASSEKSISTGKPRRSSVSSRRRLKTRT 371
Query: 277 NTPRIVXXXXXXXXXXXXXXXXXXXXXXXXXLAESFAVVKSSRDPRRDFRESMEEMIAEN 336
N+PR+ LA SFAVVK+S DPRRDF ESMEEMIAEN
Sbjct: 372 NSPRLAACRKGKPTARATTTTPTQPP-----LAHSFAVVKTSSDPRRDFLESMEEMIAEN 426
Query: 337 GIRTAADLEDLLACYLSLNAAEYHDLIVDVFEHIWANLA 375
GIR A DLEDLLACYLSLN+ EYHDLIV+VFE +W LA
Sbjct: 427 GIRDAGDLEDLLACYLSLNSGEYHDLIVEVFEQVWTGLA 465
>Os07g0679200 Protein of unknown function DUF623, plant domain containing protein
Length = 330
Score = 81.3 bits (199), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/64 (60%), Positives = 44/64 (68%)
Query: 308 LAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVF 367
L ES VVK S DP DF ESM EMIA N +R+ DLE+LLACYL+LNAAE+H IV F
Sbjct: 250 LYESLVVVKESADPEEDFLESMAEMIAANDVRSPRDLEELLACYLALNAAEHHRAIVGAF 309
Query: 368 EHIW 371
W
Sbjct: 310 RRAW 313
>Os05g0324600 Protein of unknown function DUF623, plant domain containing protein
Length = 260
Score = 80.5 bits (197), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 35/63 (55%), Positives = 47/63 (74%)
Query: 312 FAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVFEHIW 371
FAVVK+S +P RDFRESM EM+ NG+R+ DL +LL CYLSLNA E+H +I++ F +W
Sbjct: 188 FAVVKASAEPARDFRESMVEMVVGNGMRSPEDLLELLECYLSLNAREHHGVIMEAFRGVW 247
Query: 372 ANL 374
+
Sbjct: 248 VEI 250
>Os05g0517100
Length = 316
Score = 79.7 bits (195), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/73 (53%), Positives = 50/73 (68%), Gaps = 3/73 (4%)
Query: 310 ESFAVVKSSRDPRRDFRESMEEMIAENGIRTAA---DLEDLLACYLSLNAAEYHDLIVDV 366
E AVV+ +RDP+R FRESM EMIA +G AA +LE LLACYL+LNA E+HD IV V
Sbjct: 233 ERLAVVRRTRDPQRAFRESMVEMIASSGGSIAARPEELERLLACYLALNADEHHDCIVKV 292
Query: 367 FEHIWANLADIKM 379
F +W ++ +
Sbjct: 293 FRQVWFEYINLHL 305
>Os01g0749800
Length = 324
Score = 79.3 bits (194), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 38/65 (58%), Positives = 45/65 (69%)
Query: 310 ESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVFEH 369
E FAVV+ + DP+R+FR SM EMIA I +LE LLACYLSLNA E+HD IV VF
Sbjct: 247 ERFAVVRRTSDPQREFRASMVEMIASKRIGRPEELETLLACYLSLNADEHHDCIVKVFRQ 306
Query: 370 IWANL 374
+W L
Sbjct: 307 VWFEL 311
>Os03g0336900 Protein of unknown function DUF623, plant domain containing protein
Length = 301
Score = 68.9 bits (167), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 48/75 (64%), Gaps = 3/75 (4%)
Query: 308 LAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVF 367
L ES AVV S +P + +SM EM+ NG+R DL+DLLACYLSLNAAE+H IV +F
Sbjct: 223 LRESEAVVLESTEPELELVDSMIEMLCTNGVRRLEDLQDLLACYLSLNAAEHHRTIVALF 282
Query: 368 EH---IWANLADIKM 379
+W +L ++
Sbjct: 283 RRVVLVWIHLGSQRL 297
>Os02g0679700 Protein of unknown function DUF623, plant domain containing protein
Length = 281
Score = 68.6 bits (166), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 33/64 (51%), Positives = 42/64 (65%)
Query: 308 LAESFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVF 367
L S AVVK S DP DFR+SM +MI ENGI DL ++L +L+LNA +HD+I+ F
Sbjct: 168 LDGSVAVVKQSDDPLGDFRQSMLQMIVENGIVAGEDLREMLRRFLTLNAPHHHDVILRAF 227
Query: 368 EHIW 371
IW
Sbjct: 228 AEIW 231
>Os01g0226700 Protein of unknown function DUF623, plant domain containing protein
Length = 250
Score = 66.6 bits (161), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/64 (53%), Positives = 40/64 (62%)
Query: 311 SFAVVKSSRDPRRDFRESMEEMIAENGIRTAADLEDLLACYLSLNAAEYHDLIVDVFEHI 370
FAVVK SRDP DFR SM EM+ + AA+LE LL YLSLNA +H +I+ F I
Sbjct: 184 GFAVVKRSRDPYADFRSSMVEMVVGRQLFGAAELERLLRSYLSLNAPRHHPVILQAFSDI 243
Query: 371 WANL 374
W L
Sbjct: 244 WVVL 247
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.321 0.135 0.407
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 10,680,250
Number of extensions: 361886
Number of successful extensions: 921
Number of sequences better than 1.0e-10: 9
Number of HSP's gapped: 913
Number of HSP's successfully gapped: 10
Length of query: 379
Length of database: 17,035,801
Length adjustment: 103
Effective length of query: 276
Effective length of database: 11,657,759
Effective search space: 3217541484
Effective search space used: 3217541484
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 157 (65.1 bits)