BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os01g0612600 Os01g0612600|AK066561
(326 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os01g0612600 Protein of unknown function DUF1644 family pro... 563 e-161
Os02g0770600 Protein of unknown function DUF1644 family pro... 158 6e-39
Os02g0566500 Protein of unknown function DUF1644 family pro... 145 3e-35
Os04g0448100 Protein of unknown function DUF1644 family pro... 144 6e-35
Os06g0693700 Protein of unknown function DUF1644 family pro... 138 5e-33
Os09g0451800 Protein of unknown function DUF1644 family pro... 135 5e-32
Os02g0150900 Protein of unknown function DUF1644 family pro... 131 7e-31
Os07g0419800 Protein of unknown function DUF1644 family pro... 86 3e-17
AK065480 76 4e-14
>Os01g0612600 Protein of unknown function DUF1644 family protein
Length = 326
Score = 563 bits (1452), Expect = e-161, Method: Compositional matrix adjust.
Identities = 278/326 (85%), Positives = 278/326 (85%)
Query: 1 MPKDRSTRAVSYERRRSRVSPYPSNGKGCARRSEESXXXXXXXXXXXXXEWEDVRCPVCM 60
MPKDRSTRAVSYERRRSRVSPYPSNGKGCARRSEES EWEDVRCPVCM
Sbjct: 1 MPKDRSTRAVSYERRRSRVSPYPSNGKGCARRSEESAAAAAAAAAKQAAEWEDVRCPVCM 60
Query: 61 DHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXXXXXXXXXXXXXC 120
DHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYR C
Sbjct: 61 DHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRKASKESSKDSGASAAAAPECSEC 120
Query: 121 QQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQLRRHARENHPTV 180
QQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQLRRHARENHPTV
Sbjct: 121 QQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQLRRHARENHPTV 180
Query: 181 RPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGLSAREDGIGVSEGEEDISERALHSPSIT 240
RPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGLSAREDGIGVSEGEEDISERALHSPSIT
Sbjct: 181 RPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGLSAREDGIGVSEGEEDISERALHSPSIT 240
Query: 241 MVFIVRTGRSILHYREAFPGHHRRRTILLLGEAFGRESSPLXXXXXXXXXXXXXRENDEG 300
MVFIVRTGRSILHYREAFPGHHRRRTILLLGEAFGRESSPL RENDEG
Sbjct: 241 MVFIVRTGRSILHYREAFPGHHRRRTILLLGEAFGRESSPLGGASGSGDGDTTARENDEG 300
Query: 301 DDDVTLSTEASAGSQHDGEVDGDPAH 326
DDDVTLSTEASAGSQHDGEVDGDPAH
Sbjct: 301 DDDVTLSTEASAGSQHDGEVDGDPAH 326
>Os02g0770600 Protein of unknown function DUF1644 family protein
Length = 345
Score = 158 bits (399), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 79/217 (36%), Positives = 112/217 (51%), Gaps = 10/217 (4%)
Query: 5 RSTRAVSYERRRSRVSPYPSNGKGCARRSEESXXXXXXXXXXXXXEWEDVRCPVCMDHPH 64
RS RA + R+ R +PYP ++ + +WED C VCM++PH
Sbjct: 3 RSARARRHVARQLRSAPYPI--PSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPH 60
Query: 65 NAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXXXXXXXXXXXXXC---- 120
NAVLL+CSSH+KGCRP+MC TS+RHSNC DQ++
Sbjct: 61 NAVLLLCSSHDKGCRPYMCGTSHRHSNCLDQFKKAYTKGALLEELPANTVGTNLDSTPLI 120
Query: 121 ----QQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQLRRHAREN 176
+ + L+CPLCRG V WT AR YLN K R C ++ C F G Y +LR+H +
Sbjct: 121 AGEKNESVDLACPLCRGKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSE 180
Query: 177 HPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGL 213
HP +P +VDP ++ W +E +R+ D S + + +
Sbjct: 181 HPLAKPREVDPILEQKWRLLEIERERQDALSTITATM 217
>Os02g0566500 Protein of unknown function DUF1644 family protein
Length = 362
Score = 145 bits (367), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 96/171 (56%), Gaps = 9/171 (5%)
Query: 50 EWEDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXX- 108
EW+ CPVC++HPH+AVLL+C+SH KGCRP+MC T+++HSNC + ++
Sbjct: 48 EWKGASCPVCLEHPHDAVLLLCTSHHKGCRPYMCGTNHQHSNCLEHFKEAYAKEKLAHSV 107
Query: 109 --------XXXXXXXXXXXCQQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESC 160
Q ++L+CPLCRG V WT AR+YLN K RAC + C
Sbjct: 108 LIESSPGLSLSLNSQPASKQQCAMELACPLCRGDVKGWTVVEPARQYLNRKKRACMHDGC 167
Query: 161 EFRGAYGQLRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRS 211
F G+Y +L +H HP+ +P +VDP +W + E +R+ D S +RS
Sbjct: 168 SFIGSYKELCKHVNSKHPSAKPREVDPAHADEWKKFECERERQDAISTIRS 218
>Os04g0448100 Protein of unknown function DUF1644 family protein
Length = 267
Score = 144 bits (364), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 64/160 (40%), Positives = 93/160 (58%), Gaps = 1/160 (0%)
Query: 50 EWEDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXXX 109
+W+ C +C++HPH AVLL+CSSH KGCRP+MCDT+ +HSNC +Q++
Sbjct: 46 DWKRATCSICLEHPHKAVLLLCSSHSKGCRPYMCDTNRQHSNCLEQFKNAYSRGKPACEL 105
Query: 110 XXXXXXXXXXCQQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQL 169
Q+ ++L CP+CRG V WT AR++LN K R C E C F G+Y +L
Sbjct: 106 SGAVAQASKKPQE-MELVCPICRGDVKGWTVVEPARRFLNRKRRTCMHEGCSFGGSYRKL 164
Query: 170 RRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSML 209
R H R NHP+ P ++D +W +E ++D D S++
Sbjct: 165 RNHVRSNHPSSNPREIDSASLAEWKELEYEKDRQDAISII 204
>Os06g0693700 Protein of unknown function DUF1644 family protein
Length = 316
Score = 138 bits (348), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 94/166 (56%), Gaps = 4/166 (2%)
Query: 52 EDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXXXXX 111
EDV CP+C+D+PHNAVLL C+S+EKGCRPF+CDT SNC ++++
Sbjct: 29 EDVTCPICLDYPHNAVLLRCTSYEKGCRPFVCDTDQTRSNCLERFKGAYELPANMKVSTI 88
Query: 112 XXXXXXXXCQQPIKL----SCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYG 167
+ SCPLCRG V W +AR +LN K R C ++ C F G +
Sbjct: 89 AVAPLDSIHIVAPNVNNRPSCPLCRGDVIGWIVIGEARLHLNQKKRCCEEDCCSFVGNFN 148
Query: 168 QLRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGL 213
+L++H ++ HP RP+++DP RQ DW +Q D+ D+ S + + +
Sbjct: 149 ELQKHTQQKHPDSRPSEIDPARQVDWENFQQSSDIVDVLSTIHAQV 194
>Os09g0451800 Protein of unknown function DUF1644 family protein
Length = 231
Score = 135 bits (339), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 95/168 (56%), Gaps = 16/168 (9%)
Query: 51 WEDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXXXXXXXXXXXXX 110
W+D CPVC++ PHNAVLL+CSSH+KGCRP++C T+Y HSNC DQ
Sbjct: 44 WKDSICPVCLECPHNAVLLLCSSHDKGCRPYICATNYHHSNCLDQL-------------- 89
Query: 111 XXXXXXXXXCQ--QPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQ 168
C+ I+L+CPLCRG V +T AR+ LN R+C ++ C + G+YG+
Sbjct: 90 IDSRRSSKDCEDLDSIELTCPLCRGEVKGYTLVEPAREQLNQNKRSCMQDGCSYMGSYGE 149
Query: 169 LRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGLSAR 216
L +H R+ HP+V+P VDP W R+ + L D+ S + R
Sbjct: 150 LCKHVRKKHPSVKPHSVDPVHTYRWRRLLFRSSLQDMICATSSPMVRR 197
>Os02g0150900 Protein of unknown function DUF1644 family protein
Length = 320
Score = 131 bits (330), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 91/164 (55%), Gaps = 4/164 (2%)
Query: 52 EDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRXX----XXXXXXXX 107
ED+ CP+C+D PHNAVLL C+S+EKGCRPF+CDT SNC ++++
Sbjct: 28 EDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHGLPTNMKVPSF 87
Query: 108 XXXXXXXXXXXXCQQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYG 167
+ +CPLCRG V W +AR +LN K R C + C + G +
Sbjct: 88 NGAPLDSIHIISSNTTDRPACPLCRGDVIGWVVIDEARLHLNQKKRCCEESCCSYVGNFH 147
Query: 168 QLRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRS 211
+L++H ++ HP RP+++DP R+ DW +Q D+ D+ S + +
Sbjct: 148 ELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHA 191
>Os07g0419800 Protein of unknown function DUF1644 family protein
Length = 265
Score = 86.3 bits (212), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 54/81 (66%)
Query: 131 CRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYGQLRRHARENHPTVRPTQVDPERQ 190
CRG VS W + RKYLN K+R C+ +SC+F G Y QLR HAR H +P VD R+
Sbjct: 16 CRGSVSGWIPAGEVRKYLNEKLRTCSHDSCKFVGTYEQLREHARTAHLLAKPAHVDLSRK 75
Query: 191 RDWHRMEQQRDLGDLFSMLRS 211
R W R+E+++++GD+ S +RS
Sbjct: 76 RTWDRLEREQEVGDVISAIRS 96
>AK065480
Length = 112
Score = 75.9 bits (185), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 29/46 (63%), Positives = 39/46 (84%)
Query: 52 EDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYR 97
EDV CP+C+D+PHNAVLL C+S+EKGCRPF+CDT SNC ++++
Sbjct: 29 EDVTCPICLDYPHNAVLLRCTSYEKGCRPFVCDTDQTRSNCLERFK 74
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.319 0.134 0.424
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 10,869,772
Number of extensions: 415197
Number of successful extensions: 1066
Number of sequences better than 1.0e-10: 9
Number of HSP's gapped: 1063
Number of HSP's successfully gapped: 13
Length of query: 326
Length of database: 17,035,801
Length adjustment: 101
Effective length of query: 225
Effective length of database: 11,762,187
Effective search space: 2646492075
Effective search space used: 2646492075
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 156 (64.7 bits)