BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os02g0770600 Os02g0770600|AK066977
(345 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os02g0770600 Protein of unknown function DUF1644 family pro... 612 e-175
Os02g0566500 Protein of unknown function DUF1644 family pro... 238 7e-63
Os04g0448100 Protein of unknown function DUF1644 family pro... 222 3e-58
Os09g0451800 Protein of unknown function DUF1644 family pro... 187 8e-48
Os06g0693700 Protein of unknown function DUF1644 family pro... 167 1e-41
Os01g0612600 Protein of unknown function DUF1644 family pro... 166 2e-41
Os02g0150900 Protein of unknown function DUF1644 family pro... 156 2e-38
Os07g0419800 Protein of unknown function DUF1644 family pro... 86 5e-17
AK065480 81 1e-15
>Os02g0770600 Protein of unknown function DUF1644 family protein
Length = 345
Score = 612 bits (1577), Expect = e-175, Method: Compositional matrix adjust.
Identities = 296/332 (89%), Positives = 296/332 (89%)
Query: 14 QLRSAPYPIPSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKG 73
QLRSAPYPIPSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKG
Sbjct: 14 QLRSAPYPIPSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKG 73
Query: 74 CRPYMCGTSHRHSNCLDQFKKAYTKGALLEELPANTVGTNLDSTPLIAGEKNESVDLACP 133
CRPYMCGTSHRHSNCLDQFKKAYTKGALLEELPANTVGTNLDSTPLIAGEKNESVDLACP
Sbjct: 74 CRPYMCGTSHRHSNCLDQFKKAYTKGALLEELPANTVGTNLDSTPLIAGEKNESVDLACP 133
Query: 134 LCRGKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPIL 193
LCRGKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPIL
Sbjct: 134 LCRGKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPIL 193
Query: 194 EQKWRLLEIERERQDALSTITATMGRAIVFGXXXXXXXXXXXXXXXXXXXXXNANGHGTD 253
EQKWRLLEIERERQDALSTITATMGRAIVFG NANGHGTD
Sbjct: 194 EQKWRLLEIERERQDALSTITATMGRAIVFGDYVLDLEDEDDLDDVESDEDDNANGHGTD 253
Query: 254 NTRRMLMFLMRQVARHHQNQRLQNAIGTTGGAEDNYAVSSGANATTPYHYPLEGDDEDDL 313
NTRRMLMFLMRQVARHHQNQRLQNAIGTTGGAEDNYAVSSGANATTPYHYPLEGDDEDDL
Sbjct: 254 NTRRMLMFLMRQVARHHQNQRLQNAIGTTGGAEDNYAVSSGANATTPYHYPLEGDDEDDL 313
Query: 314 VMAGGGSTGMVXXXXXXXXXXXXXXXLFLGAN 345
VMAGGGSTGMV LFLGAN
Sbjct: 314 VMAGGGSTGMVRPERRRRRRRRNRERLFLGAN 345
>Os02g0566500 Protein of unknown function DUF1644 family protein
Length = 362
Score = 238 bits (606), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 112/208 (53%), Positives = 147/208 (70%), Gaps = 9/208 (4%)
Query: 22 IPSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGT 81
+PS WK + ++K +W+ A+C VC+E+PH+AVLLLC+SH KGCRPYMCGT
Sbjct: 24 LPSCCWKMKGTCEQNDIALVSEKKEWKGASCPVCLEHPHDAVLLLCTSHHKGCRPYMCGT 83
Query: 82 SHRHSNCLDQFKKAYTK-----GALLEELPANTVGTNLDSTPLIAGEKNESVDLACPLCR 136
+H+HSNCL+ FK+AY K L+E P + +L+S P A ++ +++LACPLCR
Sbjct: 84 NHQHSNCLEHFKEAYAKEKLAHSVLIESSPG--LSLSLNSQP--ASKQQCAMELACPLCR 139
Query: 137 GKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQK 196
G VKGWT+VEPAR YLN K+R CM DGCSF+G+YKEL KHV S+HP AKPREVDP +
Sbjct: 140 GDVKGWTVVEPARQYLNRKKRACMHDGCSFIGSYKELCKHVNSKHPSAKPREVDPAHADE 199
Query: 197 WRLLEIERERQDALSTITATMGRAIVFG 224
W+ E ERERQDA+STI + A++ G
Sbjct: 200 WKKFECERERQDAISTIRSMTPGAVIMG 227
>Os04g0448100 Protein of unknown function DUF1644 family protein
Length = 267
Score = 222 bits (566), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 103/203 (50%), Positives = 136/203 (66%), Gaps = 9/203 (4%)
Query: 22 IPSYRWKAMKESNRKKTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGT 81
+PSY K K S ++K DW+ A CS+C+E+PH AVLLLCSSH KGCRPYMC T
Sbjct: 22 VPSYYQKTKKASKENGLQLTSEKKDWKRATCSICLEHPHKAVLLLCSSHSKGCRPYMCDT 81
Query: 82 SHRHSNCLDQFKKAYTKGALLEELPANTVGTNLDSTPLIAGEKNESVDLACPLCRGKVKG 141
+ +HSNCL+QFK AY++G EL A +K + ++L CP+CRG VKG
Sbjct: 82 NRQHSNCLEQFKNAYSRGKPACELSGAVAQ---------ASKKPQEMELVCPICRGDVKG 132
Query: 142 WTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLE 201
WT+VEPAR +LN KRRTCM +GCSF G+Y++LR HV+S HP + PRE+D +W+ LE
Sbjct: 133 WTVVEPARRFLNRKRRTCMHEGCSFGGSYRKLRNHVRSNHPSSNPREIDSASLAEWKELE 192
Query: 202 IERERQDALSTITATMGRAIVFG 224
E++RQDA+S ITA + + G
Sbjct: 193 YEKDRQDAISIITALNPGSTIMG 215
>Os09g0451800 Protein of unknown function DUF1644 family protein
Length = 231
Score = 187 bits (476), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 118/187 (63%), Gaps = 20/187 (10%)
Query: 37 KTLPAAQKMDWEDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHSNCLDQFKKAY 96
K L A +K W+D+ C VC+E PHNAVLLLCSSHDKGCRPY+C T++ HSNCLDQ +
Sbjct: 34 KELAATEKCAWKDSICPVCLECPHNAVLLLCSSHDKGCRPYICATNYHHSNCLDQLIDSR 93
Query: 97 TKGALLEELPANTVGTNLDSTPLIAGEKNESVDLACPLCRGKVKGWTIVEPARSYLNGKR 156
E+L +S++L CPLCRG+VKG+T+VEPAR LN +
Sbjct: 94 RSSKDCEDL--------------------DSIELTCPLCRGEVKGYTLVEPAREQLNQNK 133
Query: 157 RTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLEIERERQDALSTITAT 216
R+CMQDGCS++G+Y EL KHV+ +HP KP VDP+ +WR L QD + ++
Sbjct: 134 RSCMQDGCSYMGSYGELCKHVRKKHPSVKPHSVDPVHTYRWRRLLFRSSLQDMICATSSP 193
Query: 217 MGRAIVF 223
M R +++
Sbjct: 194 MVRRVLY 200
>Os06g0693700 Protein of unknown function DUF1644 family protein
Length = 316
Score = 167 bits (422), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/203 (42%), Positives = 121/203 (59%), Gaps = 17/203 (8%)
Query: 30 MKESNRKKTLPAAQKMD--W-EDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHS 86
MK+ R + ++D W ED C +C++YPHNAVLL C+S++KGCRP++C T S
Sbjct: 8 MKKVVRPSSFDFDIQLDKSWTEDVTCPICLDYPHNAVLLRCTSYEKGCRPFVCDTDQTRS 67
Query: 87 NCLDQFKKAYTKGALLEELPAN----TVGTN-LDSTPLIAGEKNESVDLACPLCRGKVKG 141
NCL++FK AY ELPAN T+ LDS ++A N +CPLCRG V G
Sbjct: 68 NCLERFKGAY-------ELPANMKVSTIAVAPLDSIHIVAPNVNNRP--SCPLCRGDVIG 118
Query: 142 WTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLE 201
W ++ AR +LN K+R C +D CSFVG + EL+KH + +HP ++P E+DP + W +
Sbjct: 119 WIVIGEARLHLNQKKRCCEEDCCSFVGNFNELQKHTQQKHPDSRPSEIDPARQVDWENFQ 178
Query: 202 IERERQDALSTITATMGRAIVFG 224
+ D LSTI A + IV G
Sbjct: 179 QSSDIVDVLSTIHAQVPNGIVLG 201
>Os01g0612600 Protein of unknown function DUF1644 family protein
Length = 326
Score = 166 bits (420), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 75/172 (43%), Positives = 107/172 (62%), Gaps = 8/172 (4%)
Query: 46 DWEDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHSNCLDQFKKAYTKGALLEEL 105
+WED C VCM++PHNAVLL+CSSH+KGCRP+MC TS+RHSNC DQ++KA +
Sbjct: 50 EWEDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRKASKES------ 103
Query: 106 PANTVGTNLDSTPLIAGEKNESVDLACPLCRGKVKGWTIVEPARSYLNGKRRTCMQDGCS 165
+ G + + P + E + + L+CPLCRG V WT AR YLN K R C ++ C
Sbjct: 104 -SKDSGASAAAAPECS-ECQQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCE 161
Query: 166 FVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLEIERERQDALSTITATM 217
F G Y +LR+H + HP +P +VDP ++ W +E +R+ D S + + +
Sbjct: 162 FRGAYGQLRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRSGL 213
>Os02g0150900 Protein of unknown function DUF1644 family protein
Length = 320
Score = 156 bits (395), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/187 (41%), Positives = 112/187 (59%), Gaps = 5/187 (2%)
Query: 39 LPAAQKMDW-EDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHSNCLDQFKKAYT 97
L A W ED C +C+++PHNAVLL C+S++KGCRP++C T SNCL++FK A+
Sbjct: 18 LDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHG 77
Query: 98 KGALLEELPANTVGTNLDSTPLIAGEKNESVDLACPLCRGKVKGWTIVEPARSYLNGKRR 157
++ N G LDS +I+ N + ACPLCRG V GW +++ AR +LN K+R
Sbjct: 78 LPTNMKVPSFN--GAPLDSIHIISS--NTTDRPACPLCRGDVIGWVVIDEARLHLNQKKR 133
Query: 158 TCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLEIERERQDALSTITATM 217
C + CS+VG + EL+KH + +HP ++P E+DP W + + D LSTI A +
Sbjct: 134 CCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHAQV 193
Query: 218 GRAIVFG 224
IV G
Sbjct: 194 PNGIVLG 200
>Os07g0419800 Protein of unknown function DUF1644 family protein
Length = 265
Score = 85.9 bits (211), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 56/98 (57%), Gaps = 1/98 (1%)
Query: 127 SVDLACPLCRGKVKGWTIVEPARSYLNGKRRTCMQDGCSFVGTYKELRKHVKSEHPLAKP 186
SV P CRG V GW R YLN K RTC D C FVGTY++LR+H ++ H LAKP
Sbjct: 9 SVHTTAP-CRGSVSGWIPAGEVRKYLNEKLRTCSHDSCKFVGTYEQLREHARTAHLLAKP 67
Query: 187 REVDPILEQKWRLLEIERERQDALSTITATMGRAIVFG 224
VD ++ W LE E+E D +S I + AI+ G
Sbjct: 68 AHVDLSRKRTWDRLEREQEVGDVISAIRSQNPGAIIVG 105
>AK065480
Length = 112
Score = 81.3 bits (199), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 39/82 (47%), Positives = 53/82 (64%), Gaps = 10/82 (12%)
Query: 30 MKESNRKKTLPAAQKMD--W-EDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHS 86
MK+ R + ++D W ED C +C++YPHNAVLL C+S++KGCRP++C T S
Sbjct: 8 MKKVVRPSSFDFDIQLDKSWTEDVTCPICLDYPHNAVLLRCTSYEKGCRPFVCDTDQTRS 67
Query: 87 NCLDQFKKAYTKGALLEELPAN 108
NCL++FK AY ELPAN
Sbjct: 68 NCLERFKGAY-------ELPAN 82
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.317 0.132 0.409
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 11,452,802
Number of extensions: 479281
Number of successful extensions: 1157
Number of sequences better than 1.0e-10: 9
Number of HSP's gapped: 1150
Number of HSP's successfully gapped: 9
Length of query: 345
Length of database: 17,035,801
Length adjustment: 102
Effective length of query: 243
Effective length of database: 11,709,973
Effective search space: 2845523439
Effective search space used: 2845523439
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 156 (64.7 bits)