BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os02g0150900 Os02g0150900|AK121424
(320 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os02g0150900 Protein of unknown function DUF1644 family pro... 516 e-146
Os06g0693700 Protein of unknown function DUF1644 family pro... 388 e-108
Os02g0566500 Protein of unknown function DUF1644 family pro... 170 1e-42
Os02g0770600 Protein of unknown function DUF1644 family pro... 161 5e-40
Os04g0448100 Protein of unknown function DUF1644 family pro... 153 1e-37
Os01g0612600 Protein of unknown function DUF1644 family pro... 139 2e-33
AK065480 139 3e-33
Os09g0451800 Protein of unknown function DUF1644 family pro... 129 2e-30
Os07g0419800 Protein of unknown function DUF1644 family pro... 87 1e-17
>Os02g0150900 Protein of unknown function DUF1644 family protein
Length = 320
Score = 516 bits (1328), Expect = e-146, Method: Compositional matrix adjust.
Identities = 257/320 (80%), Positives = 257/320 (80%)
Query: 1 MGSGMXXXXXXXAGSFDLDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICD 60
MGSGM AGSFDLDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICD
Sbjct: 1 MGSGMVKKKVVKAGSFDLDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICD 60
Query: 61 TDQSRSNCLERFKGAHGLPTNMKVPSFNGAPLDSIHIISSNTTDRPACPLCRGDVIGWVV 120
TDQSRSNCLERFKGAHGLPTNMKVPSFNGAPLDSIHIISSNTTDRPACPLCRGDVIGWVV
Sbjct: 61 TDQSRSNCLERFKGAHGLPTNMKVPSFNGAPLDSIHIISSNTTDRPACPLCRGDVIGWVV 120
Query: 121 IDEARLHLNQKKRCCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSS 180
IDEARLHLNQKKRCCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSS
Sbjct: 121 IDEARLHLNQKKRCCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSS 180
Query: 181 DIIDVLSTIHAQVPNGIVLGDYVIXXXXXXXXXXXXVYHRVRGNWWTSCIFCKSFCXXXX 240
DIIDVLSTIHAQVPNGIVLGDYVI VYHRVRGNWWTSCIFCKSFC
Sbjct: 181 DIIDVLSTIHAQVPNGIVLGDYVIEYGDDDAGDDYEVYHRVRGNWWTSCIFCKSFCRSSG 240
Query: 241 XXXXXXXXXXXXXXXXXXXXXXXXXFTIEVPSGSVXXXXXXXXXXXXXXXVTGAMPGIAA 300
FTIEVPSGSV VTGAMPGIAA
Sbjct: 241 GRSRARARERRSSGRRSSNRSSQESFTIEVPSGSVDIREIRFDEIDDEYIVTGAMPGIAA 300
Query: 301 SRRIASHYRDPRYGRRRSYY 320
SRRIASHYRDPRYGRRRSYY
Sbjct: 301 SRRIASHYRDPRYGRRRSYY 320
>Os06g0693700 Protein of unknown function DUF1644 family protein
Length = 316
Score = 388 bits (997), Expect = e-108, Method: Compositional matrix adjust.
Identities = 196/317 (61%), Positives = 222/317 (70%), Gaps = 3/317 (0%)
Query: 1 MGSG-MXXXXXXXAGSFDLDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFIC 59
MGSG + SFD D +LDKSW ED+TCPICLD+PHNAVLLRCTSYEKGCRPF+C
Sbjct: 1 MGSGNLMMKKVVRPSSFDFDIQLDKSWTEDVTCPICLDYPHNAVLLRCTSYEKGCRPFVC 60
Query: 60 DTDQSRSNCLERFKGAHGLPTNMKVPSFNGAPLDSIHIISSNTTDRPACPLCRGDVIGWV 119
DTDQ+RSNCLERFKGA+ LP NMKV + APLDSIHI++ N +RP+CPLCRGDVIGW+
Sbjct: 61 DTDQTRSNCLERFKGAYELPANMKVSTIAVAPLDSIHIVAPNVNNRPSCPLCRGDVIGWI 120
Query: 120 VIDEARLHLNQKKRCCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQS 179
VI EARLHLNQKKRCCEE CCS+VGNF+ELQKHTQQKHP+SRPSEIDPAR+VDWENFQQS
Sbjct: 121 VIGEARLHLNQKKRCCEEDCCSFVGNFNELQKHTQQKHPDSRPSEIDPARQVDWENFQQS 180
Query: 180 SDIIDVLSTIHAQVPNGIVLGDYVIXXXXXXXXXXXXVYHRVRGNWWTSCIFCKSFCXXX 239
SDI+DVLSTIHAQVPNGIVLGDYVI V+ RVR +WW S +F + F
Sbjct: 181 SDIVDVLSTIHAQVPNGIVLGDYVIEYGDDETGEEYEVFRRVRRHWW-SFMFFRGFSRSS 239
Query: 240 XXXXXXXXXXXXXXXXXXXXXXXXXXFTIEVPSGSVXXXXXXXXXXXXXXXVTGAMPGIA 299
F +EVP+ SV VTGA+P IA
Sbjct: 240 RRRRRARARERRGSGRRNSNQAHLESFNLEVPTQSVDLREIRFDEIDDEYIVTGAIPSIA 299
Query: 300 ASRRIAS-HYRDPRYGR 315
R+AS HYRD RYGR
Sbjct: 300 TPGRMASFHYRDTRYGR 316
>Os02g0566500 Protein of unknown function DUF1644 family protein
Length = 362
Score = 170 bits (431), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 116/186 (62%), Gaps = 6/186 (3%)
Query: 24 KSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHGLPTNMK 83
K W + +CP+CL+ PH+AVLL CTS+ KGCRP++C T+ SNCLE FK A+
Sbjct: 47 KEW-KGASCPVCLEHPHDAVLLLCTSHHKGCRPYMCGTNHQHSNCLEHFKEAYAKEKLAH 105
Query: 84 VPSFNGAPLDSIHIISSNTTDRP-----ACPLCRGDVIGWVVIDEARLHLNQKKRCCEES 138
+P S+ + S + + ACPLCRGDV GW V++ AR +LN+KKR C
Sbjct: 106 SVLIESSPGLSLSLNSQPASKQQCAMELACPLCRGDVKGWTVVEPARQYLNRKKRACMHD 165
Query: 139 CCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHAQVPNGIV 198
CS++G++ EL KH KHP+++P E+DPA +W+ F+ + D +STI + P ++
Sbjct: 166 GCSFIGSYKELCKHVNSKHPSAKPREVDPAHADEWKKFECERERQDAISTIRSMTPGAVI 225
Query: 199 LGDYVI 204
+GDYV+
Sbjct: 226 MGDYVV 231
>Os02g0770600 Protein of unknown function DUF1644 family protein
Length = 345
Score = 161 bits (408), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 114/189 (60%), Gaps = 5/189 (2%)
Query: 18 LDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHG 77
L A W ED C +C+++PHNAVLL C+S++KGCRP++C T SNCL++FK A+
Sbjct: 39 LPAAQKMDW-EDANCSVCMEYPHNAVLLLCSSHDKGCRPYMCGTSHRHSNCLDQFKKAYT 97
Query: 78 LPTNMKVPSFN--GAPLDSIHIIS--SNTTDRPACPLCRGDVIGWVVIDEARLHLNQKKR 133
++ N G LDS +I+ N + ACPLCRG V GW +++ AR +LN K+R
Sbjct: 98 KGALLEELPANTVGTNLDSTPLIAGEKNESVDLACPLCRGKVKGWTIVEPARSYLNGKRR 157
Query: 134 CCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHAQV 193
C + CS+VG + EL+KH + +HP ++P E+DP W + + D LSTI A +
Sbjct: 158 TCMQDGCSFVGTYKELRKHVKSEHPLAKPREVDPILEQKWRLLEIERERQDALSTITATM 217
Query: 194 PNGIVLGDY 202
IV GDY
Sbjct: 218 GRAIVFGDY 226
>Os04g0448100 Protein of unknown function DUF1644 family protein
Length = 267
Score = 153 bits (387), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 75/186 (40%), Positives = 107/186 (57%), Gaps = 16/186 (8%)
Query: 24 KSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHGLPTNMK 83
K W + TC ICL+ PH AVLL C+S+ KGCRP++CDT++ SNCLE+FK A+
Sbjct: 45 KDW-KRATCSICLEHPHKAVLLLCSSHSKGCRPYMCDTNRQHSNCLEQFKNAYS------ 97
Query: 84 VPSFNGAPLDSIHIISSNTTDRP-----ACPLCRGDVIGWVVIDEARLHLNQKKRCCEES 138
G P + + + +P CP+CRGDV GW V++ AR LN+K+R C
Sbjct: 98 ----RGKPACELSGAVAQASKKPQEMELVCPICRGDVKGWTVVEPARRFLNRKRRTCMHE 153
Query: 139 CCSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHAQVPNGIV 198
CS+ G++ +L+ H + HP+S P EID A +W+ + D D +S I A P +
Sbjct: 154 GCSFGGSYRKLRNHVRSNHPSSNPREIDSASLAEWKELEYEKDRQDAISIITALNPGSTI 213
Query: 199 LGDYVI 204
+GDY I
Sbjct: 214 MGDYFI 219
>Os01g0612600 Protein of unknown function DUF1644 family protein
Length = 326
Score = 139 bits (351), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 96/164 (58%), Gaps = 4/164 (2%)
Query: 28 EDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHGLPTNMKVPSF 87
ED+ CP+C+D PHNAVLL C+S+EKGCRPF+CDT SNC ++++ A + S
Sbjct: 52 EDVRCPVCMDHPHNAVLLVCSSHEKGCRPFMCDTSYRHSNCFDQYRKASKESSKDSGASA 111
Query: 88 NGAPLDSIHIISSNTTDRPACPLCRGDVIGWVVIDEARLHLNQKKRCCEESCCSYVGNFH 147
AP + +CPLCRG V W +AR +LN K R C + C + G +
Sbjct: 112 AAAP----ECSECQQPIKLSCPLCRGPVSHWTKDYDARKYLNVKVRACTKESCEFRGAYG 167
Query: 148 ELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVLSTIHA 191
+L++H ++ HP RP+++DP R+ DW +Q D+ D+ S + +
Sbjct: 168 QLRRHARENHPTVRPTQVDPERQRDWHRMEQQRDLGDLFSMLRS 211
>AK065480
Length = 112
Score = 139 bits (349), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 64/91 (70%), Positives = 72/91 (79%), Gaps = 1/91 (1%)
Query: 1 MGSG-MXXXXXXXAGSFDLDAKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFIC 59
MGSG + SFD D +LDKSW ED+TCPICLD+PHNAVLLRCTSYEKGCRPF+C
Sbjct: 1 MGSGNLMMKKVVRPSSFDFDIQLDKSWTEDVTCPICLDYPHNAVLLRCTSYEKGCRPFVC 60
Query: 60 DTDQSRSNCLERFKGAHGLPTNMKVPSFNGA 90
DTDQ+RSNCLERFKGA+ LP NMKV + A
Sbjct: 61 DTDQTRSNCLERFKGAYELPANMKVSTIAVA 91
>Os09g0451800 Protein of unknown function DUF1644 family protein
Length = 231
Score = 129 bits (325), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 98/167 (58%), Gaps = 16/167 (9%)
Query: 20 AKLDKSWMEDITCPICLDFPHNAVLLRCTSYEKGCRPFICDTDQSRSNCLERFKGAHGLP 79
A +K +D CP+CL+ PHNAVLL C+S++KGCRP+IC T+ SNCL++
Sbjct: 37 AATEKCAWKDSICPVCLECPHNAVLLLCSSHDKGCRPYICATNYHHSNCLDQL------- 89
Query: 80 TNMKVPSFNGAPLDSIHIISSNTTDRPACPLCRGDVIGWVVIDEARLHLNQKKRCCEESC 139
+ + S + LDSI + CPLCRG+V G+ +++ AR LNQ KR C +
Sbjct: 90 IDSRRSSKDCEDLDSIEL---------TCPLCRGEVKGYTLVEPAREQLNQNKRSCMQDG 140
Query: 140 CSYVGNFHELQKHTQQKHPNSRPSEIDPARRVDWENFQQSSDIIDVL 186
CSY+G++ EL KH ++KHP+ +P +DP W S + D++
Sbjct: 141 CSYMGSYGELCKHVRKKHPSVKPHSVDPVHTYRWRRLLFRSSLQDMI 187
>Os07g0419800 Protein of unknown function DUF1644 family protein
Length = 265
Score = 87.4 bits (215), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 35/94 (37%), Positives = 59/94 (62%)
Query: 111 CRGDVIGWVVIDEARLHLNQKKRCCEESCCSYVGNFHELQKHTQQKHPNSRPSEIDPARR 170
CRG V GW+ E R +LN+K R C C +VG + +L++H + H ++P+ +D +R+
Sbjct: 16 CRGSVSGWIPAGEVRKYLNEKLRTCSHDSCKFVGTYEQLREHARTAHLLAKPAHVDLSRK 75
Query: 171 VDWENFQQSSDIIDVLSTIHAQVPNGIVLGDYVI 204
W+ ++ ++ DV+S I +Q P I++GDYVI
Sbjct: 76 RTWDRLEREQEVGDVISAIRSQNPGAIIVGDYVI 109
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.323 0.138 0.457
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 9,853,371
Number of extensions: 388909
Number of successful extensions: 769
Number of sequences better than 1.0e-10: 9
Number of HSP's gapped: 762
Number of HSP's successfully gapped: 9
Length of query: 320
Length of database: 17,035,801
Length adjustment: 101
Effective length of query: 219
Effective length of database: 11,762,187
Effective search space: 2575918953
Effective search space used: 2575918953
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 156 (64.7 bits)