BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os05g0402300 Os05g0402300|AK070043
(264 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os05g0402300 Protein of unknown function UPF0005 family pro... 438 e-123
Os07g0177300 Protein of unknown function UPF0005 family pro... 200 1e-51
Os03g0795800 Protein of unknown function UPF0005 family pro... 198 4e-51
Os03g0745600 Protein of unknown function UPF0005 family pro... 194 7e-50
Os03g0795600 Protein of unknown function UPF0005 family pro... 164 5e-41
Os07g0177200 Protein of unknown function UPF0005 family pro... 147 6e-36
Os11g0581900 Protein of unknown function UPF0005 family pro... 125 3e-29
>Os05g0402300 Protein of unknown function UPF0005 family protein
Length = 264
Score = 438 bits (1126), Expect = e-123, Method: Compositional matrix adjust.
Identities = 219/264 (82%), Positives = 219/264 (82%)
Query: 1 MASAAEMQPLAPAGYRRAPEMKEKVDASAVDLEAGTGETLYPGISRGESALRWGFVRKVY 60
MASAAEMQPLAPAGYRRAPEMKEKVDASAVDLEAGTGETLYPGISRGESALRWGFVRKVY
Sbjct: 1 MASAAEMQPLAPAGYRRAPEMKEKVDASAVDLEAGTGETLYPGISRGESALRWGFVRKVY 60
Query: 61 GIXXXXXXXXXXXXXXXXXHXXXXXXXXXXXXXXXXXXXXXXXXXXXXYHYQHKHPHNFV 120
GI H YHYQHKHPHNFV
Sbjct: 61 GILAAQLLLTTAVSALTVLHPTLNATLSSSPTLALVLAVLPFVLMVPLYHYQHKHPHNFV 120
Query: 121 YLGLFTLCLSFSIGVACANTQGKIVLEALILTSAVVASLTAYTFWASKKGKEFGYLGPIL 180
YLGLFTLCLSFSIGVACANTQGKIVLEALILTSAVVASLTAYTFWASKKGKEFGYLGPIL
Sbjct: 121 YLGLFTLCLSFSIGVACANTQGKIVLEALILTSAVVASLTAYTFWASKKGKEFGYLGPIL 180
Query: 181 FSALVLLVVISFIQVFFPLGSGPVALFGGLGALVFSGFIIYDTENLIKRHTYDDYIWASV 240
FSALVLLVVISFIQVFFPLGSGPVALFGGLGALVFSGFIIYDTENLIKRHTYDDYIWASV
Sbjct: 181 FSALVLLVVISFIQVFFPLGSGPVALFGGLGALVFSGFIIYDTENLIKRHTYDDYIWASV 240
Query: 241 ELYLDILNLFLYILNMIRSMQSDN 264
ELYLDILNLFLYILNMIRSMQSDN
Sbjct: 241 ELYLDILNLFLYILNMIRSMQSDN 264
>Os07g0177300 Protein of unknown function UPF0005 family protein
Length = 244
Score = 200 bits (508), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 98/235 (41%), Positives = 145/235 (61%), Gaps = 7/235 (2%)
Query: 31 DLEAGT---GETLYPGISRGESALRWGFVRKVYGIXXXXXXXXXXXXXXXXXHXXXXXXX 87
D+EAGT LYPG++ +RW +RK+Y I
Sbjct: 9 DVEAGTSGGARELYPGMTE-PPEMRWALIRKIYVILSMQLLLTAAVAAVVVKVRAISHFF 67
Query: 88 XXXXX---XXXXXXXXXXXXXXXXYHYQHKHPHNFVYLGLFTLCLSFSIGVACANTQGKI 144
Y+Y KHP N + LGLFT+ +SF++G+ CA T GK+
Sbjct: 68 VSSHAGLGLYIFLIILPFIVLCPLYYYHQKHPVNLILLGLFTVAISFAVGMTCAFTSGKV 127
Query: 145 VLEALILTSAVVASLTAYTFWASKKGKEFGYLGPILFSALVLLVVISFIQVFFPLGSGPV 204
+LE+ ILT+ VV SLTAYTFWA+K+G++F +LGP LF++L++L+V +FIQ+ FPLG
Sbjct: 128 ILESAILTTVVVFSLTAYTFWAAKRGRDFSFLGPFLFASLIVLLVFAFIQILFPLGRISQ 187
Query: 205 ALFGGLGALVFSGFIIYDTENLIKRHTYDDYIWASVELYLDILNLFLYILNMIRS 259
++GG+ +L+FSG+I+YDT+N+IKR+TYD Y+WA+V LYLD++NLFL ++ + R+
Sbjct: 188 MIYGGIASLIFSGYIVYDTDNIIKRYTYDQYVWAAVSLYLDVINLFLSLMTLFRA 242
>Os03g0795800 Protein of unknown function UPF0005 family protein
Length = 241
Score = 198 bits (503), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 104/238 (43%), Positives = 145/238 (60%), Gaps = 7/238 (2%)
Query: 31 DLEAG-TGETLYPGISRGESALRWGFVRKVYGIXXXXXXXXXXXXXXXXXHXXXXXXXXX 89
DLEAG + E LYPG+ LRW + K+Y I
Sbjct: 7 DLEAGGSSEPLYPGMVESPD-LRWALIHKIYVILSVQLAMTAAVAAFVVKVRGVSEFFVS 65
Query: 90 X---XXXXXXXXXXXXXXXXXXYHYQHKHPHNFVYLGLFTLCLSFSIGVACANTQGKIVL 146
+Y KHP N + LGLFT+ +SF++G+ CA T GK++
Sbjct: 66 SNAGFALYIFLLFLPLIVLCPLRYYHQKHPVNLLLLGLFTVAISFAVGMTCAYTSGKVIF 125
Query: 147 EALILTSAVVASLTAYTFWASKKGKEFGYLGPILFSALVLLVVISFIQVFFPLGSGPVAL 206
EA LT+ VV SLTAYTFWA+K+G +F +LGP LFSA+++L++ S IQ+FFPLG +
Sbjct: 126 EAAALTAVVVISLTAYTFWAAKRGHDFNFLGPFLFSAVMVLILFSLIQIFFPLGKISEMI 185
Query: 207 FGGLGALVFSGFIIYDTENLIKRHTYDDYIWASVELYLDILNLFLYILNMIRSMQSDN 264
+GGL +LVFSG+IIYDT+N+IKR+TYD+Y+WA+V LYLD++NLFL +L ++R+ +DN
Sbjct: 186 YGGLASLVFSGYIIYDTDNIIKRYTYDEYVWAAVSLYLDVINLFLALLRVLRA--ADN 241
>Os03g0745600 Protein of unknown function UPF0005 family protein
Length = 249
Score = 194 bits (492), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 102/235 (43%), Positives = 143/235 (60%), Gaps = 4/235 (1%)
Query: 31 DLEAGTGETLYPGISRGESALRWGFVRKVYGIXXXXXXXXXXXXXXXXXHXXXXXXXXXX 90
D EAG LYP + LRW FVRKVY I
Sbjct: 16 DAEAGMARPLYPMMLESPQ-LRWAFVRKVYAILSIQMLLTIAVASVVVFVRPVALFFVST 74
Query: 91 ---XXXXXXXXXXXXXXXXXXYHYQHKHPHNFVYLGLFTLCLSFSIGVACANTQGKIVLE 147
Y+Y +HP N + L LFT +SF++G+ CA T+G+++LE
Sbjct: 75 PAGFALYIFLIILPFIVLCPLYYYYQRHPVNLLLLALFTAAISFAVGLTCAFTKGEVILE 134
Query: 148 ALILTSAVVASLTAYTFWASKKGKEFGYLGPILFSALVLLVVISFIQVFFPLGSGPVALF 207
+ ILT+AVV SLTAYTFWA+++G +F +LGP LF+A+++L+V + IQVFFPLG + ++
Sbjct: 135 SAILTAAVVVSLTAYTFWAARRGHDFSFLGPFLFAAVMILMVFALIQVFFPLGRVSLMIY 194
Query: 208 GGLGALVFSGFIIYDTENLIKRHTYDDYIWASVELYLDILNLFLYILNMIRSMQS 262
GGL ALVF G+I+YDT+NLIKR++YD+Y+WA+V LYLD++NLFL +L + R+ S
Sbjct: 195 GGLAALVFCGYIVYDTDNLIKRYSYDEYVWAAVALYLDVINLFLSLLTLFRASDS 249
>Os03g0795600 Protein of unknown function UPF0005 family protein
Length = 247
Score = 164 bits (416), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/240 (35%), Positives = 133/240 (55%), Gaps = 5/240 (2%)
Query: 24 KVDASAVDLEAGTGETLYPGISRGESALRWGFVRKVYGIXXXXXXXXXXXXXXXX----X 79
K D A G +YP + + LRW F+RKVY I
Sbjct: 6 KCDVEACYPGGAPGGGMYPYMIE-NAQLRWAFIRKVYVIVSVQLLVTVAVAGAVNLVEPI 64
Query: 80 HXXXXXXXXXXXXXXXXXXXXXXXXXXXXYHYQHKHPHNFVYLGLFTLCLSFSIGVACAN 139
++++KHP N +L LFT+C+SFS+G+ C +
Sbjct: 65 KTFFQARTPEVLVAYVIIIISPLIMMLPMIYFRNKHPINLFFLLLFTVCISFSVGLGCLS 124
Query: 140 TQGKIVLEALILTSAVVASLTAYTFWASKKGKEFGYLGPILFSALVLLVVISFIQVFFPL 199
G ++ +A +T+A+V LT YTFWA+K+G +F +LGP LF+A ++L + + I +F P+
Sbjct: 125 KNGTVIFQAAGMTAAIVIGLTCYTFWAAKRGYDFEFLGPFLFAATLVLFLYAIITIFLPM 184
Query: 200 GSGPVALFGGLGALVFSGFIIYDTENLIKRHTYDDYIWASVELYLDILNLFLYILNMIRS 259
G ++G + AL+FSGFIIYDT+NLIKR+TYD+Y+ A++ LYLDI+NLF+ ++ +++
Sbjct: 185 GRTGKLVYGCVAALIFSGFIIYDTDNLIKRYTYDEYVAAAITLYLDIINLFMALVTALQA 244
>Os07g0177200 Protein of unknown function UPF0005 family protein
Length = 247
Score = 147 bits (372), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 82/224 (36%), Positives = 117/224 (52%), Gaps = 7/224 (3%)
Query: 44 ISRGESALRWGFVRKVYGIXXXXXXXXXXXXXXX----XXHXXXXXXXXXXXXXXXXXXX 99
I R E LRW F+RKVY I
Sbjct: 26 IERPE--LRWAFIRKVYAIVATQLVVTVAIAAAVYSVPAIRRFFLARTPASLAAFVLVIV 83
Query: 100 XXXXXXXXXYHYQHKHPHNFVYLGLFTLCLSFSIGVACANTQGKI-VLEALILTSAVVAS 158
+ KHP N + L LFT+C+S +IG+ C +++ I ++EA LT +V
Sbjct: 84 APLIVMLPTMFLRKKHPINLILLALFTICMSCAIGLGCLSSKAGIAIIEAASLTFGLVFG 143
Query: 159 LTAYTFWASKKGKEFGYLGPILFSALVLLVVISFIQVFFPLGSGPVALFGGLGALVFSGF 218
LT YTFWA+K+G +F +L P L +A ++LV+ IQ+ P G ++G + ALVFSGF
Sbjct: 144 LTLYTFWAAKRGHDFSFLRPFLVAAFLVLVLYGLIQMLVPTGKVATTVYGCVAALVFSGF 203
Query: 219 IIYDTENLIKRHTYDDYIWASVELYLDILNLFLYILNMIRSMQS 262
IIYDT+NLIKRH YD+Y+ A++ LYLD +N+F+ I + + S
Sbjct: 204 IIYDTDNLIKRHAYDEYVTAAISLYLDTVNIFIAIFTALDASDS 247
>Os11g0581900 Protein of unknown function UPF0005 family protein
Length = 258
Score = 125 bits (315), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 66/141 (46%), Positives = 100/141 (70%)
Query: 111 YQHKHPHNFVYLGLFTLCLSFSIGVACANTQGKIVLEALILTSAVVASLTAYTFWASKKG 170
Y+ KHP N + LGLFTLC S +I V + GK+VL+A ILT+ V LT +TFWA+ +G
Sbjct: 111 YREKHPVNLLLLGLFTLCESLTIAVCSSTFLGKVVLQAAILTAVAVIGLTIFTFWAAHRG 170
Query: 171 KEFGYLGPILFSALVLLVVISFIQVFFPLGSGPVALFGGLGALVFSGFIIYDTENLIKRH 230
+F ++ P L ++L++L+ IQ+ FPLG + ++G L ++FS FI++DT LIKRH
Sbjct: 171 HDFTFMYPFLAASLLVLLAYLIIQICFPLGRAGMTIYGCLATVLFSAFIVFDTNQLIKRH 230
Query: 231 TYDDYIWASVELYLDILNLFL 251
TY++Y+ A++ LYLD++NLF+
Sbjct: 231 TYNEYVIAAISLYLDVINLFM 251
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.325 0.141 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 6,867,512
Number of extensions: 241948
Number of successful extensions: 707
Number of sequences better than 1.0e-10: 7
Number of HSP's gapped: 706
Number of HSP's successfully gapped: 7
Length of query: 264
Length of database: 17,035,801
Length adjustment: 99
Effective length of query: 165
Effective length of database: 11,866,615
Effective search space: 1957991475
Effective search space used: 1957991475
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 155 (64.3 bits)