BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os04g0553800 Os04g0553800|AK067441
(428 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os04g0553800 Glycosyl transferase, family 8 protein 879 0.0
Os10g0555100 Similar to DNA chromosome 4, ESSA I CONTIG fra... 644 0.0
Os02g0624400 Glycosyl transferase, family 8 protein 113 3e-25
Os03g0184300 Glycosyl transferase, family 8 protein 76 4e-14
Os01g0880200 Glycosyl transferase, family 8 protein 69 5e-12
>Os04g0553800 Glycosyl transferase, family 8 protein
Length = 428
Score = 879 bits (2271), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/428 (100%), Positives = 428/428 (100%)
Query: 1 MMYMGTPRDYEFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVEN 60
MMYMGTPRDYEFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVEN
Sbjct: 1 MMYMGTPRDYEFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVEN 60
Query: 61 LKNPYEKQGNFNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFI 120
LKNPYEKQGNFNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFI
Sbjct: 61 LKNPYEKQGNFNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFI 120
Query: 121 NPCIFHTGLFVLQPSMDVFKNMLHELAVGRDNPDGADQGFLASYFPDLLDRPMFHPPVNG 180
NPCIFHTGLFVLQPSMDVFKNMLHELAVGRDNPDGADQGFLASYFPDLLDRPMFHPPVNG
Sbjct: 121 NPCIFHTGLFVLQPSMDVFKNMLHELAVGRDNPDGADQGFLASYFPDLLDRPMFHPPVNG 180
Query: 181 TKLEGTYRLPLGYQMDASYYYLKLRWSIPCGPNSVITFPSAPWFKPWYWWSWPVLPLGLS 240
TKLEGTYRLPLGYQMDASYYYLKLRWSIPCGPNSVITFPSAPWFKPWYWWSWPVLPLGLS
Sbjct: 181 TKLEGTYRLPLGYQMDASYYYLKLRWSIPCGPNSVITFPSAPWFKPWYWWSWPVLPLGLS 240
Query: 241 WHEQRRENLGYSSELPVVLIQALFYIGVIAVTRLARPSLSKMCYNRRMEKSTIVLLTTLR 300
WHEQRRENLGYSSELPVVLIQALFYIGVIAVTRLARPSLSKMCYNRRMEKSTIVLLTTLR
Sbjct: 241 WHEQRRENLGYSSELPVVLIQALFYIGVIAVTRLARPSLSKMCYNRRMEKSTIVLLTTLR 300
Query: 301 VVAAWSILAAYTIPFFLIPRTVHPLLGWPLYLLGAFSFSSIVINVFLLHPLAVLTTWLGI 360
VVAAWSILAAYTIPFFLIPRTVHPLLGWPLYLLGAFSFSSIVINVFLLHPLAVLTTWLGI
Sbjct: 301 VVAAWSILAAYTIPFFLIPRTVHPLLGWPLYLLGAFSFSSIVINVFLLHPLAVLTTWLGI 360
Query: 361 IGALFVMAFPWYLNGVVRALAVFAYAFCCAPLIWGSLVKTMSSLQILIERDAFRLGEPNQ 420
IGALFVMAFPWYLNGVVRALAVFAYAFCCAPLIWGSLVKTMSSLQILIERDAFRLGEPNQ
Sbjct: 361 IGALFVMAFPWYLNGVVRALAVFAYAFCCAPLIWGSLVKTMSSLQILIERDAFRLGEPNQ 420
Query: 421 TAEFTKLY 428
TAEFTKLY
Sbjct: 421 TAEFTKLY 428
>Os10g0555100 Similar to DNA chromosome 4, ESSA I CONTIG fragment NO. 6
(Glucosyltransferase like protein)
Length = 492
Score = 644 bits (1662), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/432 (69%), Positives = 355/432 (82%), Gaps = 4/432 (0%)
Query: 1 MMYMGTPRDYEFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKD-DGVKVVSVE 59
MMYMGTPRDYEFYVA RVMMRSL R+G+DADRV+IAS DVP WV+A+++ DG++VV VE
Sbjct: 61 MMYMGTPRDYEFYVAVRVMMRSLARIGADADRVLIASADVPADWVRAMREEDGMRVVLVE 120
Query: 60 NLKNPYEKQ-GNFNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAV 118
N+KNPYE G N RFKLTLNKLYAW+LV Y+RVVM+DSDNIFLQ TDELFQCGQFCAV
Sbjct: 121 NMKNPYESNLGGINRRFKLTLNKLYAWTLVDYERVVMIDSDNIFLQKTDELFQCGQFCAV 180
Query: 119 FINPCIFHTGLFVLQPSMDVFKNMLHELAVGRDNPDGADQGFLASYFPDLLDRPMFHPPV 178
FINPC FHTGLFVLQPSMDVFK MLH+L +GR N DGADQGFL +PDLLDRPMFHPP
Sbjct: 181 FINPCYFHTGLFVLQPSMDVFKGMLHDLEIGRANSDGADQGFLVGCYPDLLDRPMFHPPE 240
Query: 179 NGTKLEGTYRLPLGYQMDASYYYLKLRWSIPCGPNSVITFPSAPWFKPWYWWSWPVLPLG 238
NG+KL GTYRLPLGYQMDASYYYLKL W +PCGPNSVITFPSAPWFKPWYWWSWP+LPLG
Sbjct: 241 NGSKLNGTYRLPLGYQMDASYYYLKLHWHVPCGPNSVITFPSAPWFKPWYWWSWPILPLG 300
Query: 239 LSWHEQRRENLGYSSELPVVLIQALFYIGVIAVTRLARPSLSKMCYNRRMEKSTIVLLTT 298
LSWH+QR ++LGY++E+PV+L++ L Y +I +TRLA+P ++K+CYNRR EK ++
Sbjct: 301 LSWHKQRWDDLGYAAEMPVILMEILMYAVIITITRLAKPGMTKLCYNRRPEKQNAMVQGL 360
Query: 299 LRVVAAWSILAAYTIPFFLIPRTVHPLLGWPLYLLGAFSFSSIVINVFLLHPLAVLTTWL 358
+++ A ++L AY IPFF+IPRTVHP +GW +YL GA + +V N FLL LAVLT WL
Sbjct: 361 IKMSAIVAMLIAYAIPFFIIPRTVHPFMGWSMYLFGALALGVLVSNAFLLPLLAVLTPWL 420
Query: 359 GIIGALFVMAFPWYLNGVVRALAVFAYAFCCAPLIWGSLVKTMSSLQILIERDAF--RLG 416
IIG FVMAFPWY G+VR LA+F YAFC AP +W SLV+ M SLQ ++ER+ F RLG
Sbjct: 421 AIIGMFFVMAFPWYHGGIVRVLAIFGYAFCSAPFLWASLVRVMDSLQTMLEREPFFPRLG 480
Query: 417 EPNQTAEFTKLY 428
EP Q EF+KL+
Sbjct: 481 EPAQETEFSKLF 492
>Os02g0624400 Glycosyl transferase, family 8 protein
Length = 547
Score = 113 bits (282), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 119/238 (50%), Gaps = 7/238 (2%)
Query: 11 EFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVENLKNPYEKQGN 70
EF + RV+ +S+ + D VV+ S V + L+ DG V + L NP + +
Sbjct: 48 EFVLGVRVLGKSIRDTDTSRDLVVLVSDGVSEYSRKLLEADGFIVKHITLLANPNQVRPT 107
Query: 71 FNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFINPCIFHTGLF 130
RF KL +++ SY +V LD+D I +++ +++F CG+FCA + ++G+
Sbjct: 108 ---RFWGVYTKLKIFNMTSYKKVAYLDADTIVVKSIEDIFNCGKFCANLKHSERMNSGVM 164
Query: 131 VLQPSMDVFKNMLHELAVGRDNPDGADQGFLASYFPDLLDRPMFHPPVNGTKLEGTYRLP 190
V++PS +F +M+ ++ + G DQGFL SY+ D + ++ P T T RL
Sbjct: 165 VVEPSETLFNDMMDKVN-SLPSYTGGDQGFLNSYYADFANSRVYEPNKPTTPEPETQRLS 223
Query: 191 LGYQMDASYYYLKLRWSIPCGPNSVITFPSAPWFKPWYWWS-WPVLPLGLSWHEQRRE 247
Y D Y L +W + VI + P KPW WW+ W V P+ + W + R+
Sbjct: 224 TLYNADVGLYMLANKWMVDEKELRVIHYTLGP-LKPWDWWTAWLVKPVAV-WQDIRKN 279
>Os03g0184300 Glycosyl transferase, family 8 protein
Length = 500
Score = 76.3 bits (186), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 80/148 (54%), Gaps = 4/148 (2%)
Query: 18 VMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVENLKNPYEKQGNFNMRFKL 77
V+ +S+ R GS D V++ V + AL G ++ ++NP ++G +N +
Sbjct: 228 VLAQSIRRAGSTRDLVLLHDHTVSKPALAALVAAGWTPRKIKRIRNPRAERGTYN---EY 284
Query: 78 TLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFINPCIFHTGLFVLQPSMD 137
+K W L YDRVV +D+D + L++ D LF Q AV + +F++G+ V++PS
Sbjct: 285 NYSKFRLWQLTDYDRVVFVDADILVLRDLDALFGFPQLTAVGNDGSLFNSGVMVIEPSQC 344
Query: 138 VFKNMLHELAVGRDNPDGADQGFLASYF 165
F++++ + R + +G DQGFL F
Sbjct: 345 TFQSLIRQRRTIR-SYNGGDQGFLNEVF 371
>Os01g0880200 Glycosyl transferase, family 8 protein
Length = 635
Score = 69.3 bits (168), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 80/158 (50%), Gaps = 10/158 (6%)
Query: 11 EFYVATRVMMRSLGRLGSDADRVVIASVDVPPRWVQALKDDGVKVVSVENLKNPYEKQGN 70
E+ +S+ + GS D V++ + + L+ G KV ++ ++NP ++
Sbjct: 317 EYVCGAITAAQSIRQAGSTRDFVILVDETISNHHRKGLEAAGWKVRIIQRIRNPKAERDA 376
Query: 71 FNMRFKLTLNKLYAWSLVSYDRVVMLDSDNIFLQNTDELFQCGQFCAVFINPCIFHTGLF 130
+N + +K W L YD+++ +D+D + L+N D LF + A N +F++G+
Sbjct: 377 YN---EWNYSKFRLWQLTDYDKIIFIDADLLILRNVDFLFAMPEITATGNNATLFNSGVM 433
Query: 131 VLQPSMDVFK---NMLHELAVGRDNPDGADQGFLASYF 165
V++PS F+ + ++E+ + +G DQG+L F
Sbjct: 434 VIEPSNCTFQLLMDHINEIT----SYNGGDQGYLNEIF 467
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.327 0.141 0.462
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 14,668,446
Number of extensions: 628901
Number of successful extensions: 1383
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 1378
Number of HSP's successfully gapped: 5
Length of query: 428
Length of database: 17,035,801
Length adjustment: 104
Effective length of query: 324
Effective length of database: 11,605,545
Effective search space: 3760196580
Effective search space used: 3760196580
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 157 (65.1 bits)