BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os06g0352200 Os06g0352200|AK071746
(253 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os06g0352200 Protein of unknown function DUF679 family protein 411 e-115
Os02g0479000 169 1e-42
Os08g0106501 Protein of unknown function DUF679 family protein 137 8e-33
Os12g0411500 135 4e-32
Os07g0407900 Protein of unknown function DUF679 family protein 130 6e-31
Os03g0370400 Protein of unknown function DUF679 family protein 125 2e-29
Os01g0882400 Protein of unknown function DUF679 family protein 122 2e-28
Os01g0368400 119 2e-27
Os01g0388700 Protein of unknown function DUF679 family protein 119 2e-27
Os01g0389700 Protein of unknown function DUF679 family protein 117 1e-26
Os07g0645300 Protein of unknown function DUF679 family protein 111 4e-25
Os01g0389200 Protein of unknown function DUF679 family protein 108 5e-24
Os05g0562800 Protein of unknown function DUF679 family protein 85 4e-17
Os01g0368700 Protein of unknown function DUF679 family protein 83 2e-16
>Os06g0352200 Protein of unknown function DUF679 family protein
Length = 253
Score = 411 bits (1057), Expect = e-115, Method: Compositional matrix adjust.
Identities = 208/253 (82%), Positives = 208/253 (82%)
Query: 1 MASNKQIDLESQKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGITNRHNVQPETEPXXX 60
MASNKQIDLESQK GITNRHNVQPETEP
Sbjct: 1 MASNKQIDLESQKPATSSPAASVSSAAAATAPPASSSAVPVSVGITNRHNVQPETEPLLL 60
Query: 61 XXXXXXXXXXXXETTRLERTITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVN 120
ETTRLERTITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVN
Sbjct: 61 LAGGDGDGGSDDETTRLERTITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVN 120
Query: 121 RVMTAWLVGLCAAACFFLCFTDSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAATYRL 180
RVMTAWLVGLCAAACFFLCFTDSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAATYRL
Sbjct: 121 RVMTAWLVGLCAAACFFLCFTDSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAATYRL 180
Query: 181 RFIDFFHAVLSLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFP 240
RFIDFFHAVLSLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFP
Sbjct: 181 RFIDFFHAVLSLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFP 240
Query: 241 STRHGIGFPVHVA 253
STRHGIGFPVHVA
Sbjct: 241 STRHGIGFPVHVA 253
>Os02g0479000
Length = 216
Score = 169 bits (429), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/176 (55%), Positives = 111/176 (63%), Gaps = 6/176 (3%)
Query: 81 ITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCF 140
+ +A STA+LAKHLPTGAVL FEVLSP FT G C NR +TA LVG CA CF LCF
Sbjct: 37 VCKALNSTADLAKHLPTGAVLAFEVLSPSFTADGSCTAANRALTACLVGACALCCFLLCF 96
Query: 141 TDSFHDGKGTVRYVVATRAG-LWVIDGTAPPPPDVAAT-----YRLRFIDFFHAVLSLIV 194
TDS+ D G+VRY T +G L +ID + YRL D H LS V
Sbjct: 97 TDSYRDATGSVRYGFVTPSGSLRLIDSGSGSGSPPPPPPRDDRYRLGARDVLHGALSFAV 156
Query: 195 FLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGFPV 250
FL+VAM D NV ACFYPV S TRQ+L VP+A G G+ LFA FPSTR GIGFPV
Sbjct: 157 FLAVAMVDRNVVACFYPVESPATRQLLAAVPMAAGAAGSFLFAMFPSTRRGIGFPV 212
>Os08g0106501 Protein of unknown function DUF679 family protein
Length = 224
Score = 137 bits (345), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 109/180 (60%), Gaps = 6/180 (3%)
Query: 76 RLERTITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAAC 135
R +++A STA LA LPTG V+ F++L+P FTN G C ++TA L+ L A +C
Sbjct: 38 RRPSLLSQALASTASLANLLPTGTVMAFQLLAPTFTNNGACDATTSLLTAALLALLALSC 97
Query: 136 FFLCFTDSFHDGKGTVRYVVATRAGLWVID-----GTAPPPPDVAATYRLRFIDFFHAVL 190
FTDS G V Y +AT GLW++D APP PD + YR+R ID HA+L
Sbjct: 98 VLASFTDSVRGPDGRVYYGLATPRGLWLLDYPPAGAGAPPQPDTS-RYRMRAIDGVHALL 156
Query: 191 SLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGFPV 250
S+ VF VA D NV CF+P + T +VL VPL G++ ++LF FP+TRHGIG+PV
Sbjct: 157 SVGVFGVVAARDKNVVGCFWPSPAKGTEEVLGIVPLGVGVMCSLLFVVFPTTRHGIGYPV 216
>Os12g0411500
Length = 149
Score = 135 bits (339), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 75/139 (53%), Positives = 86/139 (61%), Gaps = 2/139 (1%)
Query: 81 ITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCF 140
+ +A STA+LAKHLPT VL F VLSP T G C NR +TA LVG CA CF LCF
Sbjct: 12 VCKALNSTADLAKHLPTSVVLAFGVLSPSSTADGSCTAANRALTACLVGACALCCFLLCF 71
Query: 141 TDSFHDGKGTVRYVVATRAG-LWVIDGTAPPPPDVAATYRLRFIDFFHAVLSLIVFLSVA 199
++S+ DG G VRY T +G L +IDG+ PP YRL D H LS VFL+VA
Sbjct: 72 SNSYRDGTGAVRYDFVTPSGRLRLIDGSGSLPPR-DNRYRLGARDVLHGALSFAVFLAVA 130
Query: 200 MFDHNVGACFYPVMSYDTR 218
M DHNV A FYPV S TR
Sbjct: 131 MVDHNVVAHFYPVESPATR 149
>Os07g0407900 Protein of unknown function DUF679 family protein
Length = 262
Score = 130 bits (328), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 106/189 (56%), Gaps = 18/189 (9%)
Query: 74 TTRLERTITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAA 133
TT +++T++ S A LAK LPTG VL F+ LSP FTN G C NR +TA L+ LC
Sbjct: 72 TTAMDKTLS----SVANLAKLLPTGTVLAFQSLSPSFTNRGACLTSNRYLTAALLYLCVL 127
Query: 134 ACFFLCFTDSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAAT--------------YR 179
+C F FTDSF G G + Y VAT G V + A D R
Sbjct: 128 SCIFFSFTDSFVGGDGKLYYGVATAKGFLVFNYDAGSSSDGDDDDQRRRREVFKDLRRLR 187
Query: 180 LRFIDFFHAVLSLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATF 239
+R++D+ HAV + +VF++VA V +C++P + +Q+LT++PL G + T +F F
Sbjct: 188 IRWVDYVHAVFTALVFMTVAFSSTAVQSCYFPEAGDNVKQLLTNLPLGAGFLSTTVFLVF 247
Query: 240 PSTRHGIGF 248
P+TR GIG+
Sbjct: 248 PTTRKGIGY 256
>Os03g0370400 Protein of unknown function DUF679 family protein
Length = 193
Score = 125 bits (315), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 92/167 (55%), Gaps = 1/167 (0%)
Query: 83 RAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDV-NRVMTAWLVGLCAAACFFLCFT 141
RA R A+L K LP+G V +F+ LSP+ TN G C +RV++A L+ LC A C F FT
Sbjct: 14 RALRGVADLIKLLPSGTVFLFQFLSPLVTNNGHCAAAYSRVLSAALLALCGAFCAFSSFT 73
Query: 142 DSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAATYRLRFIDFFHAVLSLIVFLSVAMF 201
DS+ G V Y V T GL + YRLR DF HA LSL+VF ++A+
Sbjct: 74 DSYVGSDGRVYYGVVTARGLRTFAADPDAAARDLSGYRLRAGDFVHAALSLLVFATIALL 133
Query: 202 DHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGF 248
D + AC YP + R ++ +P G V + F FP+ RHGIG+
Sbjct: 134 DADTVACLYPALEVSERTMMAVLPPVVGGVASYAFMVFPNNRHGIGY 180
>Os01g0882400 Protein of unknown function DUF679 family protein
Length = 227
Score = 122 bits (307), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 95/173 (54%), Gaps = 17/173 (9%)
Query: 88 TAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCFTDSFHDG 147
TA L LPT +L F + +P+ T+ GKC +NR +T L+ LCAA+C F TDSF
Sbjct: 49 TARLNVLLPTATILAFAIFAPLLTDDGKCTRLNRALTGALMLLCAASCVFFTLTDSFRSP 108
Query: 148 KGTVRYVVATRAGLWVI-----------DGTAPPPPDVAATYRLRFIDFFHAVLSLIVFL 196
G +RY +AT +G+ P P+ YRLR+ D FH L+L+ F+
Sbjct: 109 TGRLRYGIATTSGIRTFCVGGRRRRRGGGKAGPREPE---RYRLRWSDLFHTALALVAFV 165
Query: 197 SVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGFP 249
+ A H++ C+YP + R+V+ VPL G V ++LF FPS R GIG+P
Sbjct: 166 TFAASHHDIVLCYYPGVP---RKVVNTVPLVIGFVVSLLFVLFPSKRRGIGYP 215
>Os01g0368400
Length = 335
Score = 119 bits (298), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 94/190 (49%), Gaps = 21/190 (11%)
Query: 81 ITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCF 140
+ + +TA L K LPTG L F+ L+P FTN G+C +NR ++ L+ C A C L F
Sbjct: 50 VDKTLSTTANLVKLLPTGTTLAFQALAPSFTNHGRCLAINRYISGGLIAFCCAICALLSF 109
Query: 141 TDSFHDGKGTVRYVVA-------TRAGLWVIDGTAPPPPDVAA--------------TYR 179
TDS D KG Y +A + G + P P A R
Sbjct: 110 TDSIIDRKGRPYYGLAFPADEDTGKGGFVPFNYEKPRRPSNGAATAAADDDDSWELYKRR 169
Query: 180 LRFIDFFHAVLSLIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATF 239
+R +DF HA L + VFL++A D + C +P S R+ L ++PL G V + +F F
Sbjct: 170 VRPLDFLHATLRVFVFLALAFSDAGIQTCLFPQESATWREALVNMPLGVGFVASFVFMIF 229
Query: 240 PSTRHGIGFP 249
PSTR G+G+P
Sbjct: 230 PSTRKGVGYP 239
>Os01g0388700 Protein of unknown function DUF679 family protein
Length = 225
Score = 119 bits (297), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/176 (39%), Positives = 98/176 (55%), Gaps = 10/176 (5%)
Query: 81 ITRAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDV-NRVMTAWLVGLCAAACFFLC 139
+ + ++L K LPTG VL F+ L+P F+N G C V NR + L+G CAA+C L
Sbjct: 44 VDKTLSGASDLLKLLPTGTVLAFQALAPSFSNHGVCHAVANRYLVLALIGACAASCMLLS 103
Query: 140 FTDSF--HDGKGTVRYVVATRAGLWVID--GTAPPPPDV---AATYRLRFIDFFHAVLSL 192
FTDS HDGK + Y VAT G + GT V + +R+ +DF HA S
Sbjct: 104 FTDSLIGHDGK--LYYGVATLRGFRPFNFAGTREEHGTVFKDLSRFRITALDFVHAFFSA 161
Query: 193 IVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGF 248
+VFL+VA D V C +P D R++L ++PL G + +M+F FP+TR IG+
Sbjct: 162 VVFLAVAFADAAVQTCLFPEAEADMRELLVNLPLGAGFLSSMVFMIFPTTRKSIGY 217
>Os01g0389700 Protein of unknown function DUF679 family protein
Length = 240
Score = 117 bits (292), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 100/175 (57%), Gaps = 5/175 (2%)
Query: 79 RTIT-RAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFF 137
+T+T + STA LA+ LPTG VL ++ LSP FTN G+C N+ +TA LVG+ A F
Sbjct: 51 KTVTDKVMASTANLAQLLPTGTVLAYQALSPSFTNHGECNAANKWLTAVLVGVLAGLSLF 110
Query: 138 LCFTDSFHDGKGTVRYVVATRAGLWVIDGTAPPPPDVAATY---RLRFIDFFHAVLSLIV 194
FTDS G + Y VATR GL V + + ++ RLR +DF H+ + +V
Sbjct: 111 FSFTDSVVGQDGKLYYGVATRRGLNVFNMSREEEEAKKLSHSELRLRPLDFVHSFFTAMV 170
Query: 195 FLSVAMFDHNVGACFYPVM-SYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGF 248
FL+VA D + CF+ +T+++L ++PL + + +F FP+ R GIG+
Sbjct: 171 FLTVAFSDVGLQNCFFGQNPGGNTKELLKNLPLGMAFLSSFVFLIFPTKRKGIGY 225
>Os07g0645300 Protein of unknown function DUF679 family protein
Length = 188
Score = 111 bits (278), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 93/171 (54%), Gaps = 6/171 (3%)
Query: 85 FRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCFTDSF 144
F+S ++ K LPT V+V+EVL+P+ TN G C N+V+T ++ LCA C F FTDS+
Sbjct: 6 FKSIGDVLKLLPTATVIVYEVLTPIVTNTGDCHVANKVVTPVILVLCAFFCAFSQFTDSY 65
Query: 145 HDGKGTVRYVVATRAGLWVIDGTAPPPPDVA-----ATYRLRFIDFFH-AVLSLIVFLSV 198
G VRY + T GL G A + YRLRF DF H +
Sbjct: 66 VGADGKVRYGLVTARGLLPFSGGGGADGGDAAGRDFSKYRLRFGDFVHAFFSVAVFAAVA 125
Query: 199 AMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGFP 249
+ D N +CFYP + ++V+ +P+ G + +++F FPSTRHGIG+P
Sbjct: 126 LLADANTVSCFYPSLKDQQKKVVMALPVVVGALASVVFVVFPSTRHGIGYP 176
>Os01g0389200 Protein of unknown function DUF679 family protein
Length = 264
Score = 108 bits (269), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 92/172 (53%), Gaps = 6/172 (3%)
Query: 83 RAFRSTAELAKHLPTGAVLVFEVLSPVFTNGGKCQDVNRVMTAWLVGLCAAACFFLCFTD 142
+ STA LA+ LPTG L ++ LS FTN G+C NR +TA LV + A+ F TD
Sbjct: 71 KVMASTANLAQLLPTGTALAYQALSTSFTNHGQCYRSNRWLTAGLVAVLTASSIFFSLTD 130
Query: 143 SFHDGKGTVRYVVATRAGLWV--IDGTAPPPPDVAAT----YRLRFIDFFHAVLSLIVFL 196
S G + Y +AT G V + +++ T R+R +D HA + +VFL
Sbjct: 131 SVVGRGGKLYYGMATPRGFNVFNLSREEEEAQELSRTKLRELRVRPLDIVHAFFTAVVFL 190
Query: 197 SVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGF 248
+VA D + CF+P DT+++L ++PL + T +F FP+ R GIG+
Sbjct: 191 TVAFSDVGLTKCFFPDAGNDTKELLKNLPLGMAFMSTFVFLLFPTKRKGIGY 242
>Os05g0562800 Protein of unknown function DUF679 family protein
Length = 79
Score = 85.1 bits (209), Expect = 4e-17, Method: Composition-based stats.
Identities = 37/59 (62%), Positives = 50/59 (84%)
Query: 192 LIVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGFPV 250
+I+ ++VA+FD NV +CFYPV S TRQVLT +P+A G+VG+MLF +FP+TRHGIGFP+
Sbjct: 18 IIIIIAVALFDQNVVSCFYPVPSEGTRQVLTALPIAIGVVGSMLFVSFPTTRHGIGFPL 76
>Os01g0368700 Protein of unknown function DUF679 family protein
Length = 215
Score = 82.8 bits (203), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 93/176 (52%), Gaps = 8/176 (4%)
Query: 81 ITRAFRSTAELAKHLPTGAVLVFEVLSPVFTN-GGKCQDVNRVMTAWLVGLCAAACFFLC 139
+ + + ++ K LPTG VL F L+P FTN GG C +R TA L+ C A+C L
Sbjct: 15 VDKTMCAACDILKLLPTGTVLAFHELAPSFTNHGGACGAASRYTTAALIAACTASCVLLS 74
Query: 140 FTDSF--HDGKGTVRYVVATRAGL--WVIDGTAPPPPDVAAT---YRLRFIDFFHAVLSL 192
FTDS H + Y VAT G + +GT + ++R +DF HA +S
Sbjct: 75 FTDSLVSHVDGRRLYYGVATLRGFRPFNFEGTREEMEERFGDLPGMKVRALDFVHAHVSA 134
Query: 193 IVFLSVAMFDHNVGACFYPVMSYDTRQVLTDVPLAGGLVGTMLFATFPSTRHGIGF 248
+VF+ VA+ + +V C +P ++ ++P+ GL+ +M+F FP+TR IG+
Sbjct: 135 VVFVVVALGNADVQGCLFPDAGTGFTEMFRNLPMGLGLLASMVFMIFPTTRKSIGY 190
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.326 0.139 0.433
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 7,571,767
Number of extensions: 298864
Number of successful extensions: 968
Number of sequences better than 1.0e-10: 14
Number of HSP's gapped: 948
Number of HSP's successfully gapped: 14
Length of query: 253
Length of database: 17,035,801
Length adjustment: 99
Effective length of query: 154
Effective length of database: 11,866,615
Effective search space: 1827458710
Effective search space used: 1827458710
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 155 (64.3 bits)