BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os08g0333100 Os08g0333100|AK103806
(481 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os08g0333100 Conserved hypothetical protein 728 0.0
Os09g0281800 Conserved hypothetical protein 421 e-118
Os09g0488700 Conserved hypothetical protein 297 1e-80
Os10g0370100 Disease resistance protein family protein 230 2e-60
Os02g0717600 Conserved hypothetical protein 98 1e-20
>Os08g0333100 Conserved hypothetical protein
Length = 481
Score = 728 bits (1880), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 376/481 (78%), Positives = 376/481 (78%)
Query: 1 MHSQTPAKPKASPVRSRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAGGEVT 60
MHSQTPAKPKASPVRSR AGGEVT
Sbjct: 1 MHSQTPAKPKASPVRSRPQLPASAAAAAAAAVEPPLQLQQLHTTPPPPPPPLMPAGGEVT 60
Query: 61 GGSKAAKKRGMQKLLKSAFKRGDHHAPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 120
GGSKAAKKRGMQKLLKSAFKRGDHHAP
Sbjct: 61 GGSKAAKKRGMQKLLKSAFKRGDHHAPAGASSGGGEQSGDDEAAAAAAQDLSRSSSSSTG 120
Query: 121 XXXXXXXXXXXXXVEGDLSSRDSLELQESKNVKGAAAALRNAKLSHSYEAFPWERKMRDL 180
VEGDLSSRDSLELQESKNVKGAAAALRNAKLSHSYEAFPWERKMRDL
Sbjct: 121 GSSGRKGRKGDSSVEGDLSSRDSLELQESKNVKGAAAALRNAKLSHSYEAFPWERKMRDL 180
Query: 181 LQVAGASGFXXXXXXPRATDETQTKFHSLEDTLARAESWLMSSQMSGVPIVPMNVQTEAL 240
LQVAGASGF PRATDETQTKFHSLEDTLARAESWLMSSQMSGVPIVPMNVQTEAL
Sbjct: 181 LQVAGASGFLSLLLLPRATDETQTKFHSLEDTLARAESWLMSSQMSGVPIVPMNVQTEAL 240
Query: 241 LTKICGDVASSTVNMNSLGDLANMATVSLYGFEDYHGVDIGVVRAIRLWYAPFAGEMALE 300
LTKICGDVASSTVNMNSLGDLANMATVSLYGFEDYHGVDIGVVRAIRLWYAPFAGEMALE
Sbjct: 241 LTKICGDVASSTVNMNSLGDLANMATVSLYGFEDYHGVDIGVVRAIRLWYAPFAGEMALE 300
Query: 301 IKLQPGDTRLGFAISRTEEGFIYVSSVAEESTPGVASTRSGLLELYRRARRASKLLVVSR 360
IKLQPGDTRLGFAISRTEEGFIYVSSVAEESTPGVASTRSGLLELYRRARRASKLLVVSR
Sbjct: 301 IKLQPGDTRLGFAISRTEEGFIYVSSVAEESTPGVASTRSGLLELYRRARRASKLLVVSR 360
Query: 361 VGDDKVLPWATSTAGDIRCFDTVSLSQRLSLHRHALRPVTLHFLMWERLPPAAVIRGGAA 420
VGDDKVLPWATSTAGDIRCFDTVSLSQRLSLHRHALRPVTLHFLMWERLPPAAVIRGGAA
Sbjct: 361 VGDDKVLPWATSTAGDIRCFDTVSLSQRLSLHRHALRPVTLHFLMWERLPPAAVIRGGAA 420
Query: 421 ARPTVQMIVQXXXXXXXXXXXXXXXXVAFDGDGPEIVLSGKDDSDDRSFRFQNIGLPDSW 480
ARPTVQMIVQ VAFDGDGPEIVLSGKDDSDDRSFRFQNIGLPDSW
Sbjct: 421 ARPTVQMIVQGDEEGGGDAADESTDEVAFDGDGPEIVLSGKDDSDDRSFRFQNIGLPDSW 480
Query: 481 L 481
L
Sbjct: 481 L 481
>Os09g0281800 Conserved hypothetical protein
Length = 440
Score = 421 bits (1081), Expect = e-118, Method: Compositional matrix adjust.
Identities = 223/343 (65%), Positives = 254/343 (74%), Gaps = 7/343 (2%)
Query: 142 DSLELQESKNVKGAAAALRNAKLSHSYEAFPWERKMRDLLQVAGASGFXXXXXXPRATDE 201
+S EL SKN K AALR+AK+S++YE+FPWE+KM++LL V AS F P++ D
Sbjct: 102 ESGELDGSKNAK-VLAALRDAKISYAYESFPWEKKMKELLPVPAASCFLSMLLLPKSADG 160
Query: 202 TQTKFHSLEDTLARAESWLMSSQMSGVPIVPMNVQTEALLTKICGDVASSTVNMNSLGDL 261
+ T++ SLEDTLARA++WL+SSQ +GVP+ MNVQTEALLTKI G++A STVNM SL DL
Sbjct: 161 SHTRYKSLEDTLARADAWLVSSQAAGVPVAFMNVQTEALLTKISGEMALSTVNMGSLSDL 220
Query: 262 ANMATVSLYGFEDYHGVDIGVVRAIRLWYAPFAGEMALEIKLQPGDTRLGFAISRTEEGF 321
ANMA SLYGFEDYHGVDIGVVRA+RLWY P AGE ALEIKL PGDTRLGFAISRTEEGF
Sbjct: 221 ANMANASLYGFEDYHGVDIGVVRAVRLWYTPVAGEAALEIKLLPGDTRLGFAISRTEEGF 280
Query: 322 IYVSSVAEESTPGVASTRSGLLELYRRARRASKLLVVSRVGDDKVLPWATSTAGDIRCFD 381
IYVSSVAEESTPGVASTRSGLLEL+R ARRAS+LLVVSRVG +KVLPW STAGD++CFD
Sbjct: 281 IYVSSVAEESTPGVASTRSGLLELHRAARRASRLLVVSRVGGEKVLPWMVSTAGDVKCFD 340
Query: 382 TVSLSQRLSLHRHALRPVTLHFLMWERLPPAAVIRGGAAARPTVQMIVQXXXXXXXXXXX 441
TVSLSQ+LSLHRHALRP+TLHFLMW+ A I+ A P
Sbjct: 341 TVSLSQKLSLHRHALRPITLHFLMWDT---ALAIKDVVAKPPLPPPPTMLMLPSPPSPPP 397
Query: 442 XXXXXVA--FDGDGPEIVLSGKDDSDDRSFRFQNIG-LPDSWL 481
A GDG E SG D SFRFQNI LPDSWL
Sbjct: 398 SDAEGDAPPPSGDGDEAPGSGAKGGKDSSFRFQNIDLLPDSWL 440
>Os09g0488700 Conserved hypothetical protein
Length = 292
Score = 297 bits (761), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 143/233 (61%), Positives = 179/233 (76%), Gaps = 1/233 (0%)
Query: 177 MRDLLQVAGASGFXXXXXXPRATDETQTKFHSLEDTLARAESWLMSSQMSGVPIVPMNVQ 236
MRD L++ +S + P+A D ++ S EDTLARA +W+ SSQ+SG+PI MNVQ
Sbjct: 1 MRDSLRMPNSSSYLSMLVLPKALDLNSCRYESFEDTLARANAWIYSSQVSGIPIEFMNVQ 60
Query: 237 TEALLTKICGDVASSTVNMNSLGDLANMATVSLYGFEDYHGVDIGVVRAIRLWYAPFAGE 296
+EALLTKI G+ AS+TVN SL DL+N+ +LYGFEDYHGVDIGVV+A RLWY+ A E
Sbjct: 61 SEALLTKISGETASATVNSGSLSDLSNVTNATLYGFEDYHGVDIGVVKAARLWYSSIAEE 120
Query: 297 MALEIKLQPGDTRLGFAISRTEEGFIYVSSVAEESTPGVA-STRSGLLELYRRARRASKL 355
M LEI L+ GDTRLGFAISRTEEGFI++SSV + A STRSGL +L+ +AR ASKL
Sbjct: 121 MPLEIPLEEGDTRLGFAISRTEEGFIFISSVVDNDKDNEAPSTRSGLRDLFNQAREASKL 180
Query: 356 LVVSRVGDDKVLPWATSTAGDIRCFDTVSLSQRLSLHRHALRPVTLHFLMWER 408
LV+SRV ++KVLPW S++G IRCFDT+SLSQ+LSLHR A+RP+ LH LMWE+
Sbjct: 181 LVISRVSNEKVLPWMISSSGAIRCFDTISLSQKLSLHRLAVRPIQLHLLMWEK 233
>Os10g0370100 Disease resistance protein family protein
Length = 1297
Score = 230 bits (587), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 126/249 (50%), Positives = 164/249 (65%), Gaps = 9/249 (3%)
Query: 163 KLSHSYEAFPWERKMRDLLQVAGASGFXXXXXXPRATDETQTKFHSLEDTLARAESWLMS 222
K +Y ++ + M+D + + +S + P A D + S E TLARA +WL +
Sbjct: 1037 KAEDAYISYAMKGNMQDSVHMPDSSCYLSMLVLPMALDMKSHLYESFEVTLARANTWLYA 1096
Query: 223 SQMSGVPIVPMNVQTEALLTKI--CGDVASSTVNMNSLGDLANMATVSLYGFEDYHGVDI 280
SQ SGVPI M+VQ++ LLTKI GD S+TVN L DL+N AT+ EDY G +
Sbjct: 1097 SQASGVPIKLMSVQSDDLLTKISRVGDATSATVNSGLLPDLSN-ATL-----EDYQGYNT 1150
Query: 281 GVVRAIRLWYAPFAGEMALEIKLQPGDTRLGFAISRTEEGFIYVSSVAEESTPG-VASTR 339
VV+A RLWY+ GEM LEI + GDT+LGFAISRTEEGFIY+SSV ++ STR
Sbjct: 1151 EVVKAARLWYSSIGGEMPLEITPKVGDTKLGFAISRTEEGFIYISSVLQDDNDSETPSTR 1210
Query: 340 SGLLELYRRARRASKLLVVSRVGDDKVLPWATSTAGDIRCFDTVSLSQRLSLHRHALRPV 399
SG+ L+ RAR ASKLLV+SRV ++KVLPW S++G IRCFDT+S+SQ+LSL R+ L
Sbjct: 1211 SGMHNLFNRAREASKLLVISRVSNEKVLPWMISSSGAIRCFDTLSISQKLSLSRYPLCSF 1270
Query: 400 TLHFLMWER 408
LH LMWE+
Sbjct: 1271 QLHLLMWEK 1279
>Os02g0717600 Conserved hypothetical protein
Length = 322
Score = 97.8 bits (242), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 45/88 (51%), Positives = 61/88 (69%)
Query: 157 AALRNAKLSHSYEAFPWERKMRDLLQVAGASGFXXXXXXPRATDETQTKFHSLEDTLARA 216
AL A+L Y A+PWE+KMR+ L + +S F P A D ++++S+EDTLARA
Sbjct: 215 CALSKAQLQDGYVAYPWEKKMREALPIPNSSSFLSMLVLPTALDRAASRYNSVEDTLARA 274
Query: 217 ESWLMSSQMSGVPIVPMNVQTEALLTKI 244
+W++SSQ SGVPI +NVQTEALLTK+
Sbjct: 275 NAWILSSQSSGVPISFLNVQTEALLTKV 302
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.317 0.131 0.383
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 12,744,179
Number of extensions: 423782
Number of successful extensions: 955
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 947
Number of HSP's successfully gapped: 6
Length of query: 481
Length of database: 17,035,801
Length adjustment: 105
Effective length of query: 376
Effective length of database: 11,553,331
Effective search space: 4344052456
Effective search space used: 4344052456
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 158 (65.5 bits)