BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os03g0861100 Os03g0861100|AK100706
(1256 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os03g0861100 Conserved hypothetical protein 2145 0.0
Os07g0445600 Conserved hypothetical protein 887 0.0
Os07g0616000 Conserved hypothetical protein 141 2e-33
>Os03g0861100 Conserved hypothetical protein
Length = 1256
Score = 2145 bits (5558), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1082/1256 (86%), Positives = 1082/1256 (86%)
Query: 1 MRPETRLDSAVFQLTPTRTRCDLVVIANGRKEKIASGLLNPFVAHLKVAQQQIAKGGYSI 60
MRPETRLDSAVFQLTPTRTRCDLVVIANGRKEKIASGLLNPFVAHLKVAQQQIAKGGYSI
Sbjct: 1 MRPETRLDSAVFQLTPTRTRCDLVVIANGRKEKIASGLLNPFVAHLKVAQQQIAKGGYSI 60
Query: 61 TLEVDPEIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAITGQGGDNLGLRS 120
TLEVDPEIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAITGQGGDNLGLRS
Sbjct: 61 TLEVDPEIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAITGQGGDNLGLRS 120
Query: 121 VEDYNEKLAECIGGSKTNYDLDGDKSLILYKPGIQPPPPVQNDNATQEENSKVQLLRVLE 180
VEDYNEKLAECIGGSKTNYDLDGDKSLILYKPGIQPPPPVQNDNATQEENSKVQLLRVLE
Sbjct: 121 VEDYNEKLAECIGGSKTNYDLDGDKSLILYKPGIQPPPPVQNDNATQEENSKVQLLRVLE 180
Query: 181 TRKIVLRKEQXXXXXXXXXXGFNIDNLGFLITFADRFGASRLMKACTQFTELWRRKHETG 240
TRKIVLRKEQ GFNIDNLGFLITFADRFGASRLMKACTQFTELWRRKHETG
Sbjct: 181 TRKIVLRKEQAMAFARAVAAGFNIDNLGFLITFADRFGASRLMKACTQFTELWRRKHETG 240
Query: 241 QWIEVEPEAMSARSEFPPFNASGIMFMGDNMKQNLETLSISNGDANGEDAAKADQRTAQH 300
QWIEVEPEAMSARSEFPPFNASGIMFMGDNMKQNLETLSISNGDANGEDAAKADQRTAQH
Sbjct: 241 QWIEVEPEAMSARSEFPPFNASGIMFMGDNMKQNLETLSISNGDANGEDAAKADQRTAQH 300
Query: 301 SGAPSEYLHGPYQSAYPPWAIHXXXXXXXXXXXXXXXXXXXXXXXXMDDPRYHHSERRVS 360
SGAPSEYLHGPYQSAYPPWAIH MDDPRYHHSERRVS
Sbjct: 301 SGAPSEYLHGPYQSAYPPWAIHPPYPMQGMPYYPGVNPYYPPPYPPMDDPRYHHSERRVS 360
Query: 361 RKHSSDSKDSETLDDXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSVVVIRNINVTSKKHG 420
RKHSSDSKDSETLDD PSVVVIRNINVTSKKHG
Sbjct: 361 RKHSSDSKDSETLDDESGQSGSEIESSHGHKLHKKGKRSGKKKPSVVVIRNINVTSKKHG 420
Query: 421 XXXXXXXXXXXXXXXXXXXXHTEYXXXXXXXXXXXXXXXXXIILEPGDEYSRDEVAHRQD 480
HTEY IILEPGDEYSRDEVAHRQD
Sbjct: 421 SSESESQTSSDVASEDSDDSHTEYSKRKNKRSSSKKKESRKIILEPGDEYSRDEVAHRQD 480
Query: 481 GDQGNWNVFQSFLLRTEEKTKDNDADLFATERGPPPARRKESRTTDDPLLLVERDSTDFN 540
GDQGNWNVFQSFLLRTEEKTKDNDADLFATERGPPPARRKESRTTDDPLLLVERDSTDFN
Sbjct: 481 GDQGNWNVFQSFLLRTEEKTKDNDADLFATERGPPPARRKESRTTDDPLLLVERDSTDFN 540
Query: 541 EGKTIGFNSAHGRIRSRKMLSGDELVISAEGRSFVDGDIKEIEAGGGGYRRGASEDFIVY 600
EGKTIGFNSAHGRIRSRKMLSGDELVISAEGRSFVDGDIKEIEAGGGGYRRGASEDFIVY
Sbjct: 541 EGKTIGFNSAHGRIRSRKMLSGDELVISAEGRSFVDGDIKEIEAGGGGYRRGASEDFIVY 600
Query: 601 GQEKPMDSGSYLDPLAEGQYKSPTLMEKNMHSVADESFMIPVRSNSQDNLGPESCTAIDI 660
GQEKPMDSGSYLDPLAEGQYKSPTLMEKNMHSVADESFMIPVRSNSQDNLGPESCTAIDI
Sbjct: 601 GQEKPMDSGSYLDPLAEGQYKSPTLMEKNMHSVADESFMIPVRSNSQDNLGPESCTAIDI 660
Query: 661 DVELPGTVKKTTDAKAGDQLFYEPDELMPEREYEDVTYGYDPAMDYDSQMQIQPAIMVED 720
DVELPGTVKKTTDAKAGDQLFYEPDELMPEREYEDVTYGYDPAMDYDSQMQIQPAIMVED
Sbjct: 661 DVELPGTVKKTTDAKAGDQLFYEPDELMPEREYEDVTYGYDPAMDYDSQMQIQPAIMVED 720
Query: 721 ANADDVSLGVEGEVKKLEKDKKLRLQECLDKKKDASARRLPSSKTRLTDAQKRAQNLRAY 780
ANADDVSLGVEGEVKKLEKDKKLRLQECLDKKKDASARRLPSSKTRLTDAQKRAQNLRAY
Sbjct: 721 ANADDVSLGVEGEVKKLEKDKKLRLQECLDKKKDASARRLPSSKTRLTDAQKRAQNLRAY 780
Query: 781 KADXXXXXXXXXXXXXXXXXXXXQERQKRIAARSSTSNSISTPQQXXXXXXXXXXXXXXX 840
KAD QERQKRIAARSSTSNSISTPQQ
Sbjct: 781 KADLQKAKKEQEEEQIKRLERLKQERQKRIAARSSTSNSISTPQQVKVKPSPKTSPSTYK 840
Query: 841 XXXFSDAEPGSFSPLRKLPARTTAESDHQKTGKASKLXXXXXXXXXXXXXXLAAMKKEKN 900
FSDAEPGSFSPLRKLPARTTAESDHQKTGKASKL LAAMKKEKN
Sbjct: 841 SSKFSDAEPGSFSPLRKLPARTTAESDHQKTGKASKLSDSSTNAVSKSTSSLAAMKKEKN 900
Query: 901 GRNELSSERLKKLAEPKSNALTDRPSNSKFASMDHSRRKSMPEDTQTKKISAIMQLDQRK 960
GRNELSSERLKKLAEPKSNALTDRPSNSKFASMDHSRRKSMPEDTQTKKISAIMQLDQRK
Sbjct: 901 GRNELSSERLKKLAEPKSNALTDRPSNSKFASMDHSRRKSMPEDTQTKKISAIMQLDQRK 960
Query: 961 SATLPELKVKSPRAPSISVKNKTIAREIRDGDPGGKSPPTLEVTDGKKADVEVSRISNSD 1020
SATLPELKVKSPRAPSISVKNKTIAREIRDGDPGGKSPPTLEVTDGKKADVEVSRISNSD
Sbjct: 961 SATLPELKVKSPRAPSISVKNKTIAREIRDGDPGGKSPPTLEVTDGKKADVEVSRISNSD 1020
Query: 1021 DNVVVEKTVVILENEVVSTPPLILPPGRTSENETSSNDRTQKPSMELEYTAIRAPPSPAV 1080
DNVVVEKTVVILENEVVSTPPLILPPGRTSENETSSNDRTQKPSMELEYTAIRAPPSPAV
Sbjct: 1021 DNVVVEKTVVILENEVVSTPPLILPPGRTSENETSSNDRTQKPSMELEYTAIRAPPSPAV 1080
Query: 1081 LPEAENPTIHRHNDQGNYEVMTEHLKDETEELTLSAVEKPYQAPFARVTSLENDSATIHA 1140
LPEAENPTIHRHNDQGNYEVMTEHLKDETEELTLSAVEKPYQAPFARVTSLENDSATIHA
Sbjct: 1081 LPEAENPTIHRHNDQGNYEVMTEHLKDETEELTLSAVEKPYQAPFARVTSLENDSATIHA 1140
Query: 1141 YPHALPVESETPVHAESIRARVLDPVSTVSVEETPEANEKPRNKESKGFRKLLKFGRKSH 1200
YPHALPVESETPVHAESIRARVLDPVSTVSVEETPEANEKPRNKESKGFRKLLKFGRKSH
Sbjct: 1141 YPHALPVESETPVHAESIRARVLDPVSTVSVEETPEANEKPRNKESKGFRKLLKFGRKSH 1200
Query: 1201 TSGTMDSDASSVDGALAGDGSMLKTLIXXXXXXXXXXXXXXXXXXXXXXXQKVIVL 1256
TSGTMDSDASSVDGALAGDGSMLKTLI QKVIVL
Sbjct: 1201 TSGTMDSDASSVDGALAGDGSMLKTLISRDDSGSSSKASRSFSLLSPFRRQKVIVL 1256
>Os07g0445600 Conserved hypothetical protein
Length = 814
Score = 887 bits (2293), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 453/785 (57%), Positives = 550/785 (70%), Gaps = 15/785 (1%)
Query: 1 MRPETRLDSAVFQLTPTRTRCDLVVIANGRKEKIASGLLNPFVAHLKVAQQQIAKGGYSI 60
MRP+ RLD+AVFQLTPTRTR DLVVI NGRKEKIASGLLNPF+AHLKVAQ QIAKGGYSI
Sbjct: 1 MRPDARLDAAVFQLTPTRTRFDLVVIVNGRKEKIASGLLNPFLAHLKVAQDQIAKGGYSI 60
Query: 61 TLEVDPEIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAITGQGGDNLGLRS 120
TLE + APWFTRGTVERFVRFVSTPEVLERVTTIESEILQ+EDAI+ Q DNLGLRS
Sbjct: 61 TLEPSSGVGAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQLEDAISIQSNDNLGLRS 120
Query: 121 VEDYNEKLAECIGGSKTNYDLDGDKSLILYKPGIQPPPPVQNDNATQEENSKVQLLRVLE 180
VED+ KL E G++ N+ D DK++++Y+PG QP P V ++ T EENSKVQLLRVLE
Sbjct: 121 VEDHGGKLTESNEGTRANHSPDADKAIVIYQPGSQPTPAVHDETTTHEENSKVQLLRVLE 180
Query: 181 TRKIVLRKEQXXXXXXXXXXGFNIDNLGFLITFADRFGASRLMKACTQFTELWRRKHETG 240
TRK VLRKEQ GF+IDNLG+LI FA+RFGASRLM+AC+QF ELW+RKHETG
Sbjct: 181 TRKNVLRKEQAMAFARAVAAGFDIDNLGYLIAFAERFGASRLMRACSQFIELWKRKHETG 240
Query: 241 QWIEVEPEAMSARSEFPPFNASGIMFMGDNMKQNLETLSISNGDANGEDAAKADQRTAQH 300
QWIEVEPEAMS SEFPPFN SGI+F+GDNMKQN ET+S+SNG+ANGEDA+KA+ ++ Q
Sbjct: 241 QWIEVEPEAMSTHSEFPPFNPSGIVFVGDNMKQNTETMSVSNGEANGEDASKAEHKSGQQ 300
Query: 301 SGAPSEYLHGPYQSAYPPWAIHXXXXXXXXXXXXXXXXXXXXXXXXMDDPRYHHSERRVS 360
G YQ+AYPPWA+H +DDPRYH+S R+ S
Sbjct: 301 MG---------YQAAYPPWAMH--PPPYHMQGMPYYPGPYYPPYPPVDDPRYHYSGRKSS 349
Query: 361 RKHSSDSKDSETLDDXXXXXXXXXXXXXXXXXXXXXXXXXXXXPSVVVIRNINVTSKKHG 420
RKHSSDSK+SE LD+ PSVVVI+N+NVTSKKHG
Sbjct: 350 RKHSSDSKESEVLDEGSDGSSSERGSSHGHKSHKKGKRSGKKKPSVVVIKNVNVTSKKHG 409
Query: 421 XXXXXXXXXXXXXXXXXXXXHTEYXXXXXXXXXXXXXXXXXIILEPGDEY-SRDEVAHRQ 479
H + + GD+Y ++DE ++ Q
Sbjct: 410 SSESESQSSSEDGSQDSDDTHYKKRHGKHKSSGSKKKEGAKTNFDSGDDYNNKDESSYGQ 469
Query: 480 DGDQGNWNVFQSFLLRTEEKTKDNDADLFATERGPPPARRKESRTTDDPLLLVERDSTDF 539
D DQGNWN FQSFL+R EEKT+ NDAD+F+ E+ PP+R+K + T DP+LL DS D
Sbjct: 470 DADQGNWNAFQSFLMRAEEKTRSNDADMFSGEKA-PPSRKKNNVNTADPILLAGGDSGDV 528
Query: 540 NEGKTIGFNSAHGRIRSRKMLSGDELVISAEGRSFVDGDIKEIEAGGGGYRRGASEDFIV 599
E + GF+ +GR R+ ++ S DEL++S EG ++DG+IKEIEAGGG YRRG SEDF++
Sbjct: 529 YEQRGAGFDPVNGRSRAIRLQSNDELMMSGEGGRYMDGEIKEIEAGGGRYRRGTSEDFML 588
Query: 600 YGQEKPMDSGSYLDPLAEGQYKSPTLMEKNMHSVADESFMIPVRSNSQDNLGPESCTAID 659
YGQE+ MD S LDPLAE +Y++P ++KN ++ ADESF+IP+RS SQDN+GPE ID
Sbjct: 589 YGQERSMDRRSALDPLAEARYRNPNQVDKNGYAAADESFIIPLRSGSQDNVGPEYRATID 648
Query: 660 IDVELPGTVKKTTDAKAGDQLFYEPDELMPEREYEDVTYGYDPAMDYDSQMQIQPAIMVE 719
IDVELP KKT+D KAG QLFYEPDELMPER ED ++GYDPAMDY+S M ++ A+ VE
Sbjct: 649 IDVELPTNTKKTSDGKAGTQLFYEPDELMPERGSEDASFGYDPAMDYESNMMVR-AVKVE 707
Query: 720 DANADDVSLGVEGEVKKLEKDKKLRLQECLDK-KKDASARRLPSSKTRLTDAQKRAQNLR 778
D+N +DVS +G+VKK EK+K ++ DK KKDA RRL + KT L DAQKRAQN+R
Sbjct: 708 DSNDEDVSHSNDGDVKKPEKEKIRGTKDGSDKRKKDAILRRLSAPKTPLNDAQKRAQNMR 767
Query: 779 AYKAD 783
AYKAD
Sbjct: 768 AYKAD 772
>Os07g0616000 Conserved hypothetical protein
Length = 1365
Score = 141 bits (356), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/236 (35%), Positives = 121/236 (51%), Gaps = 40/236 (16%)
Query: 1 MRPETRLDSAVFQLTPTRTRCDLVVIANGRKEKIASGLLNPFVAHLKVAQQQIAKGGYSI 60
M P+ LD A+FQL+P R+RC+LVV NGR E+IASG + PFVAHL+ A++Q A
Sbjct: 1 MEPDAPLDFALFQLSPRRSRCELVVSGNGRTERIASGSVKPFVAHLRAAEEQAAAQPPPP 60
Query: 61 TLEVDPEIDAPWFTRGTVERFVRFVSTPEVLERVTTIESEILQIEDAITGQGGDNLGLRS 120
+ + + A WF++GT+ERFVRFVSTPEVLE T ++E+ Q+E G R
Sbjct: 61 AIRLQLDRRAAWFSKGTLERFVRFVSTPEVLEMANTFDAEMSQLE-----------GARK 109
Query: 121 VEDYNEKLAECIGGSKTNYDLDGDKSLILYKPGIQPPPPVQNDNATQEENSKVQLLRVLE 180
+ Y G+ A + K +LLR ++
Sbjct: 110 I----------------------------YAQGVAGGADGAESAAAADITKK-ELLRAID 140
Query: 181 TRKIVLRKEQXXXXXXXXXXGFNIDNLGFLITFADRFGASRLMKACTQFTELWRRK 236
R L+++ GFN D++ L+ FAD FGA+RL +AC +F L +R+
Sbjct: 141 VRLSALKQDLVTACARASSAGFNPDSVSELVLFADHFGANRLSEACNKFMSLCQRR 196
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.311 0.129 0.360
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 38,668,046
Number of extensions: 1615518
Number of successful extensions: 3943
Number of sequences better than 1.0e-10: 3
Number of HSP's gapped: 3931
Number of HSP's successfully gapped: 3
Length of query: 1256
Length of database: 17,035,801
Length adjustment: 112
Effective length of query: 1144
Effective length of database: 11,187,833
Effective search space: 12798880952
Effective search space used: 12798880952
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 162 (67.0 bits)