BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os08g0121900 Os08g0121900|AK101512
(584 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os08g0121900 Protein of unknown function DUF23 family protein 1105 0.0
Os06g0727900 Protein of unknown function DUF23 family protein 290 2e-78
>Os08g0121900 Protein of unknown function DUF23 family protein
Length = 584
Score = 1105 bits (2857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 538/584 (92%), Positives = 538/584 (92%)
Query: 1 MALAAKEXXXXXXXXXXXXXXXXXXXXXXXXXXXPAGTQRRXXXXXXXXXXXXXXXXXXX 60
MALAAKE PAGTQRR
Sbjct: 1 MALAAKERKLSRLGSKGSGGGGGGGSFGARGQRAPAGTQRRLFAAFFAFLFAGAVLFGAA 60
Query: 61 HVIGASFRPVLKTAWPSATLNAVSSERGAQQAGMVSVDAVLPSVHIQHAVALPDHVLLML 120
HVIGASFRPVLKTAWPSATLNAVSSERGAQQAGMVSVDAVLPSVHIQHAVALPDHVLLML
Sbjct: 61 HVIGASFRPVLKTAWPSATLNAVSSERGAQQAGMVSVDAVLPSVHIQHAVALPDHVLLML 120
Query: 121 RDGSLLPASGQFECLYSPVNSSQLRRQPLSVATLPDGPSLVHCPAGPSRVAVSLSLAQSV 180
RDGSLLPASGQFECLYSPVNSSQLRRQPLSVATLPDGPSLVHCPAGPSRVAVSLSLAQSV
Sbjct: 121 RDGSLLPASGQFECLYSPVNSSQLRRQPLSVATLPDGPSLVHCPAGPSRVAVSLSLAQSV 180
Query: 181 PVAPLQWDRLVYTALIDSKDNSTVVFAKGMNLRPGRLGVPSRYECVFGRDFSKPKLVVTS 240
PVAPLQWDRLVYTALIDSKDNSTVVFAKGMNLRPGRLGVPSRYECVFGRDFSKPKLVVTS
Sbjct: 181 PVAPLQWDRLVYTALIDSKDNSTVVFAKGMNLRPGRLGVPSRYECVFGRDFSKPKLVVTS 240
Query: 241 PVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNNDDKPMLVSIRTKGRGSSTLPSIAQPE 300
PVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNNDDKPMLVSIRTKGRGSSTLPSIAQPE
Sbjct: 241 PVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNNDDKPMLVSIRTKGRGSSTLPSIAQPE 300
Query: 301 PLPRYNKHWRRKAHSMCVCTMLRNQARFLREWIIYHSRIGVQRWFIYDNNSDDGIEEVLN 360
PLPRYNKHWRRKAHSMCVCTMLRNQARFLREWIIYHSRIGVQRWFIYDNNSDDGIEEVLN
Sbjct: 301 PLPRYNKHWRRKAHSMCVCTMLRNQARFLREWIIYHSRIGVQRWFIYDNNSDDGIEEVLN 360
Query: 361 TMDSSRYNVTRYLWPWMKSQEAGFAHCALRARESCEWVGFIDIDEFLHFPGNQTLQDVLR 420
TMDSSRYNVTRYLWPWMKSQEAGFAHCALRARESCEWVGFIDIDEFLHFPGNQTLQDVLR
Sbjct: 361 TMDSSRYNVTRYLWPWMKSQEAGFAHCALRARESCEWVGFIDIDEFLHFPGNQTLQDVLR 420
Query: 421 NYSVKPRIGELRTACHSFGPSGRTKIPKKGVTTGYTCRLAAPERHKSIVRPDALNPSLIN 480
NYSVKPRIGELRTACHSFGPSGRTKIPKKGVTTGYTCRLAAPERHKSIVRPDALNPSLIN
Sbjct: 421 NYSVKPRIGELRTACHSFGPSGRTKIPKKGVTTGYTCRLAAPERHKSIVRPDALNPSLIN 480
Query: 481 VVHHFHLKEGMKYVNIGQGMMLINHYKYQVWEVFKDKFSGRVATYVADWQDEENVGSRDR 540
VVHHFHLKEGMKYVNIGQGMMLINHYKYQVWEVFKDKFSGRVATYVADWQDEENVGSRDR
Sbjct: 481 VVHHFHLKEGMKYVNIGQGMMLINHYKYQVWEVFKDKFSGRVATYVADWQDEENVGSRDR 540
Query: 541 APGLGTKPVEPEDWPRRFCEVYDNGLKDFVQKVFTDPHTGNLPW 584
APGLGTKPVEPEDWPRRFCEVYDNGLKDFVQKVFTDPHTGNLPW
Sbjct: 541 APGLGTKPVEPEDWPRRFCEVYDNGLKDFVQKVFTDPHTGNLPW 584
>Os06g0727900 Protein of unknown function DUF23 family protein
Length = 540
Score = 290 bits (742), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 179/489 (36%), Positives = 264/489 (53%), Gaps = 46/489 (9%)
Query: 109 AVALPDHVLLMLRDGSLLPASGQFECLYSPVNSSQLR---RQPLS-----VATLPDGPSL 160
AV LPD +L+L + + C + SS R R P S +P+ P+
Sbjct: 65 AVLLPDWEVLLLLHPNATAIAHNATCAFQGAASSPARALGRLPSSGRHAYTCAMPE-PAR 123
Query: 161 VHCPAGPSRVAVSLSLAQSVP------VAPLQWD-RLVYTALIDSKDNSTVVFAKGMNLR 213
H P R+ V++ + P V ++W RLVY +++ +VFAKG+N R
Sbjct: 124 RHQPFHAPRI-VAMDAVHASPHDDDELVMMVKWSGRLVYDSVVVDG-GDVLVFAKGVNPR 181
Query: 214 PGRLGVPSRYECVF--GRDFSKPKLVVTSPVVSAAQEIFRCVTPVRIRRYLRMTTGGKNS 271
G S CV+ GR S +V + P ++AQ++FRC LR+T +
Sbjct: 182 QGVNRPASDVRCVYYRGRGGSADDVVASLPAATSAQQVFRCPP-PPPAALLRVTL----A 236
Query: 272 VNNDDKPMLVSIRTKGRGSSTLPSIAQPEPLPRYNKHWRRKAHSMCVCTMLRNQARFLRE 331
+ +++P +PS+A P ++ H +C CTM+R+ +F+RE
Sbjct: 237 LAGEEEP--------------IPSVATYSLPPASAAATHKRRHKICACTMVRDVGKFVRE 282
Query: 332 WIIYHSRIGVQRWFIYDNNSDDGIEEVLNTMDSSRYNVTRYLWPWMKSQEAGFAHCALRA 391
W+ YH+ +GV R+ +YDN S+D ++E + + + +VT WPW K+QEAGF+H A
Sbjct: 283 WVAYHAAVGVGRFILYDNGSEDDLDEQVRRLTAEGMDVTTLAWPWPKTQEAGFSHSAAVH 342
Query: 392 RESCEWVGFIDIDEFLHFPGNQTL----QDVLRNY-SVKPRIGELRTACHSFGPSGRTKI 446
R++CEW+ FID+DEF+ P T +LR+ +VKP +G++ C FGPSGRT
Sbjct: 343 RDACEWMAFIDVDEFIFSPNWATAASPSSSMLRSIVAVKPDVGQVSLGCVDFGPSGRTTH 402
Query: 447 PKKGVTTGYTCRLAAPERHKSIVRPDALNPSLINVVHHFHLKEGMKYVNIGQGMMLINHY 506
P +GVT GYTCR A ERHKS++R +A SL+N VHHF L+EG + +NHY
Sbjct: 403 PPEGVTQGYTCRRRAVERHKSLLRLEAAERSLVNSVHHFELREGKR--GEWNRRARVNHY 460
Query: 507 KYQVWEVFKDKFSGRVATYVADWQDEENVGSRDRAPGLGTKPVEPEDWPRRFCEVYDNGL 566
K+Q W+ F+ KF RV+ YVADW N+ S+DR PGLG PV+P W +FCEV D L
Sbjct: 461 KFQAWDEFRLKFRRRVSAYVADWTHRVNLQSKDRTPGLGFDPVQPAGWAAKFCEVNDTLL 520
Query: 567 KDFVQKVFT 575
+D ++ F
Sbjct: 521 RDVTRRWFA 529
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.321 0.135 0.427
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 19,883,415
Number of extensions: 896113
Number of successful extensions: 1783
Number of sequences better than 1.0e-10: 2
Number of HSP's gapped: 1779
Number of HSP's successfully gapped: 2
Length of query: 584
Length of database: 17,035,801
Length adjustment: 106
Effective length of query: 478
Effective length of database: 11,501,117
Effective search space: 5497533926
Effective search space used: 5497533926
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 159 (65.9 bits)