BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os01g0845000 Os01g0845000|AK106438
(300 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os01g0845000 Protein of unknown function DUF668 family protein 572 e-163
Os05g0457600 280 7e-76
Os03g0270500 Protein of unknown function DUF668 family protein 178 4e-45
Os04g0169500 Protein of unknown function DUF668 family protein 100 2e-21
Os03g0858600 Protein of unknown function DUF668 family protein 97 2e-20
Os12g0146500 Protein of unknown function DUF668 family protein 77 1e-14
>Os01g0845000 Protein of unknown function DUF668 family protein
Length = 300
Score = 572 bits (1474), Expect = e-163, Method: Compositional matrix adjust.
Identities = 285/300 (95%), Positives = 285/300 (95%)
Query: 1 MVVQKMWMPVDLRSLQQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTI 60
MVVQKMWMPVDLRSLQQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTI
Sbjct: 1 MVVQKMWMPVDLRSLQQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTI 60
Query: 61 FGAFVPGLPPPLPSAATDSVQTRLSKLLNPRAVRAKASSGPITRRDGPSRVHPPVISSSC 120
FGAFVPGLPPPLPSAATDSVQTRLSKLLNPRAVRAKASSGPITRRDGPSRVHPPVISSSC
Sbjct: 61 FGAFVPGLPPPLPSAATDSVQTRLSKLLNPRAVRAKASSGPITRRDGPSRVHPPVISSSC 120
Query: 121 PIIGLRPSGQKAAIDWRKLLDAPASTVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANA 180
PIIGLRPSGQKAAIDWRKLLDAPASTVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANA
Sbjct: 121 PIIGLRPSGQKAAIDWRKLLDAPASTVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANA 180
Query: 181 ERAEMYEMLPAKLRAAVRSKLRDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQW 240
ERAEMYEMLPAKLRAAVRSKLRDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQW
Sbjct: 181 ERAEMYEMLPAKLRAAVRSKLRDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQW 240
Query: 241 QAERNMDRTRRFDGGARVYALQTLRWADXXXXXXXXXXXXXXXSCVCWYEERRRGSVRLL 300
QAERNMDRTRRFDGGARVYALQTLRWAD SCVCWYEERRRGSVRLL
Sbjct: 241 QAERNMDRTRRFDGGARVYALQTLRWADKEKAEAALVEVLVALSCVCWYEERRRGSVRLL 300
>Os05g0457600
Length = 501
Score = 280 bits (717), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 175/305 (57%), Positives = 210/305 (68%), Gaps = 31/305 (10%)
Query: 14 SLQQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTIFGAFVPGLPPPLP 73
SL+ ++AQR +VRRLKEESLW+QSYEKAV LMARAACAVF R+C +FGA VPGLPPPLP
Sbjct: 198 SLRLGIRAQRARVRRLKEESLWSQSYEKAVILMARAACAVFVRVCVVFGAHVPGLPPPLP 257
Query: 74 SAATDSVQTRLSK-LLNPRAVRA--KASSGPITRRDGPSRVHPPVISSSCPIIGLRPSGQ 130
A ++V +RLSK LL+P + A ++ SGPI RRD P R+ + S+SCPII R Q
Sbjct: 258 PA--EAVHSRLSKLLLHPMSAAAQPRSLSGPIQRRDVPLRIE--MSSNSCPII--RSHCQ 311
Query: 131 K------AAIDWRKLLDAPASTVGGAGLDQQYANVIVSAEQLL--QMEAEGRQEEANAER 182
+ +DWRKLL+ P TVGGAGLD QYANVI +AE+LL AEGRQ A R
Sbjct: 312 QPWQTSPPGVDWRKLLEPPPGTVGGAGLDLQYANVITTAERLLLETDHAEGRQRHEEA-R 370
Query: 183 AEMYEMLPAKLRAAVRSKLRDWWRDPGP-------LDAGLAQGWKDAVDRIMAWLGPMAR 235
AE+Y MLP+KLRAAVR+KLR WWR+ G LDAGLA+GW+ A RI+AWL PMAR
Sbjct: 371 AELYAMLPSKLRAAVRAKLRGWWRERGAAAAVAVELDAGLAEGWRSAAGRILAWLAPMAR 430
Query: 236 DTVQWQAERNMDRTRRFD------GGARVYALQTLRWADXXXXXXXXXXXXXXXSCVCWY 289
DT +W AER++DR RRF+ G AR +ALQTLRWAD SCV WY
Sbjct: 431 DTARWHAERSLDRQRRFEVGGGGGGSARAWALQTLRWADAEKAEAAVVEVLVALSCVGWY 490
Query: 290 EERRR 294
+ERRR
Sbjct: 491 DERRR 495
>Os03g0270500 Protein of unknown function DUF668 family protein
Length = 286
Score = 178 bits (452), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 113/286 (39%), Positives = 157/286 (54%), Gaps = 31/286 (10%)
Query: 15 LQQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTIFGAFVPGLPPPLPS 74
L ++L +QR KVRRL E SLW+ + K LMA++ AV +RI FGA+VPGLP
Sbjct: 17 LIRELNSQRQKVRRLMESSLWSVAAHKVSKLMAKSVLAVLARISITFGAYVPGLP----- 71
Query: 75 AATDSVQTRLSKLLNPRAVRAKASSGPITRRDGPSRVHPPVISSSCPIIGLRPSGQKAAI 134
L RA + +SGP+ + P+ I S PI + + A+
Sbjct: 72 -----------LLTVGRAWALRRTSGPLQQAASPA----AAIRHSAPIFRQKDTAFSAS- 115
Query: 135 DWRKLLDAPASTVGGAGLDQQYANVIVSAEQLL-QMEAEGRQEEANA-----ERAEMYEM 188
+ + PASTVGG+G++ +YAN+IV AE LL Q+ E +A +R E+Y+M
Sbjct: 116 ---ESIKPPASTVGGSGMELRYANMIVCAEMLLRQLWPTIHSNEVDAGMDLSKRDELYKM 172
Query: 189 LPAKLRAAVRSKLRDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQWQAERNMDR 248
LP +R AV++KLR+ WR P+D A DAVDR++ WLGPMA DTV+W E +M+R
Sbjct: 173 LPVTIRTAVKAKLRESWRG-QPVDEAAAAASMDAVDRMLRWLGPMAHDTVRWHDEHSMER 231
Query: 249 TRRFDGGARVYALQTLRWADXXXXXXXXXXXXXXXSCVCWYEERRR 294
+RF RV +QTL +AD SCVCWY++ RR
Sbjct: 232 AQRFSMRPRVLMVQTLHFADRHKAENVIVEVLIGLSCVCWYDDERR 277
>Os04g0169500 Protein of unknown function DUF668 family protein
Length = 514
Score = 100 bits (248), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 86/152 (56%), Gaps = 5/152 (3%)
Query: 142 APASTVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANAERAEMYEMLPAKLRAAVRSKL 201
AP ST+GG+ L YAN+++ E+LL+ +E R ++Y+MLP+ LRAA+R L
Sbjct: 347 APPSTIGGSALALHYANIVIIIEKLLRYPHLVGEEA----RDDLYQMLPSSLRAALRKSL 402
Query: 202 RDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQWQAERNMDRTRRFDGGARVYAL 261
+ + + DA LA W++ +++ + WL PMA + ++WQAERN ++ + G V L
Sbjct: 403 KTYVKSMAIYDAFLAHDWRETLEKTLTWLAPMAHNMIRWQAERNFEQQQIVLKG-NVLLL 461
Query: 262 QTLRWADXXXXXXXXXXXXXXXSCVCWYEERR 293
QTL +AD + +C YE+++
Sbjct: 462 QTLYFADREKTEAVICELLVGLNYICRYEQQQ 493
>Os03g0858600 Protein of unknown function DUF668 family protein
Length = 475
Score = 96.7 bits (239), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 134/267 (50%), Gaps = 43/267 (16%)
Query: 16 QQDLKAQRIKVRRLKEESLWNQSYEKAVGLMARAACAVFSRICTIFGAFVPGLPPPLPSA 75
QQ + A++ +V+ LK+ SLW+ +++ V +ARAA + +RI +FGA P P
Sbjct: 199 QQLIFAKKQEVKHLKQTSLWSSTFDAVVSSLARAAFTILARIKLVFGAAHDHRPTTTPLH 258
Query: 76 ATDSVQTRLSKLLNPRAVRAKASSGPITRRDGPSRVHPPVISSSCPI-IGL--------- 125
+ + LS ++P +V + V PPV S + +G+
Sbjct: 259 RSLT----LSSAVHPSSVDVQ--------------VQPPVSRKSMSMDMGMGEALYLERQ 300
Query: 126 RPSG--QKAAIDWRKLLDAPASTVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANAERA 183
R SG +++A L P T+G A L +YA VI+S E++ A + ER
Sbjct: 301 RQSGLLERSA----AALVPPPGTLGAAALAPRYAWVIISIERM----ARSPRLVGAEERD 352
Query: 184 EMYEMLPAKLRAAVRSKLRDWWRDPGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQWQAE 243
E+Y ML A +RA +R++L P GLA W+ AV I+ WL PMA TV+WQAE
Sbjct: 353 ELYGMLTASVRAQLRARLSGTVAAAEP---GLAGQWRAAVGGILEWLAPMAHATVRWQAE 409
Query: 244 RNMDRTRRF--DGGARVYALQTLRWAD 268
R++++ R+ + + +QTL+ A+
Sbjct: 410 RSLEQQRKTTREMETQTLVVQTLQMAE 436
>Os12g0146500 Protein of unknown function DUF668 family protein
Length = 261
Score = 77.4 bits (189), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 80/148 (54%), Gaps = 15/148 (10%)
Query: 130 QKAAIDWRKLLDAPA-STVGGAGLDQQYANVIVSAEQLLQMEAEGRQEEANAERAEMYEM 188
+ ++ +W +++APA +T+G A L YAN+I+ E+L A + ER +Y M
Sbjct: 66 ESSSHNW--VMNAPAVTTLGAAALALHYANLIIFIEKL----AVAPRHICPDERDALYNM 119
Query: 189 LPAKLRAAVRSKLRDWWRD--------PGPLDAGLAQGWKDAVDRIMAWLGPMARDTVQW 240
L ++RA++R++LR ++ D +A W V RI+ WL P+A + ++W
Sbjct: 120 LTDRIRASLRARLRPIAKNMAASSSSSSSACDPAMAAEWSGTVQRILGWLAPLAHNMLRW 179
Query: 241 QAERNMDRTRRFDGGARVYALQTLRWAD 268
Q+ERN ++ G V LQTL +AD
Sbjct: 180 QSERNFEQRNVASSGTGVLLLQTLHFAD 207
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.320 0.133 0.414
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 10,517,351
Number of extensions: 443205
Number of successful extensions: 1847
Number of sequences better than 1.0e-10: 6
Number of HSP's gapped: 1833
Number of HSP's successfully gapped: 6
Length of query: 300
Length of database: 17,035,801
Length adjustment: 100
Effective length of query: 200
Effective length of database: 11,814,401
Effective search space: 2362880200
Effective search space used: 2362880200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 156 (64.7 bits)