BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os09g0552400 Os09g0552400|Os09g0552400
(325 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os09g0552400 Cupin, RmlC-type domain containing protein 536 e-153
Os09g0552600 Cupin 1 domain containing protein 476 e-134
Os09g0552500 Cupin 1 domain containing protein 450 e-127
Os05g0116000 11-S plant seed storage protein family protein 128 7e-30
Os01g0976200 11-S plant seed storage protein family protein 115 4e-26
>Os09g0552400 Cupin, RmlC-type domain containing protein
Length = 325
Score = 536 bits (1381), Expect = e-153, Method: Compositional matrix adjust.
Identities = 277/325 (85%), Positives = 277/325 (85%)
Query: 1 MAAPDMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGK 60
MAAPDMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGK
Sbjct: 1 MAAPDMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGK 60
Query: 61 FGYVLGGSAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTWCPGDFSYFILAGPMSVXX 120
FGYVLGGSAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTWCPGDFSYFILAGPMSV
Sbjct: 61 FGYVLGGSAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTWCPGDFSYFILAGPMSVLG 120
Query: 121 XXXXXXXXXXXXXXXPEQAATAFRSQPAALLTRLSRKLHGVRPREHDRHGIVVNAARVPP 180
PEQAATAFRSQPAALLTRLSRKLHGVRPREHDRHGIVVNAARVPP
Sbjct: 121 GLDAGLLATASGLTSPEQAATAFRSQPAALLTRLSRKLHGVRPREHDRHGIVVNAARVPP 180
Query: 181 DSTGGKTVTAAHLPXXXXXXXXXXXXXXXXXXXVRGPWVLRDAAAQAVYVARGSGRVQVA 240
DSTGGKTVTAAHLP VRGPWVLRDAAAQAVYVARGSGRVQVA
Sbjct: 181 DSTGGKTVTAAHLPALAQLGLSVGLALLDAGAAVRGPWVLRDAAAQAVYVARGSGRVQVA 240
Query: 241 SAGGASTLLDAEVAAGSLLVVPRYXXXXXXXXXXXXMELVSLIKSSRPAMEHFTGKGSVI 300
SAGGASTLLDAEVAAGSLLVVPRY MELVSLIKSSRPAMEHFTGKGSVI
Sbjct: 241 SAGGASTLLDAEVAAGSLLVVPRYAVALVAADDAGGMELVSLIKSSRPAMEHFTGKGSVI 300
Query: 301 GGLTPEIVQAALNVSPELVEQLRTK 325
GGLTPEIVQAALNVSPELVEQLRTK
Sbjct: 301 GGLTPEIVQAALNVSPELVEQLRTK 325
>Os09g0552600 Cupin 1 domain containing protein
Length = 354
Score = 476 bits (1224), Expect = e-134, Method: Compositional matrix adjust.
Identities = 258/353 (73%), Positives = 263/353 (74%), Gaps = 28/353 (7%)
Query: 1 MAAPDMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGK 60
MAA DMSPKAGKPLV+NDAGSYLAWSGKDQP +AGEKLGCGLLVLKPLGFALPHYADSGK
Sbjct: 1 MAATDMSPKAGKPLVENDAGSYLAWSGKDQPAVAGEKLGCGLLVLKPLGFALPHYADSGK 60
Query: 61 FGYVLGGSAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTW------------------ 102
FGYVLGGSAVVGVLP GVDARERVVRLEA DVIAMRAGEVTW
Sbjct: 61 FGYVLGGSAVVGVLPAGVDARERVVRLEAGDVIAMRAGEVTWWYNDTDGEDVTIVFMGDT 120
Query: 103 ----CPGDFSYFILAGPMSVXXXXXXXXXXXXXXXXXPEQAATAFRSQPAALLTRLSRKL 158
PGD SYF+LAGPM V PEQAATAFRSQPAALLTRL+ KL
Sbjct: 121 AGAVSPGDISYFVLAGPMGVLGGLDAGLLAKASGLTSPEQAATAFRSQPAALLTRLNGKL 180
Query: 159 HGVRPREHDRHGIVVNAARVPPDS-TGG-----KTVTAAHLPXXXXXXXXXXXXXXXXXX 212
HGVRPREHDRHG+VVNAARVP DS TGG KTVTAAHLP
Sbjct: 181 HGVRPREHDRHGLVVNAARVPADSNTGGAAAGTKTVTAAHLPVLAQLGFSVGLTRLDAGA 240
Query: 213 XVRGPWVLRDAAAQAVYVARGSGRVQVASAGGASTLLDAEVAAGSLLVVPRYXXXXXXXX 272
VRGPWVLRDAAAQAVYVARGSGRVQVA AGGASTLLDAEVAAGSLLVVPRY
Sbjct: 241 AVRGPWVLRDAAAQAVYVARGSGRVQVAGAGGASTLLDAEVAAGSLLVVPRYGVSLAAAD 300
Query: 273 XXXXMELVSLIKSSRPAMEHFTGKGSVIGGLTPEIVQAALNVSPELVEQLRTK 325
MELVSLIKS RPA EHFTGKGSVIGGLT EIVQAALNVSPE VEQLRTK
Sbjct: 301 DAGGMELVSLIKSPRPATEHFTGKGSVIGGLTAEIVQAALNVSPEFVEQLRTK 353
>Os09g0552500 Cupin 1 domain containing protein
Length = 350
Score = 450 bits (1158), Expect = e-127, Method: Compositional matrix adjust.
Identities = 246/344 (71%), Positives = 253/344 (73%), Gaps = 29/344 (8%)
Query: 8 PKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGKFGYVLGG 67
PKAGKPLV+NDAGSYLAWSGK+QP LAGEKLGCGLLVLKPLGFALPHYADSGKFGYVLGG
Sbjct: 5 PKAGKPLVENDAGSYLAWSGKNQPALAGEKLGCGLLVLKPLGFALPHYADSGKFGYVLGG 64
Query: 68 SAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTW----------------------CPG 105
SAVVGVLPVG+DARERVVRLEA DVIAMRAGEVTW PG
Sbjct: 65 SAVVGVLPVGLDARERVVRLEAGDVIAMRAGEVTWWYNDADGEDVTIVFMGDTARAASPG 124
Query: 106 DFSYFILAGPMSVXXXXXXXXXXXXXXXXXPEQAATAFRSQPAALLTRLSRKLHGVRPRE 165
D SYF+LAGPM V PEQAATAFRSQPA LLTRLSRKL VRPRE
Sbjct: 125 DISYFVLAGPMGVLGGLDAGLLATASGLTSPEQAATAFRSQPAVLLTRLSRKLQDVRPRE 184
Query: 166 HDRHGIVVNAARVPPDST------GGKTVTAAHLPXXXXXXXXXXXXXXXXXXXVRGPWV 219
HDRHGIVVNAAR+P DS+ G K VTAAHLP VRGPWV
Sbjct: 185 HDRHGIVVNAARMPADSSTGGAAAGTKIVTAAHLPVLGQLGFSVGLTPLDAGAAVRGPWV 244
Query: 220 LRDAAAQAVYVARGSGRVQVASAGGASTLLDAEVAAGSLLVVPRYXXXXXXXXXXXXMEL 279
LRDAAAQAVYVARGSGRVQVA AGGASTLLDAE AAGSLLVVPRY MEL
Sbjct: 245 LRDAAAQAVYVARGSGRVQVAGAGGASTLLDAEAAAGSLLVVPRY-AVALVGVDAGGMEL 303
Query: 280 VSLIKSSRPAMEHFTGKGSVIGGLTPEIVQAALNVSPELVEQLR 323
VSLIKS RPAM+ FTGKGSVIGGLTPEIVQAALNVSPELVEQLR
Sbjct: 304 VSLIKSPRPAMKQFTGKGSVIGGLTPEIVQAALNVSPELVEQLR 347
>Os05g0116000 11-S plant seed storage protein family protein
Length = 359
Score = 128 bits (321), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 147/354 (41%), Gaps = 35/354 (9%)
Query: 1 MAAPDMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGK 60
MA+ D++P+ + D G+Y WS D P L +G L L G ALP ++DSGK
Sbjct: 1 MASVDLTPRQARKAYGGDGGTYYEWSPADLPMLELANIGGAKLSLNAGGLALPSFSDSGK 60
Query: 61 FGYVLGGSAVVG-VLPVGVDARERVVRLEAADVIAMRAGEVTWC---------------- 103
YVL G G VLP ++E+V+ ++ D +A+ G VTW
Sbjct: 61 VAYVLQGKGTCGIVLPEA--SKEKVIAVKEGDSLALPFGVVTWWHNLPESPIELVILFLG 118
Query: 104 -------PGDFSYFILAGPMSVXXXXXXXXXXXXXXXXXPEQAATAFRSQPAALLTRLSR 156
G F+ L G + + A SQPA+ + ++
Sbjct: 119 DTSKAHKAGQFTNMQLTGATGIFTGFSTEFVGRAWDLAESD-AVKLVSSQPASGIVKIKS 177
Query: 157 KLHGVRPREHDRHGIVVNAARVPPD---STGGKTV--TAAHLPXXXXXXXXXXXXXXXXX 211
P DR G+ +N P D GG+ V A+LP
Sbjct: 178 GQKLPEPSAADREGMALNCLEAPLDVDIKNGGRVVVLNTANLPMVKEVGLGADLVRIDGH 237
Query: 212 XXVRGPWVLRDAAAQAVYVARGSGRVQVASAGGASTLLDAEVAAGSLLVVPRYXXXXXXX 271
P D+A Q Y RGSGRVQV A G +LD V G+L +VPR+
Sbjct: 238 SMCS-PGFSCDSAYQVTYFIRGSGRVQVVGADG-KRVLDTHVEGGNLFIVPRFCVVSKIA 295
Query: 272 XXXXXMELVSLIKSSRPAMEHFTGKGSVIGGLTPEIVQAALNVSPELVEQLRTK 325
++ S+I + P H GK SV ++PE+++A+ N +PE+ + R+K
Sbjct: 296 DASG-LQWFSIITTPNPIFSHLAGKTSVWKAISPEVLEASFNATPEMEKLFRSK 348
>Os01g0976200 11-S plant seed storage protein family protein
Length = 377
Score = 115 bits (288), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 138/350 (39%), Gaps = 35/350 (10%)
Query: 5 DMSPKAGKPLVQNDAGSYLAWSGKDQPTLAGEKLGCGLLVLKPLGFALPHYADSGKFGYV 64
D+SPK + GSY WS + P L +G L L G ALP Y+DS K YV
Sbjct: 19 DLSPKRPAKSYGGEGGSYFDWSPSELPMLRAASIGAAKLSLAAGGLALPFYSDSAKVAYV 78
Query: 65 LGGSAVVGVLPVGVDARERVVRLEAADVIAMRAGEVTW---------------------- 102
L G VL + E+++ ++ D +A+ G VTW
Sbjct: 79 LQGKGTCAVL-LPETPSEKILPIKEGDALALPFGVVTWWHNLHAATTELVVLFLGDTSKG 137
Query: 103 -CPGDFSYFILAGPMSVXXXXXXXXXXXXXXXXXPEQAATAFRS-QPAALLTRLSRKLHG 160
G F+ L G S P+ AA + S QP A + +L
Sbjct: 138 HTAGRFTNMQLTG--STGIFTGFSTEFVARAWDLPQDAAASLVSTQPGAGIVKLKDGFRM 195
Query: 161 VRPREHDRHGIVVNAARVPPD---STGGKTV--TAAHLPXXXXXXXXXXXXXXXXXXXVR 215
+ DR G+V+N P D GG+ V +LP
Sbjct: 196 PEGCDKDREGMVLNCLEAPLDVDIKNGGRVVVLNTQNLPLVKEVGLGADLVRIDGHSMC- 254
Query: 216 GPWVLRDAAAQAVYVARGSGRVQVASAGGASTLLDAEVAAGSLLVVPRYXXXXXXXXXXX 275
P D+A Q Y+ RGSGRVQV G + +L+ G L +VPR+
Sbjct: 255 SPGFSCDSAYQVTYIVRGSGRVQVVGIDG-TRVLETRAEGGCLFIVPRFFVVSKIADDTG 313
Query: 276 XMELVSLIKSSRPAMEHFTGKGSVIGGLTPEIVQAALNVSPELVEQLRTK 325
ME S+I + P H G+ SV ++P ++QA+ N +PE+ R+K
Sbjct: 314 -MEWFSIITTPNPIFSHLAGRTSVWKAISPAVLQASFNTTPEMENLFRSK 362
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.318 0.134 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 10,112,758
Number of extensions: 385013
Number of successful extensions: 959
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 942
Number of HSP's successfully gapped: 6
Length of query: 325
Length of database: 17,035,801
Length adjustment: 101
Effective length of query: 224
Effective length of database: 11,762,187
Effective search space: 2634729888
Effective search space used: 2634729888
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 156 (64.7 bits)