BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os02g0612800 Os02g0612800|AK072455
(755 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os02g0612800 HMG-I and HMG-Y, DNA-binding domain containing... 1217 0.0
Os04g0501600 HMG-I and HMG-Y, DNA-binding domain containing... 301 1e-81
Os04g0326000 TNP1/EN/SPM-like transposon protein domain con... 206 4e-53
Os06g0286351 Armadillo-like helical domain containing protein 165 1e-40
Os04g0319900 Hypothetical protein 102 2e-21
>Os02g0612800 HMG-I and HMG-Y, DNA-binding domain containing protein
Length = 755
Score = 1217 bits (3148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 619/716 (86%), Positives = 619/716 (86%)
Query: 1 MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA 60
MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA
Sbjct: 1 MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA 60
Query: 61 LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP 120
LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP
Sbjct: 61 LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP 120
Query: 121 LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXX 180
LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTT
Sbjct: 121 LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTIMILV 180
Query: 181 XXXXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE 240
CLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE
Sbjct: 181 IEEDDEVEIPIAECLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE 240
Query: 241 YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI 300
YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI
Sbjct: 241 YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI 300
Query: 301 GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK 360
GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK
Sbjct: 301 GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK 360
Query: 361 RGRPPGLKSXXXXXXXXXXXXXXXXXXTTDSTGXXXXXXXXXXXXXXXXXXXGAGXXXXX 420
RGRPPGLKS TTDSTG GAG
Sbjct: 361 RGRPPGLKSLEKKAAGKKVLGLKKVEETTDSTGKLSKQSSKDDSKSSTRKASGAGSSKKQ 420
Query: 421 XXXXXXXXDETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ 480
DETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ
Sbjct: 421 QKISLKQKDETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ 480
Query: 481 ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR 540
ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR
Sbjct: 481 ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR 540
Query: 541 DEKWEFVSEEQDKTPDVASEIXXXXXXXXXXXXXXXVQLKEGNAETPKSGGGDLPKKRGR 600
DEKWEFVSEEQDKTPDVASEI VQLKEGNAETPKSGGGDLPKKRGR
Sbjct: 541 DEKWEFVSEEQDKTPDVASEISPKPRGRGRKGRGSSVQLKEGNAETPKSGGGDLPKKRGR 600
Query: 601 PKGSSNGTPKSNIXXXXXXXXXXXXXXDENETPKVGSDLKKEAEEGSEDKATKSTEKTKD 660
PKGSSNGTPKSNI DENETPKVGSDLKKEAEEGSEDKATKSTEKTKD
Sbjct: 601 PKGSSNGTPKSNISATSSKSKGKAARKDENETPKVGSDLKKEAEEGSEDKATKSTEKTKD 660
Query: 661 DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN
Sbjct: 661 DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
>Os04g0501600 HMG-I and HMG-Y, DNA-binding domain containing protein
Length = 846
Score = 301 bits (770), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 166/320 (51%), Positives = 213/320 (66%), Gaps = 11/320 (3%)
Query: 6 ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
E+E +LRD+G + S PD D LL+LI EAE ++ +V+Q+P ESM A+ P M ALIKKE
Sbjct: 9 EVERRLRDIGARFTSLPDADDELLRLIEEAETWLARVDQSPPESMHKALRPTMSALIKKE 68
Query: 66 LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLFRRI 125
LLD+S ++KL+V SC++E+TRITAP+ PYDDDVMKDVF+ +V +FEKLDDME+P + R
Sbjct: 69 LLDHSVPDIKLAVASCLTEVTRITAPEAPYDDDVMKDVFTRVVEAFEKLDDMESPSYARR 128
Query: 126 VAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXXXXX 185
VA+LETVAKVR CV+MLDL+C+DLI MFH+FF T+ H ENV M T
Sbjct: 129 VAMLETVAKVRSCVLMLDLDCDDLIRDMFHHFFRTISNTHQENVITSMETVMKFVIDESE 188
Query: 186 XXXXXXXXC--------LLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTS 237
C LLK+ K E KET ASFELAEKVI C EKLKPVF LL+GT
Sbjct: 189 DVQQDMPSCLLQDLASYLLKNLKKEEKETLPASFELAEKVINKCYEKLKPVFTPLLRGTP 248
Query: 238 LNEYDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQT 297
L+EY ++ ++ ED+ D ++D GKD+V DGKLS + +SDE QE +KLEQD
Sbjct: 249 LDEYSEVVTSLFEDALDAGVADNSDAPGKDMVADGKLSHKIVSDESAQESSKLEQDA--- 305
Query: 298 TAIGSGATPVDNGTESAAAN 317
G TP +N + SA +N
Sbjct: 306 NCPGKDGTPPNNTSTSAVSN 325
Score = 206 bits (525), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 161/396 (40%), Positives = 201/396 (50%), Gaps = 36/396 (9%)
Query: 338 VANGASAETSERVDGSPAMVKSKRGRPPGLKSXXXXXXXXXXXX---XXXXXXTTDSTGX 394
N A+ + S+ VD +PA K KRGRPP KS +DS G
Sbjct: 432 TGNRAADDKSKHVDNTPAG-KGKRGRPPASKSHEKKNVGKGKVSGLESKKADAVSDSGGR 490
Query: 395 XXXXXXXXXXXXXXXXXXGAGXXXXXXXXXXXXXDE-TDSKEDTAKDLSLKEMVSPKSVS 453
G G E T EDT +DLSLK++VSPKS +
Sbjct: 491 ATRRLAKDDDIKSSFKKTGEGESSKKKQKENLKQQEDTPPDEDTDEDLSLKDIVSPKSSA 550
Query: 454 KGSAKTKGSQGQDNNGSKRKRSQEDEQETPRSRKNKGLDASLVGARIQVWWPDDKKFYKG 513
K + K KG G D+ GSKRKR+QE E ETP+ +KNK L +LVG+RI+VWWPDD+KFYKG
Sbjct: 551 K-TGKNKGQAG-DSGGSKRKRAQEAE-ETPQPKKNKILKGNLVGSRIKVWWPDDRKFYKG 607
Query: 514 IVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVSEEQDKTPDVASEIXXXXXXXXXXXX 573
+V+SFD ASK+HK+ YDDGDVE L L++EKWEF+ E +D PD +S++
Sbjct: 608 VVESFDVASKKHKVVYDDGDVERLHLKNEKWEFIDEGRDNNPDASSDMPHGRRGRVSLGE 667
Query: 574 XXXVQLKEGNAETPKSGG------GDLPKKRGRPKGSSNGTPKSNIXXXXXXXXXXXXXX 627
Q KEG ETP SG D PKKRGRPKG + N
Sbjct: 668 ----QTKEGKIETPSSGKHRGTDVADPPKKRGRPKGVRSSNSSQNDDSPLKGKSAENDDE 723
Query: 628 DENETPKVGSDLKKEAEEGSEDKATKSTEKTKDDL------PEDGSNKSASKPK-EASSG 680
D ++TPK GS LK E ++++ST KTKD L E G+ KSASK K + S
Sbjct: 724 DISKTPKSGSALKNEG-----GRSSRSTGKTKDGLLKGSNKDETGNTKSASKSKNDGGSK 778
Query: 681 GKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
KD K E+K S G PK A A + SK N
Sbjct: 779 HKDSKDEAKSS------GSNPKGASTPKAADGSKTN 808
>Os04g0326000 TNP1/EN/SPM-like transposon protein domain containing protein
Length = 649
Score = 206 bits (525), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 116/251 (46%), Positives = 163/251 (64%), Gaps = 5/251 (1%)
Query: 6 ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
E+ +LRDVG +L S PDD + LL+L+ EA + +V Q + + SA+ P M+ALIKKE
Sbjct: 9 EVRRRLRDVGARLSSLPDDGE-LLRLLQEAAKLLYRVNQCEVDRIHSALIPVMRALIKKE 67
Query: 66 LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLFRRI 125
LLD++ VKL+VVSC++ + +I APD PYDDDVMKDV ++VG F +LDD++ P +
Sbjct: 68 LLDHTDPGVKLAVVSCLTTLIKIRAPDPPYDDDVMKDVLKLVVGVFCELDDVDCPSYGTR 127
Query: 126 VAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXX--- 182
V++L T A++R C ++LDL+C DLI MFH+FF TV H E+V + M T
Sbjct: 128 VSMLGTFARIRGCALLLDLDCNDLIRDMFHHFFRTVSNTHQEHVISYMETIMKFVIEDIT 187
Query: 183 -XXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNEY 241
CLL++ K E KET ASF LAE+VIG C EKLKPVF++LL+G + EY
Sbjct: 188 DMEQDLIKDLASCLLQNVKKEEKETPPASFVLAERVIGLCHEKLKPVFIKLLQGAPITEY 247
Query: 242 DNIIATICEDS 252
N++ + +D+
Sbjct: 248 SNLVTSFLQDA 258
Score = 103 bits (258), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/119 (47%), Positives = 77/119 (64%), Gaps = 3/119 (2%)
Query: 443 LKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQETPRSRKNKGLDASLVGARIQV 502
+KE+VSPKS S KT G Q D+ + + +E P S K K LD S+VG+RI+V
Sbjct: 273 MKEVVSPKS-STMMGKTIG-QPADSGDELKPEIVQGTKEAPNSNK-KALDGSIVGSRIKV 329
Query: 503 WWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVSEEQDKTPDVASEI 561
WP D+ FY G+V SFD +S+ H+I YD GDV L+DEKWEF++EEQD PD + ++
Sbjct: 330 RWPADEMFYNGLVKSFDASSETHEIVYDHGDVVRQSLKDEKWEFIAEEQDYNPDASPDM 388
>Os06g0286351 Armadillo-like helical domain containing protein
Length = 1561
Score = 165 bits (417), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/243 (38%), Positives = 136/243 (55%), Gaps = 7/243 (2%)
Query: 4 LGELEGKLRDVGEKLQS-PPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALI 62
+G E +L+++GEKL++ PPD D L KL+ +A + VEQ+P S++ I P +KA+
Sbjct: 1 MGAAEEQLKELGEKLEAAPPDPADDLAKLLEQAAECLHGVEQSPGPSVMETIQPCLKAVA 60
Query: 63 KKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLF 122
+ E L + +VK+ + +C EITRITAP+ PY DDV++D+F ++V +F L+D+ F
Sbjct: 61 RDEFLKHHDEDVKVLLATCFCEITRITAPEAPYSDDVLRDMFHLIVDTFSGLNDVNGKSF 120
Query: 123 RRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXX 182
R VAILETVA+ R CVVMLDLEC DLI MF +F + NH N+ N M +
Sbjct: 121 GRRVAILETVARYRACVVMLDLECNDLIADMFRSFLEIISDNHEPNIVNSMQSVMALIID 180
Query: 183 XXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLK------GT 236
LL + S + +LA VI + KL+P ++L GT
Sbjct: 181 ESEDIEESLLNVLLSTLGRKKTGVSLPARKLARHVIEHSAGKLEPYIRKILTSSLDGDGT 240
Query: 237 SLN 239
S N
Sbjct: 241 STN 243
Score = 89.7 bits (221), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 35/54 (64%), Positives = 45/54 (83%)
Query: 495 LVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVS 548
L+G RI+VWWP DKKFY+G+V+SFD++ +RH + YDDGDVEVL L EKWE V+
Sbjct: 1353 LIGKRIKVWWPLDKKFYEGVVESFDSSKRRHTVLYDDGDVEVLNLAKEKWEIVA 1406
>Os04g0319900 Hypothetical protein
Length = 172
Score = 102 bits (253), Expect = 2e-21, Method: Composition-based stats.
Identities = 59/131 (45%), Positives = 82/131 (62%), Gaps = 8/131 (6%)
Query: 6 ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
E+ +LRDVG +L S PDD + LL+L+ EA + +V Q + + SA+ P M+ALIKKE
Sbjct: 9 EVRRRLRDVGARLSSLPDDGE-LLRLLQEAAKLLYRVNQCEVDRIHSALIPVMRALIKKE 67
Query: 66 LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGS------FEKLDDME- 118
LLD++ VKL+V SC++ + +I APD PYDDDVMK FS S FE + D+
Sbjct: 68 LLDHTDPGVKLAVASCLTTLIKIRAPDPPYDDDVMKVTFSSSCSSGDEMVVFECITDLYL 127
Query: 119 NPLFRRIVAIL 129
P FR ++L
Sbjct: 128 LPCFRMFSSLL 138
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.305 0.126 0.343
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 21,998,752
Number of extensions: 901348
Number of successful extensions: 3427
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 3404
Number of HSP's successfully gapped: 8
Length of query: 755
Length of database: 17,035,801
Length adjustment: 108
Effective length of query: 647
Effective length of database: 11,396,689
Effective search space: 7373657783
Effective search space used: 7373657783
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 160 (66.2 bits)