BLASTP 2.2.23 [Feb-03-2010]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= Os02g0612800 Os02g0612800|AK072455
         (755 letters)

Database: rap3 
           52,214 sequences; 17,035,801 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

Os02g0612800  HMG-I and HMG-Y, DNA-binding domain containing...  1217   0.0  
Os04g0501600  HMG-I and HMG-Y, DNA-binding domain containing...   301   1e-81
Os04g0326000  TNP1/EN/SPM-like transposon protein domain con...   206   4e-53
Os06g0286351  Armadillo-like helical domain containing protein    165   1e-40
Os04g0319900  Hypothetical protein                                102   2e-21
>Os02g0612800 HMG-I and HMG-Y, DNA-binding domain containing protein
          Length = 755

 Score = 1217 bits (3148), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 619/716 (86%), Positives = 619/716 (86%)

Query: 1   MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA 60
           MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA
Sbjct: 1   MAELGELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKA 60

Query: 61  LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP 120
           LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP
Sbjct: 61  LIKKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENP 120

Query: 121 LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXX 180
           LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTT     
Sbjct: 121 LFRRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTIMILV 180

Query: 181 XXXXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE 240
                        CLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE
Sbjct: 181 IEEDDEVEIPIAECLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNE 240

Query: 241 YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI 300
           YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI
Sbjct: 241 YDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQTTAI 300

Query: 301 GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK 360
           GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK
Sbjct: 301 GSGATPVDNGTESAAANPKELSNPDSEKKDGVKQSAKVANGASAETSERVDGSPAMVKSK 360

Query: 361 RGRPPGLKSXXXXXXXXXXXXXXXXXXTTDSTGXXXXXXXXXXXXXXXXXXXGAGXXXXX 420
           RGRPPGLKS                  TTDSTG                   GAG     
Sbjct: 361 RGRPPGLKSLEKKAAGKKVLGLKKVEETTDSTGKLSKQSSKDDSKSSTRKASGAGSSKKQ 420

Query: 421 XXXXXXXXDETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ 480
                   DETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ
Sbjct: 421 QKISLKQKDETDSKEDTAKDLSLKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQ 480

Query: 481 ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR 540
           ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR
Sbjct: 481 ETPRSRKNKGLDASLVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLR 540

Query: 541 DEKWEFVSEEQDKTPDVASEIXXXXXXXXXXXXXXXVQLKEGNAETPKSGGGDLPKKRGR 600
           DEKWEFVSEEQDKTPDVASEI               VQLKEGNAETPKSGGGDLPKKRGR
Sbjct: 541 DEKWEFVSEEQDKTPDVASEISPKPRGRGRKGRGSSVQLKEGNAETPKSGGGDLPKKRGR 600

Query: 601 PKGSSNGTPKSNIXXXXXXXXXXXXXXDENETPKVGSDLKKEAEEGSEDKATKSTEKTKD 660
           PKGSSNGTPKSNI              DENETPKVGSDLKKEAEEGSEDKATKSTEKTKD
Sbjct: 601 PKGSSNGTPKSNISATSSKSKGKAARKDENETPKVGSDLKKEAEEGSEDKATKSTEKTKD 660

Query: 661 DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
           DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN
Sbjct: 661 DLPEDGSNKSASKPKEASSGGKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
>Os04g0501600 HMG-I and HMG-Y, DNA-binding domain containing protein
          Length = 846

 Score =  301 bits (770), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 166/320 (51%), Positives = 213/320 (66%), Gaps = 11/320 (3%)

Query: 6   ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
           E+E +LRD+G +  S PD  D LL+LI EAE ++ +V+Q+P ESM  A+ P M ALIKKE
Sbjct: 9   EVERRLRDIGARFTSLPDADDELLRLIEEAETWLARVDQSPPESMHKALRPTMSALIKKE 68

Query: 66  LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLFRRI 125
           LLD+S  ++KL+V SC++E+TRITAP+ PYDDDVMKDVF+ +V +FEKLDDME+P + R 
Sbjct: 69  LLDHSVPDIKLAVASCLTEVTRITAPEAPYDDDVMKDVFTRVVEAFEKLDDMESPSYARR 128

Query: 126 VAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXXXXX 185
           VA+LETVAKVR CV+MLDL+C+DLI  MFH+FF T+   H ENV   M T          
Sbjct: 129 VAMLETVAKVRSCVLMLDLDCDDLIRDMFHHFFRTISNTHQENVITSMETVMKFVIDESE 188

Query: 186 XXXXXXXXC--------LLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTS 237
                   C        LLK+ K E KET  ASFELAEKVI  C EKLKPVF  LL+GT 
Sbjct: 189 DVQQDMPSCLLQDLASYLLKNLKKEEKETLPASFELAEKVINKCYEKLKPVFTPLLRGTP 248

Query: 238 LNEYDNIIATICEDSSDVKEDMDADPSGKDVVDDGKLSERTISDELPQEPAKLEQDVTQT 297
           L+EY  ++ ++ ED+ D     ++D  GKD+V DGKLS + +SDE  QE +KLEQD    
Sbjct: 249 LDEYSEVVTSLFEDALDAGVADNSDAPGKDMVADGKLSHKIVSDESAQESSKLEQDA--- 305

Query: 298 TAIGSGATPVDNGTESAAAN 317
              G   TP +N + SA +N
Sbjct: 306 NCPGKDGTPPNNTSTSAVSN 325

 Score =  206 bits (525), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 161/396 (40%), Positives = 201/396 (50%), Gaps = 36/396 (9%)

Query: 338 VANGASAETSERVDGSPAMVKSKRGRPPGLKSXXXXXXXXXXXX---XXXXXXTTDSTGX 394
             N A+ + S+ VD +PA  K KRGRPP  KS                      +DS G 
Sbjct: 432 TGNRAADDKSKHVDNTPAG-KGKRGRPPASKSHEKKNVGKGKVSGLESKKADAVSDSGGR 490

Query: 395 XXXXXXXXXXXXXXXXXXGAGXXXXXXXXXXXXXDE-TDSKEDTAKDLSLKEMVSPKSVS 453
                             G G              E T   EDT +DLSLK++VSPKS +
Sbjct: 491 ATRRLAKDDDIKSSFKKTGEGESSKKKQKENLKQQEDTPPDEDTDEDLSLKDIVSPKSSA 550

Query: 454 KGSAKTKGSQGQDNNGSKRKRSQEDEQETPRSRKNKGLDASLVGARIQVWWPDDKKFYKG 513
           K + K KG  G D+ GSKRKR+QE E ETP+ +KNK L  +LVG+RI+VWWPDD+KFYKG
Sbjct: 551 K-TGKNKGQAG-DSGGSKRKRAQEAE-ETPQPKKNKILKGNLVGSRIKVWWPDDRKFYKG 607

Query: 514 IVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVSEEQDKTPDVASEIXXXXXXXXXXXX 573
           +V+SFD ASK+HK+ YDDGDVE L L++EKWEF+ E +D  PD +S++            
Sbjct: 608 VVESFDVASKKHKVVYDDGDVERLHLKNEKWEFIDEGRDNNPDASSDMPHGRRGRVSLGE 667

Query: 574 XXXVQLKEGNAETPKSGG------GDLPKKRGRPKGSSNGTPKSNIXXXXXXXXXXXXXX 627
               Q KEG  ETP SG        D PKKRGRPKG  +     N               
Sbjct: 668 ----QTKEGKIETPSSGKHRGTDVADPPKKRGRPKGVRSSNSSQNDDSPLKGKSAENDDE 723

Query: 628 DENETPKVGSDLKKEAEEGSEDKATKSTEKTKDDL------PEDGSNKSASKPK-EASSG 680
           D ++TPK GS LK E       ++++ST KTKD L       E G+ KSASK K +  S 
Sbjct: 724 DISKTPKSGSALKNEG-----GRSSRSTGKTKDGLLKGSNKDETGNTKSASKSKNDGGSK 778

Query: 681 GKDLKGESKPSEGRAKPGRKPKVAGAAVAGEESKAN 716
            KD K E+K S      G  PK A    A + SK N
Sbjct: 779 HKDSKDEAKSS------GSNPKGASTPKAADGSKTN 808
>Os04g0326000 TNP1/EN/SPM-like transposon protein domain containing protein
          Length = 649

 Score =  206 bits (525), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 116/251 (46%), Positives = 163/251 (64%), Gaps = 5/251 (1%)

Query: 6   ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
           E+  +LRDVG +L S PDD + LL+L+ EA   + +V Q   + + SA+ P M+ALIKKE
Sbjct: 9   EVRRRLRDVGARLSSLPDDGE-LLRLLQEAAKLLYRVNQCEVDRIHSALIPVMRALIKKE 67

Query: 66  LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLFRRI 125
           LLD++   VKL+VVSC++ + +I APD PYDDDVMKDV  ++VG F +LDD++ P +   
Sbjct: 68  LLDHTDPGVKLAVVSCLTTLIKIRAPDPPYDDDVMKDVLKLVVGVFCELDDVDCPSYGTR 127

Query: 126 VAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXX--- 182
           V++L T A++R C ++LDL+C DLI  MFH+FF TV   H E+V + M T          
Sbjct: 128 VSMLGTFARIRGCALLLDLDCNDLIRDMFHHFFRTVSNTHQEHVISYMETIMKFVIEDIT 187

Query: 183 -XXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLKGTSLNEY 241
                       CLL++ K E KET  ASF LAE+VIG C EKLKPVF++LL+G  + EY
Sbjct: 188 DMEQDLIKDLASCLLQNVKKEEKETPPASFVLAERVIGLCHEKLKPVFIKLLQGAPITEY 247

Query: 242 DNIIATICEDS 252
            N++ +  +D+
Sbjct: 248 SNLVTSFLQDA 258

 Score =  103 bits (258), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 57/119 (47%), Positives = 77/119 (64%), Gaps = 3/119 (2%)

Query: 443 LKEMVSPKSVSKGSAKTKGSQGQDNNGSKRKRSQEDEQETPRSRKNKGLDASLVGARIQV 502
           +KE+VSPKS S    KT G Q  D+    +    +  +E P S K K LD S+VG+RI+V
Sbjct: 273 MKEVVSPKS-STMMGKTIG-QPADSGDELKPEIVQGTKEAPNSNK-KALDGSIVGSRIKV 329

Query: 503 WWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVSEEQDKTPDVASEI 561
            WP D+ FY G+V SFD +S+ H+I YD GDV    L+DEKWEF++EEQD  PD + ++
Sbjct: 330 RWPADEMFYNGLVKSFDASSETHEIVYDHGDVVRQSLKDEKWEFIAEEQDYNPDASPDM 388
>Os06g0286351 Armadillo-like helical domain containing protein
          Length = 1561

 Score =  165 bits (417), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 94/243 (38%), Positives = 136/243 (55%), Gaps = 7/243 (2%)

Query: 4   LGELEGKLRDVGEKLQS-PPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALI 62
           +G  E +L+++GEKL++ PPD  D L KL+ +A   +  VEQ+P  S++  I P +KA+ 
Sbjct: 1   MGAAEEQLKELGEKLEAAPPDPADDLAKLLEQAAECLHGVEQSPGPSVMETIQPCLKAVA 60

Query: 63  KKELLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGSFEKLDDMENPLF 122
           + E L +   +VK+ + +C  EITRITAP+ PY DDV++D+F ++V +F  L+D+    F
Sbjct: 61  RDEFLKHHDEDVKVLLATCFCEITRITAPEAPYSDDVLRDMFHLIVDTFSGLNDVNGKSF 120

Query: 123 RRIVAILETVAKVRLCVVMLDLECEDLILQMFHNFFTTVKPNHPENVTNCMTTXXXXXXX 182
            R VAILETVA+ R CVVMLDLEC DLI  MF +F   +  NH  N+ N M +       
Sbjct: 121 GRRVAILETVARYRACVVMLDLECNDLIADMFRSFLEIISDNHEPNIVNSMQSVMALIID 180

Query: 183 XXXXXXXXXXXCLLKHAKSELKETSAASFELAEKVIGACSEKLKPVFLQLLK------GT 236
                       LL     +    S  + +LA  VI   + KL+P   ++L       GT
Sbjct: 181 ESEDIEESLLNVLLSTLGRKKTGVSLPARKLARHVIEHSAGKLEPYIRKILTSSLDGDGT 240

Query: 237 SLN 239
           S N
Sbjct: 241 STN 243

 Score = 89.7 bits (221), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 35/54 (64%), Positives = 45/54 (83%)

Query: 495  LVGARIQVWWPDDKKFYKGIVDSFDTASKRHKIAYDDGDVEVLLLRDEKWEFVS 548
            L+G RI+VWWP DKKFY+G+V+SFD++ +RH + YDDGDVEVL L  EKWE V+
Sbjct: 1353 LIGKRIKVWWPLDKKFYEGVVESFDSSKRRHTVLYDDGDVEVLNLAKEKWEIVA 1406
>Os04g0319900 Hypothetical protein
          Length = 172

 Score =  102 bits (253), Expect = 2e-21,   Method: Composition-based stats.
 Identities = 59/131 (45%), Positives = 82/131 (62%), Gaps = 8/131 (6%)

Query: 6   ELEGKLRDVGEKLQSPPDDVDALLKLIHEAEIYILKVEQAPSESMISAITPAMKALIKKE 65
           E+  +LRDVG +L S PDD + LL+L+ EA   + +V Q   + + SA+ P M+ALIKKE
Sbjct: 9   EVRRRLRDVGARLSSLPDDGE-LLRLLQEAAKLLYRVNQCEVDRIHSALIPVMRALIKKE 67

Query: 66  LLDNSSYEVKLSVVSCISEITRITAPDTPYDDDVMKDVFSIMVGS------FEKLDDME- 118
           LLD++   VKL+V SC++ + +I APD PYDDDVMK  FS    S      FE + D+  
Sbjct: 68  LLDHTDPGVKLAVASCLTTLIKIRAPDPPYDDDVMKVTFSSSCSSGDEMVVFECITDLYL 127

Query: 119 NPLFRRIVAIL 129
            P FR   ++L
Sbjct: 128 LPCFRMFSSLL 138
  Database: rap3
    Posted date:  Nov 19, 2010  6:03 PM
  Number of letters in database: 17,035,801
  Number of sequences in database:  52,214
  
Lambda     K      H
   0.305    0.126    0.343 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 21,998,752
Number of extensions: 901348
Number of successful extensions: 3427
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 3404
Number of HSP's successfully gapped: 8
Length of query: 755
Length of database: 17,035,801
Length adjustment: 108
Effective length of query: 647
Effective length of database: 11,396,689
Effective search space: 7373657783
Effective search space used: 7373657783
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 160 (66.2 bits)