BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os01g0634300 Os01g0634300|Os01g0634300
(1474 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os01g0634300 Heat shock protein DnaJ, N-terminal domain con... 2480 0.0
Os05g0579900 Heat shock protein DnaJ, N-terminal domain con... 297 4e-80
Os01g0355500 TolA family protein 218 3e-56
Os12g0548200 Heat shock protein DnaJ, N-terminal domain con... 213 6e-55
Os03g0198300 Heat shock protein DnaJ, N-terminal domain con... 142 3e-33
>Os01g0634300 Heat shock protein DnaJ, N-terminal domain containing protein
Length = 1474
Score = 2480 bits (6428), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 1229/1474 (83%), Positives = 1229/1474 (83%)
Query: 1 MEELXXXXXXXXXXXXXXXXXKASSTDAAAAYGDVFGGPPQFAVAFDGVPADYGEVFGGV 60
MEEL KASSTDAAAAYGDVFGGPPQFAVAFDGVPADYGEVFGGV
Sbjct: 1 MEELAAAAAPPRHRERRRHRRKASSTDAAAAYGDVFGGPPQFAVAFDGVPADYGEVFGGV 60
Query: 61 AASCSIXXXXXXXXXXXXXXXXXXXXXEIFGRFDFGDFAEPYEDLXXXXXXXXXXXXXXX 120
AASCSI EIFGRFDFGDFAEPYEDL
Sbjct: 61 AASCSIPYLDLPPAAARDDGAGAGAYGEIFGRFDFGDFAEPYEDLLAEAVALAAEIASSS 120
Query: 121 XXXXXXXXXXXGQLDADPSILHQHYSTVGYDQHFDEDEFSPISSPPDSGKQFSMSYNKAT 180
GQLDADPSILHQHYSTVGYDQHFDEDEFSPISSPPDSGKQFSMSYNKAT
Sbjct: 121 ESSRSSVRKESGQLDADPSILHQHYSTVGYDQHFDEDEFSPISSPPDSGKQFSMSYNKAT 180
Query: 181 RGRPDDIVKMTTCMVEPPISYVVDSRNISNKSAMDQVVVVDCDTFANGEKGSMGLTFPXX 240
RGRPDDIVKMTTCMVEPPISYVVDSRNISNKSAMDQVVVVDCDTFANGEKGSMGLTFP
Sbjct: 181 RGRPDDIVKMTTCMVEPPISYVVDSRNISNKSAMDQVVVVDCDTFANGEKGSMGLTFPSS 240
Query: 241 XXXXXXXXXXVADQNLHTPICHPISKNDCEDEDYHKRLSTHSASSEEVPSPDYPFLRVSN 300
VADQNLHTPICHPISKNDCEDEDYHKRLSTHSASSEEVPSPDYPFLRVSN
Sbjct: 241 SSLKSASSDSVADQNLHTPICHPISKNDCEDEDYHKRLSTHSASSEEVPSPDYPFLRVSN 300
Query: 301 NSLHTQPIKVQPPLLAPSKLLNKKESKANGEKGSTGLTFPXXXXXXXXXXDPMADQNLHT 360
NSLHTQPIKVQPPLLAPSKLLNKKESKANGEKGSTGLTFP DPMADQNLHT
Sbjct: 301 NSLHTQPIKVQPPLLAPSKLLNKKESKANGEKGSTGLTFPSSSSVKSASSDPMADQNLHT 360
Query: 361 PTCHPISKTDCEDEDYHKRLSTHSASSEDVPSPDYPFLRVPNNSLHTQPIKVQPPSKLLN 420
PTCHPISKTDCEDEDYHKRLSTHSASSEDVPSPDYPFLRVPNNSLHTQPIKVQPPSKLLN
Sbjct: 361 PTCHPISKTDCEDEDYHKRLSTHSASSEDVPSPDYPFLRVPNNSLHTQPIKVQPPSKLLN 420
Query: 421 KKESKANGDSEVSTNSXXXXXXXXXXXXXXXXRLKAAKELMERKGDSFKLRKKPGHHRGT 480
KKESKANGDSEVSTNS RLKAAKELMERKGDSFKLRKKPGHHRGT
Sbjct: 421 KKESKANGDSEVSTNSAAAAAAIKEAMEFAEARLKAAKELMERKGDSFKLRKKPGHHRGT 480
Query: 481 KSTELKESMAPEEVRVYDEKLTMRRIVKEEKTYEETALVNKNGDSSAVNLTHCDHNEKGV 540
KSTELKESMAPEEVRVYDEKLTMRRIVKEEKTYEETALVNKNGDSSAVNLTHCDHNEKGV
Sbjct: 481 KSTELKESMAPEEVRVYDEKLTMRRIVKEEKTYEETALVNKNGDSSAVNLTHCDHNEKGV 540
Query: 541 LQPRKPQHTAQSGSKLEQLGKWTSGAEFYVLISPDQKCKTNSVTCEGDNVQTTNPSSKLG 600
LQPRKPQHTAQSGSKLEQLGKWTSGAEFYVLISPDQKCKTNSVTCEGDNVQTTNPSSKLG
Sbjct: 541 LQPRKPQHTAQSGSKLEQLGKWTSGAEFYVLISPDQKCKTNSVTCEGDNVQTTNPSSKLG 600
Query: 601 QFEKGKGETTSGDFVGCGKSWDGGDIAELRMEHVNLREYAIGSTEDGCKAPTAPEISFSN 660
QFEKGKGETTSGDFVGCGKSWDGGDIAELRMEHVNLREYAIGSTEDGCKAPTAPEISFSN
Sbjct: 601 QFEKGKGETTSGDFVGCGKSWDGGDIAELRMEHVNLREYAIGSTEDGCKAPTAPEISFSN 660
Query: 661 EKPTYQESTETHFKECVGAQNYQERYGDDGAFEISCVDSSKLHAPEIPGASLESCISGGH 720
EKPTYQESTETHFKECVGAQNYQERYGDDGAFEISCVDSSKLHAPEIPGASLESCISGGH
Sbjct: 661 EKPTYQESTETHFKECVGAQNYQERYGDDGAFEISCVDSSKLHAPEIPGASLESCISGGH 720
Query: 721 CNGNKSPSDASTKETTSLGESNKENNNIEALEVPCADEMQSQILQEYHEFRNENIDEKKA 780
CNGNKSPSDASTKETTSLGESNKENNNIEALEVPCADEMQSQILQEYHEFRNENIDEKKA
Sbjct: 721 CNGNKSPSDASTKETTSLGESNKENNNIEALEVPCADEMQSQILQEYHEFRNENIDEKKA 780
Query: 781 SQVKVSKLEESVEYYETPNFQKSSSTAHGETETVEKEKMFSFSDELRPQNKNIGITEAPP 840
SQVKVSKLEESVEYYETPNFQKSSSTAHGETETVEKEKMFSFSDELRPQNKNIGITEAPP
Sbjct: 781 SQVKVSKLEESVEYYETPNFQKSSSTAHGETETVEKEKMFSFSDELRPQNKNIGITEAPP 840
Query: 841 ESLIHKEIKKFGTEEKAYITLEGDVVQKSGSLEREANITLXXXXXXXXXXXXXXXXFVEG 900
ESLIHKEIKKFGTEEKAYITLEGDVVQKSGSLEREANITL FVEG
Sbjct: 841 ESLIHKEIKKFGTEEKAYITLEGDVVQKSGSLEREANITLESASANENEEAEEANAFVEG 900
Query: 901 INVMETHVSTYGTSVEDSDQIQDSENRMDGMGDLVSHGNEEAAKDPWLDNSEKSQVEEIF 960
INVMETHVSTYGTSVEDSDQIQDSENRMDGMGDLVSHGNEEAAKDPWLDNSEKSQVEEIF
Sbjct: 901 INVMETHVSTYGTSVEDSDQIQDSENRMDGMGDLVSHGNEEAAKDPWLDNSEKSQVEEIF 960
Query: 961 SHEEGQLSVEGGIDGGPNDAYAGVNAINDGNGNDSETKVIIDDGTDFNTKMSTCXXXXXX 1020
SHEEGQLSVEGGIDGGPNDAYAGVNAINDGNGNDSETKVIIDDGTDFNTKMSTC
Sbjct: 961 SHEEGQLSVEGGIDGGPNDAYAGVNAINDGNGNDSETKVIIDDGTDFNTKMSTCSKELSA 1020
Query: 1021 XXXXXXXXMQHLSQIDKSIAAQTSDKSTPLENLGEDCREREFPEENSTALEQGQAIGSKM 1080
MQHLSQIDKSIAAQTSDKSTPLENLGEDCREREFPEENSTALEQGQAIGSKM
Sbjct: 1021 SFLESSASMQHLSQIDKSIAAQTSDKSTPLENLGEDCREREFPEENSTALEQGQAIGSKM 1080
Query: 1081 EGDDKDKQSKLNVKDQKYFHLDSYIVPKFTENTTLNFVQKLIDETPDGQRIEGRENVKKT 1140
EGDDKDKQSKLNVKDQKYFHLDSYIVPKFTENTTLNFVQKLIDETPDGQRIEGRENVKKT
Sbjct: 1081 EGDDKDKQSKLNVKDQKYFHLDSYIVPKFTENTTLNFVQKLIDETPDGQRIEGRENVKKT 1140
Query: 1141 LRETEKEVLHRLDEDKEIYKMEREKEQAKXXXXXXXXXXXXXXXXXAKDRLAVQRATKXX 1200
LRETEKEVLHRLDEDKEIYKMEREKEQAK AKDRLAVQRATK
Sbjct: 1141 LRETEKEVLHRLDEDKEIYKMEREKEQAKERSRRELEEEKERERERAKDRLAVQRATKEA 1200
Query: 1201 XXXXXXXXXXXXXXXXXXXITLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLXXXXXXX 1260
ITL L
Sbjct: 1201 HERAFAEARAKAERIALERITLARQRASAEAREKEEKATAEAATEKASREARLKAERAAV 1260
Query: 1261 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTNQDNQLDKQFQKTASNNYERS 1320
TNQDNQLDKQFQKTASNNYERS
Sbjct: 1261 ERATAEARERAIEKAKAAADAKERMERFRSSFKDSFKSTNQDNQLDKQFQKTASNNYERS 1320
Query: 1321 TDSSNQGIVVEFESALRHKARSEREHRTAERAAKALAEKNMRDMLAQREQAERHRLAEYL 1380
TDSSNQGIVVEFESALRHKARSEREHRTAERAAKALAEKNMRDMLAQREQAERHRLAEYL
Sbjct: 1321 TDSSNQGIVVEFESALRHKARSEREHRTAERAAKALAEKNMRDMLAQREQAERHRLAEYL 1380
Query: 1381 DPEVKRWSNGKEGNLRALLSTLQYILGSDNGWQSVPLTDLITATAVKKAYRRATLCVHPD 1440
DPEVKRWSNGKEGNLRALLSTLQYILGSDNGWQSVPLTDLITATAVKKAYRRATLCVHPD
Sbjct: 1381 DPEVKRWSNGKEGNLRALLSTLQYILGSDNGWQSVPLTDLITATAVKKAYRRATLCVHPD 1440
Query: 1441 KLQQRGATIRQKYICEKVFDLLKEAWNKFNSEER 1474
KLQQRGATIRQKYICEKVFDLLKEAWNKFNSEER
Sbjct: 1441 KLQQRGATIRQKYICEKVFDLLKEAWNKFNSEER 1474
>Os05g0579900 Heat shock protein DnaJ, N-terminal domain containing protein
Length = 708
Score = 297 bits (761), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 139/175 (79%), Positives = 155/175 (88%), Gaps = 2/175 (1%)
Query: 1300 NQDNQLDKQFQKTASNNYERSTDSSNQGIVVEFESALRHKARSEREHRTAERAAKALAEK 1359
N DN+ D QFQ+ S+N R+ DS ++G+ E ESALRHKAR ER RTAER KALAEK
Sbjct: 536 NLDNRQDTQFQRAVSSNLMRNPDSYSKGL--EVESALRHKARLERHQRTAERVTKALAEK 593
Query: 1360 NMRDMLAQREQAERHRLAEYLDPEVKRWSNGKEGNLRALLSTLQYILGSDNGWQSVPLTD 1419
NMRD+LAQREQAE+HRL+EYLDPE+KRWSNGKEGNLRALLSTLQYILG+D+GWQ VPLT+
Sbjct: 594 NMRDLLAQREQAEKHRLSEYLDPEIKRWSNGKEGNLRALLSTLQYILGADSGWQPVPLTE 653
Query: 1420 LITATAVKKAYRRATLCVHPDKLQQRGATIRQKYICEKVFDLLKEAWNKFNSEER 1474
LITA AVKKAYR+ATLCVHPDKLQQRGATIRQKYICEKVFDLLK+AWNKF SEER
Sbjct: 654 LITAAAVKKAYRKATLCVHPDKLQQRGATIRQKYICEKVFDLLKDAWNKFTSEER 708
>Os01g0355500 TolA family protein
Length = 948
Score = 218 bits (555), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 103/144 (71%), Positives = 118/144 (81%)
Query: 1330 VEFESALRHKARSEREHRTAERAAKALAEKNMRDMLAQREQAERHRLAEYLDPEVKRWSN 1389
V+ ES R KAR ER RT ERAAKALAEKN RD+ Q EQ ERHR+ E LD E+KRW+
Sbjct: 803 VDGESEERRKARLERHQRTMERAAKALAEKNERDLQVQWEQEERHRIGETLDFEIKRWAA 862
Query: 1390 GKEGNLRALLSTLQYILGSDNGWQSVPLTDLITATAVKKAYRRATLCVHPDKLQQRGATI 1449
GKEGNLRALLSTLQY+L + GW+ V LTDLITA +VKK YR+ATLC+HPDK+QQ+GA +
Sbjct: 863 GKEGNLRALLSTLQYVLWPECGWRPVSLTDLITAASVKKEYRKATLCIHPDKVQQKGANL 922
Query: 1450 RQKYICEKVFDLLKEAWNKFNSEE 1473
+QKYI EKVFDLLKEAWNKFNSEE
Sbjct: 923 QQKYIAEKVFDLLKEAWNKFNSEE 946
>Os12g0548200 Heat shock protein DnaJ, N-terminal domain containing protein
Length = 925
Score = 213 bits (543), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 97/128 (75%), Positives = 111/128 (86%)
Query: 1346 HRTAERAAKALAEKNMRDMLAQREQAERHRLAEYLDPEVKRWSNGKEGNLRALLSTLQYI 1405
RT ERAAKALAEKN RDM QREQAERHR++E +D E+KRW+ GKEGNLRALLSTLQY+
Sbjct: 796 QRTRERAAKALAEKNERDMQVQREQAERHRISETMDFEIKRWAAGKEGNLRALLSTLQYV 855
Query: 1406 LGSDNGWQSVPLTDLITATAVKKAYRRATLCVHPDKLQQRGATIRQKYICEKVFDLLKEA 1465
L + GWQ V LTDLITA AVKK YR+ATLC+HPDK+QQ+GA ++QKY+ EKVFDLLKEA
Sbjct: 856 LWPECGWQPVSLTDLITAAAVKKVYRKATLCIHPDKVQQKGANLQQKYVAEKVFDLLKEA 915
Query: 1466 WNKFNSEE 1473
WNKFNSEE
Sbjct: 916 WNKFNSEE 923
>Os03g0198300 Heat shock protein DnaJ, N-terminal domain containing protein
Length = 607
Score = 142 bits (357), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 61/108 (56%), Positives = 84/108 (77%), Gaps = 3/108 (2%)
Query: 1364 MLAQREQAERHRLAEYLDPEVKRWSNGKEGNLRALLSTLQYILGSDNGWQSVPLTDLITA 1423
+L E+ E+ +++E ++ WS GKEGN+R+LLSTLQY+L ++GW+ VPL D+I
Sbjct: 499 ILRNNEEKEQIKISE---SKIWEWSKGKEGNIRSLLSTLQYVLWPESGWKPVPLVDIIEG 555
Query: 1424 TAVKKAYRRATLCVHPDKLQQRGATIRQKYICEKVFDLLKEAWNKFNS 1471
AVKKAY++A LC+HPDKLQQRGA + QKYI EKVFD+L+EAW +FN+
Sbjct: 556 AAVKKAYQKALLCLHPDKLQQRGAAMHQKYIAEKVFDILQEAWKEFNT 603
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.309 0.127 0.362
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 43,569,855
Number of extensions: 1813864
Number of successful extensions: 4272
Number of sequences better than 1.0e-10: 5
Number of HSP's gapped: 4258
Number of HSP's successfully gapped: 7
Length of query: 1474
Length of database: 17,035,801
Length adjustment: 114
Effective length of query: 1360
Effective length of database: 11,083,405
Effective search space: 15073430800
Effective search space used: 15073430800
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 163 (67.4 bits)