BLASTP 2.2.23 [Feb-03-2010]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment:
Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala,
Aleksandr Morgulis, Alejandro A. Schäffer, and Yi-Kuo Yu (2005) "Protein database
searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= Os04g0675000 Os04g0675000|AK103357
(958 letters)
Database: rap3
52,214 sequences; 17,035,801 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
Os04g0675000 Protein of unknown function DUF789 family protein 1896 0.0
Os08g0100600 Protein of unknown function DUF789 family protein 194 2e-49
Os01g0138500 Protein of unknown function DUF789 family protein 88 3e-17
Os01g0306900 Protein of unknown function DUF789 family protein 81 5e-15
Os04g0528000 Protein of unknown function DUF789 family protein 72 1e-12
Os10g0494000 Protein of unknown function DUF789 family protein 71 3e-12
Os01g0513400 Protein of unknown function DUF789 family protein 71 5e-12
Os02g0827400 Protein of unknown function DUF789 family protein 68 3e-11
>Os04g0675000 Protein of unknown function DUF789 family protein
Length = 958
Score = 1896 bits (4912), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 915/958 (95%), Positives = 915/958 (95%)
Query: 1 MLRQPASTSGDVDRIANRKHRHIVPRRSTEKKNPHNIQFERQVAALEYRQEEQRKRANGG 60
MLRQPASTSGDVDRIANRKHRHIVPRRSTEKKNPHNIQFERQVAALEYRQEEQRKRANGG
Sbjct: 1 MLRQPASTSGDVDRIANRKHRHIVPRRSTEKKNPHNIQFERQVAALEYRQEEQRKRANGG 60
Query: 61 RLFFTLSLSSHLVENGDELETSASPSLLLFHFNDPEDLARCLTSSRALLEDSQQSDKAPD 120
RLFFTLSLSSHLVENGDELETSASPSLLLFHFNDPEDLARCLTSSRALLEDSQQSDKAPD
Sbjct: 61 RLFFTLSLSSHLVENGDELETSASPSLLLFHFNDPEDLARCLTSSRALLEDSQQSDKAPD 120
Query: 121 NDFSVNTFSNAXXXXXXXXXXXXXXXXXXXXXXXXXXXXEASDTQSMQSKGASHCIDVAG 180
NDFSVNTFSNA EASDTQSMQSKGASHCIDVAG
Sbjct: 121 NDFSVNTFSNASVDVKRTSRKKSKKKNKRHKRVHGKKVSEASDTQSMQSKGASHCIDVAG 180
Query: 181 GESLTLSSNHVAHAGSEMRCRKETFPSMADGGETLTLPPNHVADKLFGDLSSDSSVREVS 240
GESLTLSSNHVAHAGSEMRCRKETFPSMADGGETLTLPPNHVADKLFGDLSSDSSVREVS
Sbjct: 181 GESLTLSSNHVAHAGSEMRCRKETFPSMADGGETLTLPPNHVADKLFGDLSSDSSVREVS 240
Query: 241 AERPDSETGNDGSFITLISSTSCSDEIELSRHASYFECCEQSNSNNSRCXXXXXXXXXXX 300
AERPDSETGNDGSFITLISSTSCSDEIELSRHASYFECCEQSNSNNSRC
Sbjct: 241 AERPDSETGNDGSFITLISSTSCSDEIELSRHASYFECCEQSNSNNSRCLDSASTSTLTD 300
Query: 301 XXXXGHYTDSSWNFSDDTENLLIDKNECPPCVQSKVTDLRGSKCGGSEEKEPGKIERSNL 360
GHYTDSSWNFSDDTENLLIDKNECPPCVQSKVTDLRGSKCGGSEEKEPGKIERSNL
Sbjct: 301 SSLDGHYTDSSWNFSDDTENLLIDKNECPPCVQSKVTDLRGSKCGGSEEKEPGKIERSNL 360
Query: 361 WGVVMWNIFVVLSMVKVEKRSKISSRPSNSCTQVASKDSTKDFIHPIKVRTWTPHEVTLN 420
WGVVMWNIFVVLSMVKVEKRSKISSRPSNSCTQVASKDSTKDFIHPIKVRTWTPHEVTLN
Sbjct: 361 WGVVMWNIFVVLSMVKVEKRSKISSRPSNSCTQVASKDSTKDFIHPIKVRTWTPHEVTLN 420
Query: 421 DYMIGANMNHLQDPKQNRRGKPHKYSCLSEVANCGFIEEKSACTAKMLPGITHSTETGVG 480
DYMIGANMNHLQDPKQNRRGKPHKYSCLSEVANCGFIEEKSACTAKMLPGITHSTETGVG
Sbjct: 421 DYMIGANMNHLQDPKQNRRGKPHKYSCLSEVANCGFIEEKSACTAKMLPGITHSTETGVG 480
Query: 481 QIASSSASDVTVREISEEICTPIGPVQKGGLQILLREENVVGTGSLDVLNHVSSVDSEEQ 540
QIASSSASDVTVREISEEICTPIGPVQKGGLQILLREENVVGTGSLDVLNHVSSVDSEEQ
Sbjct: 481 QIASSSASDVTVREISEEICTPIGPVQKGGLQILLREENVVGTGSLDVLNHVSSVDSEEQ 540
Query: 541 KKVDNAVMSRSHGMEGHHLQSQDSGSQFPGCTTDYWKTSRPTESGLEVGYHGVSAFEGRC 600
KKVDNAVMSRSHGMEGHHLQSQDSGSQFPGCTTDYWKTSRPTESGLEVGYHGVSAFEGRC
Sbjct: 541 KKVDNAVMSRSHGMEGHHLQSQDSGSQFPGCTTDYWKTSRPTESGLEVGYHGVSAFEGRC 600
Query: 601 NTNQQRSVSSKLQLGEMIKAANDACKVQGASDVHLISGHPLADFETFIYSASPVIAKTSC 660
NTNQQRSVSSKLQLGEMIKAANDACKVQGASDVHLISGHPLADFETFIYSASPVIAKTSC
Sbjct: 601 NTNQQRSVSSKLQLGEMIKAANDACKVQGASDVHLISGHPLADFETFIYSASPVIAKTSC 660
Query: 661 MRNGNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSYGLEVEIHRSLNSTRSACGVSE 720
MRNGNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSYGLEVEIHRSLNSTRSACGVSE
Sbjct: 661 MRNGNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSYGLEVEIHRSLNSTRSACGVSE 720
Query: 721 FCAYFLPSLSAIQLFEQCKNNLDHKFDSDDDFLLSQPNGVYLPKPSLSVQDHGEPLFEYF 780
FCAYFLPSLSAIQLFEQCKNNLDHKFDSDDDFLLSQPNGVYLPKPSLSVQDHGEPLFEYF
Sbjct: 721 FCAYFLPSLSAIQLFEQCKNNLDHKFDSDDDFLLSQPNGVYLPKPSLSVQDHGEPLFEYF 780
Query: 781 ESEHPSSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRI 840
ESEHPSSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRI
Sbjct: 781 ESEHPSSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRI 840
Query: 841 PQGNCRAAFLTYHSLGKVVPQIHSPDKADEPTHLVCPVVGFWSYNDKGEQWFQLRNPEIK 900
PQGNCRAAFLTYHSLGKVVPQIHSPDKADEPTHLVCPVVGFWSYNDKGEQWFQLRNPEIK
Sbjct: 841 PQGNCRAAFLTYHSLGKVVPQIHSPDKADEPTHLVCPVVGFWSYNDKGEQWFQLRNPEIK 900
Query: 901 PMSLDVGPKTDRAEVLKQRLKTLRHGASVMSSMVIPKANGEKSINRHPDYEFFLSRSN 958
PMSLDVGPKTDRAEVLKQRLKTLRHGASVMSSMVIPKANGEKSINRHPDYEFFLSRSN
Sbjct: 901 PMSLDVGPKTDRAEVLKQRLKTLRHGASVMSSMVIPKANGEKSINRHPDYEFFLSRSN 958
>Os08g0100600 Protein of unknown function DUF789 family protein
Length = 156
Score = 194 bits (494), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 95/155 (61%), Positives = 114/155 (73%), Gaps = 2/155 (1%)
Query: 803 NLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIPQGNCRAAFLTYHSLGKVVPQI 862
N+S QIFGDP+ L+N+KL DLHPASWF VAWYP+ R+P G RAAFLTYHSLGK+VPQ
Sbjct: 1 NVSGHQIFGDPEKLQNVKLCDLHPASWFSVAWYPVYRVPHGKLRAAFLTYHSLGKLVPQK 60
Query: 863 HSPDKADEPTHLVCPVVGFWSYNDKGEQWFQLRNPEIKPMSLDV-GPKTDRAEVLKQRLK 921
SPD + +V PV G SY+DKGEQWFQLR P+ K + +D K RAEVLK+RL+
Sbjct: 61 GSPDLTGLGSRIVSPVFGLQSYSDKGEQWFQLRRPDSKQLQIDGESSKGSRAEVLKERLR 120
Query: 922 TLRHGASVMSSMVIPKANGEKSINRHPDYEFFLSR 956
TL+ GA + V+PK GE S+N HPDYEFFLSR
Sbjct: 121 TLQRGALAAARAVVPKGGGE-SVNCHPDYEFFLSR 154
>Os01g0138500 Protein of unknown function DUF789 family protein
Length = 335
Score = 87.8 bits (216), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 142/361 (39%), Gaps = 99/361 (27%)
Query: 643 DFETFIYSASPVIAKTSCMRN---GNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSY 699
+ E F+ + +PV+ T+C Q + + P+ SL ++W+ + E +Y
Sbjct: 21 NLELFLEATTPVVPTTACSSKKSMNGWKQSDEENALPF------FSLGDLWDGFRESSAY 74
Query: 700 GLEVEIHRSLNSTRSACGVSEFCAYFLPSLSAIQLFEQCKNNLDHKF----DSDDDFL-- 753
G+ V I LN GV + Y++P LSAIQL+ + + + H DSD D+
Sbjct: 75 GIAVPI--VLNGCSD--GVVQ---YYVPYLSAIQLYGRLRRHFYHSRPSGEDSDGDYCQD 127
Query: 754 -----LSQPNGVYLPKPS--LSVQD---------------------HGEPLFEYFESEHP 785
+S P + SVQD H + +FE+ ESE P
Sbjct: 128 TGSEEMSDLEHDSCPSSTDAFSVQDTTCETSTSEASSDESESTRISHEQLIFEFLESEPP 187
Query: 786 SSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIPQG-- 843
R PL +KI L G F + L L+ DL P SW VAWYPI RIP G
Sbjct: 188 YQREPLADKICSLARG--------FPE---LNTLRSCDLSPTSWMSVAWYPIYRIPTGPT 236
Query: 844 --NCRAAFLTYHSL-----GKVVPQIHSPDKADEPTHLVC-PVVGFWSYNDKGEQWFQLR 895
+ A FLTYH L G + P+ + T +C P SY K W
Sbjct: 237 LCDLDACFLTYHPLSTQLTGGICPEPKGNNSGVPVTTAMCLPTFAMASYRLKVAAW---- 292
Query: 896 NPEIKPMSLDVGPKTDRAEVLKQRLKTLRHGASVMSSMVIPKANGEKSINRHPDYEFFLS 955
P D +Q + +L H A ++ HPD+ FF +
Sbjct: 293 ----APGGRD-----------RQLVASLSHAADAWLGLL---------GVHHPDHRFFAA 328
Query: 956 R 956
R
Sbjct: 329 R 329
>Os01g0306900 Protein of unknown function DUF789 family protein
Length = 357
Score = 80.9 bits (198), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 138/350 (39%), Gaps = 88/350 (25%)
Query: 643 DFETFIYSASPVIAKTSCMRNGNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSYGLE 702
+ E FI S + + R + ++ A +P Y+++D+ WE + E +YG
Sbjct: 58 NLECFIASTAVRVPAHRLPRTSSSSRERGAAGAPPYYELADL-----WEAFAEWSAYGAG 112
Query: 703 VEIHRSLNSTRSACGVSEFCAYFLPSLSAIQLF---------------EQCKNNLDHKFD 747
V + LN T GV + Y++P LSAIQLF ++ D +
Sbjct: 113 VPL--LLNGTD---GVVQ---YYVPFLSAIQLFAARPPSSTSGRLGEDSDGESAQDMSSE 164
Query: 748 SDDDFLLSQ--PNGVYLPKPSLSVQDHGE------PLFEYFESEHPSSRPPLFEKIKQLT 799
SD + L + N + + S D P+F+Y E + P R PL + I L
Sbjct: 165 SDHEHLRCRCLVNSISADQDGFSSDDSESGNQELYPVFQYMEHDAPYGRQPLADMISLLA 224
Query: 800 SGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIPQG----NCRAAFLTYHSL 855
+ F D L K DL P+SW VAWYPI RIP G + A FLT+HSL
Sbjct: 225 NR--------FPD---LRTYKSCDLLPSSWISVAWYPIYRIPTGPTLKDLDACFLTFHSL 273
Query: 856 GKVV-------PQ---IHSPDKADEPTHLVCPVVGFWSYNDKGEQWFQLRNPEIKPMSLD 905
P+ H D P + P++G S+ G W
Sbjct: 274 STPAEGTLSGHPETNVFHDSKIYDVPGKVTLPLIGLASHKFNGSMW-------------- 319
Query: 906 VGPKTDRAEVLKQRLKTLRHGASVMSSMVIPKANGEKSINRHPDYEFFLS 955
T E +Q K+L A ++ +N HPDY FFLS
Sbjct: 320 ----TSNQEHEQQLTKSLLKAADDWLC--------QRRVN-HPDYRFFLS 356
>Os04g0528000 Protein of unknown function DUF789 family protein
Length = 313
Score = 72.4 bits (176), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 120/272 (44%), Gaps = 57/272 (20%)
Query: 642 ADFETFIYSASPVIAKTSCMR-NGNCLQDPQAGSSPYQYQISDVSLRNVWEWYEEPGSYG 700
++ +F++S +P + + + G + P+ G + L ++W + +YG
Sbjct: 21 SNLRSFLHSVTPTLEPYTVAKPGGYSGRVPELGRCFF--------LVDLWNHFYPLSAYG 72
Query: 701 LEVEIHRSLNSTRSACGVSEFCAYFLPSLSAIQL-----FEQCK-----NNLDHKFDSDD 750
+ + R G E YF+P LSAIQL F C NNL FD+++
Sbjct: 73 VGTPV-------RLPSG-QEIEQYFVPYLSAIQLHTISDFTSCNEIMVGNNL---FDANN 121
Query: 751 DFLLSQP---NGVYLPKPSLSVQD-----HGEPLFEYFESEHPSSRPPLFEKIKQLTSGE 802
S NG Y SL+ D +G P F+YFE + P R PL +K+ +L
Sbjct: 122 YGWCSAADNWNGQY-ATTSLARYDSPRSMNGGPCFQYFECDSPYERMPLADKVYEL---- 176
Query: 803 NLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIPQGNCR---AAFLTYHSLGKVV 859
C F L +++L P+SW V WYPI +P N + FLTYHSL +
Sbjct: 177 ----CYNFPPLSYLSSIELS---PSSWMSVFWYPIGHVPAMNKKDLTTCFLTYHSLSTL- 228
Query: 860 PQIHSPDKADEPTHLVCPVVGFWSYNDKGEQW 891
+ +P + +P L P +G ++ G+ W
Sbjct: 229 -EDRTPFDSKDP--LTLPPIGLATHKTDGDVW 257
>Os10g0494000 Protein of unknown function DUF789 family protein
Length = 318
Score = 71.2 bits (173), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 81/196 (41%), Gaps = 40/196 (20%)
Query: 681 ISDVSLRNVWEWYEEPGSYGLEVEIHRSLNSTRSACGVSEFCAYFLPSLSAIQLFEQCKN 740
+ +L ++WE Y E +YG +T G Y++P LS IQL+
Sbjct: 57 VEYFNLADLWEQYYEWSAYGA--------GTTVQLYGGERVVQYYVPYLSGIQLYTNKAQ 108
Query: 741 NLDHKFDSDD--DFLLSQPNGVYLPKPSLSVQD---------------HGEPLFEYFESE 783
F D+ D+ + + + S + HG FE+FE
Sbjct: 109 TASRSFGEDNGMDYWSDDEDNEKMSRSWSSTSEDSLFNCDAISGNRKRHGHMYFEFFEVC 168
Query: 784 HPSSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIP-Q 842
P R PL +K+ +L+ L +L+ DL PASW VAWYPI IP Q
Sbjct: 169 SPYGRIPLIDKVYELSQSY-----------PGLTSLRSVDLSPASWMSVAWYPIYHIPYQ 217
Query: 843 GNCR---AAFLTYHSL 855
N + A FLTYH++
Sbjct: 218 RNVKDLSACFLTYHTI 233
>Os01g0513400 Protein of unknown function DUF789 family protein
Length = 369
Score = 70.9 bits (172), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 84/197 (42%), Gaps = 48/197 (24%)
Query: 686 LRNVWEWYEEPGSYGLEVEIHRSLNSTRSACGVSEFCAYFLPSLSAIQLFE-----QCKN 740
LR+VWE Y E +YG V + G Y++P LSAIQL+ + +
Sbjct: 112 LRDVWEAYREWSAYGAGVPL--------VLDGCDGVVQYYVPYLSAIQLYGDPAVLRLSS 163
Query: 741 NLDHKFDSDDD------------FLLSQPNGVYLPKPSLSVQD------HGEPLFEYFES 782
H D D + L + +L + S D HG LF+Y E
Sbjct: 164 GPRHIMDDSDGEYHDSSSDASSDYELGRVK--HLTQEGFSSDDGESGDLHGRLLFQYLEF 221
Query: 783 EHPSSRPPLFEKIKQLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIPQ 842
+ P R PL +KI L++ + G L L+ DL P SW VAWYPI RIP
Sbjct: 222 DSPFCREPLTDKISSLSA-------RFPG----LRTLRSCDLSPRSWISVAWYPIYRIPT 270
Query: 843 G----NCRAAFLTYHSL 855
G + A FLT+H L
Sbjct: 271 GPTLKDLDACFLTFHRL 287
>Os02g0827400 Protein of unknown function DUF789 family protein
Length = 299
Score = 68.2 bits (165), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 114/288 (39%), Gaps = 60/288 (20%)
Query: 643 DFETFIYSASPVIAKTSCMRNGNCLQDPQAGSSPYQY-----QISDVSLRNVWEWYEEPG 697
+ E F+ +A+P + S + C QDP S +Q ++ +L ++WE Y E
Sbjct: 11 NLEVFLQAATPCLRWRSA--SMECFQDP---SKVWQLDKKKDEVDYFALEDLWEHYAESS 65
Query: 698 SYGLEVEIHRSLNSTRSACGVSEFCAYFLPSLSAIQLFEQCKNNLDHKFDS--------- 748
+YGL V + +T +F+P LSAIQ++ K+ L S
Sbjct: 66 AYGLAVPVRLESGNT--------ITQHFVPYLSAIQIYTSTKSLLAFSRGSAGSESDSWS 117
Query: 749 ------------DDDFLLSQPNGVYLPKPSLSVQDHGEPLFEYFESEHPSSRPPLFEKIK 796
D + + + Q G F+Y E P R PL +K+
Sbjct: 118 DDSTGDKLSRSWDAAMSDDDDSSHDSSESVSAKQGAGCLNFQYNEWSSPYERVPLADKVA 177
Query: 797 QLTSGENLSTCQIFGDPKMLENLKLRDLHPASWFCVAWYPICRIP-QGNCRA---AFLTY 852
+L L +L L P+SW VAWYPI IP +GN + FLTY
Sbjct: 178 ELAQHY-----------PCLTSLSSAQLSPSSWMSVAWYPIYHIPARGNLKGLSTCFLTY 226
Query: 853 HSLGKVVPQIHSPDKADEPTHLV-CPVVGFWSYNDKGEQWFQLRNPEI 899
HSL V D +E +V G +Y +G+ W R+ ++
Sbjct: 227 HSLSSVF-----QDNVEEGRSVVGVSPFGLATYRAEGKLWTSSRSSDL 269
Database: rap3
Posted date: Nov 19, 2010 6:03 PM
Number of letters in database: 17,035,801
Number of sequences in database: 52,214
Lambda K H
0.314 0.130 0.389
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 52214
Number of Hits to DB: 33,234,313
Number of extensions: 1399769
Number of successful extensions: 2292
Number of sequences better than 1.0e-10: 8
Number of HSP's gapped: 2278
Number of HSP's successfully gapped: 8
Length of query: 958
Length of database: 17,035,801
Length adjustment: 110
Effective length of query: 848
Effective length of database: 11,292,261
Effective search space: 9575837328
Effective search space used: 9575837328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 161 (66.6 bits)