; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G11050 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G11050
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr02:19792489..19803946
RNA-Seq ExpressionClc02G11050
SyntenyClc02G11050
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015979 - photosynthesis (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0009579 - thylakoid (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001280 - Photosystem I PsaA/PsaB
IPR001969 - Aspartic peptidase, active site
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036408 - Photosystem I PsaA/PsaB superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032277.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.1e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

KAA0035455.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.1e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

KAA0056684.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.1e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

KAA0066849.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.1e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

TYK00844.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]3.6e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

TrEMBL top hitse value%identityAlignment
A0A5A7T1Y5 Reverse transcriptase1.0e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

A0A5A7UNA3 Reverse transcriptase1.0e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

A0A5A7VNK4 Reverse transcriptase1.0e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

A0A5D3BHI1 Reverse transcriptase1.0e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ+G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

A0A5D3BS67 Reverse transcriptase1.8e-30052.29Show/hide
Query:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL
        PA+AE WLN +EKCF VM+CP++RK+ L TF+LQK AE WW+ I  R  D  +L W  F+  F++KYYP +YC  KR EFL L QG++SV EYE+K+TEL
Subjt:  PAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQGNISVVEYEKKFTEL

Query:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV
        +RYA  ++  E D+C+RFE GLR EIRTPVT   +WT+F++LVE A+RVE+S+  E          +  V L SR T        +E   +    PG+ +
Subjt:  ARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRPGLLV

Query:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------
               +++ +  +GG   R++   S F                 SQP  ES+AS+ R+  CTS                    QP             
Subjt:  RISLEDHKEEIRRENGGGFLRHLYLRSQF-----------------SQP-SESVASSARKSYCTS-----------------YVNQP-------------

Query:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET
         +++ G G              EG SG RQ+G  GRPRQQG+V+A+TQQE E   DV+TG + IC+  A VL D GAT SFVS  F+  +NR LEPL E 
Subjt:  -RQEMGAG--------------EGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNAHVLIDSGATPSFVSREFVVHINRKLEPLPET

Query:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG
        L ++TP+G+  ++      C+++++ +  L DLLPL L   DVILG+DFL  ++A + C +KEV   K    EVVF+G R+ +   +IS +KA KLL KG
Subjt:  LLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGRRQILPTCVISAVKATKLLSKG

Query:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK
        C A+LAH+   + EKLKPEDVPVV+EFLDVFP++LSGLPPDRE                           ELK+QLQ+LVDKGYIRPSVSPWGAPVLF+K
Subjt:  CEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDRE---------------------------ELKVQLQKLVDKGYIRPSVSPWGAPVLFMK

Query:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY
        KKDGT+RLCIDYRQLNKVT+RNKYPLPRIDDLFDQLRGA++FSKID+R GYHQLKVRE+DI KT FRTRYGH+EF VMPFGLTNAPAVFMDLMNRIFH Y
Subjt:  KKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPY

Query:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------
        +DQF                           TLR K+LYAKFSKCEFWLEQV+FLGH+VS  GV VDPQK EA+  WE+P +                  
Subjt:  VDQF---------------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQKTEAITRWEQPKT------------------

Query:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL
                                       ELKKRLVTAPILAL  TGK++ +YCDA   GLGCVLMQ G VIAYASRQL+ HE NYPTHDLELA VVL
Subjt:  -------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVL

Query:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT
        ALKIWRHYLFGE+CHI+TDHKSLKYIFDQKELN+RQRRWLELIKDYDCTIEYHP KANVVADALSRKSR   S+L  IRV LL E + S A +    +G+
Subjt:  ALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGT

Query:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW
        LLA FQ+R  LV E++R+Q++D  ++K  E+ K   +V+FELR DG ++KQ R+ VP          EEAHSSAYAMHPGSTKMYRTL+K YWW
Subjt:  LLAHFQLRPKLVDEVIRKQTKDPVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRD--------EEAHSSAYAMHPGSTKMYRTLRKFYWW

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.8e-5531.27Show/hide
Query:  EELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGT-----MRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKT
        +E++ Q+Q ++++G IR S SP+ +P+  + KK         R+ IDYR+LN++TV +++P+P +D++  +L   + F+ ID+  G+HQ+++    + KT
Subjt:  EELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGT-----MRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKT

Query:  TFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHP--------YVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGV
         F T++GH+E+L MPFGL NAPA F   MN I  P        Y+D                      L    L  +  KCEF  ++  FLGH+++  G+
Subjt:  TFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHP--------YVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGV

Query:  CVDPQKTEAITRWEQPK--------------------------------------------------TELKKRLVTAPILALQETGKEFEVYCDAFHHGL
          +P+K EAI ++  P                                                    +LK  +   PIL + +  K+F +  DA    L
Subjt:  CVDPQKTEAITRWEQPK--------------------------------------------------TELKKRLVTAPILALQETGKEFEVYCDAFHHGL

Query:  GCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADA
        G VL Q+G  ++Y SR L  HE+NY T + EL  +V A K +RHYL G    I +DH+ L +++  K+ N +  RW   + ++D  I+Y   K N VADA
Subjt:  GCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-5433.08Show/hide
Query:  ELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKD-----GTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTT
        E++ Q+Q+++++G IR S SP+ +P   + KK         R+ IDYR+LN++T+ ++YP+P +D++  +L     F+ ID+  G+HQ+++ E  I KT 
Subjt:  ELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKD-----GTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTT

Query:  FRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHP--------YVDQ------------------FT-LRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVC
        F T+ GH+E+L MPFGL NAPA F   MN I  P        Y+D                   FT L    L  +  KCEF  ++  FLGHIV+  G+ 
Subjt:  FRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHP--------YVDQ------------------FT-LRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVC

Query:  VDPQKTEAITRWEQPKTE------------------------------LKKR--------------------LVTAPILALQETGKEFEVYCDAFHHGLG
         +P K +AI  +  P  +                              LKKR                    ++  PIL L +  K+F +  DA +  LG
Subjt:  VDPQKTEAITRWEQPKTE------------------------------LKKR--------------------LVTAPILALQETGKEFEVYCDAFHHGLG

Query:  CVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADAL
         VL Q G  I++ SR L  HE+NY   + EL  +V A K +RHYL G +  I +DH+ L+++ + KE   +  RW   + +Y   I+Y   K N VADAL
Subjt:  CVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADAL

Query:  SR
        SR
Subjt:  SR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.6e-5131.67Show/hide
Query:  DREELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFR
        + +E+   +QKL+D  +I PS SP  +PV+ + KKDGT RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +D+  GYHQ+ +   D  KT F 
Subjt:  DREELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFR

Query:  TRYGHHEFLVMPFGLTNAPAVFMDLMN------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQ
        T  G +E+ VMPFGL NAP+ F   M       R  + Y+D                      L+ + L  K  KC+F  E+  FLG+ + +  +     
Subjt:  TRYGHHEFLVMPFGLTNAPAVFMDLMN------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQ

Query:  KTEAITRWEQPKT-----------------------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEG
        K  AI  +  PKT                                               +LK  L  +P+L        + +  DA   G+G VL +  
Subjt:  KTEAITRWEQPKT-----------------------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEG

Query:  R------VIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALS
               V+ Y S+ L   + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY     NVVADA+S
Subjt:  R------VIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALS

Query:  R
        R
Subjt:  R

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.1e-4930.07Show/hide
Query:  REELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKK-----DGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPK
        R E++ Q+ +L+  G IRPS SP+ +P+  + KK     +   R+ +D+++LN VT+ + YP+P I+     L  A  F+ +D+  G+HQ+ ++E+DIPK
Subjt:  REELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKK-----DGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPK

Query:  TTFRTRYGHHEFLVMPFGLTNAPAVFMDLMN--------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAG
        T F T  G +EFL +PFGL NAPA+F  +++        ++ + Y+D                     +L    L     K  F   QV FLG+IV+  G
Subjt:  TTFRTRYGHHEFLVMPFGLTNAPAVFMDLMN--------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAG

Query:  VCVDPQKTEAITRWEQPKT------------------------------------------------------------ELKKRLVTAPILALQETGKEF
        +  DP+K  AI+    P +                                                            +LK  L ++ ILA     K F
Subjt:  VCVDPQKTEAITRWEQPKT------------------------------------------------------------ELKKRLVTAPILALQETGKEF

Query:  EVYCDAFHHGLGCVLMQE----GRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGE-RCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYD
         +  DA +  +G VL Q+     R IAY SR L   E NY T + E+  ++ +L   R YL+G     +YTDH+ L +    +  N + +RW   I++Y+
Subjt:  EVYCDAFHHGLGCVLMQE----GRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGE-RCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYD

Query:  CTIEYHPRKANVVADALSR
        C + Y P K+NVVADALSR
Subjt:  CTIEYHPRKANVVADALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.6e-5131.67Show/hide
Query:  DREELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFR
        + +E+   +QKL+D  +I PS SP  +PV+ + KKDGT RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +D+  GYHQ+ +   D  KT F 
Subjt:  DREELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGTMRLCIDYRQLNKVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFR

Query:  TRYGHHEFLVMPFGLTNAPAVFMDLMN------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQ
        T  G +E+ VMPFGL NAP+ F   M       R  + Y+D                      L+ + L  K  KC+F  E+  FLG+ + +  +     
Subjt:  TRYGHHEFLVMPFGLTNAPAVFMDLMN------RIFHPYVDQF-------------------TLRGKRLYAKFSKCEFWLEQVMFLGHIVSVAGVCVDPQ

Query:  KTEAITRWEQPKT-----------------------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEG
        K  AI  +  PKT                                               +LK  L  +P+L        + +  DA   G+G VL +  
Subjt:  KTEAITRWEQPKT-----------------------------------------------ELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEG

Query:  R------VIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALS
               V+ Y S+ L   + NYP  +LEL  ++ AL  +R+ L G+   + TDH SL  + ++ E   R +RWL+ +  YD T+EY     NVVADA+S
Subjt:  R------VIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGERCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALS

Query:  R
        R
Subjt:  R

Arabidopsis top hitse value%identityAlignment
ATCG00340.1 Photosystem I, PsaA/PsaB protein6.9e-0740.32Show/hide
Query:  FKLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYH-KAAPKLAWFQDVESMLNHHLAGL
        ++ W   G+ +   LY  A+  L  +AL L  GW H   K  P+++WF++ ES LNHHL+GL
Subjt:  FKLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYH-KAAPKLAWFQDVESMLNHHLAGL

ATCG00350.1 Photosystem I, PsaA/PsaB protein3.8e-2996.72Show/hide
Query:  FKLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGL
        F++WRASGITSELQLYCTAIGALVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGL
Subjt:  FKLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGL

ATMG00860.1 DNA/RNA polymerases superfamily protein4.2e-0445.28Show/hide
Query:  RLYAKFSKCEFWLEQVMFLG--HIVSVAGVCVDPQKTEAITRWEQPK--TELK
        + YA   KC F   Q+ +LG  HI+S  GV  DP K EA+  W +PK  TEL+
Subjt:  RLYAKFSKCEFWLEQVMFLG--HIVSVAGVCVDPQKTEAITRWEQPK--TELK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCTAGCAAGCAATCACACAGATTTGGCAAAGCGTACTCCCTCAATTGTCTGTTTCGACCATGTGCTAAGAGAATAGAATTCTGCTGGGAATCTGCGTTCGTATT
GTTCTGGGCATTTGGTGGTTTTGGCCGGTTTCATGGTAAAATTGAAGTTGAAACAGAAGTTTCGTTTGCTGTCGCGCAGTCTAAACAAGGTTCGGTGTTTGGTGAATATC
GAGCGGAAGAGTCAGATTTTGAAGCTGCAATTTTGAGGAGCGAGAGCAAGACTAGTCTCGCTAGTCTCGCTGGTCAAAATTGTCTCACAAATCTCGCTGGGTTGTGGCTC
TCGCTGGTCTCGCTCCTCCACAGTCTCGCTGGACTTGGAGAAGGCATATGGGTAAATTATGTATGGAAATGGCCCCATGTTGAAGGACTTGGAGAAGACATGAGAGTGGC
CAGAAATTTGTTAAGGGGCCCTGTACTGAAAAACTCAGAGAAGGTACTTGAGTGGTCCTGGTTAGGAGGATACCTGGGATGGGGTGTAACAAGGAATGCGTCCGAGAGGT
TGAGCACGGGTGATGGTTGGTGCTATGCGTTTGTTGAGGTCGCTGGGCAAGTGACCATGCATTGGTTGATGATAGTAAGGGGTTGCGTTGGAGAAGCATGCGTTCAAGAG
AGCCTATGGGACAGACGAATGAGTAAAGTTGTGGAGAGTTCGGGTCGTTTCCATGGTGAAACCAAAGCTGGAATAGAGAGCTTGACTGCTGTCGCGCAATCCGAGCATGG
TCTGGTGTTTGGAGAAGAGCGAGCTAAGGAGATTGAGTTTCAAGCTGAACTTTTGAGAATTGTTGTGCAATCTAGAGTAGCGCAAAGAAACTTTGAAATATCTGATGAAC
GAGGAAGAGTTAAGCCACGAAAATTTAAGCTGTGGCGAGCATCTGGAATAACTAGTGAATTACAACTCTATTGTACCGCAATTGGTGCATTAGTCTTTGCAGCCTTAATG
CTTTTTGCCGGTTGGTTTCATTATCACAAGGCTGCTCCTAAATTGGCTTGGTTCCAAGATGTCGAATCCATGTTGAATCACCATTTAGCGGGTTTGCCCTTTTATAGATG
CTGCGTTCGCAATCAATGGCAAATCTGCTTGATTGGCGGCGAGATTGATATCATGCGTCCATCTCGCCTGATTGGTGCTGAGGAACCTCGAGTAAAGGCAGAGCAGTGGA
TTGATATGGAACCCAGTGCCGAGGGACCACGTGAGAAGGCATATGAGAACTATAACATAGTGAAACTTAGTGCCGAGGGACAATATTTGAAAGCACTTGAGGGAACATGC
CCAGAGGGACCAGACGAGGTAGGCGGAAAGGCGCCGAGACACCGCGTGCTACTGGAGAGGAGAGGCAAACATCCTGCAGAAGCTGAAGCATGGTTAAATCAGGTAGAGAA
ATGCTTCAGAGTTATGCACTGTCCGAAAGACAGGAAGCTTGACTTGGTGACTTTCATGCTCCAGAAAGGGGCTGAAGATTGGTGGCGTTTGATTGAGCACAGGAACTTTG
ATGTAGGCTCTTTAATATGGGCAGACTTTAAGAAATCTTTTCAGGAGAAGTACTACCCTAAGTCTTACTGCTACAAGAAAAGGAAGGAATTCCTAAATCTAGTACAGGGA
AACATATCTGTTGTTGAATATGAGAAGAAATTCACAGAGCTTGCCAGGTATGCTTTGGCATTAGTAGTAGAGGAAATAGATAAGTGCAAGCGATTCGAGGAAGGCTTGCG
CAGTGAAATCCGGACCCCAGTGACGATAAGCACAGAATGGACTGATTTTGCAAAACTAGTAGAAGCAGCCATGAGAGTAGAGAAGAGTTTAGCGGGAGAAGGAGCTGACA
GAGGTAAAAATAGGACTGGTAGACTTACAGTTCATCTCGGGAGCAGACTTACCGAGGTGACGGACGACGTTTTACACCAAGAGTCTTTGGTAAGGGAAGGTTTAAGGCCT
GGTCTATTGGTTCGTATTTCTCTAGAGGACCACAAAGAGGAAATACGCAGAGAGAACGGGGGAGGTTTCCTCAGACATCTGTACCTACGGTCGCAATTCAGTCAACCTAG
TGAATCTGTGGCTAGCTCAGCACGGAAATCGTATTGTACTTCTTATGTAAATCAGCCCAGGCAGGAGATGGGAGCTGGAGAAGGGAACAGTGGAGGCCGACAGAGAGGTC
ATGCAGGTAGACCAAGACAGCAGGGACAGGTGTTTGCTGTGACACAGCAGGAGGCCGAGGCGACATCGGATGTAGTCACGGGTATGCTCACTATTTGTTCTAAAAATGCA
CATGTGTTGATAGATTCGGGGGCTACCCCTTCATTTGTGTCTCGTGAATTTGTTGTGCATATAAATAGAAAACTAGAACCCTTGCCTGAAACATTGTTAGTGCATACCCC
GATTGGAGAAACTACTATTATAGAACATGCTTATCTTGAGTGTAAAATTGTGATTGATGATGTGGTGTGGTTGGATGATCTATTACCTCTAGTATTATTGGAGTTTGATG
TTATTCTTGGCTTGGACTTTTTATCAAAATACCATGCGAAAGTTTATTGCTTCAAGAAAGAAGTGCGGTTGGCAAAATCCGATGGGGTTGAGGTGGTTTTTAAAGGAAGG
AGGCAGATTCTTCCTACCTGTGTAATTTCCGCAGTAAAGGCCACGAAGCTATTGAGTAAGGGCTGTGAAGCATATTTAGCTCATGTAACAGAAGCTAAGTTAGAGAAGCT
GAAACCAGAGGATGTACCGGTAGTACGGGAATTCCTTGATGTGTTCCCTGAGGAACTGTCAGGGTTACCACCTGATAGAGAGGAGTTGAAGGTGCAGTTACAGAAGCTAG
TAGATAAGGGGTACATTAGACCAAGTGTATCTCCATGGGGTGCACCGGTGTTGTTCATGAAGAAAAAGGATGGTACTATGAGACTGTGCATTGATTATCGCCAATTAAAT
AAGGTAACTGTACGTAACAAATATCCTTTGCCTCGTATTGATGATTTGTTTGATCAACTTAGAGGGGCATCAGTGTTCTCAAAGATAGACATGAGATATGGATATCATCA
GTTGAAGGTCAGGGAGACGGATATACCAAAAACGACTTTTAGAACGAGGTATGGACATCATGAATTCCTTGTGATGCCTTTTGGTCTGACGAATGCCCCAGCGGTGTTTA
TGGACTTGATGAATAGGATTTTCCATCCTTACGTGGATCAGTTTACTTTGAGAGGAAAGAGATTATATGCAAAGTTCAGCAAATGCGAGTTTTGGTTGGAACAGGTGATG
TTTTTGGGTCATATTGTTTCTGTTGCAGGAGTTTGTGTGGATCCTCAAAAGACTGAGGCGATTACCAGATGGGAACAGCCAAAAACAGAGTTAAAGAAAAGATTAGTGAC
AGCGCCTATTTTAGCACTTCAGGAAACAGGAAAAGAATTTGAGGTGTATTGTGATGCCTTCCATCATGGGTTAGGGTGTGTGCTAATGCAAGAGGGTAGAGTTATAGCTT
ATGCTTCTAGACAATTGAGGCCACATGAAGTTAACTACCCTACTCATGACTTGGAATTGGCAGTAGTAGTGTTGGCACTAAAAATATGGCGGCACTACTTGTTTGGAGAA
AGGTGCCATATCTACACAGATCACAAAAGTTTGAAATACATCTTTGATCAGAAAGAGCTTAACATGAGACAAAGGAGGTGGTTGGAACTGATCAAGGATTATGATTGTAC
TATTGAATATCATCCGAGGAAAGCAAATGTTGTAGCTGATGCCCTAAGTAGGAAGTCTCGAAGTGGAGTAAGTTCTTTGAATACAATTAGGGTCACTTTGCTCAAGGAAT
TTAAGAATAGTGCAGCTAGCCTAAAGGTAACTGGTACGGGAACATTATTAGCACACTTTCAGTTAAGACCTAAGCTGGTAGATGAGGTTATAAGGAAGCAAACAAAAGAT
CCTGTAGTTAAGAAGTTGATAGAGGAGGTAAAAGCTCAGCAGAAAGTAGACTTTGAGCTCAGACCTGATGGTGTACTGTTGAAACAGGATAGGATTTATGTGCCAAGAGA
TGAGGAGGCTCATAGCTCGGCGTATGCTATGCATCCTGGTAGTACCAAGATGTATAGAACGTTAAGGAAATTCTACTGGTGGCAGGTATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCTAGCAAGCAATCACACAGATTTGGCAAAGCGTACTCCCTCAATTGTCTGTTTCGACCATGTGCTAAGAGAATAGAATTCTGCTGGGAATCTGCGTTCGTATT
GTTCTGGGCATTTGGTGGTTTTGGCCGGTTTCATGGTAAAATTGAAGTTGAAACAGAAGTTTCGTTTGCTGTCGCGCAGTCTAAACAAGGTTCGGTGTTTGGTGAATATC
GAGCGGAAGAGTCAGATTTTGAAGCTGCAATTTTGAGGAGCGAGAGCAAGACTAGTCTCGCTAGTCTCGCTGGTCAAAATTGTCTCACAAATCTCGCTGGGTTGTGGCTC
TCGCTGGTCTCGCTCCTCCACAGTCTCGCTGGACTTGGAGAAGGCATATGGGTAAATTATGTATGGAAATGGCCCCATGTTGAAGGACTTGGAGAAGACATGAGAGTGGC
CAGAAATTTGTTAAGGGGCCCTGTACTGAAAAACTCAGAGAAGGTACTTGAGTGGTCCTGGTTAGGAGGATACCTGGGATGGGGTGTAACAAGGAATGCGTCCGAGAGGT
TGAGCACGGGTGATGGTTGGTGCTATGCGTTTGTTGAGGTCGCTGGGCAAGTGACCATGCATTGGTTGATGATAGTAAGGGGTTGCGTTGGAGAAGCATGCGTTCAAGAG
AGCCTATGGGACAGACGAATGAGTAAAGTTGTGGAGAGTTCGGGTCGTTTCCATGGTGAAACCAAAGCTGGAATAGAGAGCTTGACTGCTGTCGCGCAATCCGAGCATGG
TCTGGTGTTTGGAGAAGAGCGAGCTAAGGAGATTGAGTTTCAAGCTGAACTTTTGAGAATTGTTGTGCAATCTAGAGTAGCGCAAAGAAACTTTGAAATATCTGATGAAC
GAGGAAGAGTTAAGCCACGAAAATTTAAGCTGTGGCGAGCATCTGGAATAACTAGTGAATTACAACTCTATTGTACCGCAATTGGTGCATTAGTCTTTGCAGCCTTAATG
CTTTTTGCCGGTTGGTTTCATTATCACAAGGCTGCTCCTAAATTGGCTTGGTTCCAAGATGTCGAATCCATGTTGAATCACCATTTAGCGGGTTTGCCCTTTTATAGATG
CTGCGTTCGCAATCAATGGCAAATCTGCTTGATTGGCGGCGAGATTGATATCATGCGTCCATCTCGCCTGATTGGTGCTGAGGAACCTCGAGTAAAGGCAGAGCAGTGGA
TTGATATGGAACCCAGTGCCGAGGGACCACGTGAGAAGGCATATGAGAACTATAACATAGTGAAACTTAGTGCCGAGGGACAATATTTGAAAGCACTTGAGGGAACATGC
CCAGAGGGACCAGACGAGGTAGGCGGAAAGGCGCCGAGACACCGCGTGCTACTGGAGAGGAGAGGCAAACATCCTGCAGAAGCTGAAGCATGGTTAAATCAGGTAGAGAA
ATGCTTCAGAGTTATGCACTGTCCGAAAGACAGGAAGCTTGACTTGGTGACTTTCATGCTCCAGAAAGGGGCTGAAGATTGGTGGCGTTTGATTGAGCACAGGAACTTTG
ATGTAGGCTCTTTAATATGGGCAGACTTTAAGAAATCTTTTCAGGAGAAGTACTACCCTAAGTCTTACTGCTACAAGAAAAGGAAGGAATTCCTAAATCTAGTACAGGGA
AACATATCTGTTGTTGAATATGAGAAGAAATTCACAGAGCTTGCCAGGTATGCTTTGGCATTAGTAGTAGAGGAAATAGATAAGTGCAAGCGATTCGAGGAAGGCTTGCG
CAGTGAAATCCGGACCCCAGTGACGATAAGCACAGAATGGACTGATTTTGCAAAACTAGTAGAAGCAGCCATGAGAGTAGAGAAGAGTTTAGCGGGAGAAGGAGCTGACA
GAGGTAAAAATAGGACTGGTAGACTTACAGTTCATCTCGGGAGCAGACTTACCGAGGTGACGGACGACGTTTTACACCAAGAGTCTTTGGTAAGGGAAGGTTTAAGGCCT
GGTCTATTGGTTCGTATTTCTCTAGAGGACCACAAAGAGGAAATACGCAGAGAGAACGGGGGAGGTTTCCTCAGACATCTGTACCTACGGTCGCAATTCAGTCAACCTAG
TGAATCTGTGGCTAGCTCAGCACGGAAATCGTATTGTACTTCTTATGTAAATCAGCCCAGGCAGGAGATGGGAGCTGGAGAAGGGAACAGTGGAGGCCGACAGAGAGGTC
ATGCAGGTAGACCAAGACAGCAGGGACAGGTGTTTGCTGTGACACAGCAGGAGGCCGAGGCGACATCGGATGTAGTCACGGGTATGCTCACTATTTGTTCTAAAAATGCA
CATGTGTTGATAGATTCGGGGGCTACCCCTTCATTTGTGTCTCGTGAATTTGTTGTGCATATAAATAGAAAACTAGAACCCTTGCCTGAAACATTGTTAGTGCATACCCC
GATTGGAGAAACTACTATTATAGAACATGCTTATCTTGAGTGTAAAATTGTGATTGATGATGTGGTGTGGTTGGATGATCTATTACCTCTAGTATTATTGGAGTTTGATG
TTATTCTTGGCTTGGACTTTTTATCAAAATACCATGCGAAAGTTTATTGCTTCAAGAAAGAAGTGCGGTTGGCAAAATCCGATGGGGTTGAGGTGGTTTTTAAAGGAAGG
AGGCAGATTCTTCCTACCTGTGTAATTTCCGCAGTAAAGGCCACGAAGCTATTGAGTAAGGGCTGTGAAGCATATTTAGCTCATGTAACAGAAGCTAAGTTAGAGAAGCT
GAAACCAGAGGATGTACCGGTAGTACGGGAATTCCTTGATGTGTTCCCTGAGGAACTGTCAGGGTTACCACCTGATAGAGAGGAGTTGAAGGTGCAGTTACAGAAGCTAG
TAGATAAGGGGTACATTAGACCAAGTGTATCTCCATGGGGTGCACCGGTGTTGTTCATGAAGAAAAAGGATGGTACTATGAGACTGTGCATTGATTATCGCCAATTAAAT
AAGGTAACTGTACGTAACAAATATCCTTTGCCTCGTATTGATGATTTGTTTGATCAACTTAGAGGGGCATCAGTGTTCTCAAAGATAGACATGAGATATGGATATCATCA
GTTGAAGGTCAGGGAGACGGATATACCAAAAACGACTTTTAGAACGAGGTATGGACATCATGAATTCCTTGTGATGCCTTTTGGTCTGACGAATGCCCCAGCGGTGTTTA
TGGACTTGATGAATAGGATTTTCCATCCTTACGTGGATCAGTTTACTTTGAGAGGAAAGAGATTATATGCAAAGTTCAGCAAATGCGAGTTTTGGTTGGAACAGGTGATG
TTTTTGGGTCATATTGTTTCTGTTGCAGGAGTTTGTGTGGATCCTCAAAAGACTGAGGCGATTACCAGATGGGAACAGCCAAAAACAGAGTTAAAGAAAAGATTAGTGAC
AGCGCCTATTTTAGCACTTCAGGAAACAGGAAAAGAATTTGAGGTGTATTGTGATGCCTTCCATCATGGGTTAGGGTGTGTGCTAATGCAAGAGGGTAGAGTTATAGCTT
ATGCTTCTAGACAATTGAGGCCACATGAAGTTAACTACCCTACTCATGACTTGGAATTGGCAGTAGTAGTGTTGGCACTAAAAATATGGCGGCACTACTTGTTTGGAGAA
AGGTGCCATATCTACACAGATCACAAAAGTTTGAAATACATCTTTGATCAGAAAGAGCTTAACATGAGACAAAGGAGGTGGTTGGAACTGATCAAGGATTATGATTGTAC
TATTGAATATCATCCGAGGAAAGCAAATGTTGTAGCTGATGCCCTAAGTAGGAAGTCTCGAAGTGGAGTAAGTTCTTTGAATACAATTAGGGTCACTTTGCTCAAGGAAT
TTAAGAATAGTGCAGCTAGCCTAAAGGTAACTGGTACGGGAACATTATTAGCACACTTTCAGTTAAGACCTAAGCTGGTAGATGAGGTTATAAGGAAGCAAACAAAAGAT
CCTGTAGTTAAGAAGTTGATAGAGGAGGTAAAAGCTCAGCAGAAAGTAGACTTTGAGCTCAGACCTGATGGTGTACTGTTGAAACAGGATAGGATTTATGTGCCAAGAGA
TGAGGAGGCTCATAGCTCGGCGTATGCTATGCATCCTGGTAGTACCAAGATGTATAGAACGTTAAGGAAATTCTACTGGTGGCAGGTATGA
Protein sequenceShow/hide protein sequence
MIPSKQSHRFGKAYSLNCLFRPCAKRIEFCWESAFVLFWAFGGFGRFHGKIEVETEVSFAVAQSKQGSVFGEYRAEESDFEAAILRSESKTSLASLAGQNCLTNLAGLWL
SLVSLLHSLAGLGEGIWVNYVWKWPHVEGLGEDMRVARNLLRGPVLKNSEKVLEWSWLGGYLGWGVTRNASERLSTGDGWCYAFVEVAGQVTMHWLMIVRGCVGEACVQE
SLWDRRMSKVVESSGRFHGETKAGIESLTAVAQSEHGLVFGEERAKEIEFQAELLRIVVQSRVAQRNFEISDERGRVKPRKFKLWRASGITSELQLYCTAIGALVFAALM
LFAGWFHYHKAAPKLAWFQDVESMLNHHLAGLPFYRCCVRNQWQICLIGGEIDIMRPSRLIGAEEPRVKAEQWIDMEPSAEGPREKAYENYNIVKLSAEGQYLKALEGTC
PEGPDEVGGKAPRHRVLLERRGKHPAEAEAWLNQVEKCFRVMHCPKDRKLDLVTFMLQKGAEDWWRLIEHRNFDVGSLIWADFKKSFQEKYYPKSYCYKKRKEFLNLVQG
NISVVEYEKKFTELARYALALVVEEIDKCKRFEEGLRSEIRTPVTISTEWTDFAKLVEAAMRVEKSLAGEGADRGKNRTGRLTVHLGSRLTEVTDDVLHQESLVREGLRP
GLLVRISLEDHKEEIRRENGGGFLRHLYLRSQFSQPSESVASSARKSYCTSYVNQPRQEMGAGEGNSGGRQRGHAGRPRQQGQVFAVTQQEAEATSDVVTGMLTICSKNA
HVLIDSGATPSFVSREFVVHINRKLEPLPETLLVHTPIGETTIIEHAYLECKIVIDDVVWLDDLLPLVLLEFDVILGLDFLSKYHAKVYCFKKEVRLAKSDGVEVVFKGR
RQILPTCVISAVKATKLLSKGCEAYLAHVTEAKLEKLKPEDVPVVREFLDVFPEELSGLPPDREELKVQLQKLVDKGYIRPSVSPWGAPVLFMKKKDGTMRLCIDYRQLN
KVTVRNKYPLPRIDDLFDQLRGASVFSKIDMRYGYHQLKVRETDIPKTTFRTRYGHHEFLVMPFGLTNAPAVFMDLMNRIFHPYVDQFTLRGKRLYAKFSKCEFWLEQVM
FLGHIVSVAGVCVDPQKTEAITRWEQPKTELKKRLVTAPILALQETGKEFEVYCDAFHHGLGCVLMQEGRVIAYASRQLRPHEVNYPTHDLELAVVVLALKIWRHYLFGE
RCHIYTDHKSLKYIFDQKELNMRQRRWLELIKDYDCTIEYHPRKANVVADALSRKSRSGVSSLNTIRVTLLKEFKNSAASLKVTGTGTLLAHFQLRPKLVDEVIRKQTKD
PVVKKLIEEVKAQQKVDFELRPDGVLLKQDRIYVPRDEEAHSSAYAMHPGSTKMYRTLRKFYWWQV