; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004933 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004933
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr07:16354125..16357925
RNA-Seq ExpressionPay0004933
SyntenyPay0004933
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.61Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEF YNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRF+GPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PL+IDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDM+SRYPEL  E
Subjt:  VEEATWEREDDMRSRYPELFEE

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.89Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGL FE RLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVK PRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFE+ DKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

KAA0051368.1 pol protein [Cucumis melo var. makuwa]0.0e+0090.17Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFE+GDKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

KAA0056300.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.47Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLY EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQ IIDAQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSD AVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIV DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDML ACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILE+IG V YRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPV+VLAREVKTLRNKEIPLVKVLWRNH+
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0089.47Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALY KCCRSP+CWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        +EEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

TrEMBL top hitse value%identityAlignment
A0A5A7U330 Reverse transcriptase0.0e+0089.61Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEF YNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRF+GPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PL+IDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDM+SRYPEL  E
Subjt:  VEEATWEREDDMRSRYPELFEE

A0A5A7U7V9 Reverse transcriptase0.0e+0090.17Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFE+GDKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

A0A5A7UAA8 Reverse transcriptase0.0e+0089.89Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGL FE RLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVK PRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFE+ DKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

A0A5A7UMD7 Pol protein0.0e+0089.47Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLY EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQ IIDAQ NDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSD AVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIV DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDML ACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILE+IG V YRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPV+VLAREVKTLRNKEIPLVKVLWRNH+
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        VEEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

A0A5A7UP94 Pol protein0.0e+0089.47Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------
        +SVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKL                          
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL--------------------------

Query:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
                         LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  -----------------LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT
        ELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACALEFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLH

Query:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF
        LMEFAYNNSYQATIGMAPFEALY KCCRSP+CWGEVGEQRLMGPEL                   SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RF
Subjt:  LMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPEL-------------------SRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  VEEATWEREDDMRSRYPELFEE
        +EEATWEREDDMRSRYPELFEE
Subjt:  VEEATWEREDDMRSRYPELFEE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein9.2e-7729.14Show/hide
Query:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS
        I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ L+                              + 
Subjt:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS

Query:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA
        H+                   NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +A
Subjt:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS
        DALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV
         +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD
        DR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W 
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD

Query:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV
         H+ L++ +YNN+  +   M PFE ++      SP+      +                +  +     + K Y D++ +++ EF+ GD V +K     G 
Subjt:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV

Query:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY
        L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein9.2e-7729.14Show/hide
Query:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS
        I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ L+                              + 
Subjt:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS

Query:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA
        H+                   NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +A
Subjt:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS
        DALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV
         +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD
        DR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W 
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD

Query:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV
         H+ L++ +YNN+  +   M PFE ++      SP+      +                +  +     + K Y D++ +++ EF+ GD V +K     G 
Subjt:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV

Query:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY
        L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein9.2e-7729.14Show/hide
Query:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS
        I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ L+                              + 
Subjt:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS

Query:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA
        H+                   NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +A
Subjt:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS
        DALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV
         +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD
        DR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W 
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD

Query:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV
         H+ L++ +YNN+  +   M PFE ++      SP+      +                +  +     + K Y D++ +++ EF+ GD V +K     G 
Subjt:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV

Query:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY
        L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein9.2e-7729.14Show/hide
Query:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS
        I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ L+                              + 
Subjt:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS

Query:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA
        H+                   NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +A
Subjt:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS
        DALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV
         +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD
        DR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W 
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD

Query:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV
         H+ L++ +YNN+  +   M PFE ++      SP+      +                +  +     + K Y D++ +++ EF+ GD V +K     G 
Subjt:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV

Query:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY
        L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY

Q9UR07 Transposon Tf2-11 polyprotein9.2e-7729.14Show/hide
Query:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS
        I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ L+                              + 
Subjt:  IEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLL------------------------------KS

Query:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA
        H+                   NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +A
Subjt:  HEQ------------------NYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS
        DALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV
         +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD
        DR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W 
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD

Query:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV
         H+ L++ +YNN+  +   M PFE ++      SP+      +                +  +     + K Y D++ +++ EF+ GD V +K     G 
Subjt:  SHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGE----------------QRLMGPELSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGV

Query:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY
        L   +  KL+P F GPF +L++ GP  Y L LP S+  +    FHVS L KY
Subjt:  LRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.9e-1447.89Show/hide
Query:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLK
        +S DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK
Subjt:  ISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTGGTAGAGGTTCGACATCGGGACAGAAGAGGAAGGCTGAGCAG
CAGCCTGTTCCAGTGCCACAGCGGAATTTCAGATCAGGTGGTGATCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGG
TTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTATTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTT
GAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGT
GGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGCAGCACTTATT
ACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCA
AAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTTGT
TTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAG
GACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGG
TTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGG
TGGACAGACTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCTGAGATAGTGAGATTGCATGGAGTGCCA
GTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGAC
TGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTT
ATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATG
GGTCCTGAGTTAAGTAGACAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTT
ACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGA
CAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAGATTGATGAAAACTTGAGCTATACTGAA
CAACCCGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTCTGGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGGAGCG
AGAAGATGACATGAGGTCCCGTTATCCCGAACTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTGGTAGAGGTTCGACATCGGGACAGAAGAGGAAGGCTGAGCAG
CAGCCTGTTCCAGTGCCACAGCGGAATTTCAGATCAGGTGGTGATCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGG
TTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGG
AGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTATTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTT
GAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGT
GGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGCAGCACTTATT
ACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCA
AAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTTGT
TTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAG
GACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGG
TTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGG
TGGACAGACTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCTGAGATAGTGAGATTGCATGGAGTGCCA
GTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGAC
TGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTT
ATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATG
GGTCCTGAGTTAAGTAGACAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTT
ACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGA
CAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAGATTGATGAAAACTTGAGCTATACTGAA
CAACCCGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTCTGGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGGAGCG
AGAAGATGACATGAGGTCCCGTTATCCCGAACTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MPMHCAWQWISVYRRGLTRLRPLVEVRHRDRRGRLSSSLFQCHSGISDQVVISVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
SKACEDSFQNLKQKLLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALI
TRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ
DLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP
VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
GPELSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTE
QPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEE