; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0005846 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0005846
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr05:20750801..20752435
RNA-Seq ExpressionPay0005846
SyntenyPay0005846
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051368.1 pol protein [Cucumis melo var. makuwa]8.7e-19668.2Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

KAA0053368.1 pol protein [Cucumis melo var. makuwa]8.7e-19668.01Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DS +KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

KAA0056300.1 pol protein [Cucumis melo var. makuwa]1.4e-19367.83Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        D AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIV DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDML  CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

KAA0062245.1 pol protein [Cucumis melo var. makuwa]2.6e-20073.12Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -----------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
                                                 RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
Subjt:  -----------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG

Query:  AVTIQ---------------------------------------------------RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN
        AVT+Q                                                   RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN
Subjt:  AVTIQ---------------------------------------------------RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN

Query:  MKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT-----------------------------LYMSEIVRLHGVP
        MKREVAEFVS+CLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGF+                             LYMSEIVRLHGVP
Subjt:  MKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT-----------------------------LYMSEIVRLHGVP

Query:  VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC
        VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+SPVC
Subjt:  VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC

Query:  WGEVGE
        WGEVGE
Subjt:  WGEVGE

KAA0062835.1 pol protein [Cucumis melo var. makuwa]3.6e-19467.83Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSF+GLAGYYR FVENFS IATPLTQLTRKGAPF+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYD EILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLE+MLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

TrEMBL top hitse value%identityAlignment
A0A5A7U7V9 Reverse transcriptase4.2e-19668.2Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

A0A5A7UE75 Reverse transcriptase4.2e-19668.01Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DS +KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

A0A5A7UMD7 Pol protein6.7e-19467.83Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYDCEILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        D AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIV DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDML  CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

A0A5A7V8L8 Pol protein1.3e-20073.12Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYR FVENFS IATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -----------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
                                                 RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
Subjt:  -----------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG

Query:  AVTIQ---------------------------------------------------RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN
        AVT+Q                                                   RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN
Subjt:  AVTIQ---------------------------------------------------RLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRN

Query:  MKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT-----------------------------LYMSEIVRLHGVP
        MKREVAEFVS+CLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGF+                             LYMSEIVRLHGVP
Subjt:  MKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT-----------------------------LYMSEIVRLHGVP

Query:  VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC
        VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLEDMLR CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+SPVC
Subjt:  VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC

Query:  WGEVGE
        WGEVGE
Subjt:  WGEVGE

A0A5A7VAA9 Pol protein1.8e-19467.83Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----
        +VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSF+GLAGYYR FVENFS IATPLTQLTRKGAPF+WSKACEDSFQNLKQKLVTAPVLTVPDGSGSF     
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----

Query:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA
                                                                                       RRWLELVKDYD EILYHPGKA
Subjt:  -------------------------------------------------------------------------------RRWLELVKDYDCEILYHPGKA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVT+Q                                                   RLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQ---------------------------------------------------RLCVPS

Query:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----
        DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT    
Subjt:  DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFT----

Query:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS
                                 LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ ERLNQVLE+MLR CALEFPGS
Subjt:  -------------------------LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.7e-3822.46Show/hide
Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------
        +S+ G +     I+ V  W +P    E+R FLG   Y R F+   S +  PL  L +K   + W+     + +N+KQ LV+ PVL   D S         
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------

Query:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE
                                                                                                RW   ++D++ E
Subjt:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE
        I Y PG AN +ADALSR V  +  +             Q  +  D +   +        +                           ++ +P+D+ +   
Subjt:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE

Query:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------
        ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +     L++        
Subjt:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------

Query:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM
                              ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQ ER NQ +E +LR      P +W  H+ L+
Subjt:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM

Query:  EFAYNNSYQATIGMAPFEALY
        + +YNN+  +   M PFE ++
Subjt:  EFAYNNSYQATIGMAPFEALY

P0CT35 Transposon Tf2-2 polyprotein2.7e-3822.46Show/hide
Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------
        +S+ G +     I+ V  W +P    E+R FLG   Y R F+   S +  PL  L +K   + W+     + +N+KQ LV+ PVL   D S         
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------

Query:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE
                                                                                                RW   ++D++ E
Subjt:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE
        I Y PG AN +ADALSR V  +  +             Q  +  D +   +        +                           ++ +P+D+ +   
Subjt:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE

Query:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------
        ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +     L++        
Subjt:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------

Query:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM
                              ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQ ER NQ +E +LR      P +W  H+ L+
Subjt:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM

Query:  EFAYNNSYQATIGMAPFEALY
        + +YNN+  +   M PFE ++
Subjt:  EFAYNNSYQATIGMAPFEALY

P0CT36 Transposon Tf2-3 polyprotein2.7e-3822.46Show/hide
Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------
        +S+ G +     I+ V  W +P    E+R FLG   Y R F+   S +  PL  L +K   + W+     + +N+KQ LV+ PVL   D S         
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------

Query:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE
                                                                                                RW   ++D++ E
Subjt:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE
        I Y PG AN +ADALSR V  +  +             Q  +  D +   +        +                           ++ +P+D+ +   
Subjt:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE

Query:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------
        ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +     L++        
Subjt:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------

Query:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM
                              ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQ ER NQ +E +LR      P +W  H+ L+
Subjt:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM

Query:  EFAYNNSYQATIGMAPFEALY
        + +YNN+  +   M PFE ++
Subjt:  EFAYNNSYQATIGMAPFEALY

P0CT41 Transposon Tf2-12 polyprotein2.7e-3822.46Show/hide
Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------
        +S+ G +     I+ V  W +P    E+R FLG   Y R F+   S +  PL  L +K   + W+     + +N+KQ LV+ PVL   D S         
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------

Query:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE
                                                                                                RW   ++D++ E
Subjt:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE
        I Y PG AN +ADALSR V  +  +             Q  +  D +   +        +                           ++ +P+D+ +   
Subjt:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE

Query:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------
        ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +     L++        
Subjt:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------

Query:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM
                              ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQ ER NQ +E +LR      P +W  H+ L+
Subjt:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM

Query:  EFAYNNSYQATIGMAPFEALY
        + +YNN+  +   M PFE ++
Subjt:  EFAYNNSYQATIGMAPFEALY

Q9UR07 Transposon Tf2-11 polyprotein2.7e-3822.46Show/hide
Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------
        +S+ G +     I+ V  W +P    E+R FLG   Y R F+   S +  PL  L +K   + W+     + +N+KQ LV+ PVL   D S         
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGS-------

Query:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE
                                                                                                RW   ++D++ E
Subjt:  --------------------------------------------------------------------------------------FRRWLELVKDYDCE

Query:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE
        I Y PG AN +ADALSR V  +  +             Q  +  D +   +        +                           ++ +P+D+ +   
Subjt:  ILYHPGKANVVADALSRKVSHSAAL----------ITRQAPLHRDLERAEIAVSVGAVTI--------------------------QRLCVPSDSAVKTE

Query:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------
        ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +     L++        
Subjt:  LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYM--------

Query:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM
                              ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQ ER NQ +E +LR      P +W  H+ L+
Subjt:  --------------------SEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQIERLNQVLEDMLRVCALEFPGSWDSHLHLM

Query:  EFAYNNSYQATIGMAPFEALY
        + +YNN+  +   M PFE ++
Subjt:  EFAYNNSYQATIGMAPFEALY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-1846.67Show/hide
Query:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD
        ++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYR FV+N+  I  PLT+L +K +   W++    +F+ LK  + T PVL +PD
Subjt:  MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTA
GCAGGTTATTATCGATGGTTTGTGGAGAACTTTTCTCTTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAG
GACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTCGAAGATGGCTTGAGTTAGTGAAGGATTAC
GATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTG
CATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATACAACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCT
GAGGCCCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAA
TTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAACGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAATGTG
TCCATGGATTTCATTACAGGTCTGCCGAGAACTCTGAGGGGTTTTACACTGTACATGTCGGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGAT
AGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGATT
GAGCGTCTGAACCAAGTGTTAGAGGATATGTTGCGGGTGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAAC
AGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTA
GCAGGTTATTATCGATGGTTTGTGGAGAACTTTTCTCTTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAG
GACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTCGAAGATGGCTTGAGTTAGTGAAGGATTAC
GATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTG
CATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATACAACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCT
GAGGCCCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAA
TTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAACGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAATGTG
TCCATGGATTTCATTACAGGTCTGCCGAGAACTCTGAGGGGTTTTACACTGTACATGTCGGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGAT
AGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGATT
GAGCGTCTGAACCAAGTGTTAGAGGATATGTTGCGGGTGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAAC
AGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGTAG
Protein sequenceShow/hide protein sequence
MVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRWFVENFSLIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFRRWLELVKDY
DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTIQRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAE
FVSRCLVCQQVKAPTQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQI
ERLNQVLEDMLRVCALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGE