; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0197541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0197541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr07:20846760..20848840
RNA-Seq ExpressionCmc07g0197541
SyntenyCmc07g0197541
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040188.1 pol protein [Cucumis melo var. makuwa]0.0e+0098.93Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLG VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGA+TMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSAIKIELLSEAHSSPFFMHPGSTKMYQD+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY+KCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGVLRFERRGKLSPRFVG FEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.33Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQA  FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDS +K ELLSEAHSSPF MHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVL+ MLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEF YNNSYQATIGMAPFEALY KCCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGVLRFERRGKLSPRF+GPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

KAA0051368.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.48Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQ   FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+K ELLSEAHSSPF MHPGSTKMYQD+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY +CCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        M+GVLRFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.87Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQA  FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+K ELL+EAHSSPF MHPGSTKMYQD+KR+YWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY  SKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY+KCCRS +CWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGV+RFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

KAA0066456.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.41Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHS ALITR APLHRDLERAEIAVS+GAVTMQLA+L VQPTLRQRII AQ NDPYLVEKRGL EAGQ   FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+KIELLSEAHSSPF MHPGSTK+YQD+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY +CCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGV+RFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

TrEMBL top hitse value%identityAlignment
A0A5A7TB42 Reverse transcriptase0.0e+0098.93Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLG VLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGA+TMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSAIKIELLSEAHSSPFFMHPGSTKMYQD+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY+KCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGVLRFERRGKLSPRFVG FEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

A0A5A7U330 Reverse transcriptase0.0e+0096.33Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQA  FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDS +K ELLSEAHSSPF MHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVL+ MLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEF YNNSYQATIGMAPFEALY KCCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGVLRFERRGKLSPRF+GPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

A0A5A7U7V9 Reverse transcriptase0.0e+0096.48Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQ   FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+K ELLSEAHSSPF MHPGSTKMYQD+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY +CCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        M+GVLRFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

A0A5A7UP94 Pol protein0.0e+0095.87Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHSAALITR APLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRII AQSNDPYLVEKRGLAEAGQA  FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+K ELL+EAHSSPF MHPGSTKMYQD+KR+YWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY  SKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY+KCCRS +CWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGV+RFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

A0A5A7VJE2 Reverse transcriptase0.0e+0095.41Show/hide
Query:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
        HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS
Subjt:  HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
        DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+FALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK
Subjt:  DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP
        ANVVADALSRKVSHS ALITR APLHRDLERAEIAVS+GAVTMQLA+L VQPTLRQRII AQ NDPYLVEKRGL EAGQ   FS+SSDGGL+FERRLCVP
Subjt:  ANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVP

Query:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
        SDSA+KIELLSEAHSSPF MHPGSTK+YQD+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW
Subjt:  SDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG
        VVVDRLTKSAHFVPGKSTY ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTS+FWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVL+DMLRACALEFPG
Subjt:  VVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPG

Query:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
        SW+SHLHLMEFAYNNSYQATIGMAPFEALY +CCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP
Subjt:  SWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAP

Query:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK
        MRGV+RFERRGKLSPRFVGPFEILERIGPVAYR+ALPPSLSTVHDVFHVSMLRK
Subjt:  MRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFHVSMLRK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.2e-9131.01Show/hide
Query:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG
        +++ + + +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S 
Subjt:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+I +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+      +    E   +
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI

Query:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD
          DG L+  + ++ +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD
Subjt:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   +  A + A+++   ++   G P  I++D D  FTSQ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR
        Q ++ +LR      P +W  H+ L++ +YNN+  +   M PFE +  Y      L    E+        E  Q T +  Q ++  ++T   + K Y D++
Subjt:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR

Query:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK
         +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y + LP S+  +    FHVS L K
Subjt:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK

P0CT35 Transposon Tf2-2 polyprotein6.2e-9131.01Show/hide
Query:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG
        +++ + + +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S 
Subjt:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+I +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+      +    E   +
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI

Query:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD
          DG L+  + ++ +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD
Subjt:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   +  A + A+++   ++   G P  I++D D  FTSQ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR
        Q ++ +LR      P +W  H+ L++ +YNN+  +   M PFE +  Y      L    E+        E  Q T +  Q ++  ++T   + K Y D++
Subjt:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR

Query:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK
         +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y + LP S+  +    FHVS L K
Subjt:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK

P0CT36 Transposon Tf2-3 polyprotein6.2e-9131.01Show/hide
Query:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG
        +++ + + +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S 
Subjt:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+I +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+      +    E   +
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI

Query:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD
          DG L+  + ++ +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD
Subjt:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   +  A + A+++   ++   G P  I++D D  FTSQ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR
        Q ++ +LR      P +W  H+ L++ +YNN+  +   M PFE +  Y      L    E+        E  Q T +  Q ++  ++T   + K Y D++
Subjt:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR

Query:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK
         +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y + LP S+  +    FHVS L K
Subjt:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK

P0CT41 Transposon Tf2-12 polyprotein6.2e-9131.01Show/hide
Query:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG
        +++ + + +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S 
Subjt:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+I +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+      +    E   +
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI

Query:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD
          DG L+  + ++ +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD
Subjt:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   +  A + A+++   ++   G P  I++D D  FTSQ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR
        Q ++ +LR      P +W  H+ L++ +YNN+  +   M PFE +  Y      L    E+        E  Q T +  Q ++  ++T   + K Y D++
Subjt:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR

Query:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK
         +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y + LP S+  +    FHVS L K
Subjt:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK

Q9UR07 Transposon Tf2-11 polyprotein6.2e-9131.01Show/hide
Query:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG
        +++ + + +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S 
Subjt:  ELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+I +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+      +    E   +
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSI

Query:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD
          DG L+  + ++ +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD
Subjt:  SSDGGLVFER-RLCVPSDSAIKIELLSEAHSSPFFMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMD

Query:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN
        FIT LP +  G+  ++VVVDR +K A  VP   +  A + A+++   ++   G P  I++D D  FTSQ WK         + FS  + PQTDGQTER N
Subjt:  FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR
        Q ++ +LR      P +W  H+ L++ +YNN+  +   M PFE +  Y      L    E+        E  Q T +  Q ++  ++T   + K Y D++
Subjt:  QVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEAL--YEKCCRSLVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVR

Query:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK
         +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y + LP S+  +    FHVS L K
Subjt:  RKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTV-HDVFHVSMLRK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.1e-2147Show/hide
Query:  GVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        G  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK  + T PVL +PD    FV
Subjt:  GVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAAAATTTTACCTCGAACTTCGGGGCGTTACACACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTC
CACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGGTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAG
CTCCTTTTGTTTGGAGCAAGGCGTGTGAGGATAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTCGTGATT
TATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACA
CGATTTAGAGTTGGCAGCGGTGATTTTTGCTTTAAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTA
CTCAGAAAGAACTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTT
AGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACATGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTT
AGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGGTGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGAGG
GGTTCTCCATATCCTCTGATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGATAGAATTATTATCTGAGGCTCACAGTTCCCCATTT
TTCATGCACCCGGGTAGTACGAAGATGTATCAGGACATGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCA
GCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTATCCATGGATTTCATTACAGGACTGCCGAGAA
CTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATATTGCTAGTAAGTGGGCACAGCTGTAC
ATGTCGGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCTCGTTTCACTTCCCAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAG
GTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAAAGGATATGTTGCGGGCGTGTGCATTGGAATTTCCAGGTAGCT
GGAACTCCCATTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGAAAAATGTTGTAGATCCCTGGTT
TGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGCCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCATGCATACCGCTCAGAGTAGGCA
GAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTCCTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAA
AGTTGAGTCCCCGTTTTGTTGGGCCGTTCGAGATTCTAGAGCGGATTGGCCCTGTAGCTTATCGCGTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCAT
GTTTCTATGTTGAGGAAAACAACCTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTAAAATTTTACCTCGAACTTCGGGGCGTTACACACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTC
CACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGGTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAG
CTCCTTTTGTTTGGAGCAAGGCGTGTGAGGATAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTCGTGATT
TATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACA
CGATTTAGAGTTGGCAGCGGTGATTTTTGCTTTAAAAATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTA
CTCAGAAAGAACTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTT
AGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACATGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTT
AGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGGTGCTCAGAGTAACGATCCTTATTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGAGG
GGTTCTCCATATCCTCTGATGGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGATAGAATTATTATCTGAGGCTCACAGTTCCCCATTT
TTCATGCACCCGGGTAGTACGAAGATGTATCAGGACATGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCA
GCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTATCCATGGATTTCATTACAGGACTGCCGAGAA
CTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATATTGCTAGTAAGTGGGCACAGCTGTAC
ATGTCGGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCTCGTTTCACTTCCCAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAG
GTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAAAGGATATGTTGCGGGCGTGTGCATTGGAATTTCCAGGTAGCT
GGAACTCCCATTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGAAAAATGTTGTAGATCCCTGGTT
TGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGCCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCATGCATACCGCTCAGAGTAGGCA
GAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTCCTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAA
AGTTGAGTCCCCGTTTTGTTGGGCCGTTCGAGATTCTAGAGCGGATTGGCCCTGTAGCTTATCGCGTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCAT
GTTTCTATGTTGAGGAAAACAACCTGTTGA
Protein sequenceShow/hide protein sequence
MLKFYLELRGVTHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVI
YSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVIFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADAL
SRKVSHSAALITRHAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIGAQSNDPYLVEKRGLAEAGQAEGFSISSDGGLVFERRLCVPSDSAIKIELLSEAHSSPF
FMHPGSTKMYQDMKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYIASKWAQLY
MSEIVRLHGVPVSIVSDRDARFTSQFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLKDMLRACALEFPGSWNSHLHLMEFAYNNSYQATIGMAPFEALYEKCCRSLV
CWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRVALPPSLSTVHDVFH
VSMLRKTTC