; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0067351 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0067351
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:10081531..10082841
RNA-Seq ExpressionCmc03g0067351
SyntenyCmc03g0067351
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]4.0e-23795.87Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQ+LLD GFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

KAA0046185.1 pol protein [Cucumis melo var. makuwa]4.4e-23695.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGHMVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTV DGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

KAA0048687.1 pol protein [Cucumis melo var. makuwa]3.1e-23795.87Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK  DVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

KAA0056702.1 pol protein [Cucumis melo var. makuwa]6.2e-23895.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQ+LLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYA+FSKYEFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAV GWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.5e-23695.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK  DVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein1.9e-23795.87Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQ+LLD GFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

A0A5A7TXM6 Reverse transcriptase2.1e-23695.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGHMVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTV DGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

A0A5A7U330 Reverse transcriptase1.5e-23795.87Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK  DVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

A0A5A7ULI8 Pol protein3.0e-23895.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQ+LLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK GDVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYA+FSKYEFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAV GWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

A0A5D3BPI1 Reverse transcriptase7.3e-23795.64Show/hide
Query:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK
        M PAELKELKVQLQELLD GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNR+PLPRID LFDQLQGATVFSKIDLRSGYHQLRIK  DVPK
Subjt:  MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG
        TAFRSRYGHYEFIVMSFGLTNAP VFMDLMN VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLR NKLYAKFSK EFWLKQVSFLGH+VSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNY THDL+LAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+RWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI
        LSRKVSHSAALITRQAPL RDLERAEIAVSVGAVT+
Subjt:  LSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.3e-8540.2Show/hide
Query:  KELKVQLQELLDNGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKT
        +E++ Q+Q++L+ G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +RHP+P +D +  +L     F+ IDL  G+HQ+ +    V KT
Subjt:  KELKVQLQELLDNGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKT

Query:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGV
        AF +++GHYE++ M FGL NAP  F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  K EF  ++ +FLGH+++  G+
Subjt:  AFRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGV

Query:  SVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL
          +P KIEA+  +  P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +   P+L VPD +  F + +DAS   L
Subjt:  SVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA
        G VL Q G  ++Y SR L  HE NYST + +L A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P0CT34 Transposon Tf2-1 polyprotein6.7e-7837.35Show/hide
Query:  PAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA
        P +++ +  ++ + L +G IR S +    PV+FV KK+G++R+ +DY+ LNK    N +PLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K A
Subjt:  PAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS
        FR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +K EF   QV F+G+ +S+ G +
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS

Query:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGC
             I+ V  W +P    E+R FLG V Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G 
Subjt:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGC

Query:  VLMQQGK-----VVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQQRWLELVKDYDCEILYHPGK
        VL Q+        V Y S ++   + NYS  D ++ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG 
Subjt:  VLMQQGK-----VVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQQRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHS
        AN +ADALSR V  +
Subjt:  ANVVADALSRKVSHS

P0CT41 Transposon Tf2-12 polyprotein6.7e-7837.35Show/hide
Query:  PAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA
        P +++ +  ++ + L +G IR S +    PV+FV KK+G++R+ +DY+ LNK    N +PLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K A
Subjt:  PAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS
        FR   G +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +K EF   QV F+G+ +S+ G +
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS

Query:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGC
             I+ V  W +P    E+R FLG V Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G 
Subjt:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGC

Query:  VLMQQGK-----VVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQQRWLELVKDYDCEILYHPGK
        VL Q+        V Y S ++   + NYS  D ++ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG 
Subjt:  VLMQQGK-----VVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQQRWLELVKDYDCEILYHPGK

Query:  ANVVADALSRKVSHS
        AN +ADALSR V  +
Subjt:  ANVVADALSRKVSHS

P20825 Retrovirus-related Pol polyprotein from transposon 2977.6e-8238.56Show/hide
Query:  ELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA
        E++ Q+QE+L+ G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +R+P+P +D +  +L     F+ IDL  G+HQ+ +    + KTA
Subjt:  ELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS
        F ++ GHYE++ M FGL NAP  F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L    L  +  K EF  K+ +FLGH+V+  G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS

Query:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLG
         +P K++A+  +  P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K       K    ++F+ LK  ++  P+L +PD    FV+ +DAS   LG
Subjt:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLG

Query:  CVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADAL
         VL Q G  +++ SR L  HE NYS  + +L A+V+A K +RHYL G +  I +DH+ L++    KE   + +RW   + +Y  +I Y  GK N VADAL
Subjt:  CVLMQQGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADAL

Query:  SR
        SR
Subjt:  SR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.4e-8036.69Show/hide
Query:  ELKVQLQELLDNGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA
        E++ Q+ ELL +G IRPS SP+ +P+  V KK     +   R+ +D++ LN VT+ + +P+P I+     L  A  F+ +DL SG+HQ+ +K  D+PKTA
Subjt:  ELKVQLQELLDNGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS
        F +  G YEF+ + FGL NAP +F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +L    L     K  F   QV FLG++V+  G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVS

Query:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVI
         DP K+ A++    P++V E++ FLG+  YYR+F+++++++A PLT LTR              P    +    SF +LK  L ++ +L  P  +  F +
Subjt:  VDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVI

Query:  YSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCE
         +DAS   +G VL Q    + + +AY SR L   E+NY+T + ++ A++++L   R YLYG   I+++TDH+ L +    +  N + +RW   +++Y+CE
Subjt:  YSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYSTHDLKLAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCE

Query:  ILYHPGKANVVADALSR
        ++Y PGK+NVVADALSR
Subjt:  ILYHPGKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-2544.27Show/hide
Query:  HLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLG--HMVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL +VLQ    ++ YA   K  F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLG--HMVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAACGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTAAGAACAGACATCCCTTGCCCAGGATCGACCATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGTATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACATAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
TGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGCATAATAAGTTGTATGCAAAGTTCTCGAAATACGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTAGGCCACATGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTGCTTACTGTACCTGACGGTTCTGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACTCTACACATGATTTA
AAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAA
GGAATTGAATATGAGACAGCAAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCTTCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATATAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAACGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTTAAGAACAGACATCCCTTGCCCAGGATCGACCATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGCTGAGGATTAAGTATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACATAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
TGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGCATAATAAGTTGTATGCAAAGTTCTCGAAATACGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTAGGCCACATGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGTAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTGCTTACTGTACCTGACGGTTCTGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAAGGTAAGGTGGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACTCTACACATGATTTA
AAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAA
GGAATTGAATATGAGACAGCAAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCTTCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATATAG
Protein sequenceShow/hide protein sequence
MVPAELKELKVQLQELLDNGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRHPLPRIDHLFDQLQGATVFSKIDLRSGYHQLRIKYGDVPKTAFRSRYGHY
EFIVMSFGLTNAPTVFMDLMNIVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRHNKLYAKFSKYEFWLKQVSFLGHMVSKAGVSVDPAKIEAVTGWTRPSTV
SEVRSFLGLVGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYSTHDL
KLAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQQRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLLRDLERAEIAVSVGAVTI