; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0028941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0028941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:30143380..30144699
RNA-Seq ExpressionCmc01g0028941
SyntenyCmc01g0028941
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008194 - UDP-glycosyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]5.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

KAA0063793.1 pol protein [Cucumis melo var. makuwa]6.8e-25399.54Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

TYK01576.1 pol protein [Cucumis melo var. makuwa]5.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

TYK06888.1 pol protein [Cucumis melo var. makuwa]5.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

TYK20443.1 pol protein [Cucumis melo var. makuwa]5.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase2.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

A0A5A7V2A0 Reverse transcriptase2.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

A0A5A7V6R2 Reverse transcriptase3.3e-25399.54Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

A0A5D3BTN0 Reverse transcriptase2.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

A0A5D3C6W3 Reverse transcriptase2.8e-25299.32Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPK

Query:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
        TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG
Subjt:  TAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEG

Query:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
        VSVDPAKIEAVTNW RPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFA+KIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
        LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA
Subjt:  LSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.7e-8940.45Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +    + KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKT

Query:  AFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGV
        AF +++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + L  V E L    L  +  KCEF  ++ TFLGHV++ +G+
Subjt:  AFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGV

Query:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL
          +P KIEA+  +P P+   EI++FLGL GYYR+F+ +F+ IA P+T+  +K       +P  + +F++LK  +   P+L VPD +  F + +DAS   L
Subjt:  SVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
        G VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y  GK N V DA
Subjt:  GCVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSR
        LSR
Subjt:  LSR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.3e-8539.05Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTA
        E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G+HQ+ + +  I KTA
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTA

Query:  FRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVS
        F ++ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  EH   +  V   L    L  +  KCEF  ++  FLGH+V+ +G+ 
Subjt:  FRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLG
         +P K++A+ ++P P+   EIR+FLGL GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  P+L +PD    FV+ +DAS   LG
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLG

Query:  CVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDAL
         VL Q G  +++ SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y  +I Y  GK N V DAL
Subjt:  CVLMQQGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDAL

Query:  SR
        SR
Subjt:  SR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-8039.26Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSR

Query:  YGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPA
         G YE+ VM FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F   +  FLG+ +  + ++    
Subjt:  YGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQ
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+       +   W+   +++ ++LK  L  +PVL   +   N+ + +DASK G+G VL +
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
                 VV Y S+ L+  ++NYP  +LEL  ++ A+  +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  + Y  G  NVV DA
Subjt:  QGK------VVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKV
        +SR +
Subjt:  LSRKV

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.1e-8336.69Show/hide
Query:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTA
        E++ Q+ ELL  G IRPS SP+ +P+  V KK     +   R+ +D++ LN VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +++ DIPKTA
Subjt:  ELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK-----DGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTA

Query:  FRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVS
        F +  G YEF+ + FGL NAPA+F  +++ + ++ +     V+IDDI+++S+    H ++L  VL +L    L     K  F   +V FLG++V+++G+ 
Subjt:  FRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVS

Query:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR-----------KGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVI
         DP K+ A++  P P++V E++ FLG+  YYR+F++D++++A PLT LTR              P        +SF +LK  L ++ +L  P  +  F +
Subjt:  VDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTR-----------KGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVI

Query:  YSDASKKGLGCVLMQ----QGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGE-KIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE
         +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ A+++++   R YLYG   I++YTDH+ L +    +  N + +RW   +++Y+CE
Subjt:  YSDASKKGLGCVLMQ----QGKVVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGE-KIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCE

Query:  ILYHPGKANVVTDALSR
        ++Y PGK+NVV DALSR
Subjt:  ILYHPGKANVVTDALSR

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.2e-8039.51Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSR

Query:  YGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPA
         G YE+ VM FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL  VLE L+   L  K  KC+F   +  FLG+ +  + ++    
Subjt:  YGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPA

Query:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQ
        K  A+ ++P P TV + + FLG+  YYRRF+ + S+IA P+       +   W+   +++  +LK  L  +PVL   +   N+ + +DASK G+G VL +
Subjt:  KIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQ

Query:  QGK------VVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA
                 VV Y S+ L+  ++NYP  +LEL  ++ A+  +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  + Y  G  NVV DA
Subjt:  QGK------VVAYASRQLKIHEQNYPTHDLELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDA

Query:  LSRKV
        +SR V
Subjt:  LSRKV

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.6e-2744Show/hide
Query:  HLHQVLETLRANKLYAKFSKCEFWLRKVTFLG--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW
        HL  VL+    ++ YA   KC F   ++ +LG  H++S EGVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W
Subjt:  HLHQVLETLRANKLYAKFSKCEFWLRKVTFLG--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVW

Query:  SPACERSFQELKQKLVTAPVLTVPD
        +     +F+ LK  + T PVL +PD
Subjt:  SPACERSFQELKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCAGCCGAGTTAAAGGAGTTGAAGGTCCAGTTGCAGGAGTTGCTGGACAAAGGCTTCATCCGGCCCAGTGTGTCGCCTTGGGGAGCCCCAGTATTGTTCGTGAA
GAAGAAGGATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACGGTCAAAAACCGCTACCCCTTGCCCAGGATTGATGACCTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTACGATCAGGCTATCACCAGTTGAGGATTAGGGACGGTGACATTCCCAAGACGGCCTTTCGATCGAGGTACGGACATTAC
GAATTCGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTATTCATGGATTTGATGAACAGGGTGTTTAAGGACTTTCTAGACTCGTTCGTCATAGTCTTCATTGA
CGACATCCTCATCTACTCAAAAACTGAGGCTGAGCACGAGGAGCACTTACACCAGGTTTTGGAGACCCTTCGAGCCAACAAGTTGTATGCCAAGTTCTCCAAGTGTGAAT
TCTGGTTAAGGAAGGTGACGTTTCTTGGCCACGTGGTTTCCAGTGAGGGAGTTTCAGTAGATCCCGCAAAGATTGAAGCGGTGACCAACTGGCCTCGACCGTCCACGGTT
AGTGAAATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGAAAGGGAACCCCTTT
TGTCTGGAGCCCAGCATGCGAGCGTAGCTTTCAGGAGCTCAAACAGAAGCTAGTGACTGCACCAGTCCTGACAGTGCCCGATGGTTCGGGAAACTTTGTGATCTATAGTG
ATGCCTCCAAGAAGGGACTGGGCTGTGTCCTGATGCAGCAAGGTAAGGTAGTTGCTTATGCCTCCCGCCAGTTGAAGATTCATGAGCAGAACTACCCTACCCACGATTTG
GAGTTGGCAGCTGTAGTCTTTGCAGTGAAGATATGGAGGCACTATCTGTACGGTGAGAAGATTCAGATTTACACCGACCATAAGAGCCTGAAGTACTTCTTCACTCAGAA
GGAGTTGAACATGAGGCAGAGGAGGTGGCTCGAGTTGGTGAAAGACTACGACTGCGAGATCCTATACCACCCAGGTAAAGCAAATGTAGTAACTGACGCGCTGAGTAGGA
AGGTTGCACATTCAGCAGCGCTAATCACCAAGCAGACCCCCTTACTCAGGGATTTTGAGAGAGCCGAGATTGCAGTCTCAGTAGGTGAGGTTACCGCACAGTTGGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCAGCCGAGTTAAAGGAGTTGAAGGTCCAGTTGCAGGAGTTGCTGGACAAAGGCTTCATCCGGCCCAGTGTGTCGCCTTGGGGAGCCCCAGTATTGTTCGTGAA
GAAGAAGGATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACGGTCAAAAACCGCTACCCCTTGCCCAGGATTGATGACCTGTTCGATCAGTTGC
AGGGAGCCACTGTCTTTTCCAAGATCGACCTACGATCAGGCTATCACCAGTTGAGGATTAGGGACGGTGACATTCCCAAGACGGCCTTTCGATCGAGGTACGGACATTAC
GAATTCGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTATTCATGGATTTGATGAACAGGGTGTTTAAGGACTTTCTAGACTCGTTCGTCATAGTCTTCATTGA
CGACATCCTCATCTACTCAAAAACTGAGGCTGAGCACGAGGAGCACTTACACCAGGTTTTGGAGACCCTTCGAGCCAACAAGTTGTATGCCAAGTTCTCCAAGTGTGAAT
TCTGGTTAAGGAAGGTGACGTTTCTTGGCCACGTGGTTTCCAGTGAGGGAGTTTCAGTAGATCCCGCAAAGATTGAAGCGGTGACCAACTGGCCTCGACCGTCCACGGTT
AGTGAAATTCGAAGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGAAAGGGAACCCCTTT
TGTCTGGAGCCCAGCATGCGAGCGTAGCTTTCAGGAGCTCAAACAGAAGCTAGTGACTGCACCAGTCCTGACAGTGCCCGATGGTTCGGGAAACTTTGTGATCTATAGTG
ATGCCTCCAAGAAGGGACTGGGCTGTGTCCTGATGCAGCAAGGTAAGGTAGTTGCTTATGCCTCCCGCCAGTTGAAGATTCATGAGCAGAACTACCCTACCCACGATTTG
GAGTTGGCAGCTGTAGTCTTTGCAGTGAAGATATGGAGGCACTATCTGTACGGTGAGAAGATTCAGATTTACACCGACCATAAGAGCCTGAAGTACTTCTTCACTCAGAA
GGAGTTGAACATGAGGCAGAGGAGGTGGCTCGAGTTGGTGAAAGACTACGACTGCGAGATCCTATACCACCCAGGTAAAGCAAATGTAGTAACTGACGCGCTGAGTAGGA
AGGTTGCACATTCAGCAGCGCTAATCACCAAGCAGACCCCCTTACTCAGGGATTTTGAGAGAGCCGAGATTGCAGTCTCAGTAGGTGAGGTTACCGCACAGTTGGCTTAG
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHY
EFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVLETLRANKLYAKFSKCEFWLRKVTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTV
SEIRSFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACERSFQELKQKLVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYASRQLKIHEQNYPTHDL
ELAAVVFAVKIWRHYLYGEKIQIYTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVTDALSRKVAHSAALITKQTPLLRDFERAEIAVSVGEVTAQLA