; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0071081 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0071081
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:17200165..17201490
RNA-Seq ExpressionCmc03g0071081
SyntenyCmc03g0071081
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025998.1 pol protein [Cucumis melo var. makuwa]4.1e-23192.52Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVD+REPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNW RPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
         SFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

KAA0032541.1 pol protein [Cucumis melo var. makuwa]5.4e-23192.29Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEV+FNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDF IELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHY+FVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
         SFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

KAA0058812.1 pol protein [Cucumis melo var. makuwa]2.4e-23192.29Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAG+VCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAF SRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDF+DSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

KAA0059723.1 pol protein [Cucumis melo var. makuwa]1.2e-23092.52Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFR RYGHYEFVVMSFGLTN PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLVTAPV TVPDGS +FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

KAA0060745.1 pol protein [Cucumis melo var. makuwa]1.1e-23192.74Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        +IDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLV APV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

TrEMBL top hitse value%identityAlignment
A0A5A7SIJ5 Reverse transcriptase2.0e-23192.52Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVD+REPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNW RPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
         SFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7SSL3 Reverse transcriptase2.6e-23192.29Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEV+FNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDF IELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHY+FVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
         SFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7USG7 Reverse transcriptase1.2e-23192.29Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAG+VCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVT+KNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAF SRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDF+DSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLVTAPV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7UUX8 Reverse transcriptase5.8e-23192.52Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFR RYGHYEFVVMSFGLTN PAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLVTAPV TVPDGS +FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

A0A5A7V4E4 Reverse transcriptase5.2e-23292.74Show/hide
Query:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY
        MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLS GTWGILASVVDIREPEV+LSSEPVVREY
Subjt:  MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREY

Query:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP
         DVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYP P
Subjt:  SDVFPDELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFP

Query:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
        +IDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH
Subjt:  RIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH

Query:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
        Q                       VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIR FLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE
Subjt:  Q-----------------------VTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACE

Query:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM
        SSFQELKQKLV APV TVPDGSG+FVIYSDASKKGLGCVLM
Subjt:  SSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.7e-5836.86Show/hide
Query:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ +P+  V KK+D S     R+ IDYR+LN++TV +R+P P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  RDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLG
            + KTAF +++GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + L                        + TFLG
Subjt:  RDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLG

Query:  HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIY
        HV++ +G+  +P KIEA+  +P P+   EI+ FLGL GYYR+F+ +F+ IA P+T+  +K       +P  +S+F++LK  +   P+  VPD +  F + 
Subjt:  HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIY

Query:  SDASKKGLGCVL
        +DAS   LG VL
Subjt:  SDASKKGLGCVL

P0CT41 Transposon Tf2-12 polyprotein2.0e-5532.87Show/hide
Query:  IREPEVTLSSEPVVREYSDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL
        ++EPE+      + +E+ D+  +     LP P + ++F +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+
Subjt:  IREPEVTLSSEPVVREYSDVFPD-ELPGLPPP-REVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRL

Query:  CIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF
         +DY+ LNK    N YP P I+ L  ++QG+T+F+K+DL+S YH +R+R GD  K AFR   G +E++VM +G++ APA F   +N +  +  +S V+ +
Subjt:  CIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVF

Query:  IDDILIYSKTEAEH-----------------------EEHLHQVTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIAS
        +DDILI+SK+E+EH                       E H  QV F+G+ +S +G +     I+ V  W +P    E+R FLG   Y R+F+   S++  
Subjt:  IDDILIYSKTEAEH-----------------------EEHLHQVTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIAS

Query:  PLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL
        PL  L +K   + W+P    + + +KQ LV+ PV    D S   ++ +DAS   +G VL
Subjt:  PLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.1e-5635.42Show/hide
Query:  APISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRS
        +PI    Y +A     E++ Q+QE+L++G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP P +D++  +L     F+ IDL  
Subjt:  APISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRS

Query:  GYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH----------------------
        G+HQ+ + +  I KTAF ++ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+I+S +  EH   +                       
Subjt:  GYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLH----------------------

Query:  -QVTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPVPTVPDG
         +  FLGH+V+ +G+  +P K++A+ ++P P+   EIR FLGL GYYR+F+ +++ IA P+T   +K T           +F++LK  ++  P+  +PD 
Subjt:  -QVTFLGHVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPF-VWSPACESSFQELKQKLVTAPVPTVPDG

Query:  SGSFVIYSDASKKGLGCVL
           FV+ +DAS   LG VL
Subjt:  SGSFVIYSDASKKGLGCVL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.2e-5733.59Show/hide
Query:  KASKLLSPGTWGILASVVDIREPEVTLSSEP---------VVREYSDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ
        +AS L   G +  + S +   EP  T  S           + ++Y ++  ++LP  P P +++       IE++PG       PY +     +E+   +Q
Subjt:  KASKLLSPGTWGILASVVDIREPEVTLSSEP---------VVREYSDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ

Query:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVV
        +LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +P PRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF +  G YE+ V
Subjt:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVV

Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLGHVVSSEGVSVDPAKIEAVTNW
        M FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL                        +  FLG+ +  + ++    K  A+ ++
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLGHVVSSEGVSVDPAKIEAVTNW

Query:  PRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL
        P P TV + + FLG+  YYRRF+ + S+IA P+       +   W+   + + ++LK  L  +PV    +   ++ + +DASK G+G VL
Subjt:  PRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.2e-5733.59Show/hide
Query:  KASKLLSPGTWGILASVVDIREPEVTLSSEP---------VVREYSDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ
        +AS L   G +  + S +   EP  T  S           + ++Y ++  ++LP  P P +++       IE++PG       PY +     +E+   +Q
Subjt:  KASKLLSPGTWGILASVVDIREPEVTLSSEP---------VVREYSDVFPDELPGLPPPREVD-----FAIELEPGTAPISRAPYRMAPAELKELKVQLQ

Query:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVV
        +LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +P PRID+L  ++  A +F+ +DL SGYHQ+ +   D  KTAF +  G YE+ V
Subjt:  ELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQGATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVV

Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLGHVVSSEGVSVDPAKIEAVTNW
        M FGL NAP+ F   M   F+D    FV V++DDILI+S++  EH +HL                        +  FLG+ +  + ++    K  A+ ++
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHL-----------------------HQVTFLGHVVSSEGVSVDPAKIEAVTNW

Query:  PRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL
        P P TV + + FLG+  YYRRF+ + S+IA P+       +   W+   + +  +LK  L  +PV    +   ++ + +DASK G+G VL
Subjt:  PRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.7e-2245.71Show/hide
Query:  QVTFLG--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDG
        Q+ +LG  H++S EGVS DPAK+EA+  WP P   +E+R FLGL GYYRRFV+++ +I  PLT+L +K +   W+     +F+ LK  + T PV  +PD 
Subjt:  QVTFLG--HVVSSEGVSVDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDG

Query:  SGSFV
           FV
Subjt:  SGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGACGTGACCTTACTAGTGTTAGACATGCAGGATTTTGACGTAATTTTAGGCATGGATTGGCTGTCAGCTAACCATGCAAATATAGACTGTTTTGGTAAG
GAAGTTGTCTTTAACCCTCCCTCCGGGGCTAGTTTCAAATTTAGGGGGGCAGGCATGGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTC
AGCCCGGGTACTTGGGGCATCTTGGCAAGCGTAGTAGATATTAGAGAGCCAGAAGTTACCCTATCTTCCGAACCAGTGGTAAGGGAGTACTCTGATGTTTTCCCC
GACGAACTCCCAGGACTTCCGCCTCCCAGGGAGGTAGACTTCGCCATCGAGTTAGAGCCGGGCACTGCCCCTATCTCGAGGGCCCCTTACAGAATGGCTCCAGCC
GAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTACTGGACAAGGGTTTTATCCGGCCCAGTGTGTCACCTTGGGGAGCCCCAGTGTTGTTCGTGAAGAAGAAG
GATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAAAACCGCTACCCCTTTCCCAGGATTGATGACTTGTTCGATCAGTTGCAG
GGAGCCACTGTATTTTCCAAGATCGACCTGCGGTCAGGCTATCACCAGCTGAGGATTAGGGACGGTGACATTCCCAAGACGGCCTTTCGTTCGAGGTACGGACAT
TACGAGTTTGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTGTTCATGGATTTGATGAACAGGGTGTTTAAGGACTTTCTAGACTCGTTCGTCATAGTC
TTCATTGATGACATCTTGATTTACTCAAAAACTGAGGCCGAGCACGAGGAGCACTTGCACCAGGTGACGTTCCTTGGCCACGTGGTTTCCAGTGAGGGAGTTTCT
GTGGATCCAGCAAAGATTGAAGCGGTGACCAACTGGCCTCGACCGTCCACAGTTAGTGAAATTCGATGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTG
GAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTTTGTCTGGAGCCCAGCATGCGAGAGTAGCTTTCAGGAGCTCAAGCAG
AAACTGGTGACTGCACCAGTCCCGACAGTGCCCGATGGGTCGGGAAGCTTTGTGATCTATAGTGATGCCTCCAAGAAGGGACTGGGCTGTGTCCTGATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTAGACGTGACCTTACTAGTGTTAGACATGCAGGATTTTGACGTAATTTTAGGCATGGATTGGCTGTCAGCTAACCATGCAAATATAGACTGTTTTGGTAAG
GAAGTTGTCTTTAACCCTCCCTCCGGGGCTAGTTTCAAATTTAGGGGGGCAGGCATGGTATGTATACCCAAGGTCATCTCAGCCATGAAGGCTAGTAAACTACTC
AGCCCGGGTACTTGGGGCATCTTGGCAAGCGTAGTAGATATTAGAGAGCCAGAAGTTACCCTATCTTCCGAACCAGTGGTAAGGGAGTACTCTGATGTTTTCCCC
GACGAACTCCCAGGACTTCCGCCTCCCAGGGAGGTAGACTTCGCCATCGAGTTAGAGCCGGGCACTGCCCCTATCTCGAGGGCCCCTTACAGAATGGCTCCAGCC
GAGCTAAAAGAGTTGAAGGTCCAGTTACAGGAGTTACTGGACAAGGGTTTTATCCGGCCCAGTGTGTCACCTTGGGGAGCCCCAGTGTTGTTCGTGAAGAAGAAG
GATGGGTCGATGCGCCTTTGTATTGACTACCGAGAGCTGAACAAGGTGACAGTTAAAAACCGCTACCCCTTTCCCAGGATTGATGACTTGTTCGATCAGTTGCAG
GGAGCCACTGTATTTTCCAAGATCGACCTGCGGTCAGGCTATCACCAGCTGAGGATTAGGGACGGTGACATTCCCAAGACGGCCTTTCGTTCGAGGTACGGACAT
TACGAGTTTGTTGTGATGTCTTTCGGCTTGACTAACGCTCCTGCAGTGTTCATGGATTTGATGAACAGGGTGTTTAAGGACTTTCTAGACTCGTTCGTCATAGTC
TTCATTGATGACATCTTGATTTACTCAAAAACTGAGGCCGAGCACGAGGAGCACTTGCACCAGGTGACGTTCCTTGGCCACGTGGTTTCCAGTGAGGGAGTTTCT
GTGGATCCAGCAAAGATTGAAGCGGTGACCAACTGGCCTCGACCGTCCACAGTTAGTGAAATTCGATGTTTTCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTG
GAAGACTTCTCACGTATAGCCAGCCCGTTGACCCAGTTGACCAGGAAGGGAACCCCTTTTGTCTGGAGCCCAGCATGCGAGAGTAGCTTTCAGGAGCTCAAGCAG
AAACTGGTGACTGCACCAGTCCCGACAGTGCCCGATGGGTCGGGAAGCTTTGTGATCTATAGTGATGCCTCCAAGAAGGGACTGGGCTGTGTCCTGATGTAG
Protein sequenceShow/hide protein sequence
MLDVTLLVLDMQDFDVILGMDWLSANHANIDCFGKEVVFNPPSGASFKFRGAGMVCIPKVISAMKASKLLSPGTWGILASVVDIREPEVTLSSEPVVREYSDVFP
DELPGLPPPREVDFAIELEPGTAPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPFPRIDDLFDQLQ
GATVFSKIDLRSGYHQLRIRDGDIPKTAFRSRYGHYEFVVMSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIDDILIYSKTEAEHEEHLHQVTFLGHVVSSEGVS
VDPAKIEAVTNWPRPSTVSEIRCFLGLAGYYRRFVEDFSRIASPLTQLTRKGTPFVWSPACESSFQELKQKLVTAPVPTVPDGSGSFVIYSDASKKGLGCVLM