; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0024191 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0024191
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:24563284..24565226
RNA-Seq ExpressionCmc01g0024191
SyntenyCmc01g0024191
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047209.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0094.84Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKGE SRSLPQVISAI+ASKLLSQGTW ILASVVDTREVD+SLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIRSSVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKT+AEHE+HLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA
        IEVVT WTRPS VSELTRKGA FV SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG+VVAYASRQLK HEQNYPTHDLELAA
Subjt:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA

Query:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV
        VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDL+RAEIAVSVGAV
Subjt:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV

Query:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK
        TMQLAQLTVQPTLRQRIIDAQ  +PY+VEKRGLAEAGQAVEFS+SSDGGLLFERCLC PSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLK+VYWWRNMK
Subjt:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK

Query:  REVAEFVSRCLVCQQVKAPR
        REVAEFVSRCLVCQ VKAPR
Subjt:  REVAEFVSRCLVCQQVKAPR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0090.87Show/hide
Query:  ASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKE
        ASFKFKG  SRSLPQVISAI+ASKLLSQGTW ILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELKE
Subjt:  ASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKE

Query:  LKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYG
        LKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYG
Subjt:  LKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYG

Query:  HYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKI
        HYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAKI
Subjt:  HYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKI

Query:  EVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG
        E VT WTRPS VSE                           LTRKGA FV SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG
Subjt:  EVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG

Query:  RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS
        +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS
Subjt:  RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQAVEFS+SSDGGLLFER LC PSDS VKTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SPFSMHPGSTKMY+D+K+VYWWRNMKREVAEFVSRCLVCQQVKAPR
Subjt:  SPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

KAA0056702.1 pol protein [Cucumis melo var. makuwa]0.0e+0090.88Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKG  S+SLPQVISAI+ASKLLSQGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELP LPPHREVEFAIE E  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQKLLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYA+FSK EFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
        IE V  WTRPS VSE                           LTRKGA FV SKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYSDASKKGLGCVLMQQ
Subjt:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ

Query:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
        G+VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
Subjt:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH

Query:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH
        SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQAVEFS+SSDGGLLFERCLC PSDSAVKTELLSEAH
Subjt:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH

Query:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSRCLVCQQVKAPR
Subjt:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

KAA0059069.1 pol protein [Cucumis melo var. makuwa]0.0e+0093.87Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        +ASFKFKG  SRSLPQVISAI+ASKLLSQGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEF IELE  TVPIS+APY+MAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT FRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PA 
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA
        IE VT WTRPS VSELTRKGA FV SKACEDSFQNLK KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAA
Subjt:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA

Query:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV
        VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAV
Subjt:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV

Query:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK
        TMQLAQLTVQPTLRQRIIDAQS +PYLVEK GLAE GQAVEFSISSDGGLLFER LC PSDSAVKTELLSEAHSSPFSMHPGSTKMY DLK+VYWWRNMK
Subjt:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK

Query:  REVAEFVSRCLVCQQVKAPR
        REVAEFVSRCLVCQQVKAPR
Subjt:  REVAEFVSRCLVCQQVKAPR

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0090.57Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKG  S+SLPQVISAI+ASKLLSQGTW ILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
        IE VT WTRPS VSE                           LTRKGA FV SKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
Subjt:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ

Query:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
        G+VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
Subjt:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH

Query:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH
        SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQ  EFS+SSDGGLLFER LC PSDSAVKTELLSEAH
Subjt:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH

Query:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVS+CLVCQQVKAPR
Subjt:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

TrEMBL top hitse value%identityAlignment
A0A5A7U149 Reverse transcriptase0.0e+0094.84Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKGE SRSLPQVISAI+ASKLLSQGTW ILASVVDTREVD+SLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIRSSVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKT+AEHE+HLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA
        IEVVT WTRPS VSELTRKGA FV SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG+VVAYASRQLK HEQNYPTHDLELAA
Subjt:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA

Query:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV
        VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDL+RAEIAVSVGAV
Subjt:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV

Query:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK
        TMQLAQLTVQPTLRQRIIDAQ  +PY+VEKRGLAEAGQAVEFS+SSDGGLLFERCLC PSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLK+VYWWRNMK
Subjt:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK

Query:  REVAEFVSRCLVCQQVKAPR
        REVAEFVSRCLVCQ VKAPR
Subjt:  REVAEFVSRCLVCQQVKAPR

A0A5A7U330 Reverse transcriptase0.0e+0090.87Show/hide
Query:  ASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKE
        ASFKFKG  SRSLPQVISAI+ASKLLSQGTW ILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELKE
Subjt:  ASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKE

Query:  LKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYG
        LKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYG
Subjt:  LKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYG

Query:  HYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKI
        HYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAKI
Subjt:  HYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKI

Query:  EVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG
        E VT WTRPS VSE                           LTRKGA FV SKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG
Subjt:  EVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG

Query:  RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS
        +VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS
Subjt:  RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHS

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQAVEFS+SSDGGLLFER LC PSDS VKTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SPFSMHPGSTKMY+D+K+VYWWRNMKREVAEFVSRCLVCQQVKAPR
Subjt:  SPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

A0A5A7ULI8 Pol protein0.0e+0090.88Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKG  S+SLPQVISAI+ASKLLSQGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELP LPPHREVEFAIE E  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQKLLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYA+FSK EFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
        IE V  WTRPS VSE                           LTRKGA FV SKACEDSFQNLKQKLVTAPV+TVPDGSGSFVIYSDASKKGLGCVLMQQ
Subjt:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ

Query:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
        G+VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
Subjt:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH

Query:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH
        SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQAVEFS+SSDGGLLFERCLC PSDSAVKTELLSEAH
Subjt:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH

Query:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSRCLVCQQVKAPR
Subjt:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

A0A5A7UZV8 Reverse transcriptase0.0e+0093.87Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        +ASFKFKG  SRSLPQVISAI+ASKLLSQGTW ILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEF IELE  TVPIS+APY+MAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT FRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLR+VLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PA 
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA
        IE VT WTRPS VSELTRKGA FV SKACEDSFQNLK KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ +VVAYASRQLK HEQNYPTHDLELAA
Subjt:  IEVVTNWTRPSIVSELTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAA

Query:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV
        VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAV
Subjt:  VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAV

Query:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK
        TMQLAQLTVQPTLRQRIIDAQS +PYLVEK GLAE GQAVEFSISSDGGLLFER LC PSDSAVKTELLSEAHSSPFSMHPGSTKMY DLK+VYWWRNMK
Subjt:  TMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMK

Query:  REVAEFVSRCLVCQQVKAPR
        REVAEFVSRCLVCQQVKAPR
Subjt:  REVAEFVSRCLVCQQVKAPR

A0A5D3BPI1 Reverse transcriptase0.0e+0090.57Show/hide
Query:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK
        MASFKFKG  S+SLPQVISAI+ASKLLSQGTW ILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELE  TVPIS+APYRMAPAELK
Subjt:  MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELK

Query:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY
        ELKVQLQ+LLDKGFIR SVSPWGA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRY
Subjt:  ELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY

Query:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK
        GHYEFIVMSFGLTNAPAVFMDLMNRV+REFL TFVIVFIDDILIYSKTEAEHEEHLRMVLQTL+DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSV PAK
Subjt:  GHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAK

Query:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
        IE VT WTRPS VSE                           LTRKGA FV SKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ
Subjt:  IEVVTNWTRPSIVSE---------------------------LTRKGAHFVGSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQ

Query:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
        G+VVAYASRQLK HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH
Subjt:  GRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSH

Query:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH
        SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQS +PYLVEKRGLAEAGQ  EFS+SSDGGLLFER LC PSDSAVKTELLSEAH
Subjt:  SAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGLLFERCLCAPSDSAVKTELLSEAH

Query:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR
        SSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVS+CLVCQQVKAPR
Subjt:  SSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.8e-7630Show/hide
Query:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P                             ++ L +K   +  +     + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN

Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++ + + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   + +  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ
        L+    L    + VE +I    GLL   +  +  P+D+ +   ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ

P0CT35 Transposon Tf2-2 polyprotein6.8e-7630Show/hide
Query:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P                             ++ L +K   +  +     + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN

Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++ + + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   + +  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ
        L+    L    + VE +I    GLL   +  +  P+D+ +   ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ

P0CT36 Transposon Tf2-3 polyprotein6.8e-7630Show/hide
Query:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P                             ++ L +K   +  +     + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN

Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++ + + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   + +  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ
        L+    L    + VE +I    GLL   +  +  P+D+ +   ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ

P0CT37 Transposon Tf2-4 polyprotein6.8e-7630Show/hide
Query:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P                             ++ L +K   +  +     + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN

Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++ + + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   + +  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ
        L+    L    + VE +I    GLL   +  +  P+D+ +   ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ

P0CT41 Transposon Tf2-12 polyprotein6.8e-7630Show/hide
Query:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +     V+FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLLDKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E   + V+ ++DDILI+SK+E+EH +H++ VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQT

Query:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P                             ++ L +K   +  +     + +N
Subjt:  LQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPS---------------------------IVSELTRKGAHFVGSKACEDSFQN

Query:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY
        +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++ + + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  
Subjt:  LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQG-----RVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKY

Query:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY
          T +    N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   + +  
Subjt:  FFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPY

Query:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ
        L+    L    + VE +I    GLL   +  +  P+D+ +   ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ
Subjt:  LVEKRGLAEAGQAVEFSISSDGGLLF--ERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-0742.19Show/hide
Query:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVYPAKIEVVTNWTRPSIVSEL
        HL MVLQ  + ++ YA   KC F   Q+++LG  H++S  GVS  PAK+E +  W  P   +EL
Subjt:  HLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVYPAKIEVVTNWTRPSIVSEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGTTTTAAATTTAAGGGAGAAGAGTCAAGGTCGTTGCCTCAGGTAATCTCAGCCATCAAGGCCAGTAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAG
CGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGG
TTGAGTTTGCCATAGAGTTGGAGTCGGCCACGGTTCCTATATCTAAAGCCCCATACAGAATGGCCCCAGCAGAGTTGAAAGAACTAAAAGTGCAGTTACAGAAATTGCTC
GATAAGGGATTCATTCGATCGAGTGTGTCACCTTGGGGTGCGTCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGACTATGCATTGACTACAGGGAGTTGAATAA
GGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATTTGTTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGC
TGAGGATTAAGGATGGGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAGAGTGTATAGGGAGTTCCTAGGCACTTTTGTGATCGTGTTTATTGATGATATCTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACG
TATGGTTTTGCAAACACTTCAGGATAATAAACTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCCGGAG
TTTCTGTGTATCCAGCTAAGATAGAGGTAGTCACCAATTGGACCCGACCTTCCATAGTCAGTGAGTTGACCAGGAAGGGAGCTCATTTTGTTGGGAGCAAGGCATGTGAA
GACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGAAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGG
TTGTGTTTTGATGCAGCAAGGTAGGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGACATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTG
CTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGA
AGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACT
TATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGA
GGCAAAGGATCATTGATGCTCAGAGTAAAAATCCTTATTTGGTTGAGAAGCGGGGCCTAGCAGAGGCAGGGCAGGCGGTTGAGTTCTCCATATCCTCTGATGGTGGACTT
TTGTTTGAGAGATGCCTCTGTGCGCCATCAGATAGCGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTA
TCAGGACCTGAAGCAGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGTTTTAAATTTAAGGGAGAAGAGTCAAGGTCGTTGCCTCAGGTAATCTCAGCCATCAAGGCCAGTAAACTGCTCAGTCAGGGTACTTGGAGTATCTTAGCGAG
CGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCAGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCTCACAGAGAGG
TTGAGTTTGCCATAGAGTTGGAGTCGGCCACGGTTCCTATATCTAAAGCCCCATACAGAATGGCCCCAGCAGAGTTGAAAGAACTAAAAGTGCAGTTACAGAAATTGCTC
GATAAGGGATTCATTCGATCGAGTGTGTCACCTTGGGGTGCGTCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGACTATGCATTGACTACAGGGAGTTGAATAA
GGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATTTGTTTGACCAGTTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATATCATCAGC
TGAGGATTAAGGATGGGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAGAGTGTATAGGGAGTTCCTAGGCACTTTTGTGATCGTGTTTATTGATGATATCTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACG
TATGGTTTTGCAAACACTTCAGGATAATAAACTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCCGGAG
TTTCTGTGTATCCAGCTAAGATAGAGGTAGTCACCAATTGGACCCGACCTTCCATAGTCAGTGAGTTGACCAGGAAGGGAGCTCATTTTGTTGGGAGCAAGGCATGTGAA
GACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGAAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGG
TTGTGTTTTGATGCAGCAAGGTAGGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGACATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTG
CTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGA
AGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACT
TATTACCCGACAGGCCCCATTGCATCGAGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGA
GGCAAAGGATCATTGATGCTCAGAGTAAAAATCCTTATTTGGTTGAGAAGCGGGGCCTAGCAGAGGCAGGGCAGGCGGTTGAGTTCTCCATATCCTCTGATGGTGGACTT
TTGTTTGAGAGATGCCTCTGTGCGCCATCAGATAGCGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTA
TCAGGACCTGAAGCAGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGTAG
Protein sequenceShow/hide protein sequence
MASFKFKGEESRSLPQVISAIKASKLLSQGTWSILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELESATVPISKAPYRMAPAELKELKVQLQKLL
DKGFIRSSVSPWGASVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFM
DLMNRVYREFLGTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLQDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVYPAKIEVVTNWTRPSIVSELTRKGAHFVGSKACE
DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGRVVAYASRQLKRHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSKNPYLVEKRGLAEAGQAVEFSISSDGGL
LFERCLCAPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSRCLVCQQVKAPR