; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0101471 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0101471
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr04:17828707..17830500
RNA-Seq ExpressionCmc04g0101471
SyntenyCmc04g0101471
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.82Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQ+LLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKR LAEAGQAVEFSLSSDGGLLFER LCVPSDSA KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVS+CLVCQQVKAPRQKP  LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

KAA0046185.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGH+VSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTV DG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVSRCLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMY+D+KRVYWWRNMKREVA+FVSRCLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

KAA0058464.1 pol protein [Cucumis melo var. makuwa]0.0e+00100Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVS+CLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein0.0e+0096.82Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQ+LLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKR LAEAGQAVEFSLSSDGGLLFER LCVPSDSA KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVS+CLVCQQVKAPRQKP  LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

A0A5A7TXM6 Reverse transcriptase0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKD SMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGH+VSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTV DG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVV DA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVSRCLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

A0A5A7U330 Reverse transcriptase0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ+LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS VKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMY+D+KRVYWWRNMKREVA+FVSRCLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

A0A5A7UTH9 Reverse transcriptase0.0e+00100Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

A0A5D3BPI1 Reverse transcriptase0.0e+0096.98Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVI+FIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYY+RF ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVPDG GSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD
        ELLSE HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA+FVS+CLVCQQVKAPRQKPA LLQPLSIPEWK ENVSMDFITGLPRTLRGFTVIWVVVD
Subjt:  ELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.2e-9532.73Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG   Y ++F    S++  PL  L +K   + W+     + +++KQ LV+ PVL   D     ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ ++V  C  CQ  K+   KP   LQP+   E   E++SMDFIT LP +  G+
Subjt:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF

Query:  TVIWVVVD
          ++VVVD
Subjt:  TVIWVVVD

P0CT35 Transposon Tf2-2 polyprotein2.2e-9532.73Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG   Y ++F    S++  PL  L +K   + W+     + +++KQ LV+ PVL   D     ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ ++V  C  CQ  K+   KP   LQP+   E   E++SMDFIT LP +  G+
Subjt:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF

Query:  TVIWVVVD
          ++VVVD
Subjt:  TVIWVVVD

P0CT36 Transposon Tf2-3 polyprotein2.2e-9532.73Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG   Y ++F    S++  PL  L +K   + W+     + +++KQ LV+ PVL   D     ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ ++V  C  CQ  K+   KP   LQP+   E   E++SMDFIT LP +  G+
Subjt:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF

Query:  TVIWVVVD
          ++VVVD
Subjt:  TVIWVVVD

P0CT37 Transposon Tf2-4 polyprotein2.2e-9532.73Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG   Y ++F    S++  PL  L +K   + W+     + +++KQ LV+ PVL   D     ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ ++V  C  CQ  K+   KP   LQP+   E   E++SMDFIT LP +  G+
Subjt:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF

Query:  TVIWVVVD
          ++VVVD
Subjt:  TVIWVVVD

P0CT41 Transposon Tf2-12 polyprotein2.2e-9532.73Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++ GD  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+  
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVR

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG   Y ++F    S++  PL  L +K   + W+     + +++KQ LV+ PVL   D     ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++    ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ ++V  C  CQ  K+   KP   LQP+   E   E++SMDFIT LP +  G+
Subjt:  LCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQVKAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGF

Query:  TVIWVVVD
          ++VVVD
Subjt:  TVIWVVVD

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein7.1e-2543.2Show/hide
Query:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKVRVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVW
        HL +VLQ    ++ YA   KC F   Q+++LG  H++S   VS DPAK+EA+ GW  P   +E+R FLGL GYY+RF +N+ +I  PLT+L +K +   W
Subjt:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKVRVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQDLKQKLVTAPVLTVPD
        ++    +F+ LK  + T PVL +PD
Subjt:  SKACEDSFQDLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCCGCAGAGCTGAAAGAATTAAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCCCCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAAGATCGACGATCTATTTGACCAGTTGC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCATGTTTATCGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGTTAGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCAACGGTTTGGGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGGACCTTAAACAGAAGCTAGTTACCGCACCAGTTCTTACTGTACCTGATGGTTATGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTA
GAGTTGGCAGCGGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAA
AGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAG
CTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGGGTAACGATCCTTACTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTC
ATTATCCTCTGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGCGCGGTTAAGACAGAATTATTATCTGAGCCTCACAGTTCCCCATTTTCCATGC
ACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAAAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTT
AAGGCACCAAGGCAGAAACCAGCGGATTTATTACAACCCTTGAGCATACCGGAATGGAAGTTGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAG
GGGTTTTACAGTGATTTGGGTTGTGGTGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCCGCAGAGCTGAAAGAATTAAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCCCCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAAGATCGACGATCTATTTGACCAGTTGC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCATGTTTATCGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGTTAGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCAACGGTTTGGGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGGACCTTAAACAGAAGCTAGTTACCGCACCAGTTCTTACTGTACCTGATGGTTATGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTA
GAGTTGGCAGCGGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAA
AGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGCAGTTAGCCCAG
CTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGGGTAACGATCCTTACTTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTC
ATTATCCTCTGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGCGCGGTTAAGACAGAATTATTATCTGAGCCTCACAGTTCCCCATTTTCCATGC
ACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAAAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTT
AAGGCACCAAGGCAGAAACCAGCGGATTTATTACAACCCTTGAGCATACCGGAATGGAAGTTGGAAAACGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAG
GGGTTTTACAGTGATTTGGGTTGTGGTGGACTGA
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIMFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVRVSVDPAKIEAVTGWTRPSTV
SEVRSFLGLAGYYQRFGENFSRIATPLTQLTRKGAPFVWSKACEDSFQDLKQKLVTAPVLTVPDGYGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDL
ELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQ
LTVQPTLRQRIIDAQGNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEPHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAKFVSRCLVCQQV
KAPRQKPADLLQPLSIPEWKLENVSMDFITGLPRTLRGFTVIWVVVD