; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222201 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222201
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:10661861..10663927
RNA-Seq ExpressionCmc08g0222201
SyntenyCmc08g0222201
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045479.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.8Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFD LQGATVFS+IDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPA FMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE  EIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAH SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTAS+WAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.42Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

KAA0058399.1 pol protein [Cucumis melo var. makuwa]0.0e+0095.65Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLR+NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLYGE IQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KT+LL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTV+WVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

KAA0062112.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.11Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSP GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSD SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYT+SKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

TYK01613.1 pol protein [Cucumis melo var. makuwa]0.0e+0096.58Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

TrEMBL top hitse value%identityAlignment
A0A5A7TW75 Pol protein0.0e+0095.8Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCI+YRELNKVTVKNRYPLPRIDDLFD LQGATVFS+IDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPA FMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE  EIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAH SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTAS+WAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

A0A5A7U330 Reverse transcriptase0.0e+0096.42Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

A0A5A7UY04 Reverse transcriptase0.0e+0095.65Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLR+NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHE+NYPTHDLELAAVVFALKIWRHYLYGE IQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KT+LL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTV+WVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

A0A5A7V1N3 Reverse transcriptase0.0e+0096.11Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSP GAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSD SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYT+SKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

A0A5D3BPI1 Reverse transcriptase0.0e+0096.58Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFD LQGATVFSKIDLRSGYHQLRIKDED+PK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG              PFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
Subjt:  SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
        LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLE AEIAVSVGAVTMQLAQLTVQPTLRQRII+AQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFE
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
        RRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.5e-10332.87Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L   +QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
         +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K V   W            W+     + + +KQ LV+ PVL   D S 
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL

Query:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM
            GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SM
Subjt:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM

Query:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        DFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G
Subjt:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

P0CT35 Transposon Tf2-2 polyprotein2.5e-10332.87Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L   +QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
         +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K V   W            W+     + + +KQ LV+ PVL   D S 
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL

Query:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM
            GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SM
Subjt:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM

Query:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        DFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G
Subjt:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

P0CT41 Transposon Tf2-12 polyprotein2.5e-10332.87Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L   +QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG
         +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K V   W            W+     + + +KQ LV+ PVL   D S 
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSG

Query:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE
          ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW  
Subjt:  SFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLE

Query:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL
         ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++
Subjt:  LVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSL

Query:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM
            GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SM
Subjt:  SSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSM

Query:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
        DFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G
Subjt:  DFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.9e-10335.3Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L   +  A +F+ +DL SGYHQ+ ++ +D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPA
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPA

Query:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYS
        K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+ QL      F+  K+         W++  + + + LK  L  +PVL   +   ++ + +
Subjt:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  +
Subjt:  DASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
         Y  G  NVVADA+SR +       +R         + +      AV + + +LT      + +   +S      +K  L+E  +   +SL  D  + ++
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
         RL VP         L   H+  F  H G T     +  +Y+W  ++  + +++  C+ CQ +K+ R +  GLLQPL I E +W ++SMDF+TGLP T  
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
           +I VVVDR +K AHF+  + T  A++   L    I   HG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.9e-10335.46Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSR
        +E+   +Q+LLD  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + +PLPRID+L   +  A +F+ +DL SGYHQ+ ++ +D  KTAF + 
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSR

Query:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPA
         G YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    
Subjt:  YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPA

Query:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYS
        K  A+  +  P TV + + FLG+  YYRRF+ N S+IA P+ QL      F+  K+         W++  + +   LK  L  +PVL   +   ++ + +
Subjt:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYS

Query:  DASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI
        DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+ L+G+   + TDH SL     + E   R +RWL+ +  YD  +
Subjt:  DASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEI

Query:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE
         Y  G  NVVADA+SR V       +R         + +      AV + + +LT      + +   +S      +K  L+E  +   +SL  D  + ++
Subjt:  LYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEIAVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFE

Query:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR
         RL VP         L   H+  F  H G T     +  +Y+W  ++  + +++  C+ CQ +K+ R +  GLLQPL I E +W ++SMDF+TGLP T  
Subjt:  RRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG
           +I VVVDR +K AHF+  + T  A++   L    I   HG
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.3e-2641.67Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K      
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVW

Query:  SKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFV
                    W++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCATGGGGTGCGCCAGTCTTATTCGTTAA
GAAGAAGGACGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTGAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCACTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGACAGCATTTCGTTCCAGATATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGTTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCTTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTA
CTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAAGGTTTAGGTTGCGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAG
TTGAAGAGTCATGAACAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCACTATTTATATGGTGAAAAGATACAGATATT
CACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAAATACTGTATCATC
CAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGTGGGCTGAGATT
GCAGTGTCAGTAGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATCAATGCTCAGAGTAACGATCCTTATCTGGTTGAGAA
ACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCGGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCTGATAGTGCGATTAAGACAG
AATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTA
GCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTGTTACAACCCTTGAGTATACCGGAATGGAAATGGGAAAACGT
GTCCATGGATTTCATTACGGGACTGCCGAGGACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGATAGGCTTACCAAGTCAGCACACTTTGTTCCGGGTAAATCCA
CCTATACTGCCAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTT
GGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACAGTTACAGGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCATGGGGTGCGCCAGTCTTATTCGTTAA
GAAGAAGGACGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTGAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTTGACCACTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATATACCGAAGACAGCATTTCGTTCCAGATATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTATTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACTGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGTTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCTTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTA
CTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAAGGTTTAGGTTGCGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAG
TTGAAGAGTCATGAACAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCACTATTTATATGGTGAAAAGATACAGATATT
CACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAAATACTGTATCATC
CAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGTGGGCTGAGATT
GCAGTGTCAGTAGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATCAATGCTCAGAGTAACGATCCTTATCTGGTTGAGAA
ACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCGGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCTGATAGTGCGATTAAGACAG
AATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTA
GCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTGTTACAACCCTTGAGTATACCGGAATGGAAATGGGAAAACGT
GTCCATGGATTTCATTACGGGACTGCCGAGGACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGATAGGCTTACCAAGTCAGCACACTTTGTTCCGGGTAAATCCA
CCTATACTGCCAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTT
GGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGA
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDHLQGATVFSKIDLRSGYHQLRIKDEDIPKTAFRSRYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTV
SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGVPFVWSKACEDSFLPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQ
LKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLEWAEI
AVSVGAVTMQLAQLTVQPTLRQRIINAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREV
AEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGCQCRLFLIEMPVSLPNF
GRVCRLLWARGWTLVRLSIHRLTVRLSV