; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0225601 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0225601
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:16834475..16836121
RNA-Seq ExpressionCmc08g0225601
SyntenyCmc08g0225601
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]8.4e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHE+HLRMVLQTLRDNKLYAKF KCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAE GQAVEFSLSSDGGLLFERRLCVPSDSAIKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

KAA0040689.1 pol protein [Cucumis melo var. makuwa]4.2e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAIKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

KAA0045479.1 pol protein [Cucumis melo var. makuwa]1.3e-30997.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFS+IDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPA FMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAH SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

KAA0048687.1 pol protein [Cucumis melo var. makuwa]4.2e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS +KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

TYK01613.1 pol protein [Cucumis melo var. makuwa]1.3e-30997.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

TrEMBL top hitse value%identityAlignment
A0A5A7T190 Reverse transcriptase4.1e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHE+HLRMVLQTLRDNKLYAKF KCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAE GQAVEFSLSSDGGLLFERRLCVPSDSAIKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

A0A5A7THE6 Reverse transcriptase2.0e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDP KIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VL VPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAIKT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

A0A5A7TW75 Pol protein6.1e-31097.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCI+YRELNKVTVKNRYPLPRIDDLFDQLQGATVFS+IDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPA FMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLER EIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAH SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

A0A5A7U330 Reverse transcriptase2.0e-31097.81Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS +KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVSRCL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

A0A5D3BPI1 Reverse transcriptase6.1e-31097.63Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKK GSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
        VSVDPAKIEAVTGWTRPSTVSEVRSFLGLA YYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTA VLTVPDGSGSFVIYSDASKKGL
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQI TDHKSLKYFF QKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT
        LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSA+KT
Subjt:  LSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CL+CQ
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.5e-8740.2Show/hide
Query:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGS-----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT
        +E++ Q+Q++L++G IR S SP+ +P+  V KK  +      R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +  E V KT
Subjt:  KELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGS-----MRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKT

Query:  AFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV
        AF +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLGHV++  G+
Subjt:  AFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV

Query:  SVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
          +P KIEA+  +  P+   E+++FLGL  YYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +    +L VPD +  F + +DAS   L
Subjt:  SVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA
        G VL Q G  ++Y SR L  HE NY T + EL A+V+A K +RHYL G   +IS+DH+ L + +  K+ N +  RW   + ++D +I Y  GK N VADA
Subjt:  GCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADA

Query:  LSR
        LSR
Subjt:  LSR

P0CT34 Transposon Tf2-1 polyprotein1.3e-8632.2Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AF    G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG  +Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  +     +  N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ
Subjt:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

P0CT35 Transposon Tf2-2 polyprotein1.3e-8632.2Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AF    G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG  +Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  +     +  N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ
Subjt:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

P0CT36 Transposon Tf2-3 polyprotein1.3e-8632.2Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AF    G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG  +Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  +     +  N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ
Subjt:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

P0CT41 Transposon Tf2-12 polyprotein1.3e-8632.2Show/hide
Query:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPK

Query:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AF    G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFCSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL
         +     I+ V  W +P    E+R FLG  +Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+  VL   D S   ++ +DAS   +
Subjt:  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGL

Query:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP
        G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L  +     +  N R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQISTDHKSL--KYFFIQKELNMRQRRWLELVKDYDCEILYHP

Query:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--
        G AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE ++    GLL   +  
Subjt:  GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--

Query:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ
        + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ
Subjt:  LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.7e-2544.27Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL  YYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTALVLTVPDGSGSFV
        ++    +F+ LK  + T  VL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTALVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTGCAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGTATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTCGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTTGTTCCAGATACGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGA
CGACATCTTGATATACTCCAAGACGGAGGCCGAACACGAGAAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGTGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGTCACGTGGTTTCTAAGGCTGGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAAGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACTGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTTGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTA
GAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTCCACGGACCATAAGAGCCTGAAATACTTCTTTATTCAGAA
AGAATTGAATATGAGACAGCGGAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCTTTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAG
TTGACAGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTC
ATTATCCTCGGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGC
ACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGTTGTGTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTGCAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTATTTGTTAA
GAAGAAGTATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATTGACGATCTATTCGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTTGTTCCAGATACGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGA
CGACATCTTGATATACTCCAAGACGGAGGCCGAACACGAGAAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGTGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGTCACGTGGTTTCTAAGGCTGGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAAGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACTGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTG
ATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTTGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTA
GAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGTGAAAAGATACAGATCTCCACGGACCATAAGAGCCTGAAATACTTCTTTATTCAGAA
AGAATTGAATATGAGACAGCGGAGATGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGA
AGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCTTTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAG
TTGACAGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTC
ATTATCCTCGGATGGTGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGC
ACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGTTGTGTCAGTAG
Protein sequenceShow/hide protein sequence
MAPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKYGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFCSRYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTV
SEVRSFLGLASYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTALVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDL
ELAAVVFALKIWRHYLYGEKIQISTDHKSLKYFFIQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQ
LTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLLCQ