; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0224471 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0224471
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:14563708..14566337
RNA-Seq ExpressionCmc08g0224471
SyntenyCmc08g0224471
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]0.0e+0091.62Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQ+LLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQ N PYLVE+R LAEAGQAVEFS+SSDGGLLFER LCVPSDSA KT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSKCL+CQQVKAPRQK  GLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LH VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGM PFEALYGKCCRSPVCWGEVGEQ LMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEF + DKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFV PFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

KAA0033181.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.0e+0098.61Show/hide
Query:  GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL
        GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL
Subjt:  GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL

Query:  TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST
        TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST
Subjt:  TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST

Query:  VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFGKVVAYASRQLKSHEQNYPTHDLELAAVVF
        VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP           GKVVAYASRQLKSHEQNYPTHDLELAAVVF
Subjt:  VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFGKVVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY
        ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY
Subjt:  ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY

Query:  PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA
        PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA
Subjt:  PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA

Query:  GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR
        GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR
Subjt:  GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR

Query:  LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
        LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
Subjt:  LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR

Query:  SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
Subjt:  SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0092.09Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMY+D+K+VYWWRNMKREVAEFVS+CL+CQQVKAPRQK AGLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEF YNN+YQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFEV DKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRF+GPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.0e+0091.85Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA 
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGL FE RLCVPSDSAVKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELL EAHSSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSKCL+CQQVK PRQK AGLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFE+ DKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFVGPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0091.85Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG HQLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELL+EAHSSPFSMHPGSTKMYQDLK++YWWRNMKREVAEFVSKCL+CQQVKAPRQK AGLLQPLSIPEWKWENVSMDFI GLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGMAPFEALY KCCRSP+CWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFEV DKVFLKVAPMRGV+RF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFVGPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein0.0e+0091.62Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQ+LLDKGFIR SVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+GDVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQ N PYLVE+R LAEAGQAVEFS+SSDGGLLFER LCVPSDSA KT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSKCL+CQQVKAPRQK  GLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LH VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGM PFEALYGKCCRSPVCWGEVGEQ LMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEF + DKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFV PFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

A0A5A7SUN9 Ty3-gypsy retrotransposon protein0.0e+0098.61Show/hide
Query:  GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL
        GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL
Subjt:  GFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHYEFIVMSFGL

Query:  TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST
        TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST
Subjt:  TNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPST

Query:  VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFGKVVAYASRQLKSHEQNYPTHDLELAAVVF
        VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP           GKVVAYASRQLKSHEQNYPTHDLELAAVVF
Subjt:  VSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFGKVVAYASRQLKSHEQNYPTHDLELAAVVF

Query:  ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY
        ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY
Subjt:  ALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNY

Query:  PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA
        PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA
Subjt:  PYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSA

Query:  GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR
        GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR
Subjt:  GLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTR

Query:  LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
        LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR
Subjt:  LDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLHLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIR

Query:  SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
Subjt:  SRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

A0A5A7U330 Reverse transcriptase0.0e+0092.09Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGLLFERRLCVPSDS VKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELLSEAHSSPFSMHPGSTKMY+D+K+VYWWRNMKREVAEFVS+CL+CQQVKAPRQK AGLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEF YNN+YQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFEV DKVFLKVAPMRGVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRF+GPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

A0A5A7UAA8 Reverse transcriptase0.0e+0091.85Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRP+VSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA 
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGL FE RLCVPSDSAVKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELL EAHSSPFSMHPGSTKMYQDLK+VYWWRNMKREVAEFVSKCL+CQQVK PRQK AGLLQPLSIPEWKWENVSMDFITGLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYTASKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFE+ DKVFLKVAPM+GVLRF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFVGPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

A0A5A7UP94 Pol protein0.0e+0091.85Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        M PAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSG HQLRIK+ DVPK
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFM LMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------
        VSVDPAKIEA+TGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF           
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSF-----------

Query:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----
               GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV    
Subjt:  -------GKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANV----

Query:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT
                       APLHRDLERAEIAVSVGAVTM  AQLTVQPTLRQRIIDAQSN PYLVE+RGLAEAGQAVEFS+SSDGGLLFERRLCVPSDSAVKT
Subjt:  ---------------APLHRDLERAEIAVSVGAVTM--AQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERRLCVPSDSAVKT

Query:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT
        ELL+EAHSSPFSMHPGSTKMYQDLK++YWWRNMKREVAEFVSKCL+CQQVKAPRQK AGLLQPLSIPEWKWENVSMDFI GLPRTLRGF VIWVVVDRLT
Subjt:  ELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDRLT

Query:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH
        KSAHFVPGKSTYT SKWAQLYMSEIV LHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACA EFPGSWDSHLH
Subjt:  KSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHLH

Query:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF
        LMEFAYNN+YQATIGMAPFEALY KCCRSP+CWGEVGEQRLMGPELVQSTNEAIQKIRSR HTAQSRQKSYADVRRKDLEFEV DKVFLKVAPMRGV+RF
Subjt:  LMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK
        ERRGKLSPRFVGPFEILE IGPVAYRLALP SLSTVHDVFHVSMLRK
Subjt:  ERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.4e-12932.39Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+++GD  K
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF
         +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S            + 
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF

Query:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G V            V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV
        G AN              P+ +D E   I        + Q+++    + +++   +N   L+    L    + VE +I    GLL   +  + +P+D+ +
Subjt:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV

Query:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR
           ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ  K+   K  G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR
Subjt:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR

Query:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH
         +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H
Subjt:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH

Query:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG
        + L++ +YNN   +   M PFE ++      SP+   E+        E  Q T +  Q ++   +T   + K Y D++ +++ EF+  D V +K     G
Subjt:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG

Query:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK
         L   +  KL+P F GPF +L+  GP  Y L LP S+  +    FHVS L K
Subjt:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK

P0CT35 Transposon Tf2-2 polyprotein2.4e-12932.39Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+++GD  K
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF
         +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S            + 
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF

Query:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G V            V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV
        G AN              P+ +D E   I        + Q+++    + +++   +N   L+    L    + VE +I    GLL   +  + +P+D+ +
Subjt:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV

Query:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR
           ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ  K+   K  G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR
Subjt:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR

Query:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH
         +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H
Subjt:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH

Query:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG
        + L++ +YNN   +   M PFE ++      SP+   E+        E  Q T +  Q ++   +T   + K Y D++ +++ EF+  D V +K     G
Subjt:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG

Query:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK
         L   +  KL+P F GPF +L+  GP  Y L LP S+  +    FHVS L K
Subjt:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK

P0CT36 Transposon Tf2-3 polyprotein2.4e-12932.39Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+++GD  K
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF
         +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S            + 
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF

Query:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G V            V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV
        G AN              P+ +D E   I        + Q+++    + +++   +N   L+    L    + VE +I    GLL   +  + +P+D+ +
Subjt:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV

Query:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR
           ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ  K+   K  G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR
Subjt:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR

Query:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH
         +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H
Subjt:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH

Query:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG
        + L++ +YNN   +   M PFE ++      SP+   E+        E  Q T +  Q ++   +T   + K Y D++ +++ EF+  D V +K     G
Subjt:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG

Query:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK
         L   +  KL+P F GPF +L+  GP  Y L LP S+  +    FHVS L K
Subjt:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK

P0CT37 Transposon Tf2-4 polyprotein2.4e-12932.39Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+++GD  K
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF
         +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S            + 
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF

Query:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G V            V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV
        G AN              P+ +D E   I        + Q+++    + +++   +N   L+    L    + VE +I    GLL   +  + +P+D+ +
Subjt:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV

Query:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR
           ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ  K+   K  G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR
Subjt:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR

Query:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH
         +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H
Subjt:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH

Query:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG
        + L++ +YNN   +   M PFE ++      SP+   E+        E  Q T +  Q ++   +T   + K Y D++ +++ EF+  D V +K     G
Subjt:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG

Query:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK
         L   +  KL+P F GPF +L+  GP  Y L LP S+  +    FHVS L K
Subjt:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK

P0CT41 Transposon Tf2-12 polyprotein2.4e-12932.39Show/hide
Query:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPLP I+ L  ++QG+T+F+K+DL+S YH +R+++GD  K
Subjt:  MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF
         +     I+ +  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S            + 
Subjt:  VSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSG-----------SF

Query:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP
        G V            V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y P
Subjt:  GKV------------VAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHP

Query:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV
        G AN              P+ +D E   I        + Q+++    + +++   +N   L+    L    + VE +I    GLL   +  + +P+D+ +
Subjt:  GKAN------------VAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSISSDGGLLFERR--LCVPSDSAV

Query:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR
           ++ + H     +HPG   +   + + + W+ +++++ E+V  C  CQ  K+   K  G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR
Subjt:  KTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFIVIWVVVDR

Query:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH
         +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H
Subjt:  LTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSH

Query:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG
        + L++ +YNN   +   M PFE ++      SP+   E+        E  Q T +  Q ++   +T   + K Y D++ +++ EF+  D V +K     G
Subjt:  LHLMEFAYNNNYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDL-EFEVEDKVFLKVAPMRG

Query:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK
         L   +  KL+P F GPF +L+  GP  Y L LP S+  +    FHVS L K
Subjt:  VLRFERRGKLSPRFVGPFEILEPIGPVAYRLALPTSLSTV-HDVFHVSMLRK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.0e-2745.6Show/hide
Query:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL +VLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAITGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPD
        ++    +F+ LK  + T PVL +PD
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTGTTCGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTAAGGATTAAGGAAGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGGCTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAATCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGGTAAGGTGGTCG
CTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGT
GAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTCAAGGATTACGATTG
TGAGATACTGTATCATCCAGGCAAGGCAAATGTGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGGCCCAGTTGACGG
TACAGCCGACTTTGAGACAAAGGATCATTGATGCTCAGAGTAACTATCCTTATTTGGTTGAGCAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCC
TCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGG
TAGTACGAAGATGTATCAGGACCTGAAGCAGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGTTGTGTCAGCAGGTCAAGGCAC
CAAGGCAGAAATCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAATGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTT
ATAGTGATTTGGGTTGTGGTGGACAGGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGT
GACATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTA
CAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTTGAATTTCCAGGTAGCTGGGACTCCCACTTG
CATTTGATGGAATTTGCTTATAACAACAATTATCAGGCTACTATTGGCATGGCGCCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGTTGGGGTGAGGT
GGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCACGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAG
ATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGAGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGT
TTTGTTGGGCCGTTTGAGATTCTGGAGCCGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCAACATCACTCTCGACAGTTCATGATGTGTTTCATGTCTCTATGTTGAG
GAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCCCCGCAGAGCTGAAAGAACTGAAGGTGCAGTTACAAGAATTGCTTGATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAGTTTTGTTCGTTAA
GAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAACAAGGTAACCGTAAAGAACAGATATCCCTTGCCCAGGATCGACGATCTATTTGACCAGTTAC
AGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTAAGGATTAAGGAAGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTAC
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATGGGCTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATTGA
CGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACGTATAGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGT
TTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGTCTCTGTGGATCCAGCTAAGATAGAGGCAATCACCGGTTGGACCCGACCTTCCACAGTC
AGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTT
TGTTTGGAGCAAGGCATGTGAGGACAGTTTTCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGGTAAGGTGGTCG
CTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGT
GAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGATGGCTTGAGTTAGTCAAGGATTACGATTG
TGAGATACTGTATCATCCAGGCAAGGCAAATGTGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGGCCCAGTTGACGG
TACAGCCGACTTTGAGACAAAGGATCATTGATGCTCAGAGTAACTATCCTTATTTGGTTGAGCAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCCATATCC
TCTGATGGTGGACTTTTGTTTGAGAGGCGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGG
TAGTACGAAGATGTATCAGGACCTGAAGCAGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAAATGCTTGTTGTGTCAGCAGGTCAAGGCAC
CAAGGCAGAAATCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAATGTGTCCATGGATTTCATTACAGGACTGCCGAGAACTCTGAGGGGTTTT
ATAGTGATTTGGGTTGTGGTGGACAGGCTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGT
GACATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTAGACTTTAGTA
CAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATTTGAATTTCCAGGTAGCTGGGACTCCCACTTG
CATTTGATGGAATTTGCTTATAACAACAATTATCAGGCTACTATTGGCATGGCGCCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCGGTTTGTTGGGGTGAGGT
GGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCACGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAG
ATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGAGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTACGATTTGAAAGGAGGGGAAAGCTGAGTCCCCGT
TTTGTTGGGCCGTTTGAGATTCTGGAGCCGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCAACATCACTCTCGACAGTTCATGATGTGTTTCATGTCTCTATGTTGAG
GAAGTGA
Protein sequenceShow/hide protein sequence
MTPAELKELKVQLQELLDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKEGDVPKTAFRSRYGHY
EFIVMSFGLTNAPAVFMGLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRIVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAITGWTRPSTV
SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG
EKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVAPLHRDLERAEIAVSVGAVTMAQLTVQPTLRQRIIDAQSNYPYLVEQRGLAEAGQAVEFSIS
SDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKQVYWWRNMKREVAEFVSKCLLCQQVKAPRQKSAGLLQPLSIPEWKWENVSMDFITGLPRTLRGF
IVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVTLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAFEFPGSWDSHL
HLMEFAYNNNYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRTHTAQSRQKSYADVRRKDLEFEVEDKVFLKVAPMRGVLRFERRGKLSPR
FVGPFEILEPIGPVAYRLALPTSLSTVHDVFHVSMLRK