; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010808 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010808
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr08:18987853..19011570
RNA-Seq ExpressionPay0010808
SyntenyPay0010808
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR023780 - Chromo domain
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]0.0e+0072.79Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKR L EAGQA EFSLSSDGGLLFER LCVPSDSA KTELL+EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++ VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGM PFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G     +L+ P            +  +E     R+ +  A  R K +AD +R+  E+ IG+KV +K+ P +   LR   +G L  R+  PF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  DVLWQFQTEIEKFKE
        D +     E+  FKE
Subjt:  DVLWQFQTEIEKFKE

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0073.81Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFERRLCVPSDS VKTELL+EAHSSPFSMHPGSTKMY+D+
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLHLMEF YNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +   LR   +G L  R+ GPF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       +  +    +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  D
        D
Subjt:  D

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.0e+0074.03Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGY+QLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+EI +KV +K+ P +   LR   +G L  R+ GPF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  D
        D
Subjt:  D

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0073.3Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSG HQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KR+YWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT S         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY 
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +     +    L  R+ GPF 
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS

Query:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVV----TEFDRKIK------EILAE--RKIRRRGVPSHSEYLILWEGLP
        I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D            P+ VV     E D  +       E+LA   + +R + +P      +LW    
Subjt:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVV----TEFDRKIK------EILAE--RKIRRRGVPSHSEYLILWEGLP

Query:  ESEASWERED
          EA+WERED
Subjt:  ESEASWERED

KAA0065873.1 pol protein [Cucumis melo var. makuwa]0.0e+0073Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHV+SKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQTLKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVP+GSGNFVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPL RDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVG VTMQLAQLTVQPTLRQRIIDAQSND YLVEKRGL EAGQ AEFSLSSDGGLLFE RLCVPSD  +KTELL+EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPL IPEWKWENVSM+FITGL RTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS
        +CCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +     ++   L  R+ GPF 
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS

Query:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERED
        I+ER+G  AY+++LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERED
Subjt:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERED

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein0.0e+0072.79Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKR L EAGQA EFSLSSDGGLLFER LCVPSDSA KTELL+EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++ VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGM PFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G     +L+ P            +  +E     R+ +  A  R K +AD +R+  E+ IG+KV +K+ P +   LR   +G L  R+  PF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  DVLWQFQTEIEKFKE
        D +     E+  FKE
Subjt:  DVLWQFQTEIEKFKE

A0A5A7U330 Reverse transcriptase0.0e+0073.81Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFERRLCVPSDS VKTELL+EAHSSPFSMHPGSTKMY+D+
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLHLMEF YNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +   LR   +G L  R+ GPF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       +  +    +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  D
        D
Subjt:  D

A0A5A7UAA8 Reverse transcriptase0.0e+0074.03Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGY+QLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+EI +KV +K+ P +   LR   +G L  R+ GPF
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKG-LVRRYKGPF

Query:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE
         I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERE
Subjt:  SIIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERE

Query:  D
        D
Subjt:  D

A0A5A7UP94 Pol protein0.0e+0073.3Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSG HQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACEDSFQ LKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVPDGSG+FVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGL EAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KR+YWWRNMKREVA FV+                GLLQPLSIPEWKWENVSM+FI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT S         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLN+VLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY 
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS
        KCCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +     +    L  R+ GPF 
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS

Query:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVV----TEFDRKIK------EILAE--RKIRRRGVPSHSEYLILWEGLP
        I+ER+G  AY++ LP  L  +H+VFHVSML+ +  D            P+ VV     E D  +       E+LA   + +R + +P      +LW    
Subjt:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVV----TEFDRKIK------EILAE--RKIRRRGVPSHSEYLILWEGLP

Query:  ESEASWERED
          EA+WERED
Subjt:  ESEASWERED

A0A5A7VCI9 Pol protein0.0e+0073Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        GATVFSKIDLRSGYHQLRIKDED+PKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDN+
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        LYAKF KCEFWLKQVSFLGHV+SKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQTLKQKL
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR
        VTAPVLTVP+GSGNFVIYSDASKKGLGCVLMQQGKVVAY SRQLKSHEQNYPTHDLELAAV                                       
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQR

Query:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER
                   VFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPL RDLER
Subjt:  RWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL
        A+IAVSVG VTMQLAQLTVQPTLRQRIIDAQSND YLVEKRGL EAGQ AEFSLSSDGGLLFE RLCVPSD  +KTELL+EAHSSPFSMHPGSTKMYQDL
Subjt:  AKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDL

Query:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------
        KRVYWWRNMKREVA FV+                GLLQPL IPEWKWENVSM+FITGL RTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS         
Subjt:  KRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASN--------

Query:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
           ++GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG
Subjt:  --EINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG

Query:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS
        +CCRS    G      L+ P            +  +E     R+ +  A  R K +AD +R+  E+E+G+KV +K+ P +     ++   L  R+ GPF 
Subjt:  KCCRSRFA-GPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKLLPNQFKSLRKVHKGLVRRYKGPFS

Query:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERED
        I+ER+G  AY+++LP  L  +H+VFHVSML+ +  D       E       +   +  +  E+LA   + +R + +P      +LW      EA+WERED
Subjt:  IIERVGKAAYKVELPQKLK-IHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAE--RKIRRRGVPSHSEYLILWEGLPESEASWERED

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.8e-10129.07Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        G+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        L     KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + +KQ L
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK
        V+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK

Query:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP
                                                         +  N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P
Subjt:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP

Query:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP
        + +D E   I          + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HP
Subjt:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS
        G   +   + R + W+ +++++  +V                 G LQP+   E  WE++SM+FIT LP +  G+  ++VVVDR +K A  VP   + TA 
Subjt:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS

Query:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG
                      G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   
Subjt:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK
        M PFE ++      R++        ++P E P   +F+     + ++ I      +  L     +MKK+ D K +  +E++ G+ V+VK     F  L K
Subjt:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK

Query:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE
         +K L   + GPF ++++ G   Y+++LP  +K    + FHVS L+ +  + E
Subjt:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE

P0CT35 Transposon Tf2-2 polyprotein1.8e-10129.07Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        G+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        L     KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + +KQ L
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK
        V+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK

Query:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP
                                                         +  N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P
Subjt:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP

Query:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP
        + +D E   I          + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HP
Subjt:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS
        G   +   + R + W+ +++++  +V                 G LQP+   E  WE++SM+FIT LP +  G+  ++VVVDR +K A  VP   + TA 
Subjt:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS

Query:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG
                      G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   
Subjt:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK
        M PFE ++      R++        ++P E P   +F+     + ++ I      +  L     +MKK+ D K +  +E++ G+ V+VK     F  L K
Subjt:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK

Query:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE
         +K L   + GPF ++++ G   Y+++LP  +K    + FHVS L+ +  + E
Subjt:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE

P0CT36 Transposon Tf2-3 polyprotein1.8e-10129.07Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        G+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        L     KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + +KQ L
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK
        V+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK

Query:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP
                                                         +  N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P
Subjt:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP

Query:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP
        + +D E   I          + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HP
Subjt:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS
        G   +   + R + W+ +++++  +V                 G LQP+   E  WE++SM+FIT LP +  G+  ++VVVDR +K A  VP   + TA 
Subjt:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS

Query:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG
                      G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   
Subjt:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK
        M PFE ++      R++        ++P E P   +F+     + ++ I      +  L     +MKK+ D K +  +E++ G+ V+VK     F  L K
Subjt:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK

Query:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE
         +K L   + GPF ++++ G   Y+++LP  +K    + FHVS L+ +  + E
Subjt:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE

P0CT37 Transposon Tf2-4 polyprotein1.8e-10129.07Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        G+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        L     KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + +KQ L
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK
        V+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK

Query:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP
                                                         +  N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P
Subjt:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP

Query:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP
        + +D E   I          + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HP
Subjt:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS
        G   +   + R + W+ +++++  +V                 G LQP+   E  WE++SM+FIT LP +  G+  ++VVVDR +K A  VP   + TA 
Subjt:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS

Query:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG
                      G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   
Subjt:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK
        M PFE ++      R++        ++P E P   +F+     + ++ I      +  L     +MKK+ D K +  +E++ G+ V+VK     F  L K
Subjt:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK

Query:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE
         +K L   + GPF ++++ G   Y+++LP  +K    + FHVS L+ +  + E
Subjt:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE

P0CT41 Transposon Tf2-12 polyprotein1.8e-10129.07Show/hide
Query:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR
        G+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L++  
Subjt:  GATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNR

Query:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
        L     KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + + +KQ L
Subjt:  LYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL

Query:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK
        V+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +
Subjt:  VTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGK-----VVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQK

Query:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP
                                                         +  N R  RW   ++D++ EI Y PG AN +ADALSR       ++    P
Subjt:  ELNMRQRRWLELVKDYDLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAP

Query:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP
        + +D E   I          + Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HP
Subjt:  LHRDLERAKIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS
        G   +   + R + W+ +++++  +V                 G LQP+   E  WE++SM+FIT LP +  G+  ++VVVDR +K A  VP   + TA 
Subjt:  GSTKMYQDLKRVYWWRNMKREVAGFVT----------------GLLQPLSIPEWKWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAS

Query:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG
                      G P  I++D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   
Subjt:  NEIN----------GVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK
        M PFE ++      R++        ++P E P   +F+     + ++ I      +  L     +MKK+ D K +  +E++ G+ V+VK     F  L K
Subjt:  MAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDIS-----RACLEKAAQRMKKWADKK-RRPKEYEIGEKVLVKLLPNQFKSLRK

Query:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE
         +K L   + GPF ++++ G   Y+++LP  +K    + FHVS L+ +  + E
Subjt:  VHKGLVRRYKGPFSIIERVGKAAYKVELPQKLK--IHNVFHVSMLKPFHEDQE

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-2746.4Show/hide
Query:  HLRMVLQTLRDNRLYAKFPKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNRLYAKFPKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQTLKQKLVTAPVLTVPD
        ++    +F+ LK  + T PVL +PD
Subjt:  SKACEDSFQTLKQKLVTAPVLTVPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCTTTGGAGGACCCCACTAGGGCTCAGATGTGGTTATCGTCCTTGGAAACCATATTCCGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGGTACCATCA
GCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTA
TGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTA
CGTATGGTTTTGCAAACACTTCGGGATAATAGGTTGTATGCAAAGTTCCCGAAATGCGAGTTTTGGCTGAAACAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGG
AGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGTTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTG
TGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAG
TTAGTTACCGCACCGGTTCTTACGGTACCTGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGT
GGTCGCTTATGTGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACGCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTAT
ATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTAC
GATTTGGTATTCGCTTTGAAGATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAACTGAA
TATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCGGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTGTCAC
ATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTAAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTA
CAGCCGACTTTGAGGCAAAGGATCATCGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAACAGAGGCAGGGCAAGCGGCTGAGTTCTCGTTATCCTC
TGACGGTGGACTGTTGTTTGAAAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTAGCTGAGGCTCACAGTTCCCCATTTTCTATGCACCCAGGTA
GTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCGTAACATGAAGAGAGAAGTAGCAGGATTTGTTACGGGTTTATTACAACCCTTGAGCATACCGGAATGG
AAGTGGGAGAACGTGTCCATGAATTTCATTACAGGGCTACCGAGAACTCTGAGGGGATTTACAGTGATTTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTCGT
TCCGGGTAAATCCACCTATACTGCTAGTAATGAGATTAACGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGA
CTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTG
GAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCTCTGTACGGCAAATG
TTGTAGATCCCGGTTTGCTGGGCCCAACACACCCGGTGCCTTGATTGCCCCATATGAAGGACCTAACCCTTCTGCGTTTAATTTCGCGAAGCAATGGCACGAGGAGCAAG
ATATATCAAGGGCATGTCTCGAGAAAGCAGCCCAAAGAATGAAGAAATGGGCCGATAAAAAGAGAAGACCAAAAGAATACGAAATTGGCGAGAAGGTTTTGGTAAAACTA
TTACCAAACCAGTTCAAGTCCCTTCGAAAAGTCCACAAGGGACTAGTCAGACGATATAAAGGCCCATTCTCAATCATCGAGAGAGTGGGTAAAGCAGCGTATAAAGTGGA
ACTACCTCAAAAGTTGAAGATCCATAACGTCTTCCACGTAAGTATGTTAAAACCCTTTCACGAAGACCAGGAGGATCCGAACAGAAGCGAAACCTCTCGAGCGCCAACCG
GAGTAGTAACCGAGTTTGACAGGAAGATTAAGGAGATCCTAGCCGAGAGGAAAATAAGAAGAAGAGGAGTTCCAAGCCACTCAGAATATCTGATTCTATGGGAAGGATTG
CCAGAGTCCGAAGCCAGCTGGGAACGCGAAGATGTACTTTGGCAATTCCAAACAGAAATAGAGAAGTTCAAAGAGAACGCAACGGGGACGTTGCGAAATCAAGTGGGGGA
GGGTGTCACGCCCCAAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTCTTTGGAGGACCCCACTAGGGCTCAGATGTGGTTATCGTCCTTGGAAACCATATTCCGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGGTACCATCA
GCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTA
TGGACTTGATGAACAGAGTGTTTAGGGAGTTTCTAGACACTTTTGTGATTGTGTTTATCGACGATATCTTGATATACTCCAAGACGGAGGCCGAACACGAGGAGCATTTA
CGTATGGTTTTGCAAACACTTCGGGATAATAGGTTGTATGCAAAGTTCCCGAAATGCGAGTTTTGGCTGAAACAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGG
AGTCTCTGTAGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGTTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTG
TGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAG
TTAGTTACCGCACCGGTTCTTACGGTACCTGATGGTTCTGGCAATTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGT
GGTCGCTTATGTGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACGCATGACCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTAT
ATGGTGAAAAGATACAGATATTCACAGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTAC
GATTTGGTATTCGCTTTGAAGATATGGAGGCATTATTTATATGGTGAAAAGATACAGATATTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAAGAACTGAA
TATGAGACAGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCGGGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGGAAGGTGTCAC
ATTCAGCAGCACTTATTACCCGGCAGGCCCCATTGCATCGGGATCTCGAGCGGGCTAAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTA
CAGCCGACTTTGAGGCAAAGGATCATCGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAACAGAGGCAGGGCAAGCGGCTGAGTTCTCGTTATCCTC
TGACGGTGGACTGTTGTTTGAAAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTAGCTGAGGCTCACAGTTCCCCATTTTCTATGCACCCAGGTA
GTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCGTAACATGAAGAGAGAAGTAGCAGGATTTGTTACGGGTTTATTACAACCCTTGAGCATACCGGAATGG
AAGTGGGAGAACGTGTCCATGAATTTCATTACAGGGCTACCGAGAACTCTGAGGGGATTTACAGTGATTTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTCGT
TCCGGGTAAATCCACCTATACTGCTAGTAATGAGATTAACGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGA
CTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTG
GAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCTCTGTACGGCAAATG
TTGTAGATCCCGGTTTGCTGGGCCCAACACACCCGGTGCCTTGATTGCCCCATATGAAGGACCTAACCCTTCTGCGTTTAATTTCGCGAAGCAATGGCACGAGGAGCAAG
ATATATCAAGGGCATGTCTCGAGAAAGCAGCCCAAAGAATGAAGAAATGGGCCGATAAAAAGAGAAGACCAAAAGAATACGAAATTGGCGAGAAGGTTTTGGTAAAACTA
TTACCAAACCAGTTCAAGTCCCTTCGAAAAGTCCACAAGGGACTAGTCAGACGATATAAAGGCCCATTCTCAATCATCGAGAGAGTGGGTAAAGCAGCGTATAAAGTGGA
ACTACCTCAAAAGTTGAAGATCCATAACGTCTTCCACGTAAGTATGTTAAAACCCTTTCACGAAGACCAGGAGGATCCGAACAGAAGCGAAACCTCTCGAGCGCCAACCG
GAGTAGTAACCGAGTTTGACAGGAAGATTAAGGAGATCCTAGCCGAGAGGAAAATAAGAAGAAGAGGAGTTCCAAGCCACTCAGAATATCTGATTCTATGGGAAGGATTG
CCAGAGTCCGAAGCCAGCTGGGAACGCGAAGATGTACTTTGGCAATTCCAAACAGAAATAGAGAAGTTCAAAGAGAACGCAACGGGGACGTTGCGAAATCAAGTGGGGGA
GGGTGTCACGCCCCAAAAATAG
Protein sequenceShow/hide protein sequence
MGLWRTPLGLRCGYRPWKPYSGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHL
RMVLQTLRDNRLYAKFPKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQK
LVTAPVLTVPDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDY
DLVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAKIAVSVGAVTMQLAQLTV
QPTLRQRIIDAQSNDPYLVEKRGLTEAGQAAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAGFVTGLLQPLSIPEW
KWENVSMNFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASNEINGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACAL
EFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSRFAGPNTPGALIAPYEGPNPSAFNFAKQWHEEQDISRACLEKAAQRMKKWADKKRRPKEYEIGEKVLVKL
LPNQFKSLRKVHKGLVRRYKGPFSIIERVGKAAYKVELPQKLKIHNVFHVSMLKPFHEDQEDPNRSETSRAPTGVVTEFDRKIKEILAERKIRRRGVPSHSEYLILWEGL
PESEASWEREDVLWQFQTEIEKFKENATGTLRNQVGEGVTPQK