; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0016321 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0016321
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr01:20779849..20782124
RNA-Seq ExpressionPay0016321
SyntenyPay0016321
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040695.1 pol protein [Cucumis melo var. makuwa]0.0e+0078.64Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTL DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQA EFSLSSDGGLLFER LCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMRGVLRF
        FHPQTD                                                                             GDKVFLKVAPMRGVLRF
Subjt:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHR
        ERRGKLSPRFVGPFEILERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWR HR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHR

Query:  VEEATWEREDDMRSRYPELFE
        VEEATWEREDDMRSRYPELFE
Subjt:  VEEATWEREDDMRSRYPELFE

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.0e+0075.07Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDS +KTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPMRGVLRFERRGKLSPRF+GPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PL+ID
Subjt:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE
        ENLSY EQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWEREDDM+SRYPEL  E
Subjt:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE

KAA0050673.1 pol protein [Cucumis melo var. makuwa]0.0e+0079.27Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV FS+SSDGGL+FERRLCVPSDSA+KTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
         PFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+PEWKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------
        KSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD                                      
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------

Query:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP
                                                                                   GDKVFLKVAPMRGVLRFERRGKLSP
Subjt:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP

Query:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER
        RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRK+VPDPSH+VDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWER
Subjt:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER

Query:  EDDMRSRYPELFEE
        EDDMRSRYPELFEE
Subjt:  EDDMRSRYPELFEE

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.0e+0075.73Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFSLSSDGGL FE RLCVPSDSA+KTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK PRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDG---------------------------------------------------------------------------------------------
        FHPQ DG                                                                                             
Subjt:  FHPQTDG---------------------------------------------------------------------------------------------

Query:  --------------------DKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                            DKVFLKVAPM+GVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
Subjt:  --------------------DKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE
        ENLSYVEQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWEREDDMRSRYPELFEE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE

KAA0057672.1 pol protein [Cucumis melo var. makuwa]0.0e+0075.46Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEID
Subjt:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE
        ENLSY+EQPVEVLAREVKTLRNK+IPLVKVLWR HR+EEATWEREDDMRSRYPELFEE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE

TrEMBL top hitse value%identityAlignment
A0A5A7TGX4 Reverse transcriptase0.0e+0078.64Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTL DNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQA EFSLSSDGGLLFER LCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMRGVLRF
        FHPQTD                                                                             GDKVFLKVAPMRGVLRF
Subjt:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMRGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHR
        ERRGKLSPRFVGPFEILERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWR HR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHR

Query:  VEEATWEREDDMRSRYPELFE
        VEEATWEREDDMRSRYPELFE
Subjt:  VEEATWEREDDMRSRYPELFE

A0A5A7U470 Pol protein0.0e+0079.27Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAV FS+SSDGGL+FERRLCVPSDSA+KTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
         PFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+PEWKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------
        KSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD                                      
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------

Query:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP
                                                                                   GDKVFLKVAPMRGVLRFERRGKLSP
Subjt:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP

Query:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER
        RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRK+VPDPSH+VDYEPLEIDENLSY EQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWER
Subjt:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER

Query:  EDDMRSRYPELFEE
        EDDMRSRYPELFEE
Subjt:  EDDMRSRYPELFEE

A0A5A7UAA8 Reverse transcriptase0.0e+0075.73Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFSLSSDGGL FE RLCVPSDSA+KTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK PRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDG---------------------------------------------------------------------------------------------
        FHPQ DG                                                                                             
Subjt:  FHPQTDG---------------------------------------------------------------------------------------------

Query:  --------------------DKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                            DKVFLKVAPM+GVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
Subjt:  --------------------DKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE
        ENLSYVEQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWEREDDMRSRYPELFEE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE

A0A5A7UP94 Pol protein0.0e+0075.46Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                    AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  --------------------------------------------AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEID
Subjt:  -------------------GDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE
        ENLSY+EQPVEVLAREVKTLRNK+IPLVKVLWR HR+EEATWEREDDMRSRYPELFEE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE

A0A5A7V8X5 Pol protein0.0e+0078.9Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQNLKQKLVTAPVLTVPDGSGSF+IYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELA +VFALKIWRHYLY                 
Subjt:  DSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS
        AALITRQAPLHRDLERAEIAVS+GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAE GQAVEFS+SSDGGLLFERRLCVPSDSA+KTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
        SPFSMHPGS KMYQ+LKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLSI EWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------
        KSTYTASKWAQLYMSEIVRLHGV VSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD                                      
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------

Query:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP
                                                                                   GDKVFLKVAPMRGVLRFERRGKLSP
Subjt:  ---------------------------------------------------------------------------GDKVFLKVAPMRGVLRFERRGKLSP

Query:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER
        RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDEN SY EQPVEVLAREVKTLRNK+IPLVKVLWR HRVEEATWER
Subjt:  RFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQIPLVKVLWRKHRVEEATWER

Query:  EDDMRSRYPEL
        EDDMRSR  EL
Subjt:  EDDMRSRYPEL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein6.0e-5828.49Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------
        + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + +          
Subjt:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------

Query:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
             IT ++ P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG
        DG
Subjt:  DG

P0CT35 Transposon Tf2-2 polyprotein6.0e-5828.49Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------
        + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + +          
Subjt:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------

Query:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
             IT ++ P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG
        DG
Subjt:  DG

P0CT36 Transposon Tf2-3 polyprotein6.0e-5828.49Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------
        + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + +          
Subjt:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------

Query:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
             IT ++ P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG
        DG
Subjt:  DG

P0CT41 Transposon Tf2-12 polyprotein6.0e-5828.49Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------
        + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + +          
Subjt:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------

Query:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
             IT ++ P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG
        DG
Subjt:  DG

Q9UR07 Transposon Tf2-11 polyprotein6.0e-5828.49Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------
        + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + +          
Subjt:  SFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAAL----------

Query:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
             IT ++ P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -----ITRQA-PLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QAVEFSLSSDGGLLFERR--LCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG
        DG
Subjt:  DG

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-2645.31Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA
        MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++ 
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA

Query:  CEDSFQNLKQKLVTAPVLTVPDGSGSFV
           +F+ LK  + T PVL +PD    FV
Subjt:  CEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGT
CTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGG
AGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTA
GTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGT
CGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATG
CAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCG
ACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCGTTATCCTCTGATGG
TGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGA
AGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAG
AAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGAT
TTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGC
ATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTC
CATCCACAGACTGACGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCGTTTGA
GATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGATC
CATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTTGCTAGAGAGGTGAAGACGTTGAGAAATAAACAA
ATTCCCTTAGTTAAAGTCTTATGGCGGAAACACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGCTGGAGT
CTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGG
AGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTA
GTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGT
CGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATG
CAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCG
ACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGAGTTCTCGTTATCCTCTGATGG
TGGACTTTTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGATTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGA
AGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAG
AAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTACCGAGAACTCTGAGGGGTTTTACAGTGAT
TTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGC
ATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTC
CATCCACAGACTGACGGGGATAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTGCGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCGTTTGA
GATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGATC
CATCCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTGCTTGCTAGAGAGGTGAAGACGTTGAGAAATAAACAA
ATTCCCTTAGTTAAAGTCTTATGGCGGAAACACCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKL
VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQP
TLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAIKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQ
KPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAF
HPQTDGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKQ
IPLVKVLWRKHRVEEATWEREDDMRSRYPELFEE