; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020280 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020280
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr08:7735344..7737620
RNA-Seq ExpressionPay0020280
SyntenyPay0020280
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040695.1 pol protein [Cucumis melo var. makuwa]0.0e+0078.25Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTL DNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ AEFSLSSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMKGVLRF
        FHPQTD                                                                             GDKVFLKVAPM+GVLRF
Subjt:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMKGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRL LP SLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  IEEATWEREDDMRSRYPELFEG
        +EEATWEREDDMRSRYPELFEG
Subjt:  IEEATWEREDDMRSRYPELFEG

KAA0048687.1 pol protein [Cucumis melo var. makuwa]7.5e-30874.57Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPM+GVLRFERRGKLSPRF+GPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDY+PL+ID
Subjt:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPEL
        ENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDM+SRYPEL
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPEL

KAA0050673.1 pol protein [Cucumis melo var. makuwa]0.0e+0078.54Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  -ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHS
         ALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ   FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHS
Subjt:  -ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
         PFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+PEWKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------
        KSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD                                      
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------

Query:  ---------------------------------------------------------------------------GDKVFLKVAPMKGVLRFERRGKLSP
                                                                                   GDKVFLKVAPM+GVLRFERRGKLSP
Subjt:  ---------------------------------------------------------------------------GDKVFLKVAPMKGVLRFERRGKLSP

Query:  RFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWER
        RFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRK+VPDPSH+VDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWER
Subjt:  RFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWER

Query:  EDDMRSRYPELFE
        EDDMRSRYPELFE
Subjt:  EDDMRSRYPELFE

KAA0051357.1 pol protein [Cucumis melo var. makuwa]2.2e-30975.3Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK PRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDG---------------------------------------------------------------------------------------------
        FHPQ DG                                                                                             
Subjt:  FHPQTDG---------------------------------------------------------------------------------------------

Query:  --------------------DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                            DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
Subjt:  --------------------DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE
        ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDMRSRYPELFE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE

KAA0057672.1 pol protein [Cucumis melo var. makuwa]4.5e-31074.9Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEID
Subjt:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE
        ENLSY+EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDMRSRYPELFE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE

TrEMBL top hitse value%identityAlignment
A0A5A7TGX4 Reverse transcriptase0.0e+0078.25Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTL DNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ AEFSLSSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTK AHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMKGVLRF
        FHPQTD                                                                             GDKVFLKVAPM+GVLRF
Subjt:  FHPQTD-----------------------------------------------------------------------------GDKVFLKVAPMKGVLRF

Query:  ERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR
        ERRGKLSPRFVGPFEILERIGPVAYRL LP SLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR
Subjt:  ERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR

Query:  IEEATWEREDDMRSRYPELFEG
        +EEATWEREDDMRSRYPELFEG
Subjt:  IEEATWEREDDMRSRYPELFEG

A0A5A7U330 Reverse transcriptase3.6e-30874.57Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGLLFERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWK LQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPM+GVLRFERRGKLSPRF+GPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDY+PL+ID
Subjt:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPEL
        ENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDM+SRYPEL
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPEL

A0A5A7U470 Pol protein0.0e+0078.54Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVT WTRPSTVSEVRSFLGLAGYYRRFVENFS IATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  -ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHS
         ALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ   FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHS
Subjt:  -ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG
         PFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+PEWKWENVSMDFITGLPRTLRGFTVIWVVVDR TKSAHFVPG
Subjt:  SPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------
        KSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD                                      
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTD--------------------------------------

Query:  ---------------------------------------------------------------------------GDKVFLKVAPMKGVLRFERRGKLSP
                                                                                   GDKVFLKVAPM+GVLRFERRGKLSP
Subjt:  ---------------------------------------------------------------------------GDKVFLKVAPMKGVLRFERRGKLSP

Query:  RFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWER
        RFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRK+VPDPSH+VDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWER
Subjt:  RFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWER

Query:  EDDMRSRYPELFE
        EDDMRSRYPELFE
Subjt:  EDDMRSRYPELFE

A0A5A7UAA8 Reverse transcriptase1.1e-30975.3Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK  VSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGL FE RLCVPSDSAVKTELL EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVK PRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTDG---------------------------------------------------------------------------------------------
        FHPQ DG                                                                                             
Subjt:  FHPQTDG---------------------------------------------------------------------------------------------

Query:  --------------------DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                            DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
Subjt:  --------------------DKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE
        ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDMRSRYPELFE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE

A0A5A7UP94 Pol protein2.2e-31074.9Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSR ATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------
        DSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY                 
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLY-----------------

Query:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
                                                     ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK
Subjt:  ---------------------------------------------ALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
        RGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKR+YWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPL

Query:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
        SIPEWKWENVSMDFI GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA
Subjt:  SIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTA

Query:  FHPQTD----------------------------------------------------------------------------------------------
        FHPQTD                                                                                              
Subjt:  FHPQTD----------------------------------------------------------------------------------------------

Query:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID
                           GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDY+PLEID
Subjt:  -------------------GDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE
        ENLSY+EQPVEVLAREVKTLRNKEIPLVKVLWRNHR+EEATWEREDDMRSRYPELFE
Subjt:  ENLSYVEQPVEVLAREVKTLRNKEIPLVKVLWRNHRIEEATWEREDDMRSRYPELFE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.1e-5727.16Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + I          
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------

Query:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
                   P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS
        DG         +K+   V           +  V +       S   + PFEI+ R  P    L LPS
Subjt:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS

P0CT35 Transposon Tf2-2 polyprotein5.1e-5727.16Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + I          
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------

Query:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
                   P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS
        DG         +K+   V           +  V +       S   + PFEI+ R  P    L LPS
Subjt:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS

P0CT36 Transposon Tf2-3 polyprotein5.1e-5727.16Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + I          
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------

Query:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
                   P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS
        DG         +K+   V           +  V +       S   + PFEI+ R  P    L LPS
Subjt:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS

P0CT41 Transposon Tf2-12 polyprotein5.1e-5727.16Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + I          
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------

Query:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
                   P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS
        DG         +K+   V           +  V +       S   + PFEI+ R  P    L LPS
Subjt:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS

Q9UR07 Transposon Tf2-11 polyprotein5.1e-5727.16Show/hide
Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A++ +LK WRHYL + I          
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALI----------

Query:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG
                   P ++ L R +                   IA ++  +  +               + Q+++    + +++   +ND  L+    L    
Subjt:  -------TRQAPLHRDLERAE-------------------IAVSVGAVTMQ---------------LAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAG

Query:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW
        +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E 
Subjt:  QTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEW

Query:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT
         WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  + PQT
Subjt:  KWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT

Query:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS
        DG         +K+   V           +  V +       S   + PFEI+ R  P    L LPS
Subjt:  DG---------DKVFLKVAP---------MKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-2745.31Show/hide
Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA
        MVLQ    ++ YA   KC F   Q+++LG  H++S +GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++ 
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKA

Query:  CEDSFQTLKQKLVTAPVLTVPDGSGSFV
           +F+ LK  + T PVL +PD    FV
Subjt:  CEDSFQTLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGCAGACACTTCGGGATAATAAGTTATATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAACAGGTGTCTTTTCTGGGCCACGTGGTTTCTAAGGATGGAGT
CTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGG
AGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTA
GTTACCGCGCCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGT
CGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATG
CACTTATTACCCGACAGGCCCCATTGCATCGGGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGTGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACC
TTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGG
ACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGA
TGTATCAGGACTTAAAGCGGGTTTATTGGTGGCGTAACATGAAGAGAGAAGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAA
CCAGCGGGTTTATTACAACCTTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACCGGGCTACCGAGAACTCTGAGGGGGTTTACAGTGATCTG
GGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGCACG
GAGTGCCAGTGTCGATTGTTTCTGACAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGATTGGACTTTAGTACGGCTTTCCAT
CCACAGACTGACGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTACGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCGTTTGAGAT
TCTAGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTTCATCACTCTCGACAGTCCATGATGTGTTTCACGTCTCTATGTTGAGGAAGTACGTGCCAGATCCAT
CCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTACTGGCTAGAGAGGTGAAAACGTTGAGGAATAAGGAAATC
CCTCTGGTTAAAGTCTTATGGCGGAATCACCGGATAGAAGAGGCTACGTGGGAGCGAGAAGATGACATGAGGTCCCGTTATCCCGAACTGTTCGAAGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGCAGACACTTCGGGATAATAAGTTATATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAACAGGTGTCTTTTCTGGGCCACGTGGTTTCTAAGGATGGAGT
CTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTTGTGG
AGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAAGCTA
GTTACCGCGCCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGT
CGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGGTTTTTGCTTTGAAAATATGGAGGCATTATTTATATG
CACTTATTACCCGACAGGCCCCATTGCATCGGGATCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGTGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACC
TTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGG
ACTGTTGTTTGAGAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGA
TGTATCAGGACTTAAAGCGGGTTTATTGGTGGCGTAACATGAAGAGAGAAGTAGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAA
CCAGCGGGTTTATTACAACCTTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACCGGGCTACCGAGAACTCTGAGGGGGTTTACAGTGATCTG
GGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGCACG
GAGTGCCAGTGTCGATTGTTTCTGACAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGATTGGACTTTAGTACGGCTTTCCAT
CCACAGACTGACGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAAAGGTGTCTTACGTTTTGAAAGGAGGGGAAAGTTGAGTCCCCGTTTTGTTGGGCCGTTTGAGAT
TCTAGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTTCATCACTCTCGACAGTCCATGATGTGTTTCACGTCTCTATGTTGAGGAAGTACGTGCCAGATCCAT
CCCATGTAGTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGTTGAACAACCTGTTGAGGTACTGGCTAGAGAGGTGAAAACGTTGAGGAATAAGGAAATC
CCTCTGGTTAAAGTCTTATGGCGGAATCACCGGATAGAAGAGGCTACGTGGGAGCGAGAAGATGACATGAGGTCCCGTTATCCCGAACTGTTCGAAGGGTAA
Protein sequenceShow/hide protein sequence
MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKL
VTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPT
LRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQK
PAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFH
PQTDGDKVFLKVAPMKGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNKEI
PLVKVLWRNHRIEEATWEREDDMRSRYPELFEG