; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012215 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012215
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr02:15946239..15951617
RNA-Seq ExpressionPay0012215
SyntenyPay0012215
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]0.0e+0080.07Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRD
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------
        KLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                     
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------

Query:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
         RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
Subjt:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ

Query:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------
        AVEFS+SSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV                          
Subjt:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------

Query:  --------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS
                +HFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRACALEFPGS
Subjt:  --------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG------------------------------------------------------------
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG                                                            
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG------------------------------------------------------------

Query:  -----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV
                               ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIP+VKV
Subjt:  -----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV

Query:  LWRNHRVEEATWEREDDMKSRYPELFEE
        LWRNHRV EATWEREDDM+SRYPELFEE
Subjt:  LWRNHRVEEATWEREDDMKSRYPELFEE

KAA0040689.1 pol protein [Cucumis melo var. makuwa]1.7e-0974.58Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSG
        +CASFGSTR ICASF STRL+CAS GSTRLICASFGSTRLICA  G+TRL+ C  +  G
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSG

KAA0040689.1 pol protein [Cucumis melo var. makuwa]0.0e+0082.64Show/hide
Query:  QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDN
        +GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLR+VLQTLRDN
Subjt:  QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDN

Query:  KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQK
        KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ+
Subjt:  KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQK

Query:  LVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL
        LVTA VLT PDGSGSF+IYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL
Subjt:  LVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL

Query:  ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPF
        ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSL SDGGLLFERRLCVPSDSA+KTELLSEAHSSPF
Subjt:  ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPF

Query:  SMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV----------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPV
        SMH GSTKMYQDLKRVYWWRNMKREVAEFVSRCLV                                  +HFVPGKSTYTASKWAQLYMSEIV+LHGVPV
Subjt:  SMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV----------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPV

Query:  SIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW
        SIVSDRDARFTSKFWKGLQTAM TRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY KCCRSPVCW
Subjt:  SIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW

Query:  G-----------------------------------------------------------------------------------ERIGPVAYRLALPPSL
        G                                                                                   ERIGPVAYRLALPPSL
Subjt:  G-----------------------------------------------------------------------------------ERIGPVAYRLALPPSL

Query:  STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMKSRYPELFEE
        STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVL REVKTLRNKEIPLVKVLWRNHRVEEATWEREDDM+SRYPELFEE
Subjt:  STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMKSRYPELFEE

KAA0040695.1 pol protein [Cucumis melo var. makuwa]0.0e+0080.81Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFS+IDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTL D
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------
        KLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                     
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------

Query:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
         RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
Subjt:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ

Query:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------
        A EFSLSSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVSRCLV                          
Subjt:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------

Query:  ------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ
                                      +HFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ
Subjt:  ------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ

Query:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG--------------------------------------
        TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG                                      
Subjt:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG--------------------------------------

Query:  ---------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER
                 ERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER
Subjt:  ---------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER

Query:  EDDMKSRYPELFE
        EDDM+SRYPELFE
Subjt:  EDDMKSRYPELFE

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.0e+0081.28Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRD
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA
        KLVTAPVLTVPDGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA

Query:  LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP
        LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP
Subjt:  LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP

Query:  FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------------------------------------SHFVPGKS
        FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLV                                                        +HFV GKS
Subjt:  FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------------------------------------SHFVPGKS

Query:  TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY
        TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY
Subjt:  TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY

Query:  QATIGMAPFEALYGKCCRSPVCWG----------------------------------------------------------------------------
        QATIGMAPFEALYGKCC+SPVCWG                                                                            
Subjt:  QATIGMAPFEALYGKCCRSPVCWG----------------------------------------------------------------------------

Query:  -------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED
               ERIGP+AYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED
Subjt:  -------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED

Query:  DMKSRYPELFEE
        DM+SRYP+LFEE
Subjt:  DMKSRYPELFEE

KAA0062245.1 pol protein [Cucumis melo var. makuwa]2.6e-1067.12Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSGPSTSCAPLQGAT
        MCASFGSTR ICASFGSTRL+ AS GSTRL+CASFGSTRL+CAS GSTRLI     C+  G S       G T
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSGPSTSCAPLQGAT

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.0e+0068.74Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSG--------------------------------PST---S
        MCASFGSTR +CASFGSTRL+CAS GSTR ICASFGSTRLICA  G+TRL+    +  G+                                 P T   S
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSG--------------------------------PST---S

Query:  CAP--------------------------------------------------------------------------LQGATVFSKIDLRSGYHQLRIKD
         AP                                                                          LQGATVFSKIDLRSGYHQLRIKD
Subjt:  CAP--------------------------------------------------------------------------LQGATVFSKIDLRSGYHQLRIKD

Query:  EDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHV
         DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHV
Subjt:  EDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHV

Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA
        VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA

Query:  SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKAN
        SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDYDCEILYHPGKAN
Subjt:  SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKAN

Query:  VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSD
        VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKR LAEAGQAVEFSLSSDGGLLFER LCVPSD
Subjt:  VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSD

Query:  SAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV-------------------------------------------------
        SA KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLV                                                 
Subjt:  SAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV-------------------------------------------------

Query:  -------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW
               +HFVPGKSTYTASKWAQLYMSEIVRLH VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW
Subjt:  -------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW

Query:  DSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG-------------------------------------------------------------
        DSHLHLMEFAYNNSYQATIGM PFEALYGKCCRSPVCWG                                                             
Subjt:  DSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG-------------------------------------------------------------

Query:  ----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVL
                              ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVL
Subjt:  ----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVL

Query:  WRNHRVEEATWEREDDMKSRYPELFEE
        WRNHRVEEATWEREDDM+SRYPELF+E
Subjt:  WRNHRVEEATWEREDDMKSRYPELFEE

TrEMBL top hitse value%identityAlignment
A0A5A7TGX4 Reverse transcriptase0.0e+0080.81Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFS+IDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTL D
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------
        KLVTAPVLTV DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                     
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------

Query:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
         RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
Subjt:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ

Query:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------
        A EFSLSSDGGLLFER LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVSRCLV                          
Subjt:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------

Query:  ------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ
                                      +HFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ
Subjt:  ------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQ

Query:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG--------------------------------------
        TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG                                      
Subjt:  TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG--------------------------------------

Query:  ---------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER
                 ERIGPVAYRL LPPSLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER
Subjt:  ---------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWER

Query:  EDDMKSRYPELFE
        EDDM+SRYPELFE
Subjt:  EDDMKSRYPELFE

A0A5A7THE6 Reverse transcriptase0.0e+0080.07Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRD
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------
        KLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                     
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-------------------------------------

Query:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
         RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ
Subjt:  -RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ

Query:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------
        AVEFS+SSDGGLLFERRLCVPSDSA+KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV                          
Subjt:  AVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------

Query:  --------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS
                +HFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRACALEFPGS
Subjt:  --------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGS

Query:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG------------------------------------------------------------
        WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG                                                            
Subjt:  WDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG------------------------------------------------------------

Query:  -----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV
                               ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIP+VKV
Subjt:  -----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKV

Query:  LWRNHRVEEATWEREDDMKSRYPELFEE
        LWRNHRV EATWEREDDM+SRYPELFEE
Subjt:  LWRNHRVEEATWEREDDMKSRYPELFEE

A0A5A7THE6 Reverse transcriptase8.1e-1074.58Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSG
        +CASFGSTR ICASF STRL+CAS GSTRLICASFGSTRLICA  G+TRL+ C  +  G
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSG

A0A5A7THE6 Reverse transcriptase0.0e+0082.64Show/hide
Query:  QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDN
        +GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLR+VLQTLRDN
Subjt:  QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDN

Query:  KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQK
        KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ+
Subjt:  KLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQK

Query:  LVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL
        LVTA VLT PDGSGSF+IYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+RWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL
Subjt:  LVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAAL

Query:  ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPF
        ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKRGLAEAGQAVEFSL SDGGLLFERRLCVPSDSA+KTELLSEAHSSPF
Subjt:  ITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPF

Query:  SMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV----------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPV
        SMH GSTKMYQDLKRVYWWRNMKREVAEFVSRCLV                                  +HFVPGKSTYTASKWAQLYMSEIV+LHGVPV
Subjt:  SMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV----------------------------------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPV

Query:  SIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW
        SIVSDRDARFTSKFWKGLQTAM TRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY KCCRSPVCW
Subjt:  SIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW

Query:  G-----------------------------------------------------------------------------------ERIGPVAYRLALPPSL
        G                                                                                   ERIGPVAYRLALPPSL
Subjt:  G-----------------------------------------------------------------------------------ERIGPVAYRLALPPSL

Query:  STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMKSRYPELFEE
        STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVL REVKTLRNKEIPLVKVLWRNHRVEEATWEREDDM+SRYPELFEE
Subjt:  STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMKSRYPELFEE

A0A5A7V8L8 Pol protein0.0e+0081.28Show/hide
Query:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD
        LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLRMVLQTLRD
Subjt:  LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRD

Query:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
        NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ
Subjt:  NKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQ

Query:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA
        KLVTAPVLTVPDGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA
Subjt:  KLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAA

Query:  LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP
        LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP
Subjt:  LITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSP

Query:  FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------------------------------------SHFVPGKS
        FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLV                                                        +HFV GKS
Subjt:  FSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV--------------------------------------------------------SHFVPGKS

Query:  TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY
        TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY
Subjt:  TYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSY

Query:  QATIGMAPFEALYGKCCRSPVCWG----------------------------------------------------------------------------
        QATIGMAPFEALYGKCC+SPVCWG                                                                            
Subjt:  QATIGMAPFEALYGKCCRSPVCWG----------------------------------------------------------------------------

Query:  -------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED
               ERIGP+AYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED
Subjt:  -------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWERED

Query:  DMKSRYPELFEE
        DM+SRYP+LFEE
Subjt:  DMKSRYPELFEE

A0A5A7V8L8 Pol protein1.2e-1067.12Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSGPSTSCAPLQGAT
        MCASFGSTR ICASFGSTRL+ AS GSTRL+CASFGSTRL+CAS GSTRLI     C+  G S       G T
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSGPSTSCAPLQGAT

A0A5A7V8L8 Pol protein0.0e+0068.74Show/hide
Query:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSG--------------------------------PST---S
        MCASFGSTR +CASFGSTRL+CAS GSTR ICASFGSTRLICA  G+TRL+    +  G+                                 P T   S
Subjt:  MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSG--------------------------------PST---S

Query:  CAP--------------------------------------------------------------------------LQGATVFSKIDLRSGYHQLRIKD
         AP                                                                          LQGATVFSKIDLRSGYHQLRIKD
Subjt:  CAP--------------------------------------------------------------------------LQGATVFSKIDLRSGYHQLRIKD

Query:  EDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHV
         DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHE+HLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHV
Subjt:  EDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHV

Query:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA
        VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA
Subjt:  VSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDA

Query:  SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKAN
        SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDYDCEILYHPGKAN
Subjt:  SKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKAN

Query:  VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSD
        VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ NDPYLVEKR LAEAGQAVEFSLSSDGGLLFER LCVPSD
Subjt:  VVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSD

Query:  SAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV-------------------------------------------------
        SA KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLV                                                 
Subjt:  SAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLV-------------------------------------------------

Query:  -------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW
               +HFVPGKSTYTASKWAQLYMSEIVRLH VPVSIVSD+DARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW
Subjt:  -------SHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSW

Query:  DSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG-------------------------------------------------------------
        DSHLHLMEFAYNNSYQATIGM PFEALYGKCCRSPVCWG                                                             
Subjt:  DSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWG-------------------------------------------------------------

Query:  ----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVL
                              ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVL
Subjt:  ----------------------ERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVL

Query:  WRNHRVEEATWEREDDMKSRYPELFEE
        WRNHRVEEATWEREDDM+SRYPELF+E
Subjt:  WRNHRVEEATWEREDDMKSRYPELFEE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.6e-7927.75Show/hide
Query:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL
        A +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L
Subjt:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL

Query:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        ++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------
        KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                              
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------

Query:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL
                     RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L
Subjt:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL

Query:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------
        +    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C               
Subjt:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------

Query:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD
                                          +V  F      VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + 
Subjt:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD

Query:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
        FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT35 Transposon Tf2-2 polyprotein1.6e-7927.75Show/hide
Query:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL
        A +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L
Subjt:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL

Query:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        ++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------
        KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                              
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------

Query:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL
                     RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L
Subjt:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL

Query:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------
        +    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C               
Subjt:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------

Query:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD
                                          +V  F      VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + 
Subjt:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD

Query:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
        FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT36 Transposon Tf2-3 polyprotein1.6e-7927.75Show/hide
Query:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL
        A +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L
Subjt:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL

Query:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        ++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------
        KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                              
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------

Query:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL
                     RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L
Subjt:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL

Query:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------
        +    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C               
Subjt:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------

Query:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD
                                          +V  F      VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + 
Subjt:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD

Query:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
        FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT37 Transposon Tf2-4 polyprotein1.6e-7927.75Show/hide
Query:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL
        A +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L
Subjt:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL

Query:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        ++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------
        KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                              
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------

Query:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL
                     RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L
Subjt:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL

Query:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------
        +    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C               
Subjt:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------

Query:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD
                                          +V  F      VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + 
Subjt:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD

Query:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
        FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT41 Transposon Tf2-12 polyprotein1.6e-7927.75Show/hide
Query:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL
        A +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH KH++ VLQ L
Subjt:  APLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTL

Query:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL
        ++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+
Subjt:  RDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNL

Query:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------
        KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                              
Subjt:  KQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV------------------------------

Query:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL
                     RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + +++   +ND  L
Subjt:  ------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYL

Query:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------
        +    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C               
Subjt:  VEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRC---------------

Query:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD
                                          +V  F      VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + 
Subjt:  ----------------------------------LVSHF------VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLD

Query:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
        FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  FSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.9e-2745.8Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACAAGACTGATATGTGCATCCTCCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAG
CACGAGACTGATATGTGCATCCTCCGGGAGCACTAGACTGATCTCCTGCACCAGCTCCTGCTCCGGCTCCGGCCCCAGTACCAGTTGCGCCCCATTACAGGGAGCCACAG
TGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTG
ATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGACATCTTGAT
ATACTCCAAGACGGAGGCCGAACACGAGAAGCACTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGCTGAAGC
AGGTGTCCTTTCTGGGTCACGTGGTTTCTAAGGCTGGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGA
AGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCA
GTGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGC
AGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGA
CTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGT
GGACTTTTGTTCGAGAGACGCCTCTGTGTGCCGTCGGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAA
GATGTATCAGGATCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGGTGTCACACTTTGTTCCGGGTAAATCCACCT
ATACTGCTAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGG
AAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCG
AGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCCACTATCGGCATGGCACCATTTGAGGCCT
TGTATGGCAAATGTTGTAGATCCCCGGTTTGTTGGGGTGAACGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTCCAT
GTCTCCATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAAATTGATGAAAACTTGAGCTATACCGAACAACCCGTTGAGGTGCTGGC
TAGAGAGGTGAAAACGTTGAGGAATAAAGAAATTCCTTTGGTTAAAGTCTTATGGCGGAATCACCGGGTGGAAGAGGCTACATGGGAGCGTGAAGATGACATGAAGTCCC
GTTACCCCGAGCTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGCATCCTTCGGGAGCACTAGACCGATATGTGCATCCTTCGGGAGCACAAGACTGATATGTGCATCCTCCGGGAGCACTAGACTGATATGTGCATCCTTCGGGAG
CACGAGACTGATATGTGCATCCTCCGGGAGCACTAGACTGATCTCCTGCACCAGCTCCTGCTCCGGCTCCGGCCCCAGTACCAGTTGCGCCCCATTACAGGGAGCCACAG
TGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGCTGAGGATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTG
ATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGACATCTTGAT
ATACTCCAAGACGGAGGCCGAACACGAGAAGCACTTACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGCTGAAGC
AGGTGTCCTTTCTGGGTCACGTGGTTTCTAAGGCTGGGGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTCACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGA
AGGGTTTGGGTTGTGTTTTGATGCAGCAGGGTAAGGTGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCA
GTGCGAAGGTGGCTTGAGTTAGTGAAGGATTACGATTGTGAGATACTGTATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGC
AGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGA
CTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGT
GGACTTTTGTTCGAGAGACGCCTCTGTGTGCCGTCGGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAA
GATGTATCAGGATCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAGATGCTTGGTGTCACACTTTGTTCCGGGTAAATCCACCT
ATACTGCTAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTCTGG
AAGGGTTTGCAGACTGCTATGGGCACGAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCG
AGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCCACTATCGGCATGGCACCATTTGAGGCCT
TGTATGGCAAATGTTGTAGATCCCCGGTTTGTTGGGGTGAACGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTCCAT
GTCTCCATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAAATTGATGAAAACTTGAGCTATACCGAACAACCCGTTGAGGTGCTGGC
TAGAGAGGTGAAAACGTTGAGGAATAAAGAAATTCCTTTGGTTAAAGTCTTATGGCGGAATCACCGGGTGGAAGAGGCTACATGGGAGCGTGAAGATGACATGAAGTCCC
GTTACCCCGAGCTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MCASFGSTRPICASFGSTRLICASSGSTRLICASFGSTRLICASSGSTRLISCTSSCSGSGPSTSCAPLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIV
MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEKHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVR
SFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAA
VRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDG
GLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCLVSHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFW
KGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGERIGPVAYRLALPPSLSTVHDVFH
VSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEREDDMKSRYPELFEE