; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0010474 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0010474
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr02:15848857..15853181
RNA-Seq ExpressionPay0010474
SyntenyPay0010474
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR016197 - Chromo-like domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]0.0e+0077.15Show/hide
Query:  RIERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR
        RI+     LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR
Subjt:  RIERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR

Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-----------------------------
        DSFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                             
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-----------------------------

Query:  ---------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEK
                 RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEK
Subjt:  ---------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-----------------
        RGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVC                 
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-----------------

Query:  ----------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRA
                              KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRA
Subjt:  ----------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRA

Query:  CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE---------------------------------------------------
        CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW E                                                   
Subjt:  CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE---------------------------------------------------

Query:  ------------------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRN
                                            VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN
Subjt:  ------------------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRN

Query:  RQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEK
        ++IP+VKVLWRNHRV EATWEREDDMRSRYPELFE+
Subjt:  RQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEK

KAA0040695.1 pol protein [Cucumis melo var. makuwa]0.0e+0067.27Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWLAANHASIDCSRKEVTFNPPS+A     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID
                 P+   P R                                   +++ +G+                              LQGATVFS+ID
Subjt:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID

Query:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
        LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL DNKLYAKFSKCE
Subjt:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE

Query:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP
        FWLKQVSFLGHVVSK GVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTV 
Subjt:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP

Query:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDY
        DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDY
Subjt:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDY

Query:  DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGG
        DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEKRGLAEAGQ AEFSLSSDGG
Subjt:  DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGG

Query:  LLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC------------------------------------
        LLFER LCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVC                                    
Subjt:  LLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC------------------------------------

Query:  -------------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM
                                 KSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM
Subjt:  -------------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM

Query:  LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE------------------------------------------------
        LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW E                                                
Subjt:  LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE------------------------------------------------

Query:  ---VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPEL
           VAYRL LP SLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN++IPLVKVLWRNHRVEEATWEREDDMRSRYPEL
Subjt:  ---VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPEL

Query:  FE
        FE
Subjt:  FE

KAA0059807.1 pol protein [Cucumis melo var. makuwa]0.0e+0072Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ----------------------------------SPLQNGPRR-----------------
        MDWLAANHASIDCSRKEVTFNPPSMA     G G  S  Q                                   P+   P R                 
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ----------------------------------SPLQNGPRR-----------------

Query:  ------------------IERTEGTL----------QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF
                          +++ +G++          +GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF
Subjt:  ------------------IERTEGTL----------QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF

Query:  LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV
        LDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFV
Subjt:  LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV

Query:  ENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRR
        ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQ+LVTA VLT PDGSGSF+IYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+R
Subjt:  ENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRR

Query:  WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAE
        WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQ NDPYLVEKRGLAEAGQ  E
Subjt:  WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAE

Query:  FSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC----------------------------
        FSL SDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMH GSTKMYQDLKRVYWWRNMKREVAEFVS+CLVC                            
Subjt:  FSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC----------------------------

Query:  -----------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
                   KSTYTASKWAQLYMSEIV+LHGVPVSIVSDRDARFTSKFWKGLQTAM TRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
Subjt:  -----------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS

Query:  HLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE--------------------------------------------------------------
        HLHLMEFAYNNSYQATIGMAPFEALY KCCRSPVCW E                                                              
Subjt:  HLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE--------------------------------------------------------------

Query:  -------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWR
                                 VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVL REVKTLRN++IPLVKVLWR
Subjt:  -------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWR

Query:  NHRVEEATWEREDDMRSRYPELFEK
        NHRVEEATWEREDDMRSRYPELFE+
Subjt:  NHRVEEATWEREDDMRSRYPELFEK

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.0e+0067.93Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWL ANHASIDCSRKEVTFNPPSMA     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID
                 P+   P R                                   +++ +G+                              LQGATVFSKID
Subjt:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID

Query:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
        LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
Subjt:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE

Query:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP
        FWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVP
Subjt:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP

Query:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD
        DGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD
Subjt:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD

Query:  LERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMY
        LERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEKRGLAEAGQ AEFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMY
Subjt:  LERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMY

Query:  QDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYTASKWAQLY
        QDLKRVYWWRNMKREVAEFVSKCLVC                                                             KSTYTASKWAQLY
Subjt:  QDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYTASKWAQLY

Query:  MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA
        MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA
Subjt:  MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA

Query:  LYGKCCRSPVCWDEV-------------------------------------------------------------------------------------
        LYGKCC+SPVCW EV                                                                                     
Subjt:  LYGKCCRSPVCWDEV-------------------------------------------------------------------------------------

Query:  --AYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFE
          AYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN++IPLVKVLWRNHRVEEATWEREDDMRSRYP+LFE
Subjt:  --AYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFE

Query:  K
        +
Subjt:  K

KAA0066456.1 pol protein [Cucumis melo var. makuwa]0.0e+0066.6Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWLAANHASIDCSRKEVTFNPPSMA     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRRIERTE-----------------------------------GTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFI
                 P+   P R+   E                                     LQGATVFSKIDLRSGYHQLRIKDEDVPKT FRSRYGHYEFI
Subjt:  --------SPLQNGPRRIERTE-----------------------------------GTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFI

Query:  VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTG
        VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRD KLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTG
Subjt:  VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTG

Query:  WTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
        WTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
Subjt:  WTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT
        ASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDYDCEILYHPGKANVVADALSRKVSHS ALIT
Subjt:  ASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT

Query:  RQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSM
        RQAPLHRDLERAEIAVS+GAVTMQLA+L VQPTLRQ+I+DAQ NDPYLVEKRGL EAGQTAEFSLSSDGGLLFERRLCVPSDSAVK ELL+EAHSSPFSM
Subjt:  RQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSM

Query:  HPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYT
        HPGSTK+YQDLKRVYWWRNMKREVAEFVSKCLVC                                                             KSTYT
Subjt:  HPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYT

Query:  ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT
        ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT
Subjt:  ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT

Query:  IGMAPFEALYGKCCRSPVCWDE------------------------------------------------------------------------------
        IGMAPFEALYG+CCRSPVCW E                                                                              
Subjt:  IGMAPFEALYGKCCRSPVCWDE------------------------------------------------------------------------------

Query:  ---------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMR
                 VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY E+PV+VLAREVKTLRN++IPLVKVLWRNHRVEEATWE EDDMR
Subjt:  ---------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMR

Query:  SRYPELFEK
        SRYPELFEK
Subjt:  SRYPELFEK

TrEMBL top hitse value%identityAlignment
A0A5A7TGX4 Reverse transcriptase0.0e+0067.27Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWLAANHASIDCSRKEVTFNPPS+A     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID
                 P+   P R                                   +++ +G+                              LQGATVFS+ID
Subjt:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID

Query:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
        LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTL DNKLYAKFSKCE
Subjt:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE

Query:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP
        FWLKQVSFLGHVVSK GVSVDPAKIEAVTGW RPSTVSEVRSFLGLAGYYR+FVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTV 
Subjt:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP

Query:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDY
        DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDY
Subjt:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDY

Query:  DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGG
        DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEKRGLAEAGQ AEFSLSSDGG
Subjt:  DCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGG

Query:  LLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC------------------------------------
        LLFER LCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKRVYWW NMKREVAEFVS+CLVC                                    
Subjt:  LLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC------------------------------------

Query:  -------------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM
                                 KSTYTASKWAQLYMSEIVRLHGVPVSIVS+RDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM
Subjt:  -------------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDM

Query:  LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE------------------------------------------------
        LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW E                                                
Subjt:  LRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE------------------------------------------------

Query:  ---VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPEL
           VAYRL LP SLSTVHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN++IPLVKVLWRNHRVEEATWEREDDMRSRYPEL
Subjt:  ---VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPEL

Query:  FE
        FE
Subjt:  FE

A0A5A7THE6 Reverse transcriptase0.0e+0077.15Show/hide
Query:  RIERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR
        RI+     LQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR
Subjt:  RIERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR

Query:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDP KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-----------------------------
        DSFQ LKQKLVTAPVL VPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV                             
Subjt:  DSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV-----------------------------

Query:  ---------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEK
                 RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEK
Subjt:  ---------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEK

Query:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-----------------
        RGLAEAGQ  EFS+SSDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVC                 
Subjt:  RGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-----------------

Query:  ----------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRA
                              KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQT+GQTERLNQVLEDMLRA
Subjt:  ----------------------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRA

Query:  CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE---------------------------------------------------
        CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW E                                                   
Subjt:  CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE---------------------------------------------------

Query:  ------------------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRN
                                            VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN
Subjt:  ------------------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRN

Query:  RQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEK
        ++IP+VKVLWRNHRV EATWEREDDMRSRYPELFE+
Subjt:  RQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEK

A0A5A7UYS8 Pol protein0.0e+0072Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ----------------------------------SPLQNGPRR-----------------
        MDWLAANHASIDCSRKEVTFNPPSMA     G G  S  Q                                   P+   P R                 
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ----------------------------------SPLQNGPRR-----------------

Query:  ------------------IERTEGTL----------QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF
                          +++ +G++          +GATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF
Subjt:  ------------------IERTEGTL----------QGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREF

Query:  LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV
        LDTFVIVFIDDILIYSKTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKI+AVTGWTRPSTVSEVRSFLGLAGYYRRFV
Subjt:  LDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFV

Query:  ENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRR
        ENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQ+LVTA VLT PDGSGSF+IYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAV+R
Subjt:  ENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRR

Query:  WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAE
        WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQ NDPYLVEKRGLAEAGQ  E
Subjt:  WLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAE

Query:  FSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC----------------------------
        FSL SDGGLLFERRLCVPSDSA+KTELL+EAHSSPFSMH GSTKMYQDLKRVYWWRNMKREVAEFVS+CLVC                            
Subjt:  FSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC----------------------------

Query:  -----------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
                   KSTYTASKWAQLYMSEIV+LHGVPVSIVSDRDARFTSKFWKGLQTAM TRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS
Subjt:  -----------KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDS

Query:  HLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE--------------------------------------------------------------
        HLHLMEFAYNNSYQATIGMAPFEALY KCCRSPVCW E                                                              
Subjt:  HLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDE--------------------------------------------------------------

Query:  -------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWR
                                 VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVL REVKTLRN++IPLVKVLWR
Subjt:  -------------------------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWR

Query:  NHRVEEATWEREDDMRSRYPELFEK
        NHRVEEATWEREDDMRSRYPELFE+
Subjt:  NHRVEEATWEREDDMRSRYPELFEK

A0A5A7V8L8 Pol protein0.0e+0067.93Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWL ANHASIDCSRKEVTFNPPSMA     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID
                 P+   P R                                   +++ +G+                              LQGATVFSKID
Subjt:  --------SPLQNGPRR-----------------------------------IERTEGT------------------------------LQGATVFSKID

Query:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
        LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE
Subjt:  LRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCE

Query:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP
        FWLKQVSFLGHVVSK GVSVDPAKIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQ LKQKLVTAPVLTVP
Subjt:  FWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVP

Query:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD
        DGSGSFVIYSDA KKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD
Subjt:  DGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRD

Query:  LERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMY
        LERAEIAVSVGAVTMQLAQLTVQPTLRQ+I+DAQSNDPYLVEKRGLAEAGQ AEFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMY
Subjt:  LERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSMHPGSTKMY

Query:  QDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYTASKWAQLY
        QDLKRVYWWRNMKREVAEFVSKCLVC                                                             KSTYTASKWAQLY
Subjt:  QDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYTASKWAQLY

Query:  MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA
        MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA
Subjt:  MSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEA

Query:  LYGKCCRSPVCWDEV-------------------------------------------------------------------------------------
        LYGKCC+SPVCW EV                                                                                     
Subjt:  LYGKCCRSPVCWDEV-------------------------------------------------------------------------------------

Query:  --AYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFE
          AYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRN++IPLVKVLWRNHRVEEATWEREDDMRSRYP+LFE
Subjt:  --AYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFE

Query:  K
        +
Subjt:  K

A0A5A7VJE2 Reverse transcriptase0.0e+0066.6Show/hide
Query:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------
        MDWLAANHASIDCSRKEVTFNPPSMA     G G  S  Q                                                            
Subjt:  MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQ------------------------------------------------------------

Query:  --------SPLQNGPRRIERTE-----------------------------------GTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFI
                 P+   P R+   E                                     LQGATVFSKIDLRSGYHQLRIKDEDVPKT FRSRYGHYEFI
Subjt:  --------SPLQNGPRRIERTE-----------------------------------GTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFI

Query:  VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTG
        VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRD KLYAKFSKCEFWLKQVSFLGHVVSK GVSVDPAKIEAVTG
Subjt:  VMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTG

Query:  WTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
        WTRPSTVSEVRSFLGL GYYRRFVENFSRIATPLTQLTRKG PFVWSKACEDSFQ LKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY
Subjt:  WTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT
        ASRQLKSHEQNYPTHDLELAAV                                      RRWLELVKDYDCEILYHPGKANVVADALSRKVSHS ALIT
Subjt:  ASRQLKSHEQNYPTHDLELAAV--------------------------------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALIT

Query:  RQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSM
        RQAPLHRDLERAEIAVS+GAVTMQLA+L VQPTLRQ+I+DAQ NDPYLVEKRGL EAGQTAEFSLSSDGGLLFERRLCVPSDSAVK ELL+EAHSSPFSM
Subjt:  RQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLAEAHSSPFSM

Query:  HPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYT
        HPGSTK+YQDLKRVYWWRNMKREVAEFVSKCLVC                                                             KSTYT
Subjt:  HPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVC-------------------------------------------------------------KSTYT

Query:  ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT
        ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT
Subjt:  ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQAT

Query:  IGMAPFEALYGKCCRSPVCWDE------------------------------------------------------------------------------
        IGMAPFEALYG+CCRSPVCW E                                                                              
Subjt:  IGMAPFEALYGKCCRSPVCWDE------------------------------------------------------------------------------

Query:  ---------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMR
                 VAYRLALP SLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY E+PV+VLAREVKTLRN++IPLVKVLWRNHRVEEATWE EDDMR
Subjt:  ---------VAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMR

Query:  SRYPELFEK
        SRYPELFEK
Subjt:  SRYPELFEK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-7726.95Show/hide
Query:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM
        IE+    +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ 
Subjt:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM

Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                         
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------

Query:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS
                          RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + ++V   +
Subjt:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------
        ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C          
Subjt:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------

Query:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM
                                                          + C  + TA + A+++   ++   G P  I++D D  FTS+ WK      
Subjt:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM

Query:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
           + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT35 Transposon Tf2-2 polyprotein3.2e-7726.95Show/hide
Query:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM
        IE+    +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ 
Subjt:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM

Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                         
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------

Query:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS
                          RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + ++V   +
Subjt:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------
        ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C          
Subjt:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------

Query:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM
                                                          + C  + TA + A+++   ++   G P  I++D D  FTS+ WK      
Subjt:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM

Query:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
           + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT36 Transposon Tf2-3 polyprotein3.2e-7726.95Show/hide
Query:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM
        IE+    +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ 
Subjt:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM

Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                         
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------

Query:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS
                          RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + ++V   +
Subjt:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------
        ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C          
Subjt:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------

Query:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM
                                                          + C  + TA + A+++   ++   G P  I++D D  FTS+ WK      
Subjt:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM

Query:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
           + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT37 Transposon Tf2-4 polyprotein3.2e-7726.95Show/hide
Query:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM
        IE+    +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ 
Subjt:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM

Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                         
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------

Query:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS
                          RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + ++V   +
Subjt:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------
        ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C          
Subjt:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------

Query:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM
                                                          + C  + TA + A+++   ++   G P  I++D D  FTS+ WK      
Subjt:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM

Query:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
           + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

P0CT41 Transposon Tf2-12 polyprotein3.2e-7726.95Show/hide
Query:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM
        IE+    +QG+T+F+K+DL+S YH +R++  D  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ 
Subjt:  IERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRM

Query:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED
        VLQ L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     
Subjt:  VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACED

Query:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------
        + + +KQ LV+ PVL   D S   ++ +DAS   +G VL Q+        V Y S ++   + NY   D E+ A+                         
Subjt:  SFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----VVAYASRQLKSHEQNYPTHDLELAAV-------------------------

Query:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS
                          RW   ++D++ EI Y PG AN +ADALSR       ++    P+ +D E   I          + Q+++    + ++V   +
Subjt:  -----------------RRWLELVKDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQS

Query:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------
        ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C          
Subjt:  NDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKC----------

Query:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM
                                                          + C  + TA + A+++   ++   G P  I++D D  FTS+ WK      
Subjt:  --------------------------------------------------LVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAM

Query:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
           + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++
Subjt:  GTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.7e-2845.8Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S +GVS DPAK+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQTLKQKLVTAPVLTVPDGSGSFV
        ++    +F+ LK  + T PVL +PD    FV
Subjt:  SKACEDSFQTLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTGGTTGGCCGCTAACCACGCCAGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCATTTGCCATAGAGTTGGAGCCGGGCACGGTTC
CTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACATTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATC
AGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTT
ATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATTTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTT
ACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGATG
GAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTT
GTGGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAA
GCTAGTTACCGCGCCGGTTCTTACGGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGG
TGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGCGGAGATGGCTTGAGTTAGTGAAGGATTACGAT
TGTGAGATACTGTATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGA
TCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGTGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAGAAGATCGTTGATGCTCAAAGTAACG
ATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGGCGCCTCTGTGTCCCGTCA
GATAGTGCGGTTAAGACAGAATTATTAGCTGAGGCTCACAGTTCCCCATTTTCTATGCACCCAGGTAGTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCG
TAACATGAAGAGGGAAGTAGCAGAATTTGTCAGTAAATGCTTGGTGTGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGC
ACGGAGTGCCAGTGTCGATTGTTTCTGACAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGATTGGACTTTAGTACGGCTTTC
CATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGAT
GGAATTTGCTTATAATAACAGCTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGATGAGGTAGCTTATC
GCTTGGCGTTACCTTCATCACTCTCGACAGTCCACGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCACGTAGTGGATTACGAGCCACTAGAG
ATTGATGAAAATTTGAGCTATGTTGAACAACCTGTTGAGGTGCTCGCTAGAGAGGTGAAGACGTTGAGAAATAGACAAATTCCCCTAGTTAAAGTCTTATGGCGGAATCA
CCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGAGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGATTGGTTGGCCGCTAACCACGCCAGTATAGATTGTTCACGTAAGGAGGTAACGTTTAACCCTCCCTCGATGGCCATTTGCCATAGAGTTGGAGCCGGGCACGGTTC
CTATATCCAGAGCCCCTTACAGAATGGCCCCCGCAGAATTGAAAGAACTGAAGGTACATTACAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATC
AGCTGAGAATTAAGGATGAGGATGTACCGAAGACAGCATTTCGTTCCAGATATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTT
ATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGACGATATTTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTT
ACGTATGGTTTTGCAAACACTTCGGGATAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGCAGGTGTCCTTTCTGGGCCACGTGGTTTCTAAGGATG
GAGTCTCTGTGGATCCAGCTAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGCTATTATCGACGGTTT
GTGGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGAAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGACCCTTAAACAGAA
GCTAGTTACCGCGCCGGTTCTTACGGTACCTGATGGTTCTGGCAGTTTCGTGATTTATAGTGATGCTTCCAAGAAGGGTCTGGGTTGTGTTTTGATGCAGCAGGGTAAGG
TGGTCGCTTATGCGTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATCTAGAGTTGGCAGCAGTGCGGAGATGGCTTGAGTTAGTGAAGGATTACGAT
TGTGAGATACTGTATCATCCAGGCAAGGCGAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCGGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGA
TCTTGAGCGGGCTGAGATTGCAGTGTCAGTGGGTGCAGTTACTATGCAGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAGAAGATCGTTGATGCTCAAAGTAACG
ATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAACGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAGAGGCGCCTCTGTGTCCCGTCA
GATAGTGCGGTTAAGACAGAATTATTAGCTGAGGCTCACAGTTCCCCATTTTCTATGCACCCAGGTAGTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCG
TAACATGAAGAGGGAAGTAGCAGAATTTGTCAGTAAATGCTTGGTGTGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCTGAGATAGTGAGATTGC
ACGGAGTGCCAGTGTCGATTGTTTCTGACAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGTTTGCAGACTGCTATGGGCACGAGATTGGACTTTAGTACGGCTTTC
CATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGAT
GGAATTTGCTTATAATAACAGCTATCAGGCTACTATTGGCATGGCACCGTTTGAGGCCCTGTACGGCAAATGTTGTAGATCCCCGGTTTGCTGGGATGAGGTAGCTTATC
GCTTGGCGTTACCTTCATCACTCTCGACAGTCCACGATGTGTTTCACGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCACGTAGTGGATTACGAGCCACTAGAG
ATTGATGAAAATTTGAGCTATGTTGAACAACCTGTTGAGGTGCTCGCTAGAGAGGTGAAGACGTTGAGAAATAGACAAATTCCCCTAGTTAAAGTCTTATGGCGGAATCA
CCGGGTAGAAGAGGCTACATGGGAGCGTGAAGACGACATGAGATCCCGTTATCCCGAACTGTTCGAGAAATAA
Protein sequenceShow/hide protein sequence
MDWLAANHASIDCSRKEVTFNPPSMAICHRVGAGHGSYIQSPLQNGPRRIERTEGTLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYEFIVMSFGLTNAPAVF
MDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKDGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYRRF
VENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAAVRRWLELVKDYD
CEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQKIVDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPS
DSAVKTELLAEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAF
HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWDEVAYRLALPSSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLE
IDENLSYVEQPVEVLAREVKTLRNRQIPLVKVLWRNHRVEEATWEREDDMRSRYPELFEK