; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0024100 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0024100
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr06:15985466..15990894
RNA-Seq ExpressionIVF0024100
SyntenyIVF0024100
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR036397 - Ribonuclease H superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001584 - Integrase, catalytic core


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.076.89Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRG TSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR---------------------ALTPCSISIYSFR---------------------
        GEFRRFQQKPFEAGEAAR KPLCT CGKHHLGRCLFGTRTCFKCR                     A  P    +++                       
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR---------------------ALTPCSISIYSFR---------------------

Query:  -------------ECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASK
                     ECMLSKE+VK  QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA+HASIDCSRKEVTFN PS ASFKFKG GSRSLPQVISAIRASK
Subjt:  -------------ECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASK

Query:  LLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP
        LLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP
Subjt:  LLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP

Query:  ILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNR
        +LFV KKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNR
Subjt:  ILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNR

Query:  VFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGY
        VFREFLDTFVIVFI+DILIYSKTEAEHEEHL                                           KIEAVTGWTRPSTVSEVRSFLGLAGY
Subjt:  VFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGY

Query:  YRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----------------------
        YRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGLGCVLMQQGK                       
Subjt:  YRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK-----------------------

Query:  -------------------------------------------------------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
                                                                     ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG
Subjt:  -------------------------------------------------------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVG

Query:  AVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRN
        AVTMQLAQLTVQPTLRQRIIDAQ        KRGLAEAGQAV FS+SSDGGL+FERRLCVPSDS VKTELLSEAHSSPFSMHPGSTKMY+ +KRVYWWRN
Subjt:  AVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRN

Query:  MKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP
        MKREVAEFVSRCLVCQQVKAPRQKPAGL                      PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP
Subjt:  MKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP

Query:  VSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC
        VSIVSDRDARFTSKFWK LQT+MGTRLDFSTAFHPQTDGQTERLNQVLE MLRACALEFPGSWDSHLHLMEF YNNSYQATIGMAPFEALYGKCCRSPVC
Subjt:  VSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVC

Query:  WGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPS
        WGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRF+G FEIL+RIG VAYRLALPPS
Subjt:  WGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPS

Query:  LSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFEDESSLRREECNAQN
        LSTVHDVFHVSMLRKYVPDPSHVVDY+PL+IDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDM+++  E    LR E      
Subjt:  LSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFEDESSLRREECNAQN

Query:  FRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPF
            PS +S P +    +P   S  + +  SG+      ++ + GGSLPF
Subjt:  FRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPF

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.078.02Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQE ANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------
        GEFR FQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR        C +                                            
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------

Query:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS
                S +SF                           ECMLS+E+VKA QIEIAGHVIEVTL+VLDMLDFDVILGMDWLAANHASIDCSRK+VTFN 
Subjt:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS

Query:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE
        PSMASFKFKG GS+SLPQVISAIRASKLLSQGTWGILASVVDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE
Subjt:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE

Query:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS
        LKELKVQLQELLDKGFIRP+VSPWGAP+LFV KKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGY+QLRIKD DVPKTAFRS
Subjt:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS

Query:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------
        RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFI+DILIYSKTEAEHEEHL                                          
Subjt:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------

Query:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM
         KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGLGCVLM
Subjt:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM

Query:  QQGK------------------------------------------------------------------------------------ANVVADALSRKV
        QQGK                                                                                    ANVVADALSRKV
Subjt:  QQGK------------------------------------------------------------------------------------ANVVADALSRKV

Query:  SHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSE
        SHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ        KRGLAEAGQAV FS+SSDGGL FE RLCVPSDSAVKTELL E
Subjt:  SHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSE

Query:  AHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHF
        AHSSPFSMHPGSTKMYQ LKRVYWWRNMKREVAEFVS+CLVCQQVK PRQKPAGL                      PRTLRGFTVIWVVVDRLTKSAHF
Subjt:  AHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHF

Query:  VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFA
        VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQT+MGTRLDFSTAFHPQ DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFA
Subjt:  VPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFA

Query:  YNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGK
        YNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+ DKVFLKVAPM+GVLRFER+GK
Subjt:  YNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGK

Query:  LSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEAT
        LSPRFVG FEIL+RIG VAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEAT
Subjt:  LSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEAT

Query:  WEHEDDMRNK
        WE EDDMR++
Subjt:  WEHEDDMRNK

KAA0060413.1 pol protein [Cucumis melo var. makuwa]0.075.78Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDAKFDMLS FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC----------------RALTPC----------------SISIYSF-----------
        GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC                RA T                  S S +SF           
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC----------------RALTPC----------------SISIYSF-----------

Query:  ---------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR
                       RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR
Subjt:  ---------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR

Query:  ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW
        ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW
Subjt:  ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW

Query:  GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDL
        GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRYGHYEFIV SFGLTNAPAVFMDL
Subjt:  GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDL

Query:  MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL-------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL        KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL-------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK----------------------------------------ANVVADALSRKVSHS
        DSFQNLKQKLVTASVLTVPDGSGSFV YSDASKKGLGCVLMQQGK                                        ANVVADALSRKVSHS
Subjt:  DSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK----------------------------------------ANVVADALSRKVSHS

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRI+DAQ        KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPG
        SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL                      PRTLRGFTVIWVVVDRLTKSAHFVPG
Subjt:  SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN
        KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN

Query:  SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP
        SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP
Subjt:  SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP

Query:  RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH
        RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH
Subjt:  RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH

Query:  EDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPFRISSRFQTLGRRRASFVRPCPPSSRR
        EDDM +                               PS+PS          S SG             G     +SSR                     
Subjt:  EDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPFRISSRFQTLGRRRASFVRPCPPSSRR

Query:  PPPRGSRASRRTPPSRDTSPSSNARRLSLRLPDFKPIRPENPNQIRPVFQAASREPSGAEPRAELSRAASRAAIRDQAEPSCEPRPSRAELRVSRAESEP
                                                                                                            
Subjt:  PPPRGSRASRRTPPSRDTSPSSNARRLSLRLPDFKPIRPENPNQIRPVFQAASREPSGAEPRAELSRAASRAAIRDQAEPSCEPRPSRAELRVSRAESEP

Query:  SRAEPSRTPPSQAELPVHRATQAFLPPLRVTVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT
                 P QA L +     AF    + TVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT
Subjt:  SRAEPSRTPPSQAELPVHRATQAFLPPLRVTVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT

KAA0062141.1 pol protein [Cucumis melo var. makuwa]0.081.63Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSG+KRKAEQQPVP PQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSISIYS---------------FRECMLSKERV--------------
        GEFRRFQQKPFEAGEAAR KPLCTTCGKHHLGRCLFGTRTCFKCR        C + +                 F       ER               
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSISIYS---------------FRECMLSKERV--------------

Query:  ---------------KAF------QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKL
                        AF      +IEIAGHVI+VTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFN PSMASFKFK  GSRSLPQVISAIRASKL
Subjt:  ---------------KAF------QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKL

Query:  LSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPI
        LSQGTWGILASVVDTREVDVSLSSEPV+RDYPDVFPEELPGLPPHREVEFAIELE GTVPISRAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAP+
Subjt:  LSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPI

Query:  LFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRV
        LFV KKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRYGH EFIVMSFGLTNAPAVFMDLMNRV
Subjt:  LFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRV

Query:  FREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYY
        FR+FLDTFVIVFI+DILIYSKTEAEHEEHL                                           KIEAVTGWTRPSTVSEVRSFLGLAGYY
Subjt:  FREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYY

Query:  RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------------------------
        RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSF+IYSDASKKGLGCVLMQQGK                        
Subjt:  RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGK------------------------

Query:  ----------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFS
                        ANVVAD LSRKVSHSAALITRQAPLHRDLERAEIAVS+GAVTMQLAQLTVQPTLRQRIIDAQ        KRGLAE GQAV FS
Subjt:  ----------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFS

Query:  ISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL-----------------
        ISSDGGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ LKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL                 
Subjt:  ISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGL-----------------

Query:  -----PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLN
             PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGV VSIVSDRDARFTSKFWKGLQT+MGTRLDFSTAFHPQTDGQTERLN
Subjt:  -----PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
        QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFE LYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
Subjt:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK

Query:  DLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVL
        DLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRFVG FEIL+RIG VAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDEN SY EQPVEVL
Subjt:  DLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVL

Query:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFE
        AREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDMR++  E
Subjt:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFE

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.080.19Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAA ADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------
        GEF RFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR        C +                                            
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------

Query:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS
                S +SF                           ECMLSKE+VKA QIEIA HVIEVTL+VLDMLDFDVILGMDWL ANHASIDCSRKEVTFN 
Subjt:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS

Query:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE
        PSMASF+ KG GS+SLPQVISAIRASKLLSQGTWGIL SVVDTRE DVSLSSEPVVRDYPDVFPEELPGLP HREVEFAIELEPGTVPISRAPYRMAPAE
Subjt:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE

Query:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS
        LKELKVQLQELLDKGFIRPSVSPWGAP+LFV KKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS
Subjt:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS

Query:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------
        RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFI+DILIYSKTEAEHEEHL                                          
Subjt:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------

Query:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM
         KIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDA KKGLGCVLM
Subjt:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM

Query:  QQGK----------------------------------------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV
        QQGK                                              ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV
Subjt:  QQGK----------------------------------------------ANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV

Query:  QPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSR
        QPTLRQRIIDAQ        KRGLAEAGQA  FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ LKRVYWWRNMKREVAEFVS+
Subjt:  QPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSR

Query:  CLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF
        CLVCQQVKAP QKPAGL                      PRTLRGF+VIWVVVDRLTKSAHFV GKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF
Subjt:  CLVCQQVKAPRQKPAGL----------------------PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF

Query:  TSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMG
        TSKFWKGLQT+MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+SPVCWGEVGEQRLMG
Subjt:  TSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMG

Query:  PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVS
        PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRFVG FEIL+RIG +AYRLALPPSLSTVHDVFHVS
Subjt:  PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVS

Query:  MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK
        MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDMR++
Subjt:  MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK

TrEMBL top hitse value%identityAlignment
A0A5A7SJ52 Pol protein0.0e+0077.69Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYD KFDMLSRFAPEMIATEAARADKFVRGL LDIQGLVRAF+P THADALRLAVDL+LQ+RANSSK AGRGSTSGQKRKAEQQP+PV QRNFR G
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR-----------------------ALTPCSISI--------------YSF------
        GEFRRFQQKPFEAGEAARGKPLC TCGKHHLGRCLFGTRTCFKCR                       AL   ++ +              +SF      
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR-----------------------ALTPCSISI--------------YSF------

Query:  --------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQV
                             ECMLSKE+VKA QIE+AGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEV FN PSMASFKFKGEGSRSLPQV
Subjt:  --------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQV

Query:  ISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP
        IS IRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQE LDKGFIRP
Subjt:  ISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRP

Query:  SVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPA
        SVSPWGAP+LFV KKDGSMRLCIDYRELNKV VKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPA
Subjt:  SVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPA

Query:  VFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVR
        VFMD MNRVFREFLD+FVIVFI+DILIYSKTEAEHEEHL                                           KIEAVTGWTRPSTVSEVR
Subjt:  VFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVR

Query:  SFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ-----------------
        SFLGLAGYYR FVENFSRIAT LTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDASKKGLGCVLMQQ                 
Subjt:  SFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ-----------------

Query:  -----------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLA
                               GKANVVADALS KVSHSAALITRQAPLHRDLERAEIAVSVGAVT+QLAQLTVQPTLRQRIIDAQ        KRGL 
Subjt:  -----------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLA

Query:  EAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA----------
        EAGQAV FSISSDGGL+FERRLC                         STKMY+ LKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA          
Subjt:  EAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA----------

Query:  ------------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQ
                    GLPRTLRGFTVIWVVVDRL KSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQT+MGTRLDFSTAFHPQ
Subjt:  ------------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQ

Query:  TDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQ
        TDG+TERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQ
Subjt:  TDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQ

Query:  KSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLS
        KSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRFVG FEIL+RIG +AYRLALPPSLSTVHDVFHVSML KYVPDPSHVVDYEPLEIDENLS
Subjt:  KSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLS

Query:  YAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVP
        Y EQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDM +         R  E  A     PPS +     PS PS SSP+ + P
Subjt:  YAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVP

A0A5A7UDB1 Reverse transcriptase0.0e+0078.97Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGL LDIQGLVRAFRP THADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR---------------------ALTPCSISIYSFR---------------------
        GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR                     A  P    +++                       
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR---------------------ALTPCSISIYSFR---------------------

Query:  -------ECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKLLSQGT
               ECMLSKERVKA QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFN PSMASFKFKG GS+SLPQVISAI+ASKLLSQGT
Subjt:  -------ECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKLLSQGT

Query:  WGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNK
        WGILASVVDTRE DVSLSSEPVVRDY DVFPEELPGLPPHREVEFAIELE GTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAP+LFV K
Subjt:  WGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNK

Query:  KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFL
        KDGSMRLCID RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFL
Subjt:  KDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFL

Query:  DTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVE
        DTFVIVFI+DILIYSKTEAEHEEHL                                           KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVE
Subjt:  DTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVE

Query:  NFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKANVVADALSRKVSHSAALITRQAPL----
        NFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVP+GSGSFVIYSDA KKGLGCVLMQQGK  VVA A  +  SH     T    L    
Subjt:  NFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQGKANVVADALSRKVSHSAALITRQAPL----

Query:  --------------------HRDLE-----------------------------RAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAE
                            H+ L+                               +  V   A++ +LAQLTVQPTLRQRIIDAQ        KRGLAE
Subjt:  --------------------HRDLE-----------------------------RAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAE

Query:  AGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA-----------
          QAV FS+SSDGGL+FERRLCVPSD AVKTELLSEAHSSPFSMHPGSTKMYQ LKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPA           
Subjt:  AGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA-----------

Query:  -----------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQT
                   GLPRTL+GFTVIWVVVDRLTKSAHF+PGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDAR TSKFWKGLQT+MGTRLDFSTAFHPQT
Subjt:  -----------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQT

Query:  DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQK
        DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCW EVGEQRLMGPELVQSTNEAIQKI+SRM TAQSRQK
Subjt:  DGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQK

Query:  SYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY
        SYADVRRKDLEFEVGDKVFLKVAPMRGVLRFER+GKLS  FVG FEIL+RIG VAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY
Subjt:  SYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSY

Query:  AEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK
         EQPVEVLAREVKTLR+KEIPLVKVLWRNHRVEEATWE EDDMR++
Subjt:  AEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK

A0A5A7UWP7 Pol protein0.0e+0075.65Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDAKFDMLS FAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC----------------RALTPC----------------SISIYSF-----------
        GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC                RA T                  S S +SF           
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKC----------------RALTPC----------------SISIYSF-----------

Query:  ---------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR
                       RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR
Subjt:  ---------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIR

Query:  ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW
        ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW
Subjt:  ASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPW

Query:  GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDL
        GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRYGHYEFIV SFGLTNAPAVFMDL
Subjt:  GAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDL

Query:  MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL-------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
        MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL        KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE
Subjt:  MNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL-------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACE

Query:  DSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ----------------------------------------GKANVVADALSRKVSHS
        DSFQNLKQKLVTASVLTVPDGSGSFV YSDASKKGLGCVLMQQ                                        GKANVVADALSRKVSHS
Subjt:  DSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ----------------------------------------GKANVVADALSRKVSHS

Query:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS
        AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRI+DAQ        KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS
Subjt:  AALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHS

Query:  SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA----------------------GLPRTLRGFTVIWVVVDRLTKSAHFVPG
        SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA                      GLPRTLRGFTVIWVVVDRLTKSAHFVPG
Subjt:  SPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA----------------------GLPRTLRGFTVIWVVVDRLTKSAHFVPG

Query:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN
        KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN
Subjt:  KSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNN

Query:  SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP
        SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP
Subjt:  SYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSP

Query:  RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH
        RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH
Subjt:  RFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEH

Query:  EDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPFRISSRFQTLGRRRASFVRPCPPSSRR
        EDDM +                               PS+PS +S                        G     +SSR                     
Subjt:  EDDMRNKTFEDESSLRREECNAQNFRAPPSTISFPCTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPFRISSRFQTLGRRRASFVRPCPPSSRR

Query:  PPPRGSRASRRTPPSRDTSPSSNARRLSLRLPDFKPIRPENPNQIRPVFQAASREPSGAEPRAELSRAASRAAIRDQAEPSCEPRPSRAELRVSRAESEP
                                                                                                            
Subjt:  PPPRGSRASRRTPPSRDTSPSSNARRLSLRLPDFKPIRPENPNQIRPVFQAASREPSGAEPRAELSRAASRAAIRDQAEPSCEPRPSRAELRVSRAESEP

Query:  SRAEPSRTPPSQAELPVHRATQAFLPPLRVTVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT
                 P QA L +     AF    + TVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT
Subjt:  SRAEPSRTPPSQAELPVHRATQAFLPPLRVTVDPLSVDSIKGHNQVSGKGFSYYWTSERGWKRSSPPGIT

A0A5A7V8L8 Pol protein0.0e+0080.19Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAA ADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------
        GEF RFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR        C +                                            
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSI--------------------------------------------

Query:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS
                S +SF                           ECMLSKE+VKA QIEIA HVIEVTL+VLDMLDFDVILGMDWL ANHASIDCSRKEVTFN 
Subjt:  --------SIYSF--------------------------RECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNS

Query:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE
        PSMASF+ KG GS+SLPQVISAIRASKLLSQGTWGIL SVVDTRE DVSLSSEPVVRDYPDVFPEELPGLP HREVEFAIELEPGTVPISRAPYRMAPAE
Subjt:  PSMASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAE

Query:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS
        LKELKVQLQELLDKGFIRPSVSPWGAP+LFV KKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKTAFRS
Subjt:  LKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRS

Query:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------
        RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFI+DILIYSKTEAEHEEHL                                          
Subjt:  RYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------

Query:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM
         KIEAVTGWTRPST+SEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSFVIYSDA KKGLGCVLM
Subjt:  PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLM

Query:  QQ----------------------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV
        QQ                                              GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV
Subjt:  QQ----------------------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTV

Query:  QPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSR
        QPTLRQRIIDAQ        KRGLAEAGQA  FS+SSDGGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQ LKRVYWWRNMKREVAEFVS+
Subjt:  QPTLRQRIIDAQ--------KRGLAEAGQAVGFSISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSR

Query:  CLVCQQVKAPRQKPA----------------------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF
        CLVCQQVKAP QKPA                      GLPRTLRGF+VIWVVVDRLTKSAHFV GKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF
Subjt:  CLVCQQVKAPRQKPA----------------------GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARF

Query:  TSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMG
        TSKFWKGLQT+MGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCC+SPVCWGEVGEQRLMG
Subjt:  TSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMG

Query:  PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVS
        PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRFVG FEIL+RIG +AYRLALPPSLSTVHDVFHVS
Subjt:  PELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVS

Query:  MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK
        MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDMR++
Subjt:  MLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNK

A0A5A7V8X5 Pol protein0.0e+0081.63Show/hide
Query:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG
        MTVEQYDA+FDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSG+KRKAEQQPVP PQRNFRSG
Subjt:  MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSG

Query:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSISIYS---------------FRECMLSKERV--------------
        GEFRRFQQKPFEAGEAAR KPLCTTCGKHHLGRCLFGTRTCFKCR        C + +                 F       ER               
Subjt:  GEFRRFQQKPFEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCR----ALTPCSISIYS---------------FRECMLSKERV--------------

Query:  ---------------KAF------QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKL
                        AF      +IEIAGHVI+VTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFN PSMASFKFK  GSRSLPQVISAIRASKL
Subjt:  ---------------KAF------QIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPSMASFKFKGEGSRSLPQVISAIRASKL

Query:  LSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPI
        LSQGTWGILASVVDTREVDVSLSSEPV+RDYPDVFPEELPGLPPHREVEFAIELE GTVPISRAPYRMAP ELKELKVQLQELLDKGFIRPSVSPWGAP+
Subjt:  LSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPI

Query:  LFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRV
        LFV KKDGSMRLCIDYRELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAF SRYGH EFIVMSFGLTNAPAVFMDLMNRV
Subjt:  LFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRV

Query:  FREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYY
        FR+FLDTFVIVFI+DILIYSKTEAEHEEHL                                           KIEAVTGWTRPSTVSEVRSFLGLAGYY
Subjt:  FREFLDTFVIVFIEDILIYSKTEAEHEEHL------------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYY

Query:  RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------
        RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTA VLTVPDGSGSF+IYSDASKKGLGCVLMQQ                          
Subjt:  RRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------

Query:  --------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFS
                      GKANVVAD LSRKVSHSAALITRQAPLHRDLERAEIAVS+GAVTMQLAQLTVQPTLRQRIIDAQ        KRGLAE GQAV FS
Subjt:  --------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQ--------KRGLAEAGQAVGFS

Query:  ISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA-------------------
        ISSDGGL+FERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ LKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA                   
Subjt:  ISSDGGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPA-------------------

Query:  ---GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLN
           GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGV VSIVSDRDARFTSKFWKGLQT+MGTRLDFSTAFHPQTDGQTERLN
Subjt:  ---GLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLN

Query:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
        QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFE LYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK
Subjt:  QVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRK

Query:  DLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVL
        DLEFEVGDKVFLKVAPMRGVLRFER+GKLSPRFVG FEIL+RIG VAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDEN SY EQPVEVL
Subjt:  DLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVL

Query:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFE
        AREVKTLRNKEIPLVKVLWRNHRVEEATWE EDDMR++  E
Subjt:  AREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.6e-9026.75Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ +++DILI+SK+E+EH +H+      
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------

Query:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN
                                              I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N
Subjt:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN

Query:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------
        +KQ LV+  VL   D S   ++ +DAS   +G VL Q+                                                              
Subjt:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------

Query:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG
                                       G AN +ADALSR       ++    P+ +D E   I  V+  ++T     Q+  + T   ++++     
Subjt:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG

Query:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L
        L    + V  +I    GL+   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G    +
Subjt:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L

Query:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF
        P + R                 G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  +
Subjt:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF

Query:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA
         PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T 
Subjt:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY
          + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F G F +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein3.6e-9026.75Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ +++DILI+SK+E+EH +H+      
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------

Query:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN
                                              I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N
Subjt:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN

Query:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------
        +KQ LV+  VL   D S   ++ +DAS   +G VL Q+                                                              
Subjt:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------

Query:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG
                                       G AN +ADALSR       ++    P+ +D E   I  V+  ++T     Q+  + T   ++++     
Subjt:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG

Query:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L
        L    + V  +I    GL+   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G    +
Subjt:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L

Query:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF
        P + R                 G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  +
Subjt:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF

Query:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA
         PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T 
Subjt:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY
          + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F G F +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein3.6e-9026.75Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ +++DILI+SK+E+EH +H+      
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------

Query:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN
                                              I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N
Subjt:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN

Query:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------
        +KQ LV+  VL   D S   ++ +DAS   +G VL Q+                                                              
Subjt:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------

Query:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG
                                       G AN +ADALSR       ++    P+ +D E   I  V+  ++T     Q+  + T   ++++     
Subjt:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG

Query:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L
        L    + V  +I    GL+   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G    +
Subjt:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L

Query:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF
        P + R                 G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  +
Subjt:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF

Query:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA
         PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T 
Subjt:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY
          + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F G F +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT37 Transposon Tf2-4 polyprotein3.6e-9026.75Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ +++DILI+SK+E+EH +H+      
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------

Query:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN
                                              I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N
Subjt:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN

Query:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------
        +KQ LV+  VL   D S   ++ +DAS   +G VL Q+                                                              
Subjt:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------

Query:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG
                                       G AN +ADALSR       ++    P+ +D E   I  V+  ++T     Q+  + T   ++++     
Subjt:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG

Query:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L
        L    + V  +I    GL+   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G    +
Subjt:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L

Query:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF
        P + R                 G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  +
Subjt:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF

Query:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA
         PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T 
Subjt:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY
          + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F G F +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein3.6e-9026.75Show/hide
Query:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    P++FV KK+G++R+ +DY+ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------
          ++QG+T+F+K+DL+S YH +R++ GD  K AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ +++DILI+SK+E+EH +H+      
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHL------

Query:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN
                                              I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N
Subjt:  ------------------------------------PKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQN

Query:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------
        +KQ LV+  VL   D S   ++ +DAS   +G VL Q+                                                              
Subjt:  LKQKLVTASVLTVPDGSGSFVIYSDASKKGLGCVLMQQ--------------------------------------------------------------

Query:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG
                                       G AN +ADALSR       ++    P+ +D E   I  V+  ++T     Q+  + T   ++++     
Subjt:  -------------------------------GKANVVADALSRKVSHSAALITRQAPLHRDLERAEI-AVSVGAVTMQLA-QLTVQPTLRQRIIDAQKRG

Query:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L
        L    + V  +I    GL+   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G    +
Subjt:  LAEAGQAVGFSISSDGGLVFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAG----L

Query:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF
        P + R                 G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++D D  FTS+ WK         + FS  +
Subjt:  PRTLR-----------------GFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAF

Query:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA
         PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T 
Subjt:  HPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTA

Query:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY
          + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F G F +L++ G   Y L LP S+  +    FHVS L KY
Subjt:  QSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSLSTV-HDVFHVSMLRKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.3e-1444.05Show/hide
Query:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFV
        K+EA+ GW  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W++    +F+ LK  + T  VL +PD    FV
Subjt:  KIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGTGGAGCAGTATGATGCGAAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCCGACT
GGACATTCAGGGTTTGGTCCGAGCTTTCCGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTG
GTAGAGGTTCAACATCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAACTTCAGATCAGGTGGCGAGTTTCGCCGCTTCCAGCAGAAACCT
TTTGAGGCAGGGGAAGCTGCCAGAGGAAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACTTGCTTTAAGTGTAGAGCCCT
TACACCATGTTCTATCAGTATCTACTCCTTCCGGGAGTGTATGTTGTCGAAGGAAAGGGTGAAAGCATTCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGT
TAGTCCTGGATATGCTCGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCCCGTAAGGAGGTAACGTTTAACTCTCCCTCG
ATGGCCAGTTTTAAATTTAAGGGAGAAGGGTCAAGGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAG
CGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCGGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCACACAGAGAGG
TTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAGCTGAAAGAATTAAAGGTGCAGTTACAAGAATTGCTT
GATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAATTTTATTTGTTAACAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAA
GGTAACCGTTAAGAACAGATATCCCTTGCCCAGAATCGACGATCTATTTGACCAGTTGCAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGT
TGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGAAGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACC
TAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTA
TAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCATCGGTT
CTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGCAAGGCAAATGTGGTAGCTGATGC
TCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGC
AGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGGGTTCTCCATATCCTCTGAT
GGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTAC
GAAGATGTATCAGTACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGC
AGAAACCAGCGGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGACTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACT
GCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGTGAGATTACATGGAGTGCCGGTGTCGATCGTTTCTGATAGAGATGCTCGTTTCACTTCCAAATTCTGGAAGGG
TTTGCAGACTTCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGGGCGT
GTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTATAC
GGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACG
CATGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCAATGAGAGGTG
TCTTACGATTTGAAAGGAAGGGAAAGTTGAGTCCCCGTTTTGTTGGGTCGTTTGAGATTTTGAAGCGGATTGGCACTGTAGCTTATCGCTTGGCGTTGCCACCATCACTC
TCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTAGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGC
TGAACAACCCGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTTTGGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGG
AGCATGAAGATGACATGAGGAATAAAACTTTCGAGGACGAAAGTTCCCTAAGGAGGGAAGAATGTAACGCCCAGAATTTTCGAGCGCCGCCGTCCACCATCTCTTTTCCG
TGCACGCCGTCGATTCCATCCCCATCTTCTCCGTCGCGTTCCGTCCCCGTTTCCATCTCCGGCGTCGACAACGTCTTCGACTTCAGCATTTTGGTCCGCGGCGGCTCCCT
ACCATTTCGAATCTCCTCTCGGTTTCAAACACTCGGACGTCGCCGTGCGTCCTTCGTCCGACCGTGTCCGCCGTCAAGCCGTCGTCCTCCGCCGCGTGGAAGCAGAGCCA
GTCGTCGTACTCCACCCAGCCGCGACACATCTCCGTCGTCGAACGCCCGAAGATTGTCGTTGAGGTTGCCCGATTTCAAGCCGATCCGACCCGAAAACCCGAATCAAATC
CGACCCGTGTTTCAAGCTGCGAGCCGCGAACCGAGTGGAGCCGAGCCGCGAGCCGAGTTAAGCCGAGCCGCAAGCCGAGCTGCGATCCGAGACCAAGCCGAGCCGAGCTG
CGAGCCGAGGCCAAGCCGAGCCGAGCTGAGGGTGAGCCGAGCCGAGAGTGAGCCAAGCCGAGCCGAGCCGAGCCGAACACCTCCAAGCCAAGCCGAGCTCCCTGTTCACC
GAGCCACCCAAGCCTTCCTTCCTCCACTTCGGGTCACGGTGGATCCGTTGAGCGTGGACTCGATCAAGGGGCATAACCAAGTTTCAGGTAAGGGTTTTTCCTACTACTGG
ACCTCGGAGCGAGGCTGGAAACGTAGTAGTCCACCGGGGATTACACGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAGTGGAGCAGTATGATGCGAAGTTTGACATGTTATCCCGCTTCGCTCCCGAGATGATAGCGACCGAGGCGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCCGACT
GGACATTCAGGGTTTGGTCCGAGCTTTCCGACCCGCTACTCATGCCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCTAAGACCGCTG
GTAGAGGTTCAACATCGGGACAGAAGAGGAAGGCTGAGCAGCAGCCTGTTCCAGTGCCACAGCGGAACTTCAGATCAGGTGGCGAGTTTCGCCGCTTCCAGCAGAAACCT
TTTGAGGCAGGGGAAGCTGCCAGAGGAAAGCCGTTGTGTACCACTTGTGGGAAGCACCATCTGGGCCGTTGCTTATTCGGGACCAGGACTTGCTTTAAGTGTAGAGCCCT
TACACCATGTTCTATCAGTATCTACTCCTTCCGGGAGTGTATGTTGTCGAAGGAAAGGGTGAAAGCATTCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGCTGT
TAGTCCTGGATATGCTCGACTTTGATGTAATCCTGGGTATGGATTGGTTGGCCGCTAACCACGCCAGCATAGATTGTTCCCGTAAGGAGGTAACGTTTAACTCTCCCTCG
ATGGCCAGTTTTAAATTTAAGGGAGAAGGGTCAAGGTCGTTGCCTCAGGTAATCTCAGCCATCAGGGCCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAG
CGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCGGTGGTGAGGGACTATCCGGATGTCTTTCCTGAAGAACTTCCAGGGTTACCTCCACACAGAGAGG
TTGAGTTTGCCATAGAGTTGGAGCCGGGCACGGTTCCTATATCCAGAGCCCCTTACAGGATGGCCCCCGCAGAGCTGAAAGAATTAAAGGTGCAGTTACAAGAATTGCTT
GATAAGGGATTCATTCGACCGAGCGTGTCACCTTGGGGTGCGCCAATTTTATTTGTTAACAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGAGTTGAATAA
GGTAACCGTTAAGAACAGATATCCCTTGCCCAGAATCGACGATCTATTTGACCAGTTGCAGGGAGCCACAGTGTTCTCTAAGATTGATCTTCGGTCGGGATACCATCAGT
TGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGGTATGGACACTACGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGGCAGTGTTTATG
GACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATCGTGTTTATCGAAGATATCTTGATATACTCCAAGACGGAGGCCGAACATGAGGAGCATTTACC
TAAGATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTA
TAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAAGGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCATCGGTT
CTTACTGTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGAAGGGTTTGGGTTGTGTTTTGATGCAACAGGGCAAGGCAAATGTGGTAGCTGATGC
TCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAGTGTCAGTGGGGGCAGTCACTATGC
AGTTAGCCCAGTTGACGGTACAGCCGACTTTGAGGCAAAGGATCATTGATGCTCAGAAACGTGGCCTAGCAGAGGCAGGGCAAGCGGTTGGGTTCTCCATATCCTCTGAT
GGTGGACTTGTGTTTGAGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTAC
GAAGATGTATCAGTACCTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGC
AGAAACCAGCGGGACTGCCAAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGACTTACCAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACT
GCTAGTAAGTGGGCACAGCTGTACATGTCGGAGATAGTGAGATTACATGGAGTGCCGGTGTCGATCGTTTCTGATAGAGATGCTCGTTTCACTTCCAAATTCTGGAAGGG
TTTGCAGACTTCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGGGCGT
GTGCATTGGAATTTCCAGGTAGCTGGGACTCCCACTTGCATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTATAC
GGCAAATGTTGTAGATCCCCGGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACG
CATGCATACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTCGAATTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCAATGAGAGGTG
TCTTACGATTTGAAAGGAAGGGAAAGTTGAGTCCCCGTTTTGTTGGGTCGTTTGAGATTTTGAAGCGGATTGGCACTGTAGCTTATCGCTTGGCGTTGCCACCATCACTC
TCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTAGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATGC
TGAACAACCCGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCCTTTGGTTAAAGTCTTATGGCGGAATCACCGGGTAGAAGAGGCTACATGGG
AGCATGAAGATGACATGAGGAATAAAACTTTCGAGGACGAAAGTTCCCTAAGGAGGGAAGAATGTAACGCCCAGAATTTTCGAGCGCCGCCGTCCACCATCTCTTTTCCG
TGCACGCCGTCGATTCCATCCCCATCTTCTCCGTCGCGTTCCGTCCCCGTTTCCATCTCCGGCGTCGACAACGTCTTCGACTTCAGCATTTTGGTCCGCGGCGGCTCCCT
ACCATTTCGAATCTCCTCTCGGTTTCAAACACTCGGACGTCGCCGTGCGTCCTTCGTCCGACCGTGTCCGCCGTCAAGCCGTCGTCCTCCGCCGCGTGGAAGCAGAGCCA
GTCGTCGTACTCCACCCAGCCGCGACACATCTCCGTCGTCGAACGCCCGAAGATTGTCGTTGAGGTTGCCCGATTTCAAGCCGATCCGACCCGAAAACCCGAATCAAATC
CGACCCGTGTTTCAAGCTGCGAGCCGCGAACCGAGTGGAGCCGAGCCGCGAGCCGAGTTAAGCCGAGCCGCAAGCCGAGCTGCGATCCGAGACCAAGCCGAGCCGAGCTG
CGAGCCGAGGCCAAGCCGAGCCGAGCTGAGGGTGAGCCGAGCCGAGAGTGAGCCAAGCCGAGCCGAGCCGAGCCGAACACCTCCAAGCCAAGCCGAGCTCCCTGTTCACC
GAGCCACCCAAGCCTTCCTTCCTCCACTTCGGGTCACGGTGGATCCGTTGAGCGTGGACTCGATCAAGGGGCATAACCAAGTTTCAGGTAAGGGTTTTTCCTACTACTGG
ACCTCGGAGCGAGGCTGGAAACGTAGTAGTCCACCGGGGATTACACGTTAG
Protein sequenceShow/hide protein sequence
MTVEQYDAKFDMLSRFAPEMIATEAARADKFVRGLRLDIQGLVRAFRPATHADALRLAVDLSLQERANSSKTAGRGSTSGQKRKAEQQPVPVPQRNFRSGGEFRRFQQKP
FEAGEAARGKPLCTTCGKHHLGRCLFGTRTCFKCRALTPCSISIYSFRECMLSKERVKAFQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVTFNSPS
MASFKFKGEGSRSLPQVISAIRASKLLSQGTWGILASVVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQELL
DKGFIRPSVSPWGAPILFVNKKDGSMRLCIDYRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYGHYEFIVMSFGLTNAPAVFM
DLMNRVFREFLDTFVIVFIEDILIYSKTEAEHEEHLPKIEAVTGWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTASV
LTVPDGSGSFVIYSDASKKGLGCVLMQQGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQKRGLAEAGQAVGFSISSD
GGLVFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQYLKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT
ASKWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTSMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALY
GKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERKGKLSPRFVGSFEILKRIGTVAYRLALPPSL
STVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYAEQPVEVLAREVKTLRNKEIPLVKVLWRNHRVEEATWEHEDDMRNKTFEDESSLRREECNAQNFRAPPSTISFP
CTPSIPSPSSPSRSVPVSISGVDNVFDFSILVRGGSLPFRISSRFQTLGRRRASFVRPCPPSSRRPPPRGSRASRRTPPSRDTSPSSNARRLSLRLPDFKPIRPENPNQI
RPVFQAASREPSGAEPRAELSRAASRAAIRDQAEPSCEPRPSRAELRVSRAESEPSRAEPSRTPPSQAELPVHRATQAFLPPLRVTVDPLSVDSIKGHNQVSGKGFSYYW
TSERGWKRSSPPGITR