; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0003789 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0003789
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr10:9462640..9470486
RNA-Seq ExpressionIVF0003789
SyntenyIVF0003789
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001969 - Aspartic peptidase, active site
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040188.1 pol protein [Cucumis melo var. makuwa]0.072.89Show/hide
Query:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAPPAPAPAPVPALVVPQIV
        +VR QRGA RREAGRMREGHM+ASGFL ASAD    V  EMPPRRGARRGGRG   RGAGRVQPEVQPVA ATDP AP                    +V
Subjt:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAPPAPAPAPVPALVVPQIV

Query:  PDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD---------------------
        PDQLSAEA+HLRDFR YNP+TFDGSLED TRAQLWLSSL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETTERML GD                     
Subjt:  PDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD---------------------

Query:  -------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRK
               EFLNLEQGDMTVEQYDAEFDMLS F  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE+ANSSK AGRGSTSGQKRK
Subjt:  -------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRK

Query:  TEQQPIPVPQRNFRP--------------------------------------------------GADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAE
         EQQP+PVPQRNFR                                                    ADRCP+RLTGNAQNQG GAPHQG+VFATNKTEAE
Subjt:  TEQQPIPVPQRNFRP--------------------------------------------------GADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAE

Query:  RAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA
        RA TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLEVEPLHHVLSVSTPS ECML+KEKVKACQIEIA HVIEVTLLVLDMLDFDVILGMDWLAA
Subjt:  RAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA

Query:  NHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELES
        NHASIDCSRKEV FNPPSMAS KFKG GSRSLPQVISAIR SKLLSQGTWGILASVVDTREVDVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE 
Subjt:  NHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELES

Query:  GTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYH
        GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RSGYH
Subjt:  GTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYH

Query:  QLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQV
        QLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAP VFMDLMNRVF+EFLDTFVIVFIDDILIY K EAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQV
Subjt:  QLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQV

Query:  SFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC--------------------------
        SFLGHVVSKAGVSVDPAKIEA+T WTRPSTVS+VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                          
Subjt:  SFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC--------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG-------
           KANVVADALSRKVSHSAALITR APLHRDLERAEIAV VGA+TMQLAQLTVQ TLRQRII AQSND  LVEKRGLAEA QA  FSISSDG       
Subjt:  ---KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG-------

Query:  ---------------------------STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGF
                                   STKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAPRQKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGF
Subjt:  ---------------------------STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGF

Query:  TVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRL
        TVIWVVVDRLTKSAHFVPGKSTY  SKWAQLYMSEIVRLHGVPVSIVSDRDA FTS F                                     R C L
Subjt:  TVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRL

Query:  LWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVF
         +  GSWD HLHLMEFAYNNSYQATIGMAPFEALYDKCCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVF
Subjt:  LWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVF

Query:  LKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNRE
        LKVAP+RGVLRFE+RGKLSP FVGSFEILERI PVAYR+ALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY EQPV+VL REV MLRNRE
Subjt:  LKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNRE

Query:  IPLVK
        IPLVK
Subjt:  IPLVK

KAA0051357.1 pol protein [Cucumis melo var. makuwa]0.072.75Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAP-PAPAPAPVPALV--VPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLS
        MPPRRGARRGGRGGRGRGAGRVQPE         P     PAP PAP PAP PALV   PQ VPDQLSAEA+HLRDFR YNP+TFDGSLED TRAQ+WLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAP-PAPAPAPVPALV--VPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLS

Query:  SLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMI
        SL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETTERML GD                            EFLNLEQGDMTVEQYDAEFDMLSRF  EMI
Subjt:  SLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMI

Query:  AIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG---------------------
        A E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE ANSSK AGRGSTSGQKRK EQQP+PVPQRNFR G                     
Subjt:  AIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG---------------------

Query:  -----------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLH
                                     ADRCP+RLTG AQNQG GAPHQG+VFATN+TEAE+A TVVTGTLPVLGHYALVLFDSGSSHSFISS FV H
Subjt:  -----------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLH

Query:  VRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVIS
         RLEVEPLHHVLSVSTPS ECML++EKVKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAANHASIDCSRK+V FNPPSMAS KFKG GS+SLPQVIS
Subjt:  VRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVIS

Query:  AIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSV
        AIR SKLLSQGTWGILASVVDTRE DVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRP+V
Subjt:  AIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSV

Query:  SPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVF
        SPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RSGY+QLRIKD DVPKTAFRSRY HYEFIVMSFGLTNAP VF
Subjt:  SPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVF

Query:  MDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSF
        MDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEA+T WTRPSTVS+VRSF
Subjt:  MDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSF

Query:  LGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------------------
        LGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                                                               
Subjt:  LGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------------------

Query:  ------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAE
                                                                          KANVVADALSRKVSHSAALITRQAPLHRDLERAE
Subjt:  ------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAE

Query:  IAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG----------------------------------STKMYQDLKR
        IAV VGAVTMQLAQLTVQ TLRQRIIDAQSND  LVEKRGLAEA QAVEFS+SSDG                                  STKMYQDLKR
Subjt:  IAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG----------------------------------STKMYQDLKR

Query:  VYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIV
        VYWWRNMKREVAEFVSKCLVCQ VK PRQKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIV
Subjt:  VYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDK
        RLHGVPVSIVSDRDA FTS F                                     R C L +  GSWD HLHLMEFAYNNSYQATIGMAPFEALY K
Subjt:  RLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDK

Query:  CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAY
        CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF++ DKVFLKVAP++GVLRFE+RGKLSP FVG FEILERI PVAY
Subjt:  CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAY

Query:  RLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        RLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY+EQPV+VL REV  LRN+EIPLVK
Subjt:  RLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

KAA0052126.1 pol protein [Cucumis melo var. makuwa]0.072.62Show/hide
Query:  VADLFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHA
        +  L G SF     I   +  TR + +  VRVQRGA RREAGRMREGHMDASGFLYASAD    V  EMPPRRGARRGGRGGRGRGA RVQPEVQPVA A
Subjt:  VADLFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHA

Query:  TDPNAPQQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETT
        TDP AP                    +VPDQLSAEA+ LRDFR YNP+TFDGSLED TRAQLWLSSL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETT
Subjt:  TDPNAPQQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETT

Query:  ERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALR
        ERML GD                            +FLNLEQGDMTVEQYDAEFDMLSRF  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALR
Subjt:  ERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALR

Query:  LAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPGADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFD
        LAVDLSLQE+ANSSK AGRGSTSGQKRKTEQQP+ VPQRNFR                   GAPHQGKVFATNKTEAER  TVV GTLPVLGHYALVLFD
Subjt:  LAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPGADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFD

Query:  SGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASL
        SGSSHSFIS  FVLH RLEVEPLH+VLSVSTPS ECML+KEKVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA HASIDCS KEVAFNPPSMAS 
Subjt:  SGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASL

Query:  KFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKV
        KFKGEGSRSLPQVIS IR SKLLSQGTWGILASVVDTREVDVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE GTVPI RAPYRMAP ELKELKV
Subjt:  KFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKV

Query:  QLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYE
        QLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRL IDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKI+ RSGYHQLRIKDGDVPK AFRSRY +YE
Subjt:  QLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYE

Query:  FIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAL
        FIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFID+ILIY KTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEA+
Subjt:  FIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAL

Query:  TSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC------------------------------------------------
        T WTRPSTV  VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSK C                                                
Subjt:  TSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC------------------------------------------------

Query:  ---------------------------------------------------------------------------------KANVVADALSRKVSHSAAL
                                                                                         KANVVADALSRKVSHSAAL
Subjt:  ---------------------------------------------------------------------------------KANVVADALSRKVSHSAAL

Query:  ITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG-----------------------------
        ITRQAPL RDLERAEIAV VGAVT+QLAQLTVQSTLR+RIIDAQSND  LVEKRGLAEA Q VEFSISSDG                             
Subjt:  ITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG-----------------------------

Query:  -----STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKST
             STKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAPRQKPAGLLQPLSIPEWKWENVSM FIT  PRTLRGFTVIWVVVDRLTKSAHFVPGKST
Subjt:  -----STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKST

Query:  YTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSY
        YT SKWAQLYMSEIVRLHGVPVSIVSDRDA F   F                                     R C L +  GSWD HLHLM+FAYNNSY
Subjt:  YTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSY

Query:  QATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHF
        QATI MAPFEALY KCCRSPVCWGEVGEQRLMGPELV+STNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE+RGKLSP F
Subjt:  QATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHF

Query:  VGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        VG FEILERI PVAYRLALPPSLS VHDVFHVSMLRKY+PDPS VVDYEPLEIDENLSY E+PV+VL REV MLRNREIPL+K
Subjt:  VGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.075.03Show/hide
Query:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP----------------------
        +VR QRGA RREAGR REGHMDASGFL ASA        EMPPRRGARRGGRGGRGRGAGRVQ EVQPVA A DP AP                      
Subjt:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP----------------------

Query:  QQ--------PAP-PAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWW
        QQ        PAP PAPAPAP PA V PQ VPDQLSAEA+HLRDFR YNP+TFDGSLED TRAQ+WLSSL+TIFRYMKCP++QKVQCAVFMLTDRGTAWW
Subjt:  QQ--------PAP-PAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWW

Query:  ETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQ
        ETTERML GDEFLNLEQGDMTVEQYDAEFDMLSRF  EMIA E A ADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE+ANSSK AGRGSTSGQ
Subjt:  ETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQ

Query:  KRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKT
        KRK EQQP+PVPQRNFR G                                                  ADRCP+RLTGNAQNQG GAPHQG+VFATNKT
Subjt:  KRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKT

Query:  EAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDW
        EAE+A TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLEVEPLHHVLSVSTPS ECML+KEKVKACQIEIA HVIEVTL+VLDMLDFDVILGMDW
Subjt:  EAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDW

Query:  LAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIE
        L ANHASIDCSRKEV FNPPSMAS + KG GS+SLPQVISAIR SKLLSQGTWGIL SVVDTRE DVSLSSEPVV DYPD+FPEELPGLP HREVEFAIE
Subjt:  LAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIE

Query:  LESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRS
        LE GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RS
Subjt:  LESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRS

Query:  GYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL
        GYHQLRIKD DVPKTAFRSRY HYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL
Subjt:  GYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL

Query:  KQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC-----------------------
        KQVSFLGHVVSKAGVSVDPAKIEA+T WTRPST+S+VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                       
Subjt:  KQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC-----------------------

Query:  --------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLER
                                                                            KANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  --------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG----------------------------------STKMYQDL
        AEIAV VGAVTMQLAQLTVQ TLRQRIIDAQSND  LVEKRGLAEA QA EFS+SSDG                                  STKMYQDL
Subjt:  AEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSDG----------------------------------STKMYQDL

Query:  KRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSE
        KRVYWWRNMKREVAEFVSKCLVCQ VKAP QKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGF+VIWVVVDRLTKSAHFV GKSTYT SKWAQLYMSE
Subjt:  KRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSE

Query:  IVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALY
        IVRLHGVPVSIVSDRDA FTS F                                     R C L +  GSWD HLHLMEFAYNNSYQATIGMAPFEALY
Subjt:  IVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALY

Query:  DKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPV
         KCC+SPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE+RGKLSP FVG FEILERI P+
Subjt:  DKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPV

Query:  AYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        AYRLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY EQPV+VL REV  LRN+EIPLVK
Subjt:  AYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

KAA0063718.1 pol protein [Cucumis melo var. makuwa]0.072.79Show/hide
Query:  VADLFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHA
        +  L G SF     I   +  TR + +  VRVQRGA RREAGRMREGHMDASGFLY           EMPPRRGARRGGRGGRGRGAGRVQPEVQPVA A
Subjt:  VADLFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHA

Query:  TDPNAP-----------------------QQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCP
        TDP AP                       QQPAPPAP PAPVP  VVPQ+  DQLSAEA+HLRDFR YNP+TFDGSLED TRAQLWL SL+TIFRYMKCP
Subjt:  TDPNAP-----------------------QQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCP

Query:  KDQKVQCAVFMLTDRGTAWWETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDL
        +DQKVQCAVFMLTDRGTA     +      EFLNLEQGDMTVEQYDAEFDMLSRF  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDL
Subjt:  KDQKVQCAVFMLTDRGTAWWETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDL

Query:  SLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGN
        SLQE+ANSSKVAGRGSTSGQKRK EQQP PVPQRNFRPG                                                  ADRCPMRLTGN
Subjt:  SLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGN

Query:  AQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVI
        AQNQ  GAPHQGKVFATNKTEAERA TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLE                     EKVKACQIEIAGH I
Subjt:  AQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVI

Query:  EVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPD
        EVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMAS KFKGEGSRSLPQVISAIR SKLLSQG WGILASVVDTREVDVSLSSEPVV DYPD
Subjt:  EVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPD

Query:  IFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRI
        +FPE+LPGLP HREVEFAIELE GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RI
Subjt:  IFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRI

Query:  DDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMV
        DD FDQLQGATVF KID RSGYHQLRIKDGDVPKTAFRS+Y HYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLR+V
Subjt:  DDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMV

Query:  LQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---
        LQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGV VDPAKIEA+T WTRPSTVS+VRSFLGLAGYYRRFVENFS IATPL QLTRK APFVWSK C   
Subjt:  LQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRG
                                  KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV VGAVT+QLAQLTVQSTLRQRIIDAQSND  L EKRG
Subjt:  --------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRG

Query:  LAEAAQAVEFSISSDG----------------------------------STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSI
        LAEA QA EFSISSDG                                  STKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAP+QKPAGLLQPLSI
Subjt:  LAEAAQAVEFSISSDG----------------------------------STKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSI

Query:  PEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFGRVCRLLWARGSWDFHLHL
        PEWKWENVSMDFIT  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPV IVSDRDA FTS F       W +GSWD HLHL
Subjt:  PEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFGRVCRLLWARGSWDFHLHL

Query:  MEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFE
        MEFAYNNSYQATIGMAPFEALY KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE
Subjt:  MEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFE

Query:  KRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        ++GKLSP FVG FEILERI PVAYRLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDY+PLEIDENLSY EQP +VL REV  LRN+EIPLVK
Subjt:  KRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

TrEMBL top hitse value%identityAlignment
A0A5A7TB42 Reverse transcriptase0.0e+0072.89Show/hide
Query:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAPPAPAPAPVPALVVPQIV
        +VR QRGA RREAGRMREGHM+ASGFL ASAD    V  EMPPRRGAR   RGGRGRGAGRVQPEVQPVA ATDP AP                    +V
Subjt:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAPPAPAPAPVPALVVPQIV

Query:  PDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD---------------------
        PDQLSAEA+HLRDFR YNP+TFDGSLED TRAQLWLSSL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETTERML GD                     
Subjt:  PDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD---------------------

Query:  -------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRK
               EFLNLEQGDMTVEQYDAEFDMLS F  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE+ANSSK AGRGSTSGQKRK
Subjt:  -------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRK

Query:  TEQQPIPVPQRNFRP--------------------------------------------------GADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAE
         EQQP+PVPQRNFR                                                    ADRCP+RLTGNAQNQG GAPHQG+VFATNKTEAE
Subjt:  TEQQPIPVPQRNFRP--------------------------------------------------GADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAE

Query:  RAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA
        RA TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLEVEPLHHVLSVSTPS ECML+KEKVKACQIEIA HVIEVTLLVLDMLDFDVILGMDWLAA
Subjt:  RAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA

Query:  NHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELES
        NHASIDCSRKEV FNPPSMAS KFKG GSRSLPQVISAIR SKLLSQGTWGILASVVDTREVDVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE 
Subjt:  NHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELES

Query:  GTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYH
        GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RSGYH
Subjt:  GTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYH

Query:  QLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQV
        QLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAP VFMDLMNRVF+EFLDTFVIVFIDDILIY K EAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQV
Subjt:  QLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQV

Query:  SFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC--------------------------
        SFLGHVVSKAGVSVDPAKIEA+T WTRPSTVS+VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                          
Subjt:  SFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC--------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD--------
           KANVVADALSRKVSHSAALITR APLHRDLERAEIAV VGA+TMQLAQLTVQ TLRQRII AQSND  LVEKRGLAEA QA  FSISSD        
Subjt:  ---KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD--------

Query:  --------------------------GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGF
                                  GSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAPRQKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGF
Subjt:  --------------------------GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGF

Query:  TVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRL
        TVIWVVVDRLTKSAHFVPGKSTY  SKWAQLYMSEIVRLHGVPVSIVSDRDA FTS F                                     R C L
Subjt:  TVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRL

Query:  LWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVF
         +  GSWD HLHLMEFAYNNSYQATIGMAPFEALYDKCCRS VCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVF
Subjt:  LWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVF

Query:  LKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNRE
        LKVAP+RGVLRFE+RGKLSP FVGSFEILERI PVAYR+ALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY EQPV+VL REV MLRNRE
Subjt:  LKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNRE

Query:  IPLVK
        IPLVK
Subjt:  IPLVK

A0A5A7UAA8 Reverse transcriptase0.0e+0072.75Show/hide
Query:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAP-PAPAPAPVPAL--VVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLS
        MPPRRGARRGGRGGRGRGAGRVQPE         P     PAP PAP PAP PAL  V PQ VPDQLSAEA+HLRDFR YNP+TFDGSLED TRAQ+WLS
Subjt:  MPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAP-PAPAPAPVPAL--VVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLS

Query:  SLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMI
        SL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETTERML GD                            EFLNLEQGDMTVEQYDAEFDMLSRF  EMI
Subjt:  SLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMI

Query:  AIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG---------------------
        A E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE ANSSK AGRGSTSGQKRK EQQP+PVPQRNFR G                     
Subjt:  AIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPG---------------------

Query:  -----------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLH
                                     ADRCP+RLTG AQNQG GAPHQG+VFATN+TEAE+A TVVTGTLPVLGHYALVLFDSGSSHSFISS FV H
Subjt:  -----------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLH

Query:  VRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVIS
         RLEVEPLHHVLSVSTPS ECML++EKVKACQIEIAGHVIEVTL+VLDMLDFDVILGMDWLAANHASIDCSRK+V FNPPSMAS KFKG GS+SLPQVIS
Subjt:  VRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVIS

Query:  AIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSV
        AIR SKLLSQGTWGILASVVDTRE DVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRP+V
Subjt:  AIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSV

Query:  SPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVF
        SPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RSGY+QLRIKD DVPKTAFRSRY HYEFIVMSFGLTNAP VF
Subjt:  SPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVF

Query:  MDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSF
        MDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKA VSVDPAKIEA+T WTRPSTVS+VRSF
Subjt:  MDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSF

Query:  LGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------------------
        LGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                                                               
Subjt:  LGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------------------

Query:  ------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAE
                                                                          KANVVADALSRKVSHSAALITRQAPLHRDLERAE
Subjt:  ------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAE

Query:  IAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD----------------------------------GSTKMYQDLKR
        IAV VGAVTMQLAQLTVQ TLRQRIIDAQSND  LVEKRGLAEA QAVEFS+SSD                                  GSTKMYQDLKR
Subjt:  IAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD----------------------------------GSTKMYQDLKR

Query:  VYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIV
        VYWWRNMKREVAEFVSKCLVCQ VK PRQKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIV
Subjt:  VYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIV

Query:  RLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDK
        RLHGVPVSIVSDRDA FTS F                                     R C L +  GSWD HLHLMEFAYNNSYQATIGMAPFEALY K
Subjt:  RLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDK

Query:  CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAY
        CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF++ DKVFLKVAP++GVLRFE+RGKLSP FVG FEILERI PVAY
Subjt:  CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAY

Query:  RLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        RLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY+EQPV+VL REV  LRN+EIPLVK
Subjt:  RLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

A0A5A7UD41 Reverse transcriptase0.0e+0072.77Show/hide
Query:  LFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDP
        L G SF     I   +  TR + +  VRVQRGA RREAGRMREGHMDASGFLYASAD    V  EMPPRRGARRGGRGGRGRGA RVQPEVQPVA ATDP
Subjt:  LFGDSFPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDP

Query:  NAPQQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERM
         AP                    +VPDQLSAEA+ LRDFR YNP+TFDGSLED TRAQLWLSSL+TIFRYMKCP+DQKVQCAVFMLTDRGTAWWETTERM
Subjt:  NAPQQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERM

Query:  LEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAV
        L GD                            +FLNLEQGDMTVEQYDAEFDMLSRF  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAV
Subjt:  LEGD----------------------------EFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAV

Query:  DLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPGADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGS
        DLSLQE+ANSSK AGRGSTSGQKRKTEQQP+ VPQRNFR                   GAPHQGKVFATNKTEAER  TVV GTLPVLGHYALVLFDSGS
Subjt:  DLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPGADRCPMRLTGNAQNQGEGAPHQGKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGS

Query:  SHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFK
        SHSFIS  FVLH RLEVEPLH+VLSVSTPS ECML+KEKVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA HASIDCS KEVAFNPPSMAS KFK
Subjt:  SHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDCSRKEVAFNPPSMASLKFK

Query:  GEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQ
        GEGSRSLPQVIS IR SKLLSQGTWGILASVVDTREVDVSLSSEPVV DYPD+FPEELPGLP HREVEFAIELE GTVPI RAPYRMAP ELKELKVQLQ
Subjt:  GEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQ

Query:  KVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIV
        ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRL IDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKI+ RSGYHQLRIKDGDVPK AFRSRY +YEFIV
Subjt:  KVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIV

Query:  MSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSW
        MSFGLTNAP VFMDLMNRVFREFLDTFVIVFID+ILIY KTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEA+T W
Subjt:  MSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSW

Query:  TRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------
        TRPSTV  VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSK C                                                   
Subjt:  TRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC---------------------------------------------------

Query:  ------------------------------------------------------------------------------KANVVADALSRKVSHSAALITR
                                                                                      KANVVADALSRKVSHSAALITR
Subjt:  ------------------------------------------------------------------------------KANVVADALSRKVSHSAALITR

Query:  QAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD---------------------------------
        QAPL RDLERAEIAV VGAVT+QLAQLTVQSTLR+RIIDAQSND  LVEKRGLAEA Q VEFSISSD                                 
Subjt:  QAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD---------------------------------

Query:  -GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTT
         GSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAPRQKPAGLLQPLSIPEWKWENVSM FIT  PRTLRGFTVIWVVVDRLTKSAHFVPGKSTYT 
Subjt:  -GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTT

Query:  SKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQAT
        SKWAQLYMSEIVRLHGVPVSIVSDRDA F   F                                     R C L +  GSWD HLHLM+FAYNNSYQAT
Subjt:  SKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQAT

Query:  IGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGS
        I MAPFEALY KCCRSPVCWGEVGEQRLMGPELV+STNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE+RGKLSP FVG 
Subjt:  IGMAPFEALYDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGS

Query:  FEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        FEILERI PVAYRLALPPSLS VHDVFHVSMLRKY+PDPS VVDYEPLEIDENLSY E+PV+VL REV MLRNREIPL+K
Subjt:  FEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

A0A5A7V810 Reverse transcriptase0.0e+0073.62Show/hide
Query:  GITKY--QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP---------------
        GIT+   +VRVQRGA RREAGRMREGHMDASGFLY           EMPPRRGARRGGRGGRGRGAGRVQPEVQPVA ATDP AP               
Subjt:  GITKY--QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP---------------

Query:  --------QQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWE
                QQPAPPAP PAPVP  VVPQ+  DQLSAEA+HLRDFR YNP+TFDGSLED TRAQLWL SL+TIFRYMKCP+DQKVQCAVFMLTDRGTA   
Subjt:  --------QQPAPPAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWE

Query:  TTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQK
           R  +  EFLNLEQGDMTVEQYDAEFDMLSRF  EMIA E ARADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE+ANSSKVAGRGSTSGQK
Subjt:  TTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQK

Query:  RKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTE
        RK EQQP PVPQRNFRPG                                                  ADRCPMRLTGNAQNQ  GAPHQGKVFATNKTE
Subjt:  RKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKTE

Query:  AERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWL
        AERA TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLE                     EKVKACQIEIAGH IEVTLLVLDMLDFDVILGMDWL
Subjt:  AERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWL

Query:  AANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIEL
        AANHASIDCSRKEVAFNPPSMAS KFKGEGSRSLPQVISAIR SKLLSQG WGILASVVDTREVDVSLSSEPVV DYPD+FPE+LPGLP HREVEFAIEL
Subjt:  AANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIEL

Query:  ESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSG
        E GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVF KID RSG
Subjt:  ESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSG

Query:  YHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK
        YHQLRIKDGDVPKTAFRS+Y HYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLR+VLQTLRDNKLYAKFSKCEFWLK
Subjt:  YHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLK

Query:  QVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC------------------------
        QVSFLGHVVSKAGV VDPAKIEA+T WTRPSTVS+VRSFLGLAGYYRRFVENFS IATPL QLTRK APFVWSK C                        
Subjt:  QVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD------
             KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAV VGAVT+QLAQLTVQSTLRQRIIDAQSND  L EKRGLAEA QA EFSISSD      
Subjt:  -----KANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD------

Query:  ----------------------------GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLR
                                    GSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ VKAP+QKPAGLLQPLSIPEWKWENVSMDFIT  PRTLR
Subjt:  ----------------------------GSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLR

Query:  GFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFGRVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEAL
        GFTVIWVVVDRLTKSAHFVPGKSTYT SKWAQLYMSEIVRLHGVPV IVSDRDA FTS F       W +GSWD HLHLMEFAYNNSYQATIGMAPFEAL
Subjt:  GFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFGRVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEAL

Query:  YDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDP
        Y KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE++GKLSP FVG FEILERI P
Subjt:  YDKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDP

Query:  VAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        VAYRLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDY+PLEIDENLSY EQP +VL REV  LRN+EIPLVK
Subjt:  VAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

A0A5A7V8L8 Pol protein0.0e+0075.03Show/hide
Query:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP----------------------
        +VR QRGA RREAGR REGHMDASGFL ASA        EMPPRRGARRGGRGGRGRGAGRVQ EVQPVA A DP AP                      
Subjt:  QVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAP----------------------

Query:  QQ--------PAP-PAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWW
        QQ        PAP PAPAPAP PA V PQ VPDQLSAEA+HLRDFR YNP+TFDGSLED TRAQ+WLSSL+TIFRYMKCP++QKVQCAVFMLTDRGTAWW
Subjt:  QQ--------PAP-PAPAPAPVPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWW

Query:  ETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQ
        ETTERML GDEFLNLEQGDMTVEQYDAEFDMLSRF  EMIA E A ADKFVRGLRLDIQGLVRAFRPATH DALRLAVDLSLQE+ANSSK AGRGSTSGQ
Subjt:  ETTERMLEGDEFLNLEQGDMTVEQYDAEFDMLSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQ

Query:  KRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKT
        KRK EQQP+PVPQRNFR G                                                  ADRCP+RLTGNAQNQG GAPHQG+VFATNKT
Subjt:  KRKTEQQPIPVPQRNFRPG--------------------------------------------------ADRCPMRLTGNAQNQGEGAPHQGKVFATNKT

Query:  EAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDW
        EAE+A TVVTGTLPVLGHYALVLFDSGSSHSFISS FVLH RLEVEPLHHVLSVSTPS ECML+KEKVKACQIEIA HVIEVTL+VLDMLDFDVILGMDW
Subjt:  EAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDW

Query:  LAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIE
        L ANHASIDCSRKEV FNPPSMAS + KG GS+SLPQVISAIR SKLLSQGTWGIL SVVDTRE DVSLSSEPVV DYPD+FPEELPGLP HREVEFAIE
Subjt:  LAANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIE

Query:  LESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRS
        LE GTVPI RAPYRMAP ELKELKVQLQ++ DKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYR+LNKVTVKNRYPL RIDD FDQLQGATVFSKID RS
Subjt:  LESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRS

Query:  GYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL
        GYHQLRIKD DVPKTAFRSRY HYEFIVMSFGLTNAP VFMDLMNRVFREFLDTFVIVFIDDILIY KTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL
Subjt:  GYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWL

Query:  KQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC-----------------------
        KQVSFLGHVVSKAGVSVDPAKIEA+T WTRPST+S+VRSFLGLAGYYRRFVENFS IATPL QLTRKGAPFVWSKAC                       
Subjt:  KQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKAC-----------------------

Query:  --------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLER
                                                                            KANVVADALSRKVSHSAALITRQAPLHRDLER
Subjt:  --------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLER

Query:  AEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD----------------------------------GSTKMYQDL
        AEIAV VGAVTMQLAQLTVQ TLRQRIIDAQSND  LVEKRGLAEA QA EFS+SSD                                  GSTKMYQDL
Subjt:  AEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQSNDLSLVEKRGLAEAAQAVEFSISSD----------------------------------GSTKMYQDL

Query:  KRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSE
        KRVYWWRNMKREVAEFVSKCLVCQ VKAP QKPAGLLQPLSIPEWKWENVSMDFIT  PRTLRGF+VIWVVVDRLTKSAHFV GKSTYT SKWAQLYMSE
Subjt:  KRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSE

Query:  IVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALY
        IVRLHGVPVSIVSDRDA FTS F                                     R C L +  GSWD HLHLMEFAYNNSYQATIGMAPFEALY
Subjt:  IVRLHGVPVSIVSDRDAHFTSNFG------------------------------------RVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALY

Query:  DKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPV
         KCC+SPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEF+VGDKVFLKVAP+RGVLRFE+RGKLSP FVG FEILERI P+
Subjt:  DKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPV

Query:  AYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK
        AYRLALPPSLSTVHDVFHVSMLRKY+PDPSHVVDYEPLEIDENLSY EQPV+VL REV  LRN+EIPLVK
Subjt:  AYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.0e-7224.74Show/hide
Query:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS
        +EF +EL      +    Y + P +++ +  ++ +    G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPL  I+    ++QG+T+F+
Subjt:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS

Query:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS
        K+D +S YH +R++ GD  K AFR     +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+ K+E+EH +H++ VLQ L++  L    +
Subjt:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----
        KCEF   QV F+G+ +S+ G +     I+ +  W +P    ++R FLG   Y R+F+   S +  PL  L +K   + W+          K C  +    
Subjt:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----

Query:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------
                           V   LS+K        V + +A +++    +   ++  +A+                          L+G +T +      
Subjt:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------

Query:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------
                                                           + Q+++    + +++   +ND  L+     E + + E  Q         
Subjt:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------

Query:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL
                              E  +   G   +   + R + W+ +++++ E+V  C  CQ+ K+   KP G LQP+   E  WE++SMDFIT  P + 
Subjt:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL

Query:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C
         G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS          NF             G+             C
Subjt:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C

Query:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG
               +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF  G
Subjt:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG

Query:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY
        D V +K     G L   K  KL+P F G F +L++  P  Y L LP S+  +    FHVS L KY
Subjt:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein2.0e-7224.74Show/hide
Query:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS
        +EF +EL      +    Y + P +++ +  ++ +    G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPL  I+    ++QG+T+F+
Subjt:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS

Query:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS
        K+D +S YH +R++ GD  K AFR     +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+ K+E+EH +H++ VLQ L++  L    +
Subjt:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----
        KCEF   QV F+G+ +S+ G +     I+ +  W +P    ++R FLG   Y R+F+   S +  PL  L +K   + W+          K C  +    
Subjt:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----

Query:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------
                           V   LS+K        V + +A +++    +   ++  +A+                          L+G +T +      
Subjt:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------

Query:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------
                                                           + Q+++    + +++   +ND  L+     E + + E  Q         
Subjt:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------

Query:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL
                              E  +   G   +   + R + W+ +++++ E+V  C  CQ+ K+   KP G LQP+   E  WE++SMDFIT  P + 
Subjt:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL

Query:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C
         G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS          NF             G+             C
Subjt:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C

Query:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG
               +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF  G
Subjt:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG

Query:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY
        D V +K     G L   K  KL+P F G F +L++  P  Y L LP S+  +    FHVS L KY
Subjt:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein2.0e-7224.74Show/hide
Query:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS
        +EF +EL      +    Y + P +++ +  ++ +    G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPL  I+    ++QG+T+F+
Subjt:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS

Query:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS
        K+D +S YH +R++ GD  K AFR     +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+ K+E+EH +H++ VLQ L++  L    +
Subjt:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----
        KCEF   QV F+G+ +S+ G +     I+ +  W +P    ++R FLG   Y R+F+   S +  PL  L +K   + W+          K C  +    
Subjt:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----

Query:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------
                           V   LS+K        V + +A +++    +   ++  +A+                          L+G +T +      
Subjt:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------

Query:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------
                                                           + Q+++    + +++   +ND  L+     E + + E  Q         
Subjt:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------

Query:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL
                              E  +   G   +   + R + W+ +++++ E+V  C  CQ+ K+   KP G LQP+   E  WE++SMDFIT  P + 
Subjt:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL

Query:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C
         G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS          NF             G+             C
Subjt:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C

Query:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG
               +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF  G
Subjt:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG

Query:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY
        D V +K     G L   K  KL+P F G F +L++  P  Y L LP S+  +    FHVS L KY
Subjt:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein2.0e-7224.74Show/hide
Query:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS
        +EF +EL      +    Y + P +++ +  ++ +    G IR S +    PV+FV KK+G++R+ +DY+ LNK    N YPL  I+    ++QG+T+F+
Subjt:  VEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFS

Query:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS
        K+D +S YH +R++ GD  K AFR     +E++VM +G++ AP  F   +N +  E  ++ V+ ++DDILI+ K+E+EH +H++ VLQ L++  L    +
Subjt:  KIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFS

Query:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----
        KCEF   QV F+G+ +S+ G +     I+ +  W +P    ++R FLG   Y R+F+   S +  PL  L +K   + W+          K C  +    
Subjt:  KCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWS----------KACKAN----

Query:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------
                           V   LS+K        V + +A +++    +   ++  +A+                          L+G +T +      
Subjt:  ------------------VVADALSRK--------VSHSAALITRQAPLHRDLERAEIAV--------------------------LVGAVTMQ------

Query:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------
                                                           + Q+++    + +++   +ND  L+     E + + E  Q         
Subjt:  ---------------------------------------------------LAQLTVQSTLRQRIIDAQSNDLSLV-----EKRGLAEAAQA--------

Query:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL
                              E  +   G   +   + R + W+ +++++ E+V  C  CQ+ K+   KP G LQP+   E  WE++SMDFIT  P + 
Subjt:  ---------------------VEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTL

Query:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C
         G+  ++VVVDR +K A  VP   + T  + A+++   ++   G P  I++D D  FTS          NF             G+             C
Subjt:  RGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTS----------NF-------------GRV------------C

Query:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG
               +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        E  Q T +  Q ++  ++T   + K Y D++ +++ EF  G
Subjt:  RLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYD-KCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFDVG

Query:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY
        D V +K     G L   K  KL+P F G F +L++  P  Y L LP S+  +    FHVS L KY
Subjt:  DKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTV-HDVFHVSMLRKY

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.1e-7325.66Show/hide
Query:  YPDIFPEELPGLP---SHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNR
        Y +I   +LP  P   ++  V+  IE++ G       PY +     +E+   +QK+ D  FI PS SP  +PV+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDIFPEELPGLP---SHREVEFAIELESGTVPIFRAPYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNR

Query:  YPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHE
        +PL RID+   ++  A +F+ +D  SGYHQ+ ++  D  KTAF +    YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+ ++  EH 
Subjt:  YPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRYEHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHE

Query:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPL-------PQLTRK
        +HL  VL+ L++  L  K  KC+F  ++  FLG+ +    ++    K  A+  +  P TV + + FLG+  YYRRF+ N S IA P+        Q T K
Subjt:  EHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPL-------PQLTRK

Query:  ----------------------------------------------------GAPFVWSKACKA------------------------------------
                                                            G    +SK+ ++                                    
Subjt:  ----------------------------------------------------GAPFVWSKACKA------------------------------------

Query:  --------------------------------------NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQS
                                              NVVADA+SR V       +R           +   L  AV + + +LT  +   + +   +S
Subjt:  --------------------------------------NVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQS

Query:  N----DLSLVEKRGLAEAAQAVEFS-------ISSDGSTKMYQD----------------LKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQP
             +LS   ++  +   + + +           +   ++Y D                +  +Y+W  ++  + +++  C+ CQL+K+ R +  GLLQP
Subjt:  N----DLSLVEKRGLAEAAQAVEFS-------ISSDGSTKMYQD----------------LKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQP

Query:  LSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSN-----------------
        L I E +W ++SMDF+T  P T     +I VVVDR +K AHF+  + T   ++   L    I   HG P +I SDRD   T++                 
Subjt:  LSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSN-----------------

Query:  -------------FGRVCRLLWARGS-----WDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWG--EVGEQRLMGPELVQSTNEAIQKIRSRM
                        + RLL A  S     W  +L  +EF YN++   T+G +PFE        +P      EV  +     EL +       + + ++
Subjt:  -------------FGRVCRLLWARGS-----WDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWG--EVGEQRLMGPELVQSTNEAIQKIRSRM

Query:  HTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRG--KLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYE
          AQ   ++  + RRK L  ++GD V      +     F+K    K+   +VG F ++++I+  AY L L  S    H V +V  L+K++  P      +
Subjt:  HTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRG--KLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHVVDYE

Query:  PLEIDENL
        P+   E +
Subjt:  PLEIDENL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein4.2e-2046.88Show/hide
Query:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGA
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+  W  P   +++R FLGL GYYRRFV+N+  I  PL +L +K +
Subjt:  HLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPAKIEALTSWTRPSTVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCCGTCGCCGTCTCCGTCGTCGGTGGTTTCACAGCTCCAACTTTTCGTCGTCGTGGCTGTACCTCCATTTCCAATCTCCCTCTCGGTCTCACGCAAGGACGCCGC
CGTGCGTCCTTCGTCCGACCGTGTCTGCCGACGTCCAGCTGCCTTCCGCCGCGCCGTTCGCCAAGCAGCCGTCGGCGAACCTCCAACGGTCGTGGACCTCCTATCTTGCG
AGCCGAGTCAAGCCGAGCCGAGCCGCGAGCCGAGTCAAGCCAAGTTGAGCCGTATTAGGGAGTTGACACTCAAATGCTCGGGTTTCGTGGCAGACTTATTTGGAGATAGT
TTTCCTTTTCTTGGGTGGATCCGTTGCGAGTGGACTCGAACGAGGGGCATAACCAAGTATCAGGTAAGGGTACAGCGAGGGGCATACCGACGAGAGGCAGGAAGGATGCG
TGAAGGCCATATGGACGCGTCTGGTTTTCTTTATGCTTCCGCTGATGGATCATTGGTGGTTGTTAGGGAAATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGGGGAG
GCCGAGGAAGGGGAGCAGGACGTGTTCAACCTGAGGTGCAGCCTGTAGCCCATGCCACTGACCCGAATGCGCCACAGCAGCCTGCCCCGCCAGCTCCAGCTCCAGCTCCA
GTTCCAGCTCTAGTTGTGCCCCAGATCGTGCCGGATCAGTTGTCGGCAGAGGCAAGGCACTTGAGGGATTTCAGGAACTATAACCCCTCGACATTCGATGGGTCCTTGGA
GGACGCCACCAGGGCTCAGCTGTGGTTATCGTCTTTAAAGACCATATTTCGATATATGAAGTGCCCTAAGGATCAGAAAGTTCAGTGTGCTGTTTTCATGTTGACAGACA
GAGGTACTGCATGGTGGGAGACTACAGAGAGAATGCTAGAGGGTGATGAGTTCCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATG
TTATCCCGCTTCACTTCCGAGATGATAGCGATTGAGACGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATCCAGGGTTTGGTTCGAGCTTTCCGACCCGC
CACTCATGTCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAAGGCTAACTCGTCTAAGGTCGCAGGTAGAGGTTCGACCTCGGGACAGAAAAGGAAGACTG
AGCAGCAGCCTATTCCAGTGCCACAACGGAACTTCAGACCAGGTGCTGACAGATGCCCGATGAGACTTACGGGGAATGCGCAGAATCAGGGAGAAGGTGCTCCACATCAG
GGTAAAGTCTTTGCTACCAACAAGACTGAGGCTGAGAGGGCAGCCACAGTGGTGACAGGTACGCTTCCAGTATTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTC
GTCGCATTCTTTTATCTCTTCTGTATTTGTGTTGCATGTCCGCTTAGAGGTAGAGCCCCTACACCATGTTTTATCAGTGTCTACTCCTTCTGAGGAGTGTATGTTGGCGA
AAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGTTGTTGGTCCTGGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTG
GCCGCTAACCATGCCAGCATAGATTGTTCCCGTAAGGAGGTAGCGTTTAACCCTCCCTCGATGGCCAGTTTGAAATTTAAGGGAGAAGGGTCAAGGTCGTTGCCTCAGGT
AATCTCAGCCATCAGAACCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCAGTGG
TGGGGGACTATCCGGATATCTTTCCTGAAGAACTTCCAGGGTTACCTTCTCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGTCGGGCACGGTTCCTATATTCAGAGCC
CCATACAGAATGGCCCCAGTAGAGTTGAAAGAACTGAAAGTGCAGCTACAGAAAGTGTTCGATAAGGGATTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTT
ATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGATTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGTCCAGGATCGATGATTTTTTTG
ACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATTTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATAC
GAACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGT
GTTTATTGATGACATTTTGATATATTTCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGA
AATGCGAGTTTTGGCTAAAGCAAGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTAGATCCAGCTAAGATAGAGGCACTCACCAGTTGGACCCGACCT
TCCACAGTCAGTAAGGTTCGTAGCTTCCTGGGCTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCTGTATAGCTACTCCTCTTCCTCAGTTGACCAGAAAGGG
AGCTCCTTTTGTTTGGAGCAAAGCATGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATC
GAGATCTTGAGCGGGCTGAGATTGCAGTGTTAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACAGTACAGTCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGT
AACGATCTTTCTTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGCGCAAGCGGTTGAGTTCTCCATATCCTCTGATGGTAGTACGAAGATGTATCAAGACCTAAAACGGGT
TTATTGGTGGCGTAATATGAAGAGAGAGGTGGCGGAATTTGTTAGTAAATGCTTGGTGTGTCAGCTGGTTAAAGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCT
TGAGCATACCGGAATGGAAATGGGAAAACGTGTCCATGGATTTCATTACAAGACAGCCAAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGACTTACC
AAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACCACTAGTAAATGGGCACAGTTGTACATGTCTGAGATAGTAAGACTACATGGAGTGCCGGTGTCGATTGTTTC
TGATAGAGATGCCCATTTCACTTCCAATTTTGGAAGGGTTTGCAGACTACTATGGGCACGAGGTAGCTGGGACTTCCACTTGCATTTGATGGAATTTGCTTATAATAACA
GTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGACAAATGTTGTAGATCTCCTGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAG
TTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGTATGCATACCGCACAGAGTAGGCAGAAGAGTTATGCAGATGTGAGACGAAAGGATCTTGAGTTTGA
TGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATAAGAGGTGTCTTACGATTTGAAAAGAGAGGAAAGCTGAGTCCCCATTTTGTTGGGTCGTTTGAGATTCTGGAGC
GGATTGACCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACATGCCAGATCCATCCCATGTA
GTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATATTGAACAACCCGTTAAGGTGCTGACTAGAGAGGTGAATATGTTGAGGAATAGAGAAATTCCTTTGGT
CAAGTCTTATGGCGGAATCACCGGGTGGAAGAGGCTACATGGGAGCGAGAAGATGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTCCGTCGCCGTCTCCGTCGTCGGTGGTTTCACAGCTCCAACTTTTCGTCGTCGTGGCTGTACCTCCATTTCCAATCTCCCTCTCGGTCTCACGCAAGGACGCCGC
CGTGCGTCCTTCGTCCGACCGTGTCTGCCGACGTCCAGCTGCCTTCCGCCGCGCCGTTCGCCAAGCAGCCGTCGGCGAACCTCCAACGGTCGTGGACCTCCTATCTTGCG
AGCCGAGTCAAGCCGAGCCGAGCCGCGAGCCGAGTCAAGCCAAGTTGAGCCGTATTAGGGAGTTGACACTCAAATGCTCGGGTTTCGTGGCAGACTTATTTGGAGATAGT
TTTCCTTTTCTTGGGTGGATCCGTTGCGAGTGGACTCGAACGAGGGGCATAACCAAGTATCAGGTAAGGGTACAGCGAGGGGCATACCGACGAGAGGCAGGAAGGATGCG
TGAAGGCCATATGGACGCGTCTGGTTTTCTTTATGCTTCCGCTGATGGATCATTGGTGGTTGTTAGGGAAATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGGGGAG
GCCGAGGAAGGGGAGCAGGACGTGTTCAACCTGAGGTGCAGCCTGTAGCCCATGCCACTGACCCGAATGCGCCACAGCAGCCTGCCCCGCCAGCTCCAGCTCCAGCTCCA
GTTCCAGCTCTAGTTGTGCCCCAGATCGTGCCGGATCAGTTGTCGGCAGAGGCAAGGCACTTGAGGGATTTCAGGAACTATAACCCCTCGACATTCGATGGGTCCTTGGA
GGACGCCACCAGGGCTCAGCTGTGGTTATCGTCTTTAAAGACCATATTTCGATATATGAAGTGCCCTAAGGATCAGAAAGTTCAGTGTGCTGTTTTCATGTTGACAGACA
GAGGTACTGCATGGTGGGAGACTACAGAGAGAATGCTAGAGGGTGATGAGTTCCTGAACTTAGAGCAGGGTGACATGACAGTGGAGCAGTATGATGCGGAGTTTGACATG
TTATCCCGCTTCACTTCCGAGATGATAGCGATTGAGACGGCCAGAGCTGATAAGTTTGTTAGAGGCCTCAGACTGGACATCCAGGGTTTGGTTCGAGCTTTCCGACCCGC
CACTCATGTCGATGCACTGCGCCTGGCAGTGGATCTCAGTTTACAGGAGAAGGCTAACTCGTCTAAGGTCGCAGGTAGAGGTTCGACCTCGGGACAGAAAAGGAAGACTG
AGCAGCAGCCTATTCCAGTGCCACAACGGAACTTCAGACCAGGTGCTGACAGATGCCCGATGAGACTTACGGGGAATGCGCAGAATCAGGGAGAAGGTGCTCCACATCAG
GGTAAAGTCTTTGCTACCAACAAGACTGAGGCTGAGAGGGCAGCCACAGTGGTGACAGGTACGCTTCCAGTATTGGGGCATTACGCCTTAGTTTTGTTTGATTCGGGTTC
GTCGCATTCTTTTATCTCTTCTGTATTTGTGTTGCATGTCCGCTTAGAGGTAGAGCCCCTACACCATGTTTTATCAGTGTCTACTCCTTCTGAGGAGTGTATGTTGGCGA
AAGAAAAGGTGAAAGCATGCCAGATTGAGATAGCAGGCCATGTGATTGAAGTAACGTTGTTGGTCCTGGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTG
GCCGCTAACCATGCCAGCATAGATTGTTCCCGTAAGGAGGTAGCGTTTAACCCTCCCTCGATGGCCAGTTTGAAATTTAAGGGAGAAGGGTCAAGGTCGTTGCCTCAGGT
AATCTCAGCCATCAGAACCAGTAAACTGCTCAGTCAGGGTACTTGGGGTATCTTAGCGAGCGTGGTGGATACTAGAGAGGTCGATGTATCCCTGTCATCAGAACCAGTGG
TGGGGGACTATCCGGATATCTTTCCTGAAGAACTTCCAGGGTTACCTTCTCACAGAGAGGTTGAGTTTGCCATAGAGTTGGAGTCGGGCACGGTTCCTATATTCAGAGCC
CCATACAGAATGGCCCCAGTAGAGTTGAAAGAACTGAAAGTGCAGCTACAGAAAGTGTTCGATAAGGGATTCATTCGACCGAGTGTGTCACCTTGGGGTGCGCCAGTTTT
ATTTGTTAAGAAGAAGGATGGATCGATGCGTCTATGCATTGACTATAGGGATTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGTCCAGGATCGATGATTTTTTTG
ACCAGTTACAGGGAGCTACAGTGTTCTCTAAGATTGATTTTCGGTCGGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCGAAGACAGCATTTCGTTCCAGATAC
GAACACTATGAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCGACAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGT
GTTTATTGATGACATTTTGATATATTTCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAACACTTCGGGATAATAAATTGTATGCAAAGTTCTCGA
AATGCGAGTTTTGGCTAAAGCAAGTGTCCTTTCTAGGCCATGTGGTTTCTAAGGCTGGAGTTTCTGTAGATCCAGCTAAGATAGAGGCACTCACCAGTTGGACCCGACCT
TCCACAGTCAGTAAGGTTCGTAGCTTCCTGGGCTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCTGTATAGCTACTCCTCTTCCTCAGTTGACCAGAAAGGG
AGCTCCTTTTGTTTGGAGCAAAGCATGCAAGGCAAATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCATTGCATC
GAGATCTTGAGCGGGCTGAGATTGCAGTGTTAGTGGGGGCAGTCACTATGCAGTTAGCCCAGTTGACAGTACAGTCGACTTTGAGGCAAAGGATCATTGATGCTCAGAGT
AACGATCTTTCTTTGGTTGAGAAGCGTGGCCTAGCAGAGGCAGCGCAAGCGGTTGAGTTCTCCATATCCTCTGATGGTAGTACGAAGATGTATCAAGACCTAAAACGGGT
TTATTGGTGGCGTAATATGAAGAGAGAGGTGGCGGAATTTGTTAGTAAATGCTTGGTGTGTCAGCTGGTTAAAGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCT
TGAGCATACCGGAATGGAAATGGGAAAACGTGTCCATGGATTTCATTACAAGACAGCCAAGAACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGGACAGACTTACC
AAATCAGCGCACTTTGTTCCGGGTAAATCCACCTATACCACTAGTAAATGGGCACAGTTGTACATGTCTGAGATAGTAAGACTACATGGAGTGCCGGTGTCGATTGTTTC
TGATAGAGATGCCCATTTCACTTCCAATTTTGGAAGGGTTTGCAGACTACTATGGGCACGAGGTAGCTGGGACTTCCACTTGCATTTGATGGAATTTGCTTATAATAACA
GTTATCAGGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGACAAATGTTGTAGATCTCCTGTTTGCTGGGGTGAGGTGGGTGAGCAGAGATTGATGGGTCCTGAG
TTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGTATGCATACCGCACAGAGTAGGCAGAAGAGTTATGCAGATGTGAGACGAAAGGATCTTGAGTTTGA
TGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATAAGAGGTGTCTTACGATTTGAAAAGAGAGGAAAGCTGAGTCCCCATTTTGTTGGGTCGTTTGAGATTCTGGAGC
GGATTGACCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAGTTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACATGCCAGATCCATCCCATGTA
GTGGATTACGAGCCACTAGAGATTGATGAAAACTTGAGCTATATTGAACAACCCGTTAAGGTGCTGACTAGAGAGGTGAATATGTTGAGGAATAGAGAAATTCCTTTGGT
CAAGTCTTATGGCGGAATCACCGGGTGGAAGAGGCTACATGGGAGCGAGAAGATGACATGA
Protein sequenceShow/hide protein sequence
MCPSPSPSSVVSQLQLFVVVAVPPFPISLSVSRKDAAVRPSSDRVCRRPAAFRRAVRQAAVGEPPTVVDLLSCEPSQAEPSREPSQAKLSRIRELTLKCSGFVADLFGDS
FPFLGWIRCEWTRTRGITKYQVRVQRGAYRREAGRMREGHMDASGFLYASADGSLVVVREMPPRRGARRGGRGGRGRGAGRVQPEVQPVAHATDPNAPQQPAPPAPAPAP
VPALVVPQIVPDQLSAEARHLRDFRNYNPSTFDGSLEDATRAQLWLSSLKTIFRYMKCPKDQKVQCAVFMLTDRGTAWWETTERMLEGDEFLNLEQGDMTVEQYDAEFDM
LSRFTSEMIAIETARADKFVRGLRLDIQGLVRAFRPATHVDALRLAVDLSLQEKANSSKVAGRGSTSGQKRKTEQQPIPVPQRNFRPGADRCPMRLTGNAQNQGEGAPHQ
GKVFATNKTEAERAATVVTGTLPVLGHYALVLFDSGSSHSFISSVFVLHVRLEVEPLHHVLSVSTPSEECMLAKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWL
AANHASIDCSRKEVAFNPPSMASLKFKGEGSRSLPQVISAIRTSKLLSQGTWGILASVVDTREVDVSLSSEPVVGDYPDIFPEELPGLPSHREVEFAIELESGTVPIFRA
PYRMAPVELKELKVQLQKVFDKGFIRPSVSPWGAPVLFVKKKDGSMRLCIDYRDLNKVTVKNRYPLSRIDDFFDQLQGATVFSKIDFRSGYHQLRIKDGDVPKTAFRSRY
EHYEFIVMSFGLTNAPTVFMDLMNRVFREFLDTFVIVFIDDILIYFKTEAEHEEHLRMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEALTSWTRP
STVSKVRSFLGLAGYYRRFVENFSCIATPLPQLTRKGAPFVWSKACKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQLTVQSTLRQRIIDAQS
NDLSLVEKRGLAEAAQAVEFSISSDGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQLVKAPRQKPAGLLQPLSIPEWKWENVSMDFITRQPRTLRGFTVIWVVVDRLT
KSAHFVPGKSTYTTSKWAQLYMSEIVRLHGVPVSIVSDRDAHFTSNFGRVCRLLWARGSWDFHLHLMEFAYNNSYQATIGMAPFEALYDKCCRSPVCWGEVGEQRLMGPE
LVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFDVGDKVFLKVAPIRGVLRFEKRGKLSPHFVGSFEILERIDPVAYRLALPPSLSTVHDVFHVSMLRKYMPDPSHV
VDYEPLEIDENLSYIEQPVKVLTREVNMLRNREIPLVKSYGGITGWKRLHGSEKMT