; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0011239 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0011239
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionReverse transcriptase
Genome locationchr08:30273648..30278172
RNA-Seq ExpressionIVF0011239
SyntenyIVF0011239
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0004518 - nuclease activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR043502 - DNA/RNA polymerase superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR041588 - Integrase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR001969 - Aspartic peptidase, active site
IPR001878 - Zinc finger, CCHC-type
IPR000477 - Reverse transcriptase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040689.1 pol protein [Cucumis melo var. makuwa]0.069.17Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                 V DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFMLTDR                                             
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------

Query:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP----
                 VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P+   D   + +  + Q                  R+   +P    
Subjt:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP----

Query:  ----RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGT
            RSGGE RRFQQKPFEA E ARGKPLCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQ A A HQ +VFATN TEAE+AG VVTGT
Subjt:  ----RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGT

Query:  LPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSR
        LPVLGHY LVLFDSGSSHSFIS AFVLHARLEVEPLHHVLSVSTPSGECM SK K+KACQIEIAGHVIEVTL+VLDMLDF+VILG              R
Subjt:  LPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSR

Query:  KEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAP
         +V                   +++VISA+RASKLLSQGT  ILAS+VDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV ISRAP
Subjt:  KEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAP

Query:  YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDV
        YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DV
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDV

Query:  PKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK
        PKT FRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK
Subjt:  PKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK

Query:  VGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-----------------------------------
         GVSV+P KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                   
Subjt:  VGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-----------------------------------

Query:  ----------------------------------------------------------------------------------------------KANVVA
                                                                                                      KANVVA
Subjt:  ----------------------------------------------------------------------------------------------KANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAV
        DALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTLR  IIDAQSN+PYLV+KRGLA+A QAVEFSIS DGGLLFERRLCVPSDSA+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAV

Query:  KTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA------------------------------------------------
        KT+LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQ                                                 
Subjt:  KTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA------------------------------------------------

Query:  ---------------------------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFE
                                         AM TRLDFSTAFH QT+GQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFE
Subjt:  ---------------------------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFE

Query:  ALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERI
        ALYGKC RSPVCWGEVGEQRLMG ELV+STNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERI
Subjt:  ALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERI

Query:  GPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLF
        GP+AYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVK LRN+EIP+VKVLW+NHR+ EATWEREDDMR+RYP+LF
Subjt:  GPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLF

Query:  EE
        EE
Subjt:  EE

KAA0040871.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]0.070.96Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVP------------VPV----VLDQLSVEAK
        MPPRRGARRGGR GRGR AGRVQ EVQPVAQAT+PA PVTHADLAAMEQRFRDLIMQMREQQQ APP P P            VPV    V DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVP------------VPV----VLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFM TDR                                             
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------

Query:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGWFEPSDPSLMPIHCAWQWISVYRRGLTRPRSGGESRRFQQKPFEAEETARGK
                 VEQYDAEFDML RF PEMIATEAARA+    G+             S        Q + V +R     RSGGE RRFQQKPFEA E ARGK
Subjt:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGWFEPSDPSLMPIHCAWQWISVYRRGLTRPRSGGESRRFQQKPFEAEETARGK

Query:  PLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVL
         LCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA A HQ +VFATN TEAERAG VVTGTLPVLGHY LVLFDSGSSHSFISSAFVL
Subjt:  PLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVL

Query:  HARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVI
        HARLEVEPLHHVLSVSTPSGECM SK KVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHA IDCSRKEV FNP S+ASFKFKGEGSR L +VI
Subjt:  HARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVI

Query:  SAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPS
        SA+RASKLLSQGTW ILAS+VDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV ISRAPY+MAPAELKELKVQLQELLDKGFIRPS
Subjt:  SAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPS

Query:  MSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAV
        +SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKN+YPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT FRS+YGHYEFIVMSFGLTNAPAV
Subjt:  MSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAV

Query:  FMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRS
        FMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSV+PAKIEAVT W RPSTVSEVRS
Subjt:  FMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRS

Query:  FLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC--------------------------------------------------------------
        FLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                                              
Subjt:  FLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC--------------------------------------------------------------

Query:  -------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERA
                                                                           KANVVADALSRKVSHSAALITRQAPLHRDLERA
Subjt:  -------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERA

Query:  EISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDL-
        EI+VSVGAVTMQLAQLTVQPTLR  IIDAQ N+ YLV+KRGLA+A QAVEFS S DGGLLFERRLCVPSDSAVKT+LLSEAHSSP SMHPG+        
Subjt:  EISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDL-

Query:  -----------KRVY----------------------------------------WWRNMKREV---------------AEFVSKCLVCQQAAMDTRLDF
                   KRV+                                        W +    E+               A F SK     Q AM TRLDF
Subjt:  -----------KRVY----------------------------------------WWRNMKREV---------------AEFVSKCLVCQQAAMDTRLDF

Query:  STAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRM
        STAFH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFEALYGKC RSPVCWGEVGEQRLMG ELV STNEAIQKIRSRM
Subjt:  STAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRM

Query:  QTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPL
         TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGP+AYRLALPPSLS VHDV HVSMLRKYV DPSHVVDYEPL
Subjt:  QTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPL

Query:  EIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFE
        EIDENLSYTEQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+RYP+LFE
Subjt:  EIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFE

KAA0048687.1 pol protein [Cucumis melo var. makuwa]0.068.2Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV--------------------VLDQLS
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                     V DQLS
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV--------------------VLDQLS

Query:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------
         EAKHLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFMLTDR                                         
Subjt:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------

Query:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQWIS----VYRRGLT---------RP
                     VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P+   D   + +  + Q  +       RGLT         +P
Subjt:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQWIS----VYRRGLT---------RP

Query:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV
                RSGGE RRFQQKPFEA E AR KPLCT CGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA A HQ +VFATN TEAE+AG V
Subjt:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV

Query:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI
        VTGTLPVLGHY LVLFDS                           VSTPSGECM SK KVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA+HA I
Subjt:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI

Query:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI
        DCSRKEV FNPPS ASFKFKG GSR L +VISA+RASKLLSQGTW ILAS+VDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV I
Subjt:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI

Query:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        D DVPKT FRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------
        VVSK GVSV+PAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                               
Subjt:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------

Query:  --------------------------------------------------------------------------------------------------KA
                                                                                                          KA
Subjt:  --------------------------------------------------------------------------------------------------KA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTLR  IIDAQSN+PYLV+KRGLA+A QAVEFS+S DGGLLFERRLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPS

Query:  DSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA--------------------------------------------
        DS VKT+LLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQQ                                             
Subjt:  DSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA--------------------------------------------

Query:  -----------------------------------------------------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGS
                                                                   AM TRLDFSTAFH QTDGQTERLNQVLE MLRACALEFPGS
Subjt:  -----------------------------------------------------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGS

Query:  WDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPM
        WDS+LHLMEF YNNS+QATI MAPFEALYGKC RSPVCWGEVGEQRLMG ELVQSTNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM
Subjt:  WDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPM

Query:  KGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKV
        +GVLRFERRGKLSP F+GPFEILERIGP+AYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDY+PL+IDENLSYTEQPVEVLAREVK LRN+EIPLVKV
Subjt:  KGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKV

Query:  LWQNHRIEEATWEREDDMRARYPKL
        LW+NHR+EEATWEREDDM++RYP+L
Subjt:  LWQNHRIEEATWEREDDMRARYPKL

KAA0062141.1 pol protein [Cucumis melo var. makuwa]0.069.82Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVP--------------------VVLDQLS
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQAT+PA PVTHADLAAMEQRFRDLIMQMREQQQPAPP P P P                    VV DQLS
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVP--------------------VVLDQLS

Query:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------
         EAKHLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIF+YMKCPEDQKVQCA+FMLTDR                                         
Subjt:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------

Query:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP
                     VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P+   D   + +  + Q                  R+   +P
Subjt:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP

Query:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV
                RSGGE RRFQQKPFEA E AR KPLCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA   HQ +VFATN TEAERAG V
Subjt:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV

Query:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI
        VTGTLPVLGHY LVLFDSGSSHSFISSAFVLHARLE                            IEIAGHVI+VTLLVLDMLDFDVILGMDWLAANHA I
Subjt:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI

Query:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI
        DCSRKEV FNPPS+ASFKFK  GSR L +VISA+RASKLLSQGTW ILAS+VDTREVDVSLSSEPV+RDYPDVFPEELPGLPPHRE+EFAIELE GTV I
Subjt:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI

Query:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAP ELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCID+RELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DGDVPKT F S+YGH EFIVMSFGLTNAPAVFMDLMNRVFR+FLD FVIVFIDDILIYSKTEAEHEEHL +VLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------
        VVSK GVSV+PAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                               
Subjt:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------

Query:  ------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQL
                                                              KANVVAD LSRKVSHSAALITRQAPLHRDLERAEI+VS+GAVTMQL
Subjt:  ------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQL

Query:  AQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTLR  IIDAQSN+PYLV+KRGLA+  QAVEFSIS DGGLLFERRLCVPSDSAVKT+LLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREVA
Subjt:  AQLTVQPTLR--IIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSKCLVCQQA----------------------------------------------------------------------------------------
        EFVS+CLVCQQ                                                                                         
Subjt:  EFVSKCLVCQQA----------------------------------------------------------------------------------------

Query:  ---------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGE
                       AM TRLDFSTAFH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFE LYGKC RSPVCWGEVGE
Subjt:  ---------------AMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGE

Query:  QRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHD
        QRLMG ELVQSTNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGP+AYRLALPPSLS VHD
Subjt:  QRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHD

Query:  VLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKL
        V HVSMLRKYVPDPSHVVDYEPLEIDEN SYTEQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+R  +L
Subjt:  VLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKL

KAA0062245.1 pol protein [Cucumis melo var. makuwa]0.073.11Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK
        MPPRRGARRGGR GRGRGAGRVQLEVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                 V DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR--------------------------VEQYDAEFDMLCRFVPEMI
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPE+QKVQCAVFMLTDR                          VEQYDAEFDML RF PEMI
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR--------------------------VEQYDAEFDMLCRFVPEMI

Query:  ATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP--------RSGGESRRFQQKPFEAEETARGKP
        ATEAA ADKFVRGLRLDIQG    F P+   D   + +  + Q                  R+   +P        RSGGE  RFQQKPFEA E ARGKP
Subjt:  ATEAARADKFVRGLRLDIQGW---FEPS---DPSLMPIHCAWQ-------------WISVYRRGLTRP--------RSGGESRRFQQKPFEAEETARGKP

Query:  LCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVLH
        LCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQG  A HQ +VFATN TEAE+AG VVTGTLPVLGHY LVLFDSGSSHSFISSAFVLH
Subjt:  LCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVLH

Query:  ARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVIS
        ARLEVEPLHHVLSVSTPSGECM SK KVKACQIEIA HVIEVTL+VLDMLDFDVILGMDWL ANHA IDCSRKEV FNPPS+ASF+ KG GS+ L +VIS
Subjt:  ARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVIS

Query:  AMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSM
        A+RASKLLSQGTW IL S+VDTRE DVSLSSEPVVRDYPDVFPEELPGLP HRE+EFAIELEPGTV ISRAPYRMAPAELKELKVQLQELLDKGFIRPS+
Subjt:  AMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSM

Query:  SPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKT FRS+YGHYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSF
        MDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSV+PAKIEAVT W RPST+SEVRSF
Subjt:  MDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC---------------------------------------------------------------
        LGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                                               
Subjt:  LGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC---------------------------------------------------------------

Query:  ----------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDK
                                    KANVVADALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTLR  IIDAQSN+PYLV+K
Subjt:  ----------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLR--IIDAQSNNPYLVDK

Query:  RGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA--------------
        RGLA+A QA EFS+S DGGLLFERRLCVPSDSAVKT+LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQ               
Subjt:  RGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQA--------------

Query:  -----------------------------------------------------------------------------------------AMDTRLDFSTA
                                                                                                 AM TRLDFSTA
Subjt:  -----------------------------------------------------------------------------------------AMDTRLDFSTA

Query:  FHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTA
        FH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFEALYGKC +SPVCWGEVGEQRLMG ELVQSTNEAIQKIRSRM TA
Subjt:  FHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTA

Query:  QSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEID
        QSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGPIAYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDYEPLEID
Subjt:  QSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFEE
        ENLSY EQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+RYP LFEE
Subjt:  ENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFEE

TrEMBL top hitse value%identityAlignment
A0A5A7TGS7 Reverse transcriptase0.0e+0070.96Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTP------------VPVP----VVLDQLSVEAK
        MPPRRGARRGGR GRGR AGRVQ EVQPVAQAT+PA PVTHADLAAMEQRFRDLIMQMREQQQ APP P             PVP    VV DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTP------------VPVP----VVLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFM TDR                                             
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------

Query:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGWFEPSDPSLMPIHCAWQWISVYRRGLTRPRSGGESRRFQQKPFEAEETARGK
                 VEQYDAEFDML RF PEMIATEAARA+    G+             S        Q + V +R     RSGGE RRFQQKPFEA E ARGK
Subjt:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGWFEPSDPSLMPIHCAWQWISVYRRGLTRPRSGGESRRFQQKPFEAEETARGK

Query:  PLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVL
         LCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA A HQ +VFATN TEAERAG VVTGTLPVLGHY LVLFDSGSSHSFISSAFVL
Subjt:  PLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVL

Query:  HARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVI
        HARLEVEPLHHVLSVSTPSGECM SK KVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHA IDCSRKEV FNP S+ASFKFKGEGSR L +VI
Subjt:  HARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVI

Query:  SAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPS
        SA+RASKLLSQGTW ILAS+VDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV ISRAPY+MAPAELKELKVQLQELLDKGFIRPS
Subjt:  SAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPS

Query:  MSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAV
        +SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKN+YPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKT FRS+YGHYEFIVMSFGLTNAPAV
Subjt:  MSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAV

Query:  FMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRS
        FMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL +VLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSV+PAKIEAVT W RPSTVSEVRS
Subjt:  FMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRS

Query:  FLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC--------------------------------------------------------------
        FLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                                              
Subjt:  FLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC--------------------------------------------------------------

Query:  -------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERA
                                                                           KANVVADALSRKVSHSAALITRQAPLHRDLERA
Subjt:  -------------------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERA

Query:  EISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDL-
        EI+VSVGAVTMQLAQLTVQPTL  RIIDAQ N+ YLV+KRGLA+A QAVEFS S DGGLLFERRLCVPSDSAVKT+LLSEAHSSP SMHPG+        
Subjt:  EISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDL-

Query:  -----------KRVY----------------------------------------WWRNMKREV---------------AEFVSKCLVCQQAAMDTRLDF
                   KRV+                                        W +    E+               A F SK     Q AM TRLDF
Subjt:  -----------KRVY----------------------------------------WWRNMKREV---------------AEFVSKCLVCQQAAMDTRLDF

Query:  STAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRM
        STAFH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFEALYGKC RSPVCWGEVGEQRLMG ELV STNEAIQKIRSRM
Subjt:  STAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRM

Query:  QTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPL
         TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGP+AYRLALPPSLS VHDV HVSMLRKYV DPSHVVDYEPL
Subjt:  QTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPL

Query:  EIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFE
        EIDENLSYTEQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+RYP+LFE
Subjt:  EIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFE

A0A5A7THE6 Reverse transcriptase0.0e+0069.04Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                 V DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFMLTDR                                             
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR---------------------------------------------

Query:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP----
                 VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P   +D   + +  + Q                  R+   +P    
Subjt:  ---------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP----

Query:  ----RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGT
            RSGGE RRFQQKPFEA E ARGKPLCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQ A A HQ +VFATN TEAE+AG VVTGT
Subjt:  ----RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGT

Query:  LPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSR
        LPVLGHY LVLFDSGSSHSFIS AFVLHARLEVEPLHHVLSVSTPSGECM SK K+KACQIEIAGHVIEVTL+VLDMLDF+VIL                
Subjt:  LPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSR

Query:  KEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAP
                        G   + +++VISA+RASKLLSQGT  ILAS+VDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV ISRAP
Subjt:  KEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAP

Query:  YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDV
        YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DV
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDV

Query:  PKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK
        PKT FRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK
Subjt:  PKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK

Query:  VGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-----------------------------------
         GVSV+P KIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                   
Subjt:  VGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-----------------------------------

Query:  ----------------------------------------------------------------------------------------------KANVVA
                                                                                                      KANVVA
Subjt:  ----------------------------------------------------------------------------------------------KANVVA

Query:  DALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAV
        DALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTL  RIIDAQSN+PYLV+KRGLA+A QAVEFSIS DGGLLFERRLCVPSDSA+
Subjt:  DALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAV

Query:  KTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ--------------------------------------------------
        KT+LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQ                                                  
Subjt:  KTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ--------------------------------------------------

Query:  -------------------------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFE
                                       Q AM TRLDFSTAFH QT+GQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFE
Subjt:  -------------------------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFE

Query:  ALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERI
        ALYGKC RSPVCWGEVGEQRLMG ELV+STNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERI
Subjt:  ALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERI

Query:  GPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLF
        GP+AYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVK LRN+EIP+VKVLW+NHR+ EATWEREDDMR+RYP+LF
Subjt:  GPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLF

Query:  EE
        EE
Subjt:  EE

A0A5A7U330 Reverse transcriptase0.0e+0068.13Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV--------------------VLDQLS
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                     V DQLS
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV--------------------VLDQLS

Query:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------
         EAKHLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPEDQKVQCAVFMLTDR                                         
Subjt:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------

Query:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ----WISVYRRGLT---------RP
                     VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P   +D   + +  + Q          RGLT         +P
Subjt:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ----WISVYRRGLT---------RP

Query:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV
                RSGGE RRFQQKPFEA E AR KPLCT CGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA A HQ +VFATN TEAE+AG V
Subjt:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV

Query:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI
        VTGTLPVLGHY LVLFD                           SVSTPSGECM SK KVK CQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAA+HA I
Subjt:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI

Query:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI
        DCSRKEV FNPPS ASFKFKG GSR L +VISA+RASKLLSQGTW ILAS+VDTRE DVSLSSEPVVRDYPDVFPEELPGLPPHRE+EFAIELEPGTV I
Subjt:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI

Query:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAPAELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        D DVPKT FRS+YGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------
        VVSK GVSV+PAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                               
Subjt:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------

Query:  --------------------------------------------------------------------------------------------------KA
                                                                                                          KA
Subjt:  --------------------------------------------------------------------------------------------------KA

Query:  NVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPS
        NVVADALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTL  RIIDAQSN+PYLV+KRGLA+A QAVEFS+S DGGLLFERRLCVPS
Subjt:  NVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPS

Query:  DSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ----------------------------------------------
        DS VKT+LLSEAHSSPFSMHPGSTKMY+D+KRVYWWRNMKREVAEFVS+CLVCQ                                              
Subjt:  DSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ----------------------------------------------

Query:  ---------------------------------------------------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGS
                                                                 Q AM TRLDFSTAFH QTDGQTERLNQVLE MLRACALEFPGS
Subjt:  ---------------------------------------------------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGS

Query:  WDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPM
        WDS+LHLMEF YNNS+QATI MAPFEALYGKC RSPVCWGEVGEQRLMG ELVQSTNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM
Subjt:  WDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPM

Query:  KGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKV
        +GVLRFERRGKLSP F+GPFEILERIGP+AYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDY+PL+IDENLSYTEQPVEVLAREVK LRN+EIPLVKV
Subjt:  KGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKV

Query:  LWQNHRIEEATWEREDDMRARYPKLFEE
        LW+NHR+EEATWEREDDM++RYP+L  E
Subjt:  LWQNHRIEEATWEREDDMRARYPKLFEE

A0A5A7V8L8 Pol protein0.0e+0073.11Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK
        MPPRRGARRGGR GRGRGAGRVQLEVQPVAQA +PA PVTHADLAAMEQRFRDLIMQMREQQ+PA PTP P P                 V DQLS EAK
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPV----------------VLDQLSVEAK

Query:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR--------------------------VEQYDAEFDMLCRFVPEMI
        HLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIFRYMKCPE+QKVQCAVFMLTDR                          VEQYDAEFDML RF PEMI
Subjt:  HLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR--------------------------VEQYDAEFDMLCRFVPEMI

Query:  ATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP--------RSGGESRRFQQKPFEAEETARGKP
        ATEAA ADKFVRGLRLDIQG    F P   +D   + +  + Q                  R+   +P        RSGGE  RFQQKPFEA E ARGKP
Subjt:  ATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP--------RSGGESRRFQQKPFEAEETARGKP

Query:  LCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVLH
        LCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQG  A HQ +VFATN TEAE+AG VVTGTLPVLGHY LVLFDSGSSHSFISSAFVLH
Subjt:  LCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVLH

Query:  ARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVIS
        ARLEVEPLHHVLSVSTPSGECM SK KVKACQIEIA HVIEVTL+VLDMLDFDVILGMDWL ANHA IDCSRKEV FNPPS+ASF+ KG GS+ L +VIS
Subjt:  ARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVIS

Query:  AMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSM
        A+RASKLLSQGTW IL S+VDTRE DVSLSSEPVVRDYPDVFPEELPGLP HRE+EFAIELEPGTV ISRAPYRMAPAELKELKVQLQELLDKGFIRPS+
Subjt:  AMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSM

Query:  SPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDGSMRLCID+RELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKD DVPKT FRS+YGHYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSF
        MDLMNRVFREFLD FVIVFIDDILIYSKTEAEHEEHL MVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSK GVSV+PAKIEAVT W RPST+SEVRSF
Subjt:  MDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSF

Query:  LGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC---------------------------------------------------------------
        LGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                                                               
Subjt:  LGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC---------------------------------------------------------------

Query:  ----------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDK
                                    KANVVADALSRKVSHSAALITRQAPLHRDLERAEI+VSVGAVTMQLAQLTVQPTL  RIIDAQSN+PYLV+K
Subjt:  ----------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTL--RIIDAQSNNPYLVDK

Query:  RGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ----------------
        RGLA+A QA EFS+S DGGLLFERRLCVPSDSAVKT+LLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ                
Subjt:  RGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ----------------

Query:  ---------------------------------------------------------------------------------------QAAMDTRLDFSTA
                                                                                               Q AM TRLDFSTA
Subjt:  ---------------------------------------------------------------------------------------QAAMDTRLDFSTA

Query:  FHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTA
        FH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFEALYGKC +SPVCWGEVGEQRLMG ELVQSTNEAIQKIRSRM TA
Subjt:  FHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTA

Query:  QSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEID
        QSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGPIAYRLALPPSLS VHDV HVSMLRKYVPDPSHVVDYEPLEID
Subjt:  QSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEID

Query:  ENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFEE
        ENLSY EQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+RYP LFEE
Subjt:  ENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFEE

A0A5A7V8X5 Pol protein0.0e+0069.82Show/hide
Query:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVP--------------------VVLDQLS
        MPPRRGARRGGR GRGRGAGRVQ EVQPVAQAT+PA PVTHADLAAMEQRFRDLIMQMREQQQPAPP P P P                    VV DQLS
Subjt:  MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVP--------------------VVLDQLS

Query:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------
         EAKHLRDFRKYNPTTFD SLEDPTRAQ+WLS LETIF+YMKCPEDQKVQCA+FMLTDR                                         
Subjt:  VEAKHLRDFRKYNPTTFDRSLEDPTRAQIWLSYLETIFRYMKCPEDQKVQCAVFMLTDR-----------------------------------------

Query:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP
                     VEQYDAEFDML RF PEMIATEAARADKFVRGLRLDIQG    F P   +D   + +  + Q                  R+   +P
Subjt:  -------------VEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGW---FEP---SDPSLMPIHCAWQ-------------WISVYRRGLTRP

Query:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV
                RSGGE RRFQQKPFEA E AR KPLCTTCGKHHLGRCLFGT+TCFKCRQEGHTADRC +RLTGN QNQGA   HQ +VFATN TEAERAG V
Subjt:  --------RSGGESRRFQQKPFEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMV

Query:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI
        VTGTLPVLGHY LVLFDSGSSHSFISSAFVLHARLE                            IEIAGHVI+VTLLVLDMLDFDVILGMDWLAANHA I
Subjt:  VTGTLPVLGHYDLVLFDSGSSHSFISSAFVLHARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARI

Query:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI
        DCSRKEV FNPPS+ASFKFK  GSR L +VISA+RASKLLSQGTW ILAS+VDTREVDVSLSSEPV+RDYPDVFPEELPGLPPHRE+EFAIELE GTV I
Subjt:  DCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLSQGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTI

Query:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
        SRAPYRMAP ELKELKVQLQELLDKGFIRPS+SPWGAPVLFVKKKDGSMRLCID+RELNKVT+KNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK
Subjt:  SRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIK

Query:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH
        DGDVPKT F S+YGH EFIVMSFGLTNAPAVFMDLMNRVFR+FLD FVIVFIDDILIYSKTEAEHEEHL +VLQTLRDNKLYAKFSKCEFWLKQVSFLGH
Subjt:  DGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGH

Query:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------
        VVSK GVSV+PAKIEAVT W RPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRK APFVWSKAC                               
Subjt:  VVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKAC-------------------------------

Query:  ------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQL
                                                              KANVVAD LSRKVSHSAALITRQAPLHRDLERAEI+VS+GAVTMQL
Subjt:  ------------------------------------------------------KANVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQL

Query:  AQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA
        AQLTVQPTL  RIIDAQSN+PYLV+KRGLA+  QAVEFSIS DGGLLFERRLCVPSDSAVKT+LLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREVA
Subjt:  AQLTVQPTL--RIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVA

Query:  EFVSKCLVCQ------------------------------------------------------------------------------------------
        EFVS+CLVCQ                                                                                          
Subjt:  EFVSKCLVCQ------------------------------------------------------------------------------------------

Query:  -------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGE
                     Q AM TRLDFSTAFH QTDGQTERLNQVLE MLRACALEFPGSWDS+LHLMEFAYNNS+QATI MAPFE LYGKC RSPVCWGEVGE
Subjt:  -------------QAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGE

Query:  QRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHD
        QRLMG ELVQSTNEAIQKIRSRM TAQSRQKSYADVRRKDLEF+V DKVFLKVAPM+GVLRFERRGKLSP FVGPFEILERIGP+AYRLALPPSLS VHD
Subjt:  QRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLSVVHD

Query:  VLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKL
        V HVSMLRKYVPDPSHVVDYEPLEIDEN SYTEQPVEVLAREVK LRN+EIPLVKVLW+NHR+EEATWEREDDMR+R  +L
Subjt:  VLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-6623.8Show/hide
Query:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +D++ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK+E+EH +H+  VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+       + 
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA

Query:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------
         K  +V+  + R    S  ++              +  H D            + +A+++ SV    M              + T++P            
Subjt:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------

Query:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS
                                                  RI+        D++ N+   V++  +   +                      + VE +
Subjt:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS

Query:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------
        I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ                           
Subjt:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------

Query:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL
                                               Q A   D R                                  + FS  +  QTDGQTER 
Subjt:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL

Query:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR
        NQ +E +LR      P +W  ++ L++ +YNN+  +   M PFE ++   +   +   E+        E  Q T +  Q ++  + T   + K Y D++ 
Subjt:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR

Query:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY
        +++ EF   D V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +     HVS L KY
Subjt:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY

P0CT35 Transposon Tf2-2 polyprotein3.2e-6623.8Show/hide
Query:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +D++ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK+E+EH +H+  VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+       + 
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA

Query:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------
         K  +V+  + R    S  ++              +  H D            + +A+++ SV    M              + T++P            
Subjt:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------

Query:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS
                                                  RI+        D++ N+   V++  +   +                      + VE +
Subjt:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS

Query:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------
        I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ                           
Subjt:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------

Query:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL
                                               Q A   D R                                  + FS  +  QTDGQTER 
Subjt:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL

Query:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR
        NQ +E +LR      P +W  ++ L++ +YNN+  +   M PFE ++   +   +   E+        E  Q T +  Q ++  + T   + K Y D++ 
Subjt:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR

Query:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY
        +++ EF   D V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +     HVS L KY
Subjt:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY

P0CT36 Transposon Tf2-3 polyprotein3.2e-6623.8Show/hide
Query:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +D++ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK+E+EH +H+  VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+       + 
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA

Query:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------
         K  +V+  + R    S  ++              +  H D            + +A+++ SV    M              + T++P            
Subjt:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------

Query:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS
                                                  RI+        D++ N+   V++  +   +                      + VE +
Subjt:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS

Query:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------
        I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ                           
Subjt:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------

Query:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL
                                               Q A   D R                                  + FS  +  QTDGQTER 
Subjt:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL

Query:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR
        NQ +E +LR      P +W  ++ L++ +YNN+  +   M PFE ++   +   +   E+        E  Q T +  Q ++  + T   + K Y D++ 
Subjt:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR

Query:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY
        +++ EF   D V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +     HVS L KY
Subjt:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY

P0CT37 Transposon Tf2-4 polyprotein3.2e-6623.8Show/hide
Query:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +D++ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK+E+EH +H+  VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+       + 
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA

Query:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------
         K  +V+  + R    S  ++              +  H D            + +A+++ SV    M              + T++P            
Subjt:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------

Query:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS
                                                  RI+        D++ N+   V++  +   +                      + VE +
Subjt:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS

Query:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------
        I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ                           
Subjt:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------

Query:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL
                                               Q A   D R                                  + FS  +  QTDGQTER 
Subjt:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL

Query:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR
        NQ +E +LR      P +W  ++ L++ +YNN+  +   M PFE ++   +   +   E+        E  Q T +  Q ++  + T   + K Y D++ 
Subjt:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR

Query:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY
        +++ EF   D V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +     HVS L KY
Subjt:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY

P0CT41 Transposon Tf2-12 polyprotein3.2e-6623.8Show/hide
Query:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL
        E+LP   P + +EF +EL      +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G++R+ +D++ LNK    N YPLP I+ L
Subjt:  EELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRLCIDHRELNKVTVKNRYPLPRIDDL

Query:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT
          ++QG+T+F+K+DL+S YH +R++ GD  K  FR   G +E++VM +G++ APA F   +N +  E  +  V+ ++DDILI+SK+E+EH +H+  VLQ 
Subjt:  FDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKTEAEHEEHLHMVLQT

Query:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA
        L++  L    +KCEF   QV F+G+ +S+ G +     I+ V  W +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+       + 
Subjt:  LRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWS-------KA

Query:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------
         K  +V+  + R    S  ++              +  H D            + +A+++ SV    M              + T++P            
Subjt:  CKANVVADALSRKVSHSAALITRQ-----------APLHRD------------LERAEISVSVGAVTM-----------QLAQLTVQP------------

Query:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS
                                                  RI+        D++ N+   V++  +   +                      + VE +
Subjt:  ----------------------------------------TLRII--------DAQSNNPYLVDKRGLAKAW----------------------QAVEFS

Query:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------
        I    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ                           
Subjt:  ISYDGGLLFERR--LCVPSDSAVKTKLLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQ---------------------------

Query:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL
                                               Q A   D R                                  + FS  +  QTDGQTER 
Subjt:  ---------------------------------------QAA--MDTR----------------------------------LDFSTAFHTQTDGQTERL

Query:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR
        NQ +E +LR      P +W  ++ L++ +YNN+  +   M PFE ++   +   +   E+        E  Q T +  Q ++  + T   + K Y D++ 
Subjt:  NQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPFEALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRR

Query:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY
        +++ EF   D V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  +     HVS L KY
Subjt:  KDL-EFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLALPPSLS-VVHDVLHVSMLRKY

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-2148.96Show/hide
Query:  HLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRA
        HL MVLQ    ++ YA   KC F   Q+++LG  H++S  GVS +PAK+EA+  WP P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +
Subjt:  HLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLG--HVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGAGAAGGTCGAGGGAGGGGAGCAGGACGTGTTCAACTTGAGGTGCAGCCTGTAGCCCAAGCCACCAACCCGGCTGT
GCCAGTTACTCATGCGGACCTCGCTGCTATGGAGCAGAGGTTTAGGGATTTGATTATGCAGATGCGGGAGCAGCAGCAGCCTGCCCCGCCAACTCCTGTTCCAGTTCCAG
TCGTGCTGGATCAGTTGTCGGTAGAGGCCAAGCACTTGAGGGATTTCAGGAAGTATAACCCCACGACATTCGATAGGTCTTTGGAGGACCCCACCAGGGCTCAGATTTGG
TTATCTTATTTGGAGACCATATTTCGGTATATGAAGTGCCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTCATGTTGACAGACAGAGTAGAGCAGTATGATGCGGAGTT
TGACATGTTATGCCGTTTCGTTCCCGAGATGATAGCGACTGAGGCAGCCAGGGCTGATAAGTTTGTTAGAGGCCTCAGGCTAGACATCCAGGGTTGGTTCGAGCCTTCCG
ACCCGTCACTCATGCCAATACACTGCGCTTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGATCAGGTGGTGAGTCTCGCCGTTTCCAGCAGAAACCT
TTTGAGGCAGAGGAAACTGCCAGAGGGAAGCCGTTGTGTACCACTTGCGGGAAGCACCATCTGGGCCGTTGTTTATTTGGGACCAAGACTTGCTTCAAGTGTAGGCAAGA
GGGGCATACCGCTGACAGATGCTCGATGAGGCTTACCGGAAATGTACAGAATCAGGGAGCATGTGCTCTACATCAGGTTAAAGTCTTTGCTACCAATAATACTGAGGCTG
AGAGAGCAGGCATGGTAGTGACAGGTACGCTTCCAGTGTTGGGGCATTATGATCTAGTTTTGTTTGATTCGGGTTCGTCACATTCCTTTATCTCTTCTGCATTTGTGTTG
CATGCCCGCTTAGAGGTAGAGCCCCTACACCATGTTTTATCAGTATCTACTCCTTCTGGGGAGTGTATGTGGTCGAAGGGAAAGGTGAAAGCATGCCAGATTGAGATAGC
AGGCCATGTGATTGAAGTAACGTTGTTGGTCCTGGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTAGCCGCTAACCATGCCAGAATAGATTGTTCCCGTA
AGGAGGTAGCGTTTAACCCTCCCTCGTTGGCCAGTTTTAAATTTAAGGGAGAAGGGTCAAGACCGTTGTCTAAGGTAATCTCAGCCATGAGGGCCAGCAAATTGCTTAGT
CAGGGTACTTGGAGTATCTTAGCGAGCATGGTGGATACTAGAGAGGTTGATGTATCTCTGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCCTGAAGAACT
TCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCAGGCACGGTTACTATATCCAGAGCCCCATACAGAATGGCCCCAGCAGAATTGAAAGAGC
TGAAAGTGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTATGTCGCCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGCCTA
TGTATTGACCATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAAGGAGCTACAGTGTTCTCTAAGAT
TGATCTTCGGTCGGGATATCATCAACTGAGGATTAAGGATGGTGATGTACCGAAGACAACCTTTCGATCCAAATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTT
TGACAAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACATTTTTGTGATCGTGTTTATTGATGATATTTTGATATATTCCAAGACA
GAGGCCGAGCATGAGGAGCATTTACATATGGTTCTGCAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGTTGAAGCAGGTGTCCTTTCT
AGGCCATGTGGTTTCTAAGGTTGGAGTTTCTGTGAATCCAGCAAAGATAGAGGCAGTCACTAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTTTAGGTT
TAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGAGAGCTCCTTTTGTTTGGAGCAAGGCATGCAAGGCA
AATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCCTTGCATCGAGATCTTGAGAGGGCTGAGATTTCAGTGTCAGT
AGGGGCAGTCACTATGCAGTTAGCCCAGTTGACAGTACAACCGACATTGAGGATCATTGATGCTCAGAGTAACAATCCTTATTTGGTTGATAAGCGTGGCCTAGCAAAGG
CATGGCAAGCTGTTGAGTTCTCTATATCGTATGATGGTGGACTGTTGTTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAAAATTATTATCTGAGGCT
CACAGTTCCCCATTTTCTATGCACCCGGGTAGTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGCAGAATTTGTTAGTAA
ATGCTTGGTGTGTCAGCAGGCTGCTATGGACACGAGGCTAGACTTTAGTACAGCTTTCCACACACAGACTGATGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGACTA
TGTTGCGAGCTTGTGCATTAGAATTTCCAGGTAGTTGGGACTCCTACTTGCATTTGATGGAATTTGCTTATAATAACAGTTTTCAGGCTACCATTAGCATGGCACCATTT
GAGGCCTTGTACGGAAAATGTTTTAGATCCCCTGTGTGCTGGGGTGAGGTAGGTGAGCAGAGATTGATGGGTCATGAATTAGTTCAGTCTACTAACGAAGCGATACAGAA
AATTAGGTCACGTATGCAGACCGCACAGAGTAGGCAGAAGAGTTATGCGGATGTGAGACGGAAGGATCTTGAATTTGATGTGTGGGACAAAGTGTTCTTGAAGGTAGCAC
CTATGAAAGGTGTCTTACGATTTGAAAGGAGAGGAAAGTTGAGTCCCTGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGAATTGGCCCTATAGCGTATCGCTTGGCATTG
CCACCATCACTCTCGGTAGTTCATGATGTGTTACATGTTTCTATGTTGAGGAAGTACGTGCCGGATCCATCCCATGTAGTGGATTATGAGCCACTAGAGATTGATGAAAA
CTTGAGCTATACAGAACAACCCGTCGAGGTGTTGGCTAGGGAGGTGAAGGTGTTGAGGAATAGAGAGATTCCTTTGGTAAAGGTCTTATGGCAGAATCACAGAATTGAAG
AAGCTACATGGGAGAGAGAAGATGACATGAGAGCTCGTTATCCTAAATTGTTCGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCCAAGGAGAGGTGCACGTAGGGGTGGCCGAGAAGGTCGAGGGAGGGGAGCAGGACGTGTTCAACTTGAGGTGCAGCCTGTAGCCCAAGCCACCAACCCGGCTGT
GCCAGTTACTCATGCGGACCTCGCTGCTATGGAGCAGAGGTTTAGGGATTTGATTATGCAGATGCGGGAGCAGCAGCAGCCTGCCCCGCCAACTCCTGTTCCAGTTCCAG
TCGTGCTGGATCAGTTGTCGGTAGAGGCCAAGCACTTGAGGGATTTCAGGAAGTATAACCCCACGACATTCGATAGGTCTTTGGAGGACCCCACCAGGGCTCAGATTTGG
TTATCTTATTTGGAGACCATATTTCGGTATATGAAGTGCCCTGAGGATCAGAAAGTTCAGTGTGCTGTTTTCATGTTGACAGACAGAGTAGAGCAGTATGATGCGGAGTT
TGACATGTTATGCCGTTTCGTTCCCGAGATGATAGCGACTGAGGCAGCCAGGGCTGATAAGTTTGTTAGAGGCCTCAGGCTAGACATCCAGGGTTGGTTCGAGCCTTCCG
ACCCGTCACTCATGCCAATACACTGCGCTTGGCAGTGGATCTCAGTTTACAGGAGAGGGCTAACTCGTCCAAGATCAGGTGGTGAGTCTCGCCGTTTCCAGCAGAAACCT
TTTGAGGCAGAGGAAACTGCCAGAGGGAAGCCGTTGTGTACCACTTGCGGGAAGCACCATCTGGGCCGTTGTTTATTTGGGACCAAGACTTGCTTCAAGTGTAGGCAAGA
GGGGCATACCGCTGACAGATGCTCGATGAGGCTTACCGGAAATGTACAGAATCAGGGAGCATGTGCTCTACATCAGGTTAAAGTCTTTGCTACCAATAATACTGAGGCTG
AGAGAGCAGGCATGGTAGTGACAGGTACGCTTCCAGTGTTGGGGCATTATGATCTAGTTTTGTTTGATTCGGGTTCGTCACATTCCTTTATCTCTTCTGCATTTGTGTTG
CATGCCCGCTTAGAGGTAGAGCCCCTACACCATGTTTTATCAGTATCTACTCCTTCTGGGGAGTGTATGTGGTCGAAGGGAAAGGTGAAAGCATGCCAGATTGAGATAGC
AGGCCATGTGATTGAAGTAACGTTGTTGGTCCTGGACATGCTCGACTTTGATGTAATTCTGGGTATGGATTGGTTAGCCGCTAACCATGCCAGAATAGATTGTTCCCGTA
AGGAGGTAGCGTTTAACCCTCCCTCGTTGGCCAGTTTTAAATTTAAGGGAGAAGGGTCAAGACCGTTGTCTAAGGTAATCTCAGCCATGAGGGCCAGCAAATTGCTTAGT
CAGGGTACTTGGAGTATCTTAGCGAGCATGGTGGATACTAGAGAGGTTGATGTATCTCTGTCATCAGAACCAGTGGTAAGGGACTATCCGGATGTCTTTCCTGAAGAACT
TCCAGGGTTACCTCCTCACAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCAGGCACGGTTACTATATCCAGAGCCCCATACAGAATGGCCCCAGCAGAATTGAAAGAGC
TGAAAGTGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTATGTCGCCTTGGGGTGCGCCAGTTTTATTTGTTAAGAAGAAGGATGGATCGATGCGCCTA
TGTATTGACCATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAGGATCGACGATCTGTTTGACCAGTTACAAGGAGCTACAGTGTTCTCTAAGAT
TGATCTTCGGTCGGGATATCATCAACTGAGGATTAAGGATGGTGATGTACCGAAGACAACCTTTCGATCCAAATACGGACACTATGAGTTTATTGTGATGTCTTTTGGTT
TGACAAATGCTCCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGACATTTTTGTGATCGTGTTTATTGATGATATTTTGATATATTCCAAGACA
GAGGCCGAGCATGAGGAGCATTTACATATGGTTCTGCAAACCCTTCGGGATAATAAATTGTATGCAAAGTTCTCGAAATGTGAGTTTTGGTTGAAGCAGGTGTCCTTTCT
AGGCCATGTGGTTTCTAAGGTTGGAGTTTCTGTGAATCCAGCAAAGATAGAGGCAGTCACTAGTTGGCCCCGACCTTCCACAGTCAGTGAGGTTCGTAGCTTTTTAGGTT
TAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCCCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGAGAGCTCCTTTTGTTTGGAGCAAGGCATGCAAGGCA
AATGTGGTAGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCGACAGGCCCCCTTGCATCGAGATCTTGAGAGGGCTGAGATTTCAGTGTCAGT
AGGGGCAGTCACTATGCAGTTAGCCCAGTTGACAGTACAACCGACATTGAGGATCATTGATGCTCAGAGTAACAATCCTTATTTGGTTGATAAGCGTGGCCTAGCAAAGG
CATGGCAAGCTGTTGAGTTCTCTATATCGTATGATGGTGGACTGTTGTTTGAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTTAAAACAAAATTATTATCTGAGGCT
CACAGTTCCCCATTTTCTATGCACCCGGGTAGTACGAAGATGTATCAGGACCTAAAGCGGGTTTATTGGTGGCGTAATATGAAGAGAGAGGTGGCAGAATTTGTTAGTAA
ATGCTTGGTGTGTCAGCAGGCTGCTATGGACACGAGGCTAGACTTTAGTACAGCTTTCCACACACAGACTGATGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGACTA
TGTTGCGAGCTTGTGCATTAGAATTTCCAGGTAGTTGGGACTCCTACTTGCATTTGATGGAATTTGCTTATAATAACAGTTTTCAGGCTACCATTAGCATGGCACCATTT
GAGGCCTTGTACGGAAAATGTTTTAGATCCCCTGTGTGCTGGGGTGAGGTAGGTGAGCAGAGATTGATGGGTCATGAATTAGTTCAGTCTACTAACGAAGCGATACAGAA
AATTAGGTCACGTATGCAGACCGCACAGAGTAGGCAGAAGAGTTATGCGGATGTGAGACGGAAGGATCTTGAATTTGATGTGTGGGACAAAGTGTTCTTGAAGGTAGCAC
CTATGAAAGGTGTCTTACGATTTGAAAGGAGAGGAAAGTTGAGTCCCTGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGAATTGGCCCTATAGCGTATCGCTTGGCATTG
CCACCATCACTCTCGGTAGTTCATGATGTGTTACATGTTTCTATGTTGAGGAAGTACGTGCCGGATCCATCCCATGTAGTGGATTATGAGCCACTAGAGATTGATGAAAA
CTTGAGCTATACAGAACAACCCGTCGAGGTGTTGGCTAGGGAGGTGAAGGTGTTGAGGAATAGAGAGATTCCTTTGGTAAAGGTCTTATGGCAGAATCACAGAATTGAAG
AAGCTACATGGGAGAGAGAAGATGACATGAGAGCTCGTTATCCTAAATTGTTCGAGGAATAA
Protein sequenceShow/hide protein sequence
MPPRRGARRGGREGRGRGAGRVQLEVQPVAQATNPAVPVTHADLAAMEQRFRDLIMQMREQQQPAPPTPVPVPVVLDQLSVEAKHLRDFRKYNPTTFDRSLEDPTRAQIW
LSYLETIFRYMKCPEDQKVQCAVFMLTDRVEQYDAEFDMLCRFVPEMIATEAARADKFVRGLRLDIQGWFEPSDPSLMPIHCAWQWISVYRRGLTRPRSGGESRRFQQKP
FEAEETARGKPLCTTCGKHHLGRCLFGTKTCFKCRQEGHTADRCSMRLTGNVQNQGACALHQVKVFATNNTEAERAGMVVTGTLPVLGHYDLVLFDSGSSHSFISSAFVL
HARLEVEPLHHVLSVSTPSGECMWSKGKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHARIDCSRKEVAFNPPSLASFKFKGEGSRPLSKVISAMRASKLLS
QGTWSILASMVDTREVDVSLSSEPVVRDYPDVFPEELPGLPPHREIEFAIELEPGTVTISRAPYRMAPAELKELKVQLQELLDKGFIRPSMSPWGAPVLFVKKKDGSMRL
CIDHRELNKVTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTTFRSKYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDIFVIVFIDDILIYSKT
EAEHEEHLHMVLQTLRDNKLYAKFSKCEFWLKQVSFLGHVVSKVGVSVNPAKIEAVTSWPRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKRAPFVWSKACKA
NVVADALSRKVSHSAALITRQAPLHRDLERAEISVSVGAVTMQLAQLTVQPTLRIIDAQSNNPYLVDKRGLAKAWQAVEFSISYDGGLLFERRLCVPSDSAVKTKLLSEA
HSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQAAMDTRLDFSTAFHTQTDGQTERLNQVLETMLRACALEFPGSWDSYLHLMEFAYNNSFQATISMAPF
EALYGKCFRSPVCWGEVGEQRLMGHELVQSTNEAIQKIRSRMQTAQSRQKSYADVRRKDLEFDVWDKVFLKVAPMKGVLRFERRGKLSPCFVGPFEILERIGPIAYRLAL
PPSLSVVHDVLHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKVLRNREIPLVKVLWQNHRIEEATWEREDDMRARYPKLFEE