; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G009980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G009980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr07:4902984..4905935
RNA-Seq ExpressionCmoCh07G009980
SyntenyCmoCh07G009980
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.8e-18259.75Show/hide
Query:  DLDARYQGHIREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW
        D +    G ++ +  +Q +V+ PK   EKGLMFVDA IN  P+KS ++DSGA HNFI++QEA RL LTIE D  KMKAVNSEAL +V VSKRV  KLGTW
Subjt:  DLDARYQGHIREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW

Query:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------
        TG  D VVVRMDDFDVVLGM+FL+EHKVIPMPLAKC+++T ++PTV+ ASIKQPG +RMISA+QLK+G  REEPTF+                       
Subjt:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------

Query:  ----------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIF
                                                            GFIRPAKAP+GASVLFQKKKDGTLRLCIDYRALNKVT+RNKYPLPII 
Subjt:  ----------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIF

Query:  DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-
        DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATFCT+MNQVFHE+LDQFVVVY+D+IV+Y+ TL+EH++HL+L  
Subjt:  DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-

Query:  ------------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAF
                                      +  ISM +DK+KTIQEW++PT V+ +RSFLGL NYYR+F+EGFSRRA P+TELL K     WS E Q AF
Subjt:  ------------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAF

Query:  EDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        EDLKA M KG VLGL DVTKPFE+E DASD+ALGG+L+Q+ HPIAYESRKLN+ ERRYT+ EK+
Subjt:  EDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]3.8e-0750.82Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT
        M +++TR+ +TM+A+ N    Q ++  NKLK P+P+PF  NR  K+LENF FDV+QYFK T
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]4.0e-18251.09Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------
        MA M TRI VTMKA+ENVTA QT+  SNKL+FP+PR FK NRDAKELENFIFDV+QYFKATT            +YL                       
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------

Query:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR
              +   D  L  NA H   + LVA K                    G   ++                                  L+ +    G 
Subjt:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR

Query:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK
         +     P     P K  ++ NG  N P       + W  +          +R   H R   T  LQ  +   P ++        L  +  +I  S  +K
Subjt:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK

Query:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN
          T++D         DQ+  R+G        + K    E   +VE    VPFK+G WTGE+D+VV RMDDFDVVLGM+FLLEHKVIPMPLAKCLV+TD N
Subjt:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN

Query:  PTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------------------
        PTVIPASIKQPGNLRMIS IQLKRG AREE TF+                                                                  
Subjt:  PTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------------------

Query:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL
        GFIRP KAPY A VLFQKKKDGTL LCIDYRALNKVT+RNK+PLPII DLFDQLHGAKYFTKLDL SGYYQVRIAEGDEP+TTC+TRY AFE+LVMPF L
Subjt:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL

Query:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLAELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTEL
        TNAPATFCTLMNQVF+E+LDQFV+VY+D+IV+Y+TTLEEHKVHLKL    ISM SDKIK IQEWK+PT VS++RSFLGL NYYR+FVEGFSRRAAPL EL
Subjt:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLAELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTEL

Query:  LKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        LKKDHPW WSN+CQMAFEDLK  M  G VLGLVDVTKPFEIE DASDFALGG+LIQEGHPIA+ESRKLND ERRY + EKK
Subjt:  LKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]2.2e-21753.1Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------
        M+ M+TRI VTMKA+E VTA QT   SNKL+FPDPR FK NRDAKELENFIFDV+QYFKATT            +YL                       
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------

Query:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRELI----------------------------------------GH
              R   +  L  NA H   + +VA K                    G   ++ +                                        G 
Subjt:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRELI----------------------------------------GH

Query:  HRLVGRPQERVEEPILPRKRR----TYSNGKTNNPETN--------------------------WKHKGLY----------------------QTVLDL-
         R          +P  P   R      SNG T+ P  N                            HK  Y                      +T+LD  
Subjt:  HRLVGRPQERVEEPILPRKRR----TYSNGKTNNPETN--------------------------WKHKGLY----------------------QTVLDL-

Query:  ---DARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW
           D    G ++ +  LQ KV+PKEIIEKGLMFVDATINSR SKSTLIDSGA HNFIADQEARRLGLTI  DP KMKAVNSEAL +V VSK VPFK+G W
Subjt:  ---DARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW

Query:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------
        TGE+D+VVVRMDDFDVVLGM+FLLEHKVIPMPLAKCLVITD NPTVIPASIKQPGNLRMISAIQLKRG AREEPTF+                       
Subjt:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------

Query:  ----------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKY
                                                                  GFIRPAKAPYGA VLFQKKKDGTLRLCIDYRALNKVT+RNKY
Subjt:  ----------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKY

Query:  PLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKV
        PLPII DLFDQLHGAKYFTKLDLRSGYYQVRI EGDEPKTTC+TRY AFE+LVMPFGLTNAPATF TLMNQVF+E+LDQFV+VY+D+IV+Y+TTLEEHKV
Subjt:  PLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKV

Query:  HLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN
        HLK                           L  +V    ISM SDKIK IQEWK+PT VS +RSFLGL NYYR+FVEGFSRRAAPLTELLKKDHPWSWSN
Subjt:  HLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN

Query:  ECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        +CQMAFE+LK  MT+G VLGLVDVTKPFE+E DASDFALGG+LIQEGHPIAYESRKLND ERRYT+ EK+
Subjt:  ECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]9.2e-18765.74Show/hide
Query:  KGLYQTVLDLDARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRV
        KG ++T   +    +   + +  LQ KV+PKEIIEKGLMFVDATINSR SKSTLIDSGA HNFIADQEARRLGLTI  DP KMKAVNSEAL +V VSK V
Subjt:  KGLYQTVLDLDARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRV

Query:  PFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI----------------
        PFK+G WTGE+D+VVVRMDDFDVVLGM+FLLEHKVIPMPLAKCLVITD NP VIPASIKQPGNLRMISAIQLKRG AREEPTF+                
Subjt:  PFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI----------------

Query:  -----------------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNK
                                                                         GFIRPAKAPYGA VLFQKKKDGTLRLCIDYRALNK
Subjt:  -----------------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNK

Query:  VTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNT
        VT+RNKYPLPII DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATFCTLMNQVF+E+LDQFV+VY+D+IV+Y+T
Subjt:  VTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNT

Query:  TLEEHKVHLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKD
        TLEEHKVHLK                           L  +V    ISM SDKIK IQEWK+PT VS +RSFLGL NYYR+FVEGFSRRAAPLTELLKKD
Subjt:  TLEEHKVHLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKD

Query:  HPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEI
        HPWSWSN+CQMAFE+LK  MT+G VLGLVD+TKPFE+
Subjt:  HPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEI

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]4.1e-21953.33Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------
        M+ M+TRI VTMKA+E VTA QT   SNKL+FPDPR FK NRDAKELENFIFDV+QYFKATT            +YL                       
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------

Query:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRELI----------------------------------------GH
              R   +  L  NA H   + +VA K                    G   ++ +                                        G 
Subjt:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRELI----------------------------------------GH

Query:  HRLVGRPQERVEEPILPRKRR----TYSNGKTNNPETN--------------------------WKHKGLY----------------------QTVLDL-
         R          +P  P   R      SNG T+ P  N                            HK  Y                      +T+LD  
Subjt:  HRLVGRPQERVEEPILPRKRR----TYSNGKTNNPETN--------------------------WKHKGLY----------------------QTVLDL-

Query:  ---DARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW
           D    G ++ +  LQ KV+PKEIIEKGLMFVDATINSR SKSTLIDSGA HNFIADQEARRLGLTI  DP KMKAVNSEAL +V VSK VPFK+G W
Subjt:  ---DARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW

Query:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------
        TGE+D+VVVRMDDFDVVLGM+FLLEHKVIPMPLAKCLVITD NPTVIPASIKQPGNLRMISAIQLKRG AREEPTF+                       
Subjt:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------

Query:  ----------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKY
                                                                  GFIRPAKAPYGA VLFQKKKDGTLRLCIDYRALNKVT+RNKY
Subjt:  ----------------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKY

Query:  PLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKV
        PLPII DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATFCTLMNQVF+E+LDQFV+VY+D+IV+Y+TTLEEHKV
Subjt:  PLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKV

Query:  HLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN
        HLK                           L  +V    ISM SDKIK IQEWK+PT VS +RSFLGL NYYR+FVEGFSRRAAPLTELLKKDHPWSWSN
Subjt:  HLK---------------------------LAELV----ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN

Query:  ECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        +CQMAFE+LK  MT+G VLGLVDVTKPFE+E DASDFALGG+LIQEGHPIAYESRKLND ERRYT+ EK+
Subjt:  ECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

TrEMBL top hitse value%identityAlignment
A0A5A7UZE9 Reverse transcriptase2.7e-16843.89Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT--------TDLYLRHKSDSS--------LNASHRRC---
        +AD+N R+ +TM+AM N       I  +++K P+P+PF   RDAK LEN+IFD++QYF+AT          L   H S+ +        ++    RC   
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT--------TDLYLRHKSDSS--------LNASHRRC---

Query:  ------KTLVAYKGAKHRELIGHHRL-----VGRPQERVE----------------------EPILP-RKRRTY--------------------------
              + L +    ++ E++   +L      G  +E V+                      E + P  K + Y                          
Subjt:  ------KTLVAYKGAKHRELIGHHRL-----VGRPQERVE----------------------EPILP-RKRRTY--------------------------

Query:  --------SNGKTNN----PET--------------------NWKHKGLY---------QTVLDLDARYQ------GHIREIQTLQWKV-DPKEIIEKGL
                S G  NN    P+T                    +W+   ++         QT  + D   +      G ++ + +LQ KV +    +E+GL
Subjt:  --------SNGKTNN----PET--------------------NWKHKGLY---------QTVLDLDARYQ------GHIREIQTLQWKV-DPKEIIEKGL

Query:  MFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPM
        M+VD  IN +P+KST++DSGA HNFI + EA+RL L  E D  +MKAVNS AL ++ + KR   +LG W+G +D VVV+MDDFDVVLGM+FLLEH+VIPM
Subjt:  MFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPM

Query:  PLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------
        PLAKCLVIT    +V+   ++QP  L+MISA+QLK+G +R+EPTF+                                                      
Subjt:  PLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------

Query:  ---------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVR
                                   GFIRPAKAPYGA +LFQKKKDG+LRLCIDYRALNK+T+RNKYPLPII DLFD+LHGAKYF+KLDLRSGYYQVR
Subjt:  ---------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVR

Query:  IAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLK------LAELV----ISMVSDKIKTIQE
        IAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATF TLMN+VFHE+LD+FVVVY+D+IV+Y+TT+EEH+ HL+      L  ++    I M   KI  I++
Subjt:  IAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLK------LAELV----ISMVSDKIKTIQE

Query:  WKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAY
        W +P  VS +RSFLGL NYY +FVEGFS+RA+PLTELLKKD  W+W  ECQ AF+ LK  M +G +LG+ DVTKPFE+E DASD+ALGG+L+Q GHPIAY
Subjt:  WKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAY

Query:  ESRKLNDTERRYTLFEKK
        ESRKLN  ERRYT+FEK+
Subjt:  ESRKLNDTERRYTLFEKK

A0A6J1D906 Reverse transcriptase4.0e-18059.18Show/hide
Query:  IREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVV
        ++ +  +Q +V+ PK   EKGLMFVDATIN   +KST++DSGA HNFI++QEARRL LTIE D  KMK VN EAL +V VSKRV  KLGTWTG +D VVV
Subjt:  IREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVV

Query:  RMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI--------------------------------
        RMDDFDVVLGM+FL+EHKVIPMPLAKC+++T ++PTV+  SIKQPG +RMISA+QLK+G  REEPTF+                                
Subjt:  RMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI--------------------------------

Query:  -------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLF
                                                         GFIRPAKAP+GA VLFQKKKDGTLRLCIDYRALNKVT+RNKYPLPII DLF
Subjt:  -------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLF

Query:  DQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA----
        DQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVM FGLTNAPATFCT+MNQVFHE+LDQFVVVY+D+IV+Y+ TL+EH++HL+L     
Subjt:  DQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA----

Query:  ---------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDL
                                   +  ISM +DK+K IQEW++PT V  +RSFLGL NYYR+F+EGFSRRA P+TELLKK   W WS E Q AFEDL
Subjt:  ---------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDL

Query:  KAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        KA M KG VLGL DVTKPFE+E DASD+ALGG+L+Q+ HPI YESRKLN+ ERRYT+ EK+
Subjt:  KAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

A0A6J1D906 Reverse transcriptase6.8e-1055.74Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT
        M +++TR+ +TM+A+ N    Q ++  NKLK P+P+PF  NRDAK+LENF+FDV+QYFKAT
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT

A0A6J1D906 Reverse transcriptase3.2e-17750.52Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------
        MA M TRI VTMKA+ENVTA QT+  SNKL+FP+PR FK N+DAKELENFIFDV+QYFKATT            +YL                       
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------

Query:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR
              +   D  L  NA H   + LVA K                    G   ++                                  L+ +    G 
Subjt:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR

Query:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK
         +     P     P K  ++ NG  N P       + W  +          +R   H R   T  LQ  +   P ++        L  +  +I  S  +K
Subjt:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK

Query:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN
          T++D         DQ+  R+G        + K    E   +VE    VPFK+G WTGE+D+VV RMDDFDVVLGM+FLLEHKVIPMPLAKCLVITD N
Subjt:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN

Query:  PTVIPASIKQP--------------------------------------GNLRMISAIQ--LKRGFAREEPTF------------IGFIRPAKAPYGASV
        PTVIPASIKQP                                        + +I  ++   K  +    P               GFIRPAKAPYGA V
Subjt:  PTVIPASIKQP--------------------------------------GNLRMISAIQ--LKRGFAREEPTF------------IGFIRPAKAPYGASV

Query:  LFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQV
        LFQKKKDGTLRLCIDYRALNKVT+RNKYPLPII DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATFCTLMNQV
Subjt:  LFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQV

Query:  FHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-------ELV--ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDH
        F+E+LDQFV+VY+D+IV+Y+TTLEEHKVHLKL        +L   ISM SDKIK IQEWK+PT VS++RSFLGL NYYR+FVEGFSRRAAPLTELLKKDH
Subjt:  FHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-------ELV--ISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDH

Query:  PWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
         WSWS++CQMAFEDLK  MT+G VLGLVDVTKPFEIE DASDFALGG+LIQEGHPIA+ESRKLND ERRYT+ EK+
Subjt:  PWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

A0A6J1DK29 uncharacterized protein LOC1110218298.7e-18359.75Show/hide
Query:  DLDARYQGHIREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW
        D +    G ++ +  +Q +V+ PK   EKGLMFVDA IN  P+KS ++DSGA HNFI++QEA RL LTIE D  KMKAVNSEAL +V VSKRV  KLGTW
Subjt:  DLDARYQGHIREIQTLQWKVD-PKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTW

Query:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------
        TG  D VVVRMDDFDVVLGM+FL+EHKVIPMPLAKC+++T ++PTV+ ASIKQPG +RMISA+QLK+G  REEPTF+                       
Subjt:  TGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI-----------------------

Query:  ----------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIF
                                                            GFIRPAKAP+GASVLFQKKKDGTLRLCIDYRALNKVT+RNKYPLPII 
Subjt:  ----------------------------------------------------GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIF

Query:  DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-
        DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTC+TRY AFE+LVMPFGLTNAPATFCT+MNQVFHE+LDQFVVVY+D+IV+Y+ TL+EH++HL+L  
Subjt:  DLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLA-

Query:  ------------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAF
                                      +  ISM +DK+KTIQEW++PT V+ +RSFLGL NYYR+F+EGFSRRA P+TELL K     WS E Q AF
Subjt:  ------------------------------ELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAF

Query:  EDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        EDLKA M KG VLGL DVTKPFE+E DASD+ALGG+L+Q+ HPIAYESRKLN+ ERRYT+ EK+
Subjt:  EDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

A0A6J1DK29 uncharacterized protein LOC1110218291.8e-0750.82Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT
        M +++TR+ +TM+A+ N    Q ++  NKLK P+P+PF  NR  K+LENF FDV+QYFK T
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKAT

A0A6J1DK29 uncharacterized protein LOC1110218291.9e-18251.09Show/hide
Query:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------
        MA M TRI VTMKA+ENVTA QT+  SNKL+FP+PR FK NRDAKELENFIFDV+QYFKATT            +YL                       
Subjt:  MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATT-----------DLYL-----------------------

Query:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR
              +   D  L  NA H   + LVA K                    G   ++                                  L+ +    G 
Subjt:  ------RHKSDSSL--NASHRRCKTLVAYK--------------------GAKHRE----------------------------------LIGHHRLVGR

Query:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK
         +     P     P K  ++ NG  N P       + W  +          +R   H R   T  LQ  +   P ++        L  +  +I  S  +K
Subjt:  PQERVEEPIL---PRKRRTYSNGKTNNPE------TNWKHKGLYQTVLDLDARYQGHIREIQT--LQWKV--DPKEII----EKGLMFVDATI-NSRPSK

Query:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN
          T++D         DQ+  R+G        + K    E   +VE    VPFK+G WTGE+D+VV RMDDFDVVLGM+FLLEHKVIPMPLAKCLV+TD N
Subjt:  -STLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHN

Query:  PTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------------------
        PTVIPASIKQPGNLRMIS IQLKRG AREE TF+                                                                  
Subjt:  PTVIPASIKQPGNLRMISAIQLKRGFAREEPTFI------------------------------------------------------------------

Query:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL
        GFIRP KAPY A VLFQKKKDGTL LCIDYRALNKVT+RNK+PLPII DLFDQLHGAKYFTKLDL SGYYQVRIAEGDEP+TTC+TRY AFE+LVMPF L
Subjt:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL

Query:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLAELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTEL
        TNAPATFCTLMNQVF+E+LDQFV+VY+D+IV+Y+TTLEEHKVHLKL    ISM SDKIK IQEWK+PT VS++RSFLGL NYYR+FVEGFSRRAAPL EL
Subjt:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHLKLAELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTEL

Query:  LKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        LKKDHPW WSN+CQMAFEDLK  M  G VLGLVDVTKPFEIE DASDFALGG+LIQEGHPIA+ESRKLND ERRY + EKK
Subjt:  LKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.2e-5334.09Show/hide
Query:  SIKQPGNLRMISAIQLKRGFAREEPTFI------GFIRPAKAPYGASV-LFQKKKDGT----LRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFT
        +I    NL + S     + + +E  + I      G IR + +PY + + +  KK+D +     R+ IDYR LN++T+ +++P+P + ++  +L    YFT
Subjt:  SIKQPGNLRMISAIQLKRGFAREEPTFI------GFIRPAKAPYGASV-LFQKKKDGT----LRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFT

Query:  KLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHL-----KLAELVISMVSD
         +DL  G++Q+ +      KT   T++  +EYL MPFGL NAPATF   MN +    L++  +VY+D+I++++T+L+EH   L     KLA+  + +  D
Subjt:  KLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVHL-----KLAELVISMVSD

Query:  --------------------------KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-ECQMAFEDLKAIMTKGSV
                                  KI+ IQ++ IPT    +++FLGLT YYR+F+  F+  A P+T+ LKK+     +N E   AF+ LK ++++  +
Subjt:  --------------------------KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-ECQMAFEDLKAIMTKGSV

Query:  LGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
        L + D TK F +  DASD ALG +L Q+GHP++Y SR LN+ E  Y+  EK+
Subjt:  LGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

P0CT42 Transposon Tf2-7 polyprotein1.2e-4835.96Show/hide
Query:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL
        G IR +KA     V+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K         FEYLVMP+G+
Subjt:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL

Query:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL
        + APA F   +N +  E  +  VV Y+DNI+I++ +  EH  H       LK A L+I+                 +S+K        I  + +WK P  
Subjt:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL

Query:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE
           +R FLG  NY R+F+   S+   PL  LLKKD  W W+     A E++K  +    VL   D +K   +E DASD A+G +L Q+      +P+ Y 
Subjt:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE

Query:  SRKLNDTERRYTLFEKK
        S K++  +  Y++ +K+
Subjt:  SRKLNDTERRYTLFEKK

P0CT43 Transposon Tf2-8 polyprotein1.2e-4835.96Show/hide
Query:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL
        G IR +KA     V+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K         FEYLVMP+G+
Subjt:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL

Query:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL
        + APA F   +N +  E  +  VV Y+DNI+I++ +  EH  H       LK A L+I+                 +S+K        I  + +WK P  
Subjt:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL

Query:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE
           +R FLG  NY R+F+   S+   PL  LLKKD  W W+     A E++K  +    VL   D +K   +E DASD A+G +L Q+      +P+ Y 
Subjt:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE

Query:  SRKLNDTERRYTLFEKK
        S K++  +  Y++ +K+
Subjt:  SRKLNDTERRYTLFEKK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-5130.7Show/hide
Query:  KGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKV
        K L    + IN +    TL D    +  I  +  R   L I+  PE + + + E++  ++ S+   F+L     E    +  +   +    +++    K+
Subjt:  KGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKAVNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKV

Query:  IPMPLAKCLVITDHNPTVIPAS--IKQPGNLRMISAIQLKRGFAREEPTFIGFIRPAKAPYGASV-LFQKKKDGT----LRLCIDYRALNKVTIRNKYPL
              K ++ T HN  +      + Q   + + + +Q        E    G IR + +PY +   +  KK D +     R+ IDYR LN++TI ++YP+
Subjt:  IPMPLAKCLVITDHNPTVIPAS--IKQPGNLRMISAIQLKRGFAREEPTFIGFIRPAKAPYGASV-LFQKKKDGT----LRLCIDYRALNKVTIRNKYPL

Query:  PIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEH----
        P + ++  +L   +YFT +DL  G++Q+ + E    KT   T+   +EYL MPFGL NAPATF   MN +    L++  +VY+D+I+I++T+L EH    
Subjt:  PIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEH----

Query:  -KVHLKLAELVISMVSD--------------------------KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-E
          V  KLA+  + +  D                          K+K I  + IPT    +R+FLGLT YYR+F+  ++  A P+T  LKK         E
Subjt:  -KVHLKLAELVISMVSD--------------------------KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-E

Query:  CQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK
           AFE LKA++ +  +L L D  K F +  DAS+ ALG +L Q GHPI++ SR LND E  Y+  EK+
Subjt:  CQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKK

Q9UR07 Transposon Tf2-11 polyprotein1.2e-4835.96Show/hide
Query:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL
        G IR +KA     V+F  KK+GTLR+ +DY+ LNK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K         FEYLVMP+G+
Subjt:  GFIRPAKAPYGASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGL

Query:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL
        + APA F   +N +  E  +  VV Y+DNI+I++ +  EH  H       LK A L+I+                 +S+K        I  + +WK P  
Subjt:  TNAPATFCTLMNQVFHEFLDQFVVVYVDNIVIYNTTLEEHKVH-------LKLAELVISM----------------VSDK--------IKTIQEWKIPTL

Query:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE
           +R FLG  NY R+F+   S+   PL  LLKKD  W W+     A E++K  +    VL   D +K   +E DASD A+G +L Q+      +P+ Y 
Subjt:  VSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPFEIEMDASDFALGGILIQEG-----HPIAYE

Query:  SRKLNDTERRYTLFEKK
        S K++  +  Y++ +K+
Subjt:  SRKLNDTERRYTLFEKK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.5e-1140.96Show/hide
Query:  KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPF
        K++ +  W  P   + +R FLGLT YYR+FV+ + +   PLTELLKK +   W+    +AF+ LK  +T   VL L D+  PF
Subjt:  KIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLGLVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACATGAACACCCGAATTGGGGTGACCATGAAAGCCATGGAGAACGTCACGGCTAGGCAAACCCATATAAGATCCAACAAACTGAAGTTCCCAGACCCCAGACC
CTTTAAAGAGAACCGGGACGCCAAAGAGTTGGAGAACTTCATCTTTGATGTCAAACAGTACTTCAAAGCCACGACTGACTTGTACCTACGACATAAAAGTGACAGTAGCC
TCAATGCATCTCATAGACGATGCAAAACTTTGGTGGCGTACAAAGGTGCAAAACATCGAGAATTGATTGGGCACCATCGACTCGTGGGAAGACCTCAAGAGAGAGTTGAG
GAACCAATTCTTCCCCGAAAACGTAGAACATATAGCAATGGAAAAACTAATAACCCTGAAACAAACTGGAAGCATAAGGGACTATATCAGACAGTTCTCGACCTTGATGC
TAGATATCAGGGGCACATCAGAGAAATACAAACCCTCCAATGGAAGGTCGACCCAAAGGAGATAATAGAGAAAGGACTCATGTTTGTAGATGCAACAATAAACTCCCGAC
CAAGCAAGAGCACCTTGATAGACTCAGGTGCGATCCACAACTTCATCGCCGACCAAGAAGCACGAAGATTGGGACTCACCATAGAAAATGACCCCGAAAAAATGAAAGCT
GTCAACTCCGAGGCCTTGCATGTTGTGGAAGTTTCTAAAAGAGTCCCCTTCAAATTAGGGACATGGACAGGAGAAATGGATATTGTCGTGGTTCGCATGGACGACTTCGA
CGTGGTACTTGGGATGGACTTCCTCCTAGAACACAAAGTTATTCCAATGCCACTGGCGAAATGCTTGGTGATCACCGACCACAACCCCACAGTGATACCTGCAAGCATCA
AGCAACCAGGCAATCTTAGAATGATCTCGGCCATACAATTAAAAAGAGGATTCGCACGAGAGGAACCTACATTTATAGGATTCATCCGTCCAGCAAAGGCACCTTACGGA
GCCTCCGTATTATTTCAGAAAAAGAAGGATGGGACGTTGCGTCTGTGCATTGATTATAGAGCCTTAAACAAGGTGACGATACGCAACAAATACCCACTGCCGATAATATT
CGACTTATTTGACCAGCTACACGGGGCCAAATACTTCACAAAGTTGGACTTACGATCAGGGTACTACCAAGTACGTATCGCCGAAGGGGACGAGCCCAAGACGACGTGTA
TAACAAGATATGAGGCCTTTGAATACCTGGTAATGCCCTTTGGTTTGACAAATGCCCCAGCTACGTTTTGCACGTTAATGAACCAGGTTTTCCACGAATTCTTGGATCAG
TTCGTCGTAGTATACGTTGACAACATAGTTATATACAACACAACCTTGGAGGAACACAAAGTGCACCTGAAGCTAGCAGAATTAGTTATCAGTATGGTTAGTGATAAGAT
AAAAACTATCCAAGAGTGGAAAATCCCTACTTTAGTATCTAATGTGCGATCCTTCCTAGGATTGACAAACTACTATAGGCAGTTCGTCGAAGGATTTTCACGACGAGCCG
CCCCATTAACAGAACTCTTGAAGAAAGACCACCCTTGGTCGTGGTCAAATGAATGTCAAATGGCCTTTGAAGATCTGAAGGCAATCATGACGAAGGGTTCTGTCCTCGGA
TTAGTAGATGTTACTAAACCATTTGAAATAGAGATGGATGCTTCCGACTTTGCCCTAGGCGGTATCCTTATTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGAAAGCT
CAATGATACTGAAAGGAGGTACACTCTTTTCGAAAAGAAATGTCGGATGTGGTCCATTGCCTACGAGTCTGGAGACAGTACCTCTTGGGATCATAGTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACGGAATTGAATGGAAAAGTCGTACAAATCGACGCAATGGACAACTGCCTGGATGGGTTGCCAATTGCAGAACTAATGTTTTGAGTGGCCTCGCTCGAAGAAA
GAGTTACTCCTACGATCAGCTCAAGACCCTTTGAACAACGATTGATGTCATCCAAGAAAAGATGGCCGACATGAACACCCGAATTGGGGTGACCATGAAAGCCATGGAGA
ACGTCACGGCTAGGCAAACCCATATAAGATCCAACAAACTGAAGTTCCCAGACCCCAGACCCTTTAAAGAGAACCGGGACGCCAAAGAGTTGGAGAACTTCATCTTTGAT
GTCAAACAGTACTTCAAAGCCACGACTGACTTGTACCTACGACATAAAAGTGACAGTAGCCTCAATGCATCTCATAGACGATGCAAAACTTTGGTGGCGTACAAAGGTGC
AAAACATCGAGAATTGATTGGGCACCATCGACTCGTGGGAAGACCTCAAGAGAGAGTTGAGGAACCAATTCTTCCCCGAAAACGTAGAACATATAGCAATGGAAAAACTA
ATAACCCTGAAACAAACTGGAAGCATAAGGGACTATATCAGACAGTTCTCGACCTTGATGCTAGATATCAGGGGCACATCAGAGAAATACAAACCCTCCAATGGAAGGTC
GACCCAAAGGAGATAATAGAGAAAGGACTCATGTTTGTAGATGCAACAATAAACTCCCGACCAAGCAAGAGCACCTTGATAGACTCAGGTGCGATCCACAACTTCATCGC
CGACCAAGAAGCACGAAGATTGGGACTCACCATAGAAAATGACCCCGAAAAAATGAAAGCTGTCAACTCCGAGGCCTTGCATGTTGTGGAAGTTTCTAAAAGAGTCCCCT
TCAAATTAGGGACATGGACAGGAGAAATGGATATTGTCGTGGTTCGCATGGACGACTTCGACGTGGTACTTGGGATGGACTTCCTCCTAGAACACAAAGTTATTCCAATG
CCACTGGCGAAATGCTTGGTGATCACCGACCACAACCCCACAGTGATACCTGCAAGCATCAAGCAACCAGGCAATCTTAGAATGATCTCGGCCATACAATTAAAAAGAGG
ATTCGCACGAGAGGAACCTACATTTATAGGATTCATCCGTCCAGCAAAGGCACCTTACGGAGCCTCCGTATTATTTCAGAAAAAGAAGGATGGGACGTTGCGTCTGTGCA
TTGATTATAGAGCCTTAAACAAGGTGACGATACGCAACAAATACCCACTGCCGATAATATTCGACTTATTTGACCAGCTACACGGGGCCAAATACTTCACAAAGTTGGAC
TTACGATCAGGGTACTACCAAGTACGTATCGCCGAAGGGGACGAGCCCAAGACGACGTGTATAACAAGATATGAGGCCTTTGAATACCTGGTAATGCCCTTTGGTTTGAC
AAATGCCCCAGCTACGTTTTGCACGTTAATGAACCAGGTTTTCCACGAATTCTTGGATCAGTTCGTCGTAGTATACGTTGACAACATAGTTATATACAACACAACCTTGG
AGGAACACAAAGTGCACCTGAAGCTAGCAGAATTAGTTATCAGTATGGTTAGTGATAAGATAAAAACTATCCAAGAGTGGAAAATCCCTACTTTAGTATCTAATGTGCGA
TCCTTCCTAGGATTGACAAACTACTATAGGCAGTTCGTCGAAGGATTTTCACGACGAGCCGCCCCATTAACAGAACTCTTGAAGAAAGACCACCCTTGGTCGTGGTCAAA
TGAATGTCAAATGGCCTTTGAAGATCTGAAGGCAATCATGACGAAGGGTTCTGTCCTCGGATTAGTAGATGTTACTAAACCATTTGAAATAGAGATGGATGCTTCCGACT
TTGCCCTAGGCGGTATCCTTATTCAAGAAGGCCACCCCATCGCTTATGAAAGTCGAAAGCTCAATGATACTGAAAGGAGGTACACTCTTTTCGAAAAGAAATGTCGGATG
TGGTCCATTGCCTACGAGTCTGGAGACAGTACCTCTTGGGATCATAGTTCGTAG
Protein sequenceShow/hide protein sequence
MADMNTRIGVTMKAMENVTARQTHIRSNKLKFPDPRPFKENRDAKELENFIFDVKQYFKATTDLYLRHKSDSSLNASHRRCKTLVAYKGAKHRELIGHHRLVGRPQERVE
EPILPRKRRTYSNGKTNNPETNWKHKGLYQTVLDLDARYQGHIREIQTLQWKVDPKEIIEKGLMFVDATINSRPSKSTLIDSGAIHNFIADQEARRLGLTIENDPEKMKA
VNSEALHVVEVSKRVPFKLGTWTGEMDIVVVRMDDFDVVLGMDFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLRMISAIQLKRGFAREEPTFIGFIRPAKAPYG
ASVLFQKKKDGTLRLCIDYRALNKVTIRNKYPLPIIFDLFDQLHGAKYFTKLDLRSGYYQVRIAEGDEPKTTCITRYEAFEYLVMPFGLTNAPATFCTLMNQVFHEFLDQ
FVVVYVDNIVIYNTTLEEHKVHLKLAELVISMVSDKIKTIQEWKIPTLVSNVRSFLGLTNYYRQFVEGFSRRAAPLTELLKKDHPWSWSNECQMAFEDLKAIMTKGSVLG
LVDVTKPFEIEMDASDFALGGILIQEGHPIAYESRKLNDTERRYTLFEKKCRMWSIAYESGDSTSWDHSS