; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0021324 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0021324
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr02:7989271..7991851
RNA-Seq ExpressionPay0021324
SyntenyPay0021324
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66690.1 hypothetical protein VITISV_023209 [Vitis vinifera]5.2e-17147.21Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKN LFQ IDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQ L+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI  S+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKA------------------------SIETPSSSKGGRGR-------------------------------------------------
        + Q+D EEQALKA                         +      +GGRG                                                  
Subjt:  LKQQDNEEQALKA------------------------SIETPSSSKGGRGR-------------------------------------------------

Query:  ----GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG--------------------------------------------WTQN
            GKG I ++TK   + TISN  ++PDLK+NL S GQL EKGY I I+ G                                            W   
Subjt:  ----GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG--------------------------------------------WTQN

Query:  IAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEY
             +G                I   S++CE+CVVGKQHR  FP GKS RAK+V EL  S+         + FKSFKA+VEKE G +IKIL +DRG EY
Subjt:  IAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEY

Query:  NSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIA
         S EFE+F ++  I+R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WPEAVNWSIHVLNRSPTF VQN TPEEAWSG KP +DHF+IFG IA
Subjt:  NSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIA

Query:  YAHVPNKEDHNVVFNEAEFWNHEECK-PDQRIQVDFENGDEGTRQQI-------------ETNDEATI--TQEIPAVSGV---ERAHRVKRKPAWMEDYV
        YAHVP+++   +          E+C    Q  QV F+N  E  RQQ+               ND  T   T    A S V    R  RV+++PAWM+D+ 
Subjt:  YAHVPNKEDHNVVFNEAEFWNHEECK-PDQRIQVDFENGDEGTRQQI-------------ETNDEATI--TQEIPAVSGV---ERAHRVKRKPAWMEDYV

Query:  VTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW
        VTG+   + D + H+AL +DCDP+TF+EA++  KW K MN+EI +IE+NNSW
Subjt:  VTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW

CAN74536.1 hypothetical protein VITISV_023111 [Vitis vinifera]2.4e-16043.27Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKNYLFQAIDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQQL+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI RS+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASIETPSSS-------------------KGGRGR------------------------------------------------------
        + Q+D EEQALKAS    + +                    GGRGR                                                      
Subjt:  LKQQDNEEQALKASIETPSSS-------------------KGGRGR------------------------------------------------------

Query:  ---------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG-------------
                                               GKG I ++TK   + TIS   ++PDLK+NLLS GQLQEKGY I I+ G             
Subjt:  ---------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG-------------

Query:  -------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG
                                       W        +G                I   S++CE+CVVGKQHR  FP GKS RAK+           
Subjt:  -------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG

Query:  PINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHV
          +   + FKSFKA+VEKE G +IKIL +DRG EY S EFE+F ++  I+R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WP AVNWSIHV
Subjt:  PINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHV

Query:  LNRSPTFAVQNQTPEEAWSGQKPNIDH----------------FRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQI---
        LNRSPTF+VQN TPEEAW+ ++  +D                 +++F  +    V +++   V+F E   WN     P    QV F+N  E  RQQ+   
Subjt:  LNRSPTFAVQNQTPEEAWSGQKPNIDH----------------FRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQI---

Query:  ----------ETNDEATITQ--EIPAVSGV---ERAHRVKRKPAWMEDYVVTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNN
                    ND +T T+    PA S V    R  RV+++PAWM+D+ VTG+   + D + H+AL +DCDP+TF+EA++  KW K MN+EI +IE+NN
Subjt:  ----------ETNDEATITQ--EIPAVSGV---ERAHRVKRKPAWMEDYVVTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNN

Query:  SW
        SW
Subjt:  SW

RVW63136.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.8e-15143.19Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKNYLFQAIDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQQL+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI RS+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASI-----------------------------ETPSSSKGGRGR--------------------------------------------
        + Q+D EEQALKAS                              + P     GRGR                                            
Subjt:  LKQQDNEEQALKASI-----------------------------ETPSSSKGGRGR--------------------------------------------

Query:  -----------------------------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGY
                                                                   GKG I ++TK   + TISN  ++ DLK+NLLS GQLQEKGY
Subjt:  -----------------------------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGY

Query:  EIFIKNG--------------------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFP
         I I+ G                                            W        +G                I   S++CE+CV+GKQHR  FP
Subjt:  EIFIKNG--------------------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFP

Query:  TGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIK
         GKS RAK+V EL+HSD+CGPINPTSN                               FKSFKA+VEKE G +IKIL +DRG EY S EFE+F ++  I+
Subjt:  TGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIK

Query:  RQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE-------
        R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WP+AVNWSIHVLNRSPTF+VQN TPEEAWSG+KP +DHF+IFG IAYAHVP+++       
Subjt:  RQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE-------

Query:  ----------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQ
                                      +V+F+E   WN    +P    QV F+N  E  RQ
Subjt:  ----------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQ

RVW92024.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.7e-18544.15Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEIT--AAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+ FVQPAIPRFDGHYDH  MLMENFLRSKEYW VVS+G+ E T  A MT  Q+TE++G +LKDLKAKNYLFQAIDRSILETIL KDTSK IWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEIT--AAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GSA+AKRQQL+ L TEFE L+M+SGES+  YFSR M I NKMR+  DK+ED+ I+EKI RS+TP FNFVVC+IEES +ID+LS+DELQSSLLVHERK
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASIETPSSSKGGRGRGK-----------------------------------------------------------------------
          QQ+ EEQALKAS E   +++G RGRG+                                                                       
Subjt:  LKQQDNEEQALKASIETPSSSKGGRGRGK-----------------------------------------------------------------------

Query:  --------------------------------------------------------------GKITLQTKGDI-IHT-------ISNDLFIPDLKTNLLS
                                                                       K+++  KG + IH+       ISN  F+PDLKTNLLS
Subjt:  --------------------------------------------------------------GKITLQTKGDI-IHT-------ISNDLFIPDLKTNLLS

Query:  VGQLQEKGYEIFIKNG-----------------------------WTQNIAAEKYGD------------------------------TIKSHSEICEDCV
        VGQLQEKGYEIFIK+G                              TQN  + K  D                               I++ S+ICE+CV
Subjt:  VGQLQEKGYEIFIKNG-----------------------------WTQNIAAEKYGD------------------------------TIKSHSEICEDCV

Query:  VGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEF
        VGKQHR  FP GKSWRA  V EL+HSD+CGPINPTSN                               FKSFK  VEKEAG  IKI  SDRG EY SQEF
Subjt:  VGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEF

Query:  ENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVP
         NF E H I++QLT AY+PQQN +SERKNRTI+NMVR +L    + ++ WPEAV WSIH+LNRSPT  VQN TPEEAW+G+KP+++HFRIFG IAYAH+P
Subjt:  ENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVP

Query:  NKE-----------------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQ----------IETNDEATITQEIPA
        +++                                     +++F+E  FW  ++    Q+IQ DF+  +E  RQQ          I  N+  T  +  P 
Subjt:  NKE-----------------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQ----------IETNDEATITQEIPA

Query:  VSGVER---------AHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW
            +          +HRV+++PAWM DY VTG+D S+DP+ HFALF+DCDP TFE AV++ KW+K M+ EIAAIERN++W
Subjt:  VSGVER---------AHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW

XP_016900876.1 PREDICTED: uncharacterized protein LOC107991076 [Cucumis melo]0.0e+00100Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY
        MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY

Query:  EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK
        EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK
Subjt:  EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK

Query:  QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG
        QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG
Subjt:  QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG

Query:  KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI
        KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI
Subjt:  KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI

Query:  MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN
        MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN
Subjt:  MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN

Query:  GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID
        GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID
Subjt:  GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID

TrEMBL top hitse value%identityAlignment
A0A1S4DY17 uncharacterized protein LOC1079910760.0e+00100Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY
        MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKY

Query:  EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK
        EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK
Subjt:  EGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLK

Query:  QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG
        QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG
Subjt:  QQDNEEQALKASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVG

Query:  KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI
        KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI
Subjt:  KQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTI

Query:  MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN
        MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN
Subjt:  MNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFEN

Query:  GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID
        GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID
Subjt:  GDEGTRQQIETNDEATITQEIPAVSGVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID

A0A438FT48 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-15143.19Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKNYLFQAIDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQQL+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI RS+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASI-----------------------------ETPSSSKGGRGR--------------------------------------------
        + Q+D EEQALKAS                              + P     GRGR                                            
Subjt:  LKQQDNEEQALKASI-----------------------------ETPSSSKGGRGR--------------------------------------------

Query:  -----------------------------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGY
                                                                   GKG I ++TK   + TISN  ++ DLK+NLLS GQLQEKGY
Subjt:  -----------------------------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGY

Query:  EIFIKNG--------------------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFP
         I I+ G                                            W        +G                I   S++CE+CV+GKQHR  FP
Subjt:  EIFIKNG--------------------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFP

Query:  TGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIK
         GKS RAK+V EL+HSD+CGPINPTSN                               FKSFKA+VEKE G +IKIL +DRG EY S EFE+F ++  I+
Subjt:  TGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIK

Query:  RQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE-------
        R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WP+AVNWSIHVLNRSPTF+VQN TPEEAWSG+KP +DHF+IFG IAYAHVP+++       
Subjt:  RQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE-------

Query:  ----------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQ
                                      +V+F+E   WN    +P    QV F+N  E  RQ
Subjt:  ----------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQ

A0A438I5N6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-18544.15Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEIT--AAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+ FVQPAIPRFDGHYDH  MLMENFLRSKEYW VVS+G+ E T  A MT  Q+TE++G +LKDLKAKNYLFQAIDRSILETIL KDTSK IWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEIT--AAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GSA+AKRQQL+ L TEFE L+M+SGES+  YFSR M I NKMR+  DK+ED+ I+EKI RS+TP FNFVVC+IEES +ID+LS+DELQSSLLVHERK
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASIETPSSSKGGRGRGK-----------------------------------------------------------------------
          QQ+ EEQALKAS E   +++G RGRG+                                                                       
Subjt:  LKQQDNEEQALKASIETPSSSKGGRGRGK-----------------------------------------------------------------------

Query:  --------------------------------------------------------------GKITLQTKGDI-IHT-------ISNDLFIPDLKTNLLS
                                                                       K+++  KG + IH+       ISN  F+PDLKTNLLS
Subjt:  --------------------------------------------------------------GKITLQTKGDI-IHT-------ISNDLFIPDLKTNLLS

Query:  VGQLQEKGYEIFIKNG-----------------------------WTQNIAAEKYGD------------------------------TIKSHSEICEDCV
        VGQLQEKGYEIFIK+G                              TQN  + K  D                               I++ S+ICE+CV
Subjt:  VGQLQEKGYEIFIKNG-----------------------------WTQNIAAEKYGD------------------------------TIKSHSEICEDCV

Query:  VGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEF
        VGKQHR  FP GKSWRA  V EL+HSD+CGPINPTSN                               FKSFK  VEKEAG  IKI  SDRG EY SQEF
Subjt:  VGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSN------------------------------VFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEF

Query:  ENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVP
         NF E H I++QLT AY+PQQN +SERKNRTI+NMVR +L    + ++ WPEAV WSIH+LNRSPT  VQN TPEEAW+G+KP+++HFRIFG IAYAH+P
Subjt:  ENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVP

Query:  NKE-----------------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQ----------IETNDEATITQEIPA
        +++                                     +++F+E  FW  ++    Q+IQ DF+  +E  RQQ          I  N+  T  +  P 
Subjt:  NKE-----------------------------------DHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQ----------IETNDEATITQEIPA

Query:  VSGVER---------AHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW
            +          +HRV+++PAWM DY VTG+D S+DP+ HFALF+DCDP TFE AV++ KW+K M+ EIAAIERN++W
Subjt:  VSGVER---------AHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW

A5AFP3 Integrase catalytic domain-containing protein2.5e-17147.21Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKN LFQ IDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQ L+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI  S+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKA------------------------SIETPSSSKGGRGR-------------------------------------------------
        + Q+D EEQALKA                         +      +GGRG                                                  
Subjt:  LKQQDNEEQALKA------------------------SIETPSSSKGGRGR-------------------------------------------------

Query:  ----GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG--------------------------------------------WTQN
            GKG I ++TK   + TISN  ++PDLK+NL S GQL EKGY I I+ G                                            W   
Subjt:  ----GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG--------------------------------------------WTQN

Query:  IAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEY
             +G                I   S++CE+CVVGKQHR  FP GKS RAK+V EL  S+         + FKSFKA+VEKE G +IKIL +DRG EY
Subjt:  IAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEY

Query:  NSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIA
         S EFE+F ++  I+R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WPEAVNWSIHVLNRSPTF VQN TPEEAWSG KP +DHF+IFG IA
Subjt:  NSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIA

Query:  YAHVPNKEDHNVVFNEAEFWNHEECK-PDQRIQVDFENGDEGTRQQI-------------ETNDEATI--TQEIPAVSGV---ERAHRVKRKPAWMEDYV
        YAHVP+++   +          E+C    Q  QV F+N  E  RQQ+               ND  T   T    A S V    R  RV+++PAWM+D+ 
Subjt:  YAHVPNKEDHNVVFNEAEFWNHEECK-PDQRIQVDFENGDEGTRQQI-------------ETNDEATI--TQEIPAVSGV---ERAHRVKRKPAWMEDYV

Query:  VTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW
        VTG+   + D + H+AL +DCDP+TF+EA++  KW K MN+EI +IE+NNSW
Subjt:  VTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSW

A5BGM4 Integrase catalytic domain-containing protein1.2e-16043.27Show/hide
Query:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN
        MAS+NFVQPAIPRFDGHYD+  MLMENFLRSKEYW+VVS G+  P   + MTD QKTE+EG +LKDLKAKNYLFQAIDRSILETIL KDTS+QIWDSMK 
Subjt:  MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGM--PEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKN

Query:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK
        KY+GS + KRQQL+ L +EFE L+MK GES++ YFSR M I NKMR+  +K ED+ +IEKI RS+TPKFN+VVC+IEESK++D+LS+DELQ SLLVHE+K
Subjt:  KYEGSAKAKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERK

Query:  LKQQDNEEQALKASIETPSSS-------------------KGGRGR------------------------------------------------------
        + Q+D EEQALKAS    + +                    GGRGR                                                      
Subjt:  LKQQDNEEQALKASIETPSSS-------------------KGGRGR------------------------------------------------------

Query:  ---------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG-------------
                                               GKG I ++TK   + TIS   ++PDLK+NLLS GQLQEKGY I I+ G             
Subjt:  ---------------------------------------GKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNG-------------

Query:  -------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG
                                       W        +G                I   S++CE+CVVGKQHR  FP GKS RAK+           
Subjt:  -------------------------------WTQNIAAEKYG--------------DTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG

Query:  PINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHV
          +   + FKSFKA+VEKE G +IKIL +DRG EY S EFE+F ++  I+R+LT AYTPQQN +SERKNRTI+NMVR LL   ++ K+ WP AVNWSIHV
Subjt:  PINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHV

Query:  LNRSPTFAVQNQTPEEAWSGQKPNIDH----------------FRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQI---
        LNRSPTF+VQN TPEEAW+ ++  +D                 +++F  +    V +++   V+F E   WN     P    QV F+N  E  RQQ+   
Subjt:  LNRSPTFAVQNQTPEEAWSGQKPNIDH----------------FRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQI---

Query:  ----------ETNDEATITQ--EIPAVSGV---ERAHRVKRKPAWMEDYVVTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNN
                    ND +T T+    PA S V    R  RV+++PAWM+D+ VTG+   + D + H+AL +DCDP+TF+EA++  KW K MN+EI +IE+NN
Subjt:  ----------ETNDEATITQ--EIPAVSGV---ERAHRVKRKPAWMEDYVVTGMDHSD-DPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNN

Query:  SW
        SW
Subjt:  SW

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.1e-3634.02Show/hide
Query:  VGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTS-----------------------
        +G + EKG +I  K    +++ +   G T+K     C+ C+ GKQHR SF T  S R  ++ +L++SD+CGP+   S                       
Subjt:  VGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCGPINPTS-----------------------

Query:  -------NVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIH
                VF+ F A VE+E G  +K L SD G EY S+EFE +   H I+ + T   TPQ N ++ER NRTI+  VR +L+ +++ K+ W EAV  + +
Subjt:  -------NVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIH

Query:  VLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE
        ++NRSP+  +  + PE  W+ ++ +  H ++FG  A+AHVP ++
Subjt:  VLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKE

P25384 Transposon Ty2-C Gag-Pol polyprotein1.2e-0825Show/hide
Query:  CEDCVVGKQHRDSFPTG---KSWRAKHVFELIHSDLCGPIN--PTS------------------------------NVFKSFKAKVEKEAGMAIKILHSD
        C DC++GK  +     G   K   +   F+ +H+D+ GP++  P S                              NVF S  A ++ +    + ++  D
Subjt:  CEDCVVGKQHRDSFPTG---KSWRAKHVFELIHSDLCGPIN--PTS------------------------------NVFKSFKAKVEKEAGMAIKILHSD

Query:  RGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLN
        RG EY ++    F+    I    TT    + + ++ER NRT++N  R LL  S +    W  AV +S  + N
Subjt:  RGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLN

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.2e-0825Show/hide
Query:  CEDCVVGKQHRDSFPTG---KSWRAKHVFELIHSDLCGPIN--PTS------------------------------NVFKSFKAKVEKEAGMAIKILHSD
        C DC++GK  +     G   K   +   F+ +H+D+ GP++  P S                              NVF S  A ++ +    + ++  D
Subjt:  CEDCVVGKQHRDSFPTG---KSWRAKHVFELIHSDLCGPIN--PTS------------------------------NVFKSFKAKVEKEAGMAIKILHSD

Query:  RGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLN
        RG EY ++    F+    I    TT    + + ++ER NRT++N  R LL  S +    W  AV +S  + N
Subjt:  RGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.2e-1425.59Show/hide
Query:  CEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG--------------------------PINPTSNV---FKSFKAKVEKEAGMAIKILHSDRGCEYN
        C DC++ K ++  F +  +  +    E I+SD+                            P+   S V   F +FK  +E      I   +SD G E+ 
Subjt:  CEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG--------------------------PINPTSNV---FKSFKAKVEKEAGMAIKILHSDRGCEYN

Query:  SQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAY
        +     ++ +H I    +  +TP+ N +SERK+R I+     LL  + + KT WP A   +++++NR PT  +Q ++P +   G  PN D  R+FG   Y
Subjt:  SQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAY

Query:  AHVPNKEDHNV
          +     H +
Subjt:  AHVPNKEDHNV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.0e-1629.5Show/hide
Query:  CEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG--------------------------PINPTSNV---FKSFKAKVEKEAGMAIKILHSDRGCEYN
        C DC + K H+  F       +K + E I+SD+                            P+   S V   F  FK+ VE      I  L+SD G E+ 
Subjt:  CEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG--------------------------PINPTSNV---FKSFKAKVEKEAGMAIKILHSDRGCEYN

Query:  SQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAY
             ++  +H I    +  +TP+ N +SERK+R I+ M   LL  + V KT WP A + +++++NR PT  +Q Q+P +   GQ PN +  ++FG   Y
Subjt:  SQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein6.2e-0530.99Show/hide
Query:  VGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG
        +  + ++G E+ +K G+  +           S  + CEDC+ GK HR +F TG+    K+  + +HSDL G
Subjt:  VGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVGKQHRDSFPTGKSWRAKHVFELIHSDLCG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0835.71Show/hide
Query:  NRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAH
        NRTI+  VR +L    + KT   +A N ++H++N+ P+ A+    P+E W    P   + R FG +AY H
Subjt:  NRTIMNMVRCLLKSSEVQKTSWPEAVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTAAAAACTTTGTGCAACCAGCGATTCCTCGTTTCGATGGTCATTACGATCATTTAAAAATGTTGATGGAGAATTTTCTGAGATCAAAAGAATATTGG
GAAGTTGTGTCTGATGGAATGCCAGAAATAACAGCTGCAATGACAGATGAACAAAAGACTGAAGTTGAAGGAACGAAATTGAAGGATCTGAAGGCCAAAAATTAC
TTGTTTCAGGCTATTGATCGTTCAATCTTGGAGACAATTCTTTACAAAGATACCTCCAAGCAGATTTGGGATTCAATGAAGAATAAATATGAAGGCTCGGCAAAA
GCAAAGAGGCAGCAACTTAAAACACTATGTACCGAGTTTGAAAACCTTAAGATGAAGTCCGGTGAGTCGATTGCTATTTATTTCTCACGAGTCATGGAAATCACC
AACAAAATGCGGATGTTCAGGGACAAGTCAGAGGATATCATCATCATTGAGAAGATATTTAGATCCTTAACACCAAAATTCAATTTTGTTGTTTGTGCAATTGAA
GAGTCTAAGAATATTGATGATCTCTCACTTGATGAGCTGCAAAGTTCTTTACTAGTACATGAACGGAAGCTCAAGCAACAAGACAATGAGGAGCAAGCTTTGAAA
GCTTCAATAGAAACTCCATCGTCGTCAAAAGGAGGTAGAGGCAGAGGGAAAGGAAAAATCACTCTCCAAACCAAAGGTGATATCATCCATACTATTTCAAATGAC
CTCTTTATTCCAGATTTGAAGACCAACTTACTAAGTGTGGGTCAATTGCAAGAGAAGGGATATGAGATCTTCATAAAAAATGGTTGGACTCAAAACATTGCAGCA
GAAAAATATGGTGACACGATTAAAAGTCATTCTGAGATTTGTGAAGATTGTGTGGTTGGGAAACAACACAGAGATAGTTTTCCAACAGGAAAATCATGGAGAGCA
AAGCATGTTTTTGAGCTTATTCACTCTGATCTTTGTGGACCCATAAATCCGACATCAAATGTTTTTAAAAGCTTCAAAGCAAAGGTTGAAAAGGAAGCAGGCATG
GCAATAAAGATTCTTCATAGTGATCGTGGATGTGAGTACAACTCGCAAGAATTTGAAAATTTTTATGAGGAGCATGACATTAAAAGGCAACTTACAACAGCATAT
ACGCCACAACAAAATGACATTTCAGAGAGGAAAAATCGCACAATCATGAACATGGTACGGTGTCTATTAAAGAGTAGTGAAGTTCAGAAAACTAGTTGGCCTGAA
GCTGTCAATTGGAGTATCCATGTGCTGAATAGAAGTCCCACATTTGCTGTTCAGAATCAGACACCAGAAGAAGCTTGGAGTGGACAAAAACCAAATATAGATCAT
TTTAGAATTTTTGGCTACATAGCATATGCACATGTTCCAAACAAAGAGGACCATAATGTTGTTTTCAATGAAGCAGAATTCTGGAATCATGAAGAATGCAAGCCT
GACCAGAGAATCCAGGTTGATTTTGAAAATGGAGATGAAGGGACAAGGCAACAAATTGAAACAAATGATGAAGCAACTATCACTCAAGAAATTCCAGCTGTCAGT
GGCGTAGAAAGAGCTCATCGAGTCAAAAGAAAGCCTGCTTGGATGGAAGACTATGTGGTAACTGGAATGGATCATTCTGATGATCCAGTTGTTCATTTTGCTTTG
TTTGCAGATTGTGATCCAGTAACCTTTGAAGAAGCTGTCCAAAAACCGAAATGGCAAAAGACAATGAATGATGAGATAGCAGCAATTGAAAGAAACAACAGCTGG
GGAATTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTCTAAAAACTTTGTGCAACCAGCGATTCCTCGTTTCGATGGTCATTACGATCATTTAAAAATGTTGATGGAGAATTTTCTGAGATCAAAAGAATATTGG
GAAGTTGTGTCTGATGGAATGCCAGAAATAACAGCTGCAATGACAGATGAACAAAAGACTGAAGTTGAAGGAACGAAATTGAAGGATCTGAAGGCCAAAAATTAC
TTGTTTCAGGCTATTGATCGTTCAATCTTGGAGACAATTCTTTACAAAGATACCTCCAAGCAGATTTGGGATTCAATGAAGAATAAATATGAAGGCTCGGCAAAA
GCAAAGAGGCAGCAACTTAAAACACTATGTACCGAGTTTGAAAACCTTAAGATGAAGTCCGGTGAGTCGATTGCTATTTATTTCTCACGAGTCATGGAAATCACC
AACAAAATGCGGATGTTCAGGGACAAGTCAGAGGATATCATCATCATTGAGAAGATATTTAGATCCTTAACACCAAAATTCAATTTTGTTGTTTGTGCAATTGAA
GAGTCTAAGAATATTGATGATCTCTCACTTGATGAGCTGCAAAGTTCTTTACTAGTACATGAACGGAAGCTCAAGCAACAAGACAATGAGGAGCAAGCTTTGAAA
GCTTCAATAGAAACTCCATCGTCGTCAAAAGGAGGTAGAGGCAGAGGGAAAGGAAAAATCACTCTCCAAACCAAAGGTGATATCATCCATACTATTTCAAATGAC
CTCTTTATTCCAGATTTGAAGACCAACTTACTAAGTGTGGGTCAATTGCAAGAGAAGGGATATGAGATCTTCATAAAAAATGGTTGGACTCAAAACATTGCAGCA
GAAAAATATGGTGACACGATTAAAAGTCATTCTGAGATTTGTGAAGATTGTGTGGTTGGGAAACAACACAGAGATAGTTTTCCAACAGGAAAATCATGGAGAGCA
AAGCATGTTTTTGAGCTTATTCACTCTGATCTTTGTGGACCCATAAATCCGACATCAAATGTTTTTAAAAGCTTCAAAGCAAAGGTTGAAAAGGAAGCAGGCATG
GCAATAAAGATTCTTCATAGTGATCGTGGATGTGAGTACAACTCGCAAGAATTTGAAAATTTTTATGAGGAGCATGACATTAAAAGGCAACTTACAACAGCATAT
ACGCCACAACAAAATGACATTTCAGAGAGGAAAAATCGCACAATCATGAACATGGTACGGTGTCTATTAAAGAGTAGTGAAGTTCAGAAAACTAGTTGGCCTGAA
GCTGTCAATTGGAGTATCCATGTGCTGAATAGAAGTCCCACATTTGCTGTTCAGAATCAGACACCAGAAGAAGCTTGGAGTGGACAAAAACCAAATATAGATCAT
TTTAGAATTTTTGGCTACATAGCATATGCACATGTTCCAAACAAAGAGGACCATAATGTTGTTTTCAATGAAGCAGAATTCTGGAATCATGAAGAATGCAAGCCT
GACCAGAGAATCCAGGTTGATTTTGAAAATGGAGATGAAGGGACAAGGCAACAAATTGAAACAAATGATGAAGCAACTATCACTCAAGAAATTCCAGCTGTCAGT
GGCGTAGAAAGAGCTCATCGAGTCAAAAGAAAGCCTGCTTGGATGGAAGACTATGTGGTAACTGGAATGGATCATTCTGATGATCCAGTTGTTCATTTTGCTTTG
TTTGCAGATTGTGATCCAGTAACCTTTGAAGAAGCTGTCCAAAAACCGAAATGGCAAAAGACAATGAATGATGAGATAGCAGCAATTGAAAGAAACAACAGCTGG
GGAATTGACTGA
Protein sequenceShow/hide protein sequence
MASKNFVQPAIPRFDGHYDHLKMLMENFLRSKEYWEVVSDGMPEITAAMTDEQKTEVEGTKLKDLKAKNYLFQAIDRSILETILYKDTSKQIWDSMKNKYEGSAK
AKRQQLKTLCTEFENLKMKSGESIAIYFSRVMEITNKMRMFRDKSEDIIIIEKIFRSLTPKFNFVVCAIEESKNIDDLSLDELQSSLLVHERKLKQQDNEEQALK
ASIETPSSSKGGRGRGKGKITLQTKGDIIHTISNDLFIPDLKTNLLSVGQLQEKGYEIFIKNGWTQNIAAEKYGDTIKSHSEICEDCVVGKQHRDSFPTGKSWRA
KHVFELIHSDLCGPINPTSNVFKSFKAKVEKEAGMAIKILHSDRGCEYNSQEFENFYEEHDIKRQLTTAYTPQQNDISERKNRTIMNMVRCLLKSSEVQKTSWPE
AVNWSIHVLNRSPTFAVQNQTPEEAWSGQKPNIDHFRIFGYIAYAHVPNKEDHNVVFNEAEFWNHEECKPDQRIQVDFENGDEGTRQQIETNDEATITQEIPAVS
GVERAHRVKRKPAWMEDYVVTGMDHSDDPVVHFALFADCDPVTFEEAVQKPKWQKTMNDEIAAIERNNSWGID