; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001120 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001120
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr4:24755265..24761313
RNA-Seq ExpressionLag0001120
SyntenyLag0001120
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038009.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]4.9e-5432.14Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDK-INSVVELWKKLESMYLNKYLTNKILL
        +FE+ KF+  GDF+LWRKK+RA+LVQ KVA+ILD+   LP  ++E ++++MDE+A+ST+++YLS  VLR VD+   +  ELWKKLES+YL K L NKI +
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDK-INSVVELWKKLESMYLNKYLTNKILL

Query:  KERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAA
        KE+ FGY+MD SK LE+NL+EF K  +D +     +    +A  + +                       S P  +         ++A ++    S + +
Subjt:  KERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAA

Query:  IACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLL
        I        F    +R  +I K R     L                          +    S  K         R+      R C    KE  F     L
Subjt:  IACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLL

Query:  ELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDE
           R A +  +A +T        A ITY  D      E  YE                                           +AEVL  S     D 
Subjt:  ELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDE

Query:  WILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMM
        WI+DSGC+YHMTP++ + ++ ++ DGG+VL+G+N  C +KG GS+ ++  DG  ++L  VRYVP+LKRN ISL  LD++ Y  K E G+MK++KGSL+ +
Subjt:  WILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMM

Query:  AGSLKNGLHTLNGSSTLNTANVAPSETNSEVV
         G+LKN L+ L G++  ++A  A  +   ++V
Subjt:  AGSLKNGLHTLNGSSTLNTANVAPSETNSEVV

KAA0046503.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.4e-5532.31Show/hide
Query:  SFTVAISIWAKFELEKFD-GKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYL
        +F   I  W + +L+    G GDF+LWRKK+RA+LVQ KVA+ILD+   +P  ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL
Subjt:  SFTVAISIWAKFELEKFD-GKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYL

Query:  NKYLTNKILLKERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQV
         K L NKI +KE+ FGY++D SK LE+NL+EF K  +D +     +    +   + +   ++   +      +  G  SL+          +++ ++A  
Subjt:  NKYLTNKILLKERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQV

Query:  RRCLSSYSAAIACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEV
         R L                I    +   +L  R    +                              ++   K+ SS    +     A K     KE 
Subjt:  RRCLSSYSAAIACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEV

Query:  VFAALGLLELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTT
         F     L   R A+       T V      AKIT  D  D    E  YE                                           +AEVL  
Subjt:  VFAALGLLELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTT

Query:  SDLSPTDEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKI
        S     D WI DSGC+YHMTP++ + ++ ++ DGG+VL+G+N  C +KG GS+ ++  DG  ++L  VRYVP+LKRN ISLG LD++GY  K E G+MK+
Subjt:  SDLSPTDEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKI

Query:  SKGSLLMMAGSLKNGLHTLNGSS
        +KGSL+ + G+L+NGL+ L G++
Subjt:  SKGSLLMMAGSLKNGLHTLNGSS

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]9.5e-5833.27Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDF+LWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ T++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +                       S P  +         ++A ++    S + +I
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
               R +        I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +  +  +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]6.6e-5933.65Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDFSLWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +    +   +      +  G  SL+          +++ ++A   R L       
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
                          I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +     +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.5e-5833.46Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDF+LWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +    +   +      +  G  SL+          +++ ++A   R L       
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
                          I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +     +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

TrEMBL top hitse value%identityAlignment
A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class4.6e-5833.27Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDF+LWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ T++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +                       S P  +         ++A ++    S + +I
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
               R +        I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +  +  +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

A0A5A7UB25 Putative gag-pol polyprotein3.2e-5933.65Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDFSLWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +    +   +      +  G  SL+          +++ ++A   R L       
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
                          I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +     +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

A0A5D3BRB2 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-5432.14Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDK-INSVVELWKKLESMYLNKYLTNKILL
        +FE+ KF+  GDF+LWRKK+RA+LVQ KVA+ILD+   LP  ++E ++++MDE+A+ST+++YLS  VLR VD+   +  ELWKKLES+YL K L NKI +
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDK-INSVVELWKKLESMYLNKYLTNKILL

Query:  KERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAA
        KE+ FGY+MD SK LE+NL+EF K  +D +     +    +A  + +                       S P  +         ++A ++    S + +
Subjt:  KERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAA

Query:  IACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLL
        I        F    +R  +I K R     L                          +    S  K         R+      R C    KE  F     L
Subjt:  IACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLL

Query:  ELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDE
           R A +  +A +T        A ITY  D      E  YE                                           +AEVL  S     D 
Subjt:  ELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDE

Query:  WILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMM
        WI+DSGC+YHMTP++ + ++ ++ DGG+VL+G+N  C +KG GS+ ++  DG  ++L  VRYVP+LKRN ISL  LD++ Y  K E G+MK++KGSL+ +
Subjt:  WILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMM

Query:  AGSLKNGLHTLNGSSTLNTANVAPSETNSEVV
         G+LKN L+ L G++  ++A  A  +   ++V
Subjt:  AGSLKNGLHTLNGSSTLNTANVAPSETNSEVV

A0A5D3BRV3 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-5532.31Show/hide
Query:  SFTVAISIWAKFELEKFD-GKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYL
        +F   I  W + +L+    G GDF+LWRKK+RA+LVQ KVA+ILD+   +P  ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL
Subjt:  SFTVAISIWAKFELEKFD-GKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYL

Query:  NKYLTNKILLKERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQV
         K L NKI +KE+ FGY++D SK LE+NL+EF K  +D +     +    +   + +   ++   +      +  G  SL+          +++ ++A  
Subjt:  NKYLTNKILLKERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQV

Query:  RRCLSSYSAAIACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEV
         R L                I    +   +L  R    +                              ++   K+ SS    +     A K     KE 
Subjt:  RRCLSSYSAAIACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEV

Query:  VFAALGLLELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTT
         F     L   R A+       T V      AKIT  D  D    E  YE                                           +AEVL  
Subjt:  VFAALGLLELLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTT

Query:  SDLSPTDEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKI
        S     D WI DSGC+YHMTP++ + ++ ++ DGG+VL+G+N  C +KG GS+ ++  DG  ++L  VRYVP+LKRN ISLG LD++GY  K E G+MK+
Subjt:  SDLSPTDEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKI

Query:  SKGSLLMMAGSLKNGLHTLNGSS
        +KGSL+ + G+L+NGL+ L G++
Subjt:  SKGSLLMMAGSLKNGLHTLNGSS

A0A5D3DNU1 Putative gag-pol polyprotein7.1e-5933.46Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        +FE+ KF+G GDF+LWRKK+RA+LVQ KVA+ILD+ E LP+ ++E ++++MDE+A+ST++LYLSD VLR VD+  +  ELWKKLES+YL K L NKI +K
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        E+ FGY+MD SK LE+NLDEF K  +D +     +    +A  + +    +   +      +  G  SL+          +++ ++A   R L       
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE
                          I K R     L                          +    S  K         R       R C    KE  F     L 
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCL--GKEVVFAALGLLE

Query:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW
          R A +  +A +T        A+IT  D  D    E  YE                                           +AEVL  S     D W
Subjt:  LLRSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEW

Query:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA
        I+DSGC++HMTPH+ +  + ++ DGG+VL+G+N  C +KG GS++++  DG  +IL  VRYVP LKRN ISLG LD++G   KSE GVMK++KGSL+ + 
Subjt:  ILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMA

Query:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL
        G+L++GL+ L G++   +A +A  +     +L
Subjt:  GSLKNGLHTLNGSSTLNTANVAPSETNSEVVL

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.2e-0424.48Show/hide
Query:  AKFELEKFDGKGDFSLWRKKLRAMLVQLKVARILD--KPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKI
        AK  ++ FDG+  +++W+ ++RA+L +  V +++D   P ++     +   K+ +  A ST+I YLSDS L       +  ++ + L+++Y  K L +++
Subjt:  AKFELEKFDGKGDFSLWRKKLRAMLVQLKVARILD--KPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKI

Query:  LLKERLFGYRMDPSKPLEDN---LDEFVKKFLDSDTAASPLPK
         L++RL   ++     L  +    DE + + L +      + K
Subjt:  LLKERLFGYRMDPSKPLEDN---LDEFVKKFLDSDTAASPLPK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-1521.21Show/hide
Query:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK
        K+E+ KF+G   FS W++++R +L+Q  + ++LD     P+ +      ++DE A S + L+LSD V+  +   ++   +W +LES+Y++K LTNK+ LK
Subjt:  KFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKKLESMYLNKYLTNKILLK

Query:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI
        ++L+   M        +L+ F             + +  +A  + +    S   L+     +  G  ++      S     A+ +  ++R+   +   A+
Subjt:  ERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCLSSYSAAI

Query:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEVVFAALGLLELL
                         R   Y+ S            S ++G     +      +     S  ++C + +          KR C                
Subjt:  ACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEVVFAALGLLELL

Query:  RSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEWIL
             +   E +   + +  A +   +D                          VVL         F+NE        +H+            P  EW++
Subjt:  RSAIAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEWIL

Query:  DSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMAGS
        D+  S+H TP +  F      D G V MGN     I GIG I +    G   +L  VR+VPDL+ N IS   LD+ GY         +++KGSL++  G 
Subjt:  DSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMAGS

Query:  LKNGLHTLNGSSTLNTANVAPSETNSEV
         +  L+  N        N A  E + ++
Subjt:  LKNGLHTLNGSSTLNTANVAPSETNSEV

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein2.0e-0530.12Show/hide
Query:  DEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGY
        D WI+      +MTP+  YF  L       V   +    L++G G +K+ + +G  K +  V +VP L RN +S G +    Y
Subjt:  DEWILDSGCSYHMTPHKHYFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGAGCAGCGATTTGGTGATGGAGTATGTAGATCAGTAGAGATTAGCTTTACTGTTGCAATCTCTATCTGGGCTAAATTCGAATTAGAGAAATTTGACGGGAA
AGGGGATTTTAGTCTCTGGAGGAAGAAGCTCAGGGCAATGCTAGTTCAACTAAAAGTTGCTAGGATTTTAGATAAACCAGAAGATCTTCCTGAGCCCCTATCTGAACCAA
AACGAAAAGAGATGGATGAGATAGCTTTTAGTACAATGATTCTGTACTTGTCTGACTCTGTTCTTCGGCAAGTTGACAAAATTAACAGTGTTGTAGAGTTGTGGAAAAAG
CTTGAATCAATGTATCTTAACAAATATTTGACAAATAAAATTCTCCTAAAAGAACGACTATTCGGTTATAGGATGGACCCTTCGAAACCTCTTGAAGATAATTTGGATGA
ATTTGTCAAAAAATTTTTAGATAGTGACACCGCCGCCAGCCCCTTGCCGAAATCGCGTCGTGCCGCCGCCGTCTGGAGTCACACAGCTCGTTCGCGCGTCCCTCTCTCTC
TCCCGGGTGTCCCTCTCTCCCGTGGACCTCCCTCCCTCTCTACGCCGTCATTCTGGTCGCGCCGCCCAGCCCTTGCAGTCGCCATCGAAGCCCAAGTCCGCCGCTGCCTA
AGCTCGTACAGCGCCGCCATCGCGTGTGTTCTCTTCCCACATCGTTTCATTCTCTATCCATCTCGCGTGTCTCGCATCTTGAAGTATCGATCCTCGCGTCCTCGCCTCTG
TCCAGTAGCGTGTTCGGTTTTTTCAGTGTCTTTTGGCGTTTTCGCACCGTCTAAGTGTTCGATTGAGTTCGATACACTTCAACTCGAACACTCACTGCCCAAGGATTGTT
CTAGCGCGTCGTTAGAGCTTAGGATAACCCGTCTTGCGTTGAAACGGGTTTGTTTGGGTAAGGAGGTTGTCTTCGCTGCTTTAGGCTTGTTGGAGTTGCTTAGGAGCGCC
ATAGCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGAAAATCACGTATCGGGATGATCTGGACCTTGGTGGCGAGGAGGAGAGCTACGAGGA
AAAATCCCTAAAGTGGTGGTGTCCATGTGGTTATGTTTTCCTAGTTGTTTTAGCAGATGCTAGTCGAATAGCTTGCACTTTCGTTAATGAGTTGTCTTTAAGTTCCGCTG
CGTGGATTCATATTATTGCAGCTGAGGTGCTAACCACTTCAGATCTTAGCCCTACAGATGAATGGATCCTTGATTCCGGTTGCTCCTACCATATGACTCCACATAAGCAT
TACTTTTTAGATTTAAAAGAACAAGATGGTGGGAGAGTCCTCATGGGTAATAATCAGCAGTGTTTAATTAAGGGAATAGGTTCTATAAAACTTAGTCTAGCCGATGGAAC
CAATAAAATTTTACCTGCTGTAAGATATGTTCCAGATTTAAAGAGAAACCCCATCTCATTAGGAACATTAGATAAAGCAGGCTATAGATATAAATCTGAAGGTGGAGTTA
TGAAAATTTCAAAAGGATCCTTACTAATGATGGCCGGTAGCTTGAAAAATGGTCTACATACCTTAAATGGATCATCTACCTTAAACACTGCAAATGTAGCTCCTTCTGAA
ACCAATTCTGAAGTTGTACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGAGCAGCGATTTGGTGATGGAGTATGTAGATCAGTAGAGATTAGCTTTACTGTTGCAATCTCTATCTGGGCTAAATTCGAATTAGAGAAATTTGACGGGAA
AGGGGATTTTAGTCTCTGGAGGAAGAAGCTCAGGGCAATGCTAGTTCAACTAAAAGTTGCTAGGATTTTAGATAAACCAGAAGATCTTCCTGAGCCCCTATCTGAACCAA
AACGAAAAGAGATGGATGAGATAGCTTTTAGTACAATGATTCTGTACTTGTCTGACTCTGTTCTTCGGCAAGTTGACAAAATTAACAGTGTTGTAGAGTTGTGGAAAAAG
CTTGAATCAATGTATCTTAACAAATATTTGACAAATAAAATTCTCCTAAAAGAACGACTATTCGGTTATAGGATGGACCCTTCGAAACCTCTTGAAGATAATTTGGATGA
ATTTGTCAAAAAATTTTTAGATAGTGACACCGCCGCCAGCCCCTTGCCGAAATCGCGTCGTGCCGCCGCCGTCTGGAGTCACACAGCTCGTTCGCGCGTCCCTCTCTCTC
TCCCGGGTGTCCCTCTCTCCCGTGGACCTCCCTCCCTCTCTACGCCGTCATTCTGGTCGCGCCGCCCAGCCCTTGCAGTCGCCATCGAAGCCCAAGTCCGCCGCTGCCTA
AGCTCGTACAGCGCCGCCATCGCGTGTGTTCTCTTCCCACATCGTTTCATTCTCTATCCATCTCGCGTGTCTCGCATCTTGAAGTATCGATCCTCGCGTCCTCGCCTCTG
TCCAGTAGCGTGTTCGGTTTTTTCAGTGTCTTTTGGCGTTTTCGCACCGTCTAAGTGTTCGATTGAGTTCGATACACTTCAACTCGAACACTCACTGCCCAAGGATTGTT
CTAGCGCGTCGTTAGAGCTTAGGATAACCCGTCTTGCGTTGAAACGGGTTTGTTTGGGTAAGGAGGTTGTCTTCGCTGCTTTAGGCTTGTTGGAGTTGCTTAGGAGCGCC
ATAGCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGAAAATCACGTATCGGGATGATCTGGACCTTGGTGGCGAGGAGGAGAGCTACGAGGA
AAAATCCCTAAAGTGGTGGTGTCCATGTGGTTATGTTTTCCTAGTTGTTTTAGCAGATGCTAGTCGAATAGCTTGCACTTTCGTTAATGAGTTGTCTTTAAGTTCCGCTG
CGTGGATTCATATTATTGCAGCTGAGGTGCTAACCACTTCAGATCTTAGCCCTACAGATGAATGGATCCTTGATTCCGGTTGCTCCTACCATATGACTCCACATAAGCAT
TACTTTTTAGATTTAAAAGAACAAGATGGTGGGAGAGTCCTCATGGGTAATAATCAGCAGTGTTTAATTAAGGGAATAGGTTCTATAAAACTTAGTCTAGCCGATGGAAC
CAATAAAATTTTACCTGCTGTAAGATATGTTCCAGATTTAAAGAGAAACCCCATCTCATTAGGAACATTAGATAAAGCAGGCTATAGATATAAATCTGAAGGTGGAGTTA
TGAAAATTTCAAAAGGATCCTTACTAATGATGGCCGGTAGCTTGAAAAATGGTCTACATACCTTAAATGGATCATCTACCTTAAACACTGCAAATGTAGCTCCTTCTGAA
ACCAATTCTGAAGTTGTACTTTGA
Protein sequenceShow/hide protein sequence
MVSEQRFGDGVCRSVEISFTVAISIWAKFELEKFDGKGDFSLWRKKLRAMLVQLKVARILDKPEDLPEPLSEPKRKEMDEIAFSTMILYLSDSVLRQVDKINSVVELWKK
LESMYLNKYLTNKILLKERLFGYRMDPSKPLEDNLDEFVKKFLDSDTAASPLPKSRRAAAVWSHTARSRVPLSLPGVPLSRGPPSLSTPSFWSRRPALAVAIEAQVRRCL
SSYSAAIACVLFPHRFILYPSRVSRILKYRSSRPRLCPVACSVFSVSFGVFAPSKCSIEFDTLQLEHSLPKDCSSASLELRITRLALKRVCLGKEVVFAALGLLELLRSA
IAERDAEITWVVSAERDAKITYRDDLDLGGEEESYEEKSLKWWCPCGYVFLVVLADASRIACTFVNELSLSSAAWIHIIAAEVLTTSDLSPTDEWILDSGCSYHMTPHKH
YFLDLKEQDGGRVLMGNNQQCLIKGIGSIKLSLADGTNKILPAVRYVPDLKRNPISLGTLDKAGYRYKSEGGVMKISKGSLLMMAGSLKNGLHTLNGSSTLNTANVAPSE
TNSEVVL