; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028128 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028128
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPolyprotein
Genome locationchr8:13824755..13828151
RNA-Seq ExpressionLag0028128
SyntenyLag0028128
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025139.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]3.8e-3825.9Show/hide
Query:  KRAVILGKTRGMAQ-----RQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG--
        K+A  +G  +G  Q      Q+      +E+ LE ++   ++LP IE+++A L R++ ++ + ++ Q Q     QQ +   + G  +     RK   G  
Subjt:  KRAVILGKTRGMAQ-----RQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG--

Query:  ------------------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLK
                                EE+  +R KFKKVEMP FDG   D WL  A+RY  IH L+D  K+TV  ISF   AL WYR    R+ F+GW DLK
Subjt:  ------------------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLK

Query:  LRILERFRPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY
         ++L RFR  R    V  FL +  E  ++  +   DK         + +L +T   G   W  S    + P                        L  +Y
Subjt:  LRILERFRPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY

Query:  TPKLLEPN--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEA
          K    N                    P++TITL  + T   R+E P K +SD                                       + G++  
Subjt:  TPKLLEPN--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEA

Query:  IGNQN--PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGS-----------------------------VSRMNI------------------------
        I  +     E  +++VE  +AE    ELSLN+VVG ++PG+                             V+++ +                        
Subjt:  IGNQN--PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGS-----------------------------VSRMNI------------------------

Query:  -----------------------------------------------------------------------NQLMPDHTDPHEFVLKSGSDG--------
                                                                                 L P H+  H   LKSG+D         
Subjt:  -----------------------------------------------------------------------NQLMPDHTDPHEFVLKSGSDG--------

Query:  -----------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKT
                                           +K+  +WRFCVDY+ALN  T+PDKFPIP+IEEL DEL GA VF K+DL++ YH+IR+  +DI KT
Subjt:  -----------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKT

Query:  AFRTHDDHYEFLVMPFGLTNAP
        AFRTH+ HYEFLVMPFGLTNAP
Subjt:  AFRTHDDHYEFLVMPFGLTNAP

KAA0035327.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.7e-3929.16Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEM  F G   D WL  A+RY  IH L +  K++V  ISF   AL WYR    R+ F  W DLK ++L RFR  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------
        E M++  Q   DK         + +L +T   G   W  +   V+ P                        L   Y        PK  E           
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------

Query:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------
            P++T+TL  +  +  R+E P K         LV  +   + E IG     E   E V E  + E    ELS+N++VG ++ G++            
Subjt:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------

Query:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------
                                ++  +P                               D H + LK G++                           
Subjt:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------

Query:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG
                          KK+G +WRFCVDY+ALN  T+PDK PIP+IEEL D L+GA +F K+DL++ Y +IR+H +D+ KT FRTH  HYEFLVMPFG
Subjt:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG

Query:  LTNAPYNSSTI
        LTNAP    T+
Subjt:  LTNAPYNSSTI

KAA0045216.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]5.1e-3524.35Show/hide
Query:  AVILGKTRGMAQRQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG---------
        A+++     +   Q+      +++ LE ++   ++LP IE+++A L +S+ ++ + ++ Q Q     QQ +   + G  +     R+   G         
Subjt:  AVILGKTRGMAQRQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG---------

Query:  -----------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERF
                         EE+  +R KFKKVEMP FDG   D WL  A+RY  IH LTD  ++TV  ISF   AL WYR    R+ F+GW DLKL++L RF
Subjt:  -----------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERF

Query:  RPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSYTPKLLEP
        R  R    V  FL +  E+ ++  +   DK         + +L +T   G   W  S    + P                        L  +Y  K    
Subjt:  RPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSYTPKLLEP

Query:  N--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEAIGNQN--
                             P++TITL  + T   R+E P K +SD                                       + G++  I  +   
Subjt:  N--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEAIGNQN--

Query:  PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSV-----------------------------------------------------------------
          E ++++VE  + E    ELS+N+VVG ++PG++                                                                 
Subjt:  PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSV-----------------------------------------------------------------

Query:  ------------------SRMNINQLM---------------------------------------------------------PDHTDPHEFVLKSGSD
                          +R+++  L+                                                         P  +  H   LKSG+D
Subjt:  ------------------SRMNINQLM---------------------------------------------------------PDHTDPHEFVLKSGSD

Query:  G-------------------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIR
                                                    +K+  +WRFCVDY+ALN  T+PDKFPIP+IEEL DEL GA VF KLDL++ YH+IR
Subjt:  G-------------------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIR

Query:  VHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        +  +DI KTAFRTH+ HYEFLVM FGLTNAP
Subjt:  VHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

KAA0065906.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]5.0e-3832.57Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEMP F+G   D WL  A+RY  IH L +  K+ V  ISF +  L WYR    R+ F  W DLK ++L RF+  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQ
        E  ++  +   DK         + +L +T   G   W  +   V+ P   +                         Q  +  LK+       K E++G  
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQ

Query:  NPRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELH
                          T+ L +                                      +K+ R+WRFCVDY+ALN  T+ DKFPIPIIE+L +EL+
Subjt:  NPRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELH

Query:  GARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAPY
        GA +F ++DL++ YH+IR++  D+ KTAFRTH+ HY+FLVMPF LTNAP+
Subjt:  GARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAPY

TYK14310.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.7e-3929.16Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEM  F G   D WL  A+RY  IH L +  K++V  ISF   AL WYR    R+ F  W DLK ++L RFR  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------
        E M++  Q   DK         + +L +T   G   W  +   V+ P                        L   Y        PK  E           
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------

Query:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------
            P++T+TL  +  +  R+E P K         LV  +   + E IG     E   E V E  + E    ELS+N++VG ++ G++            
Subjt:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------

Query:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------
                                ++  +P                               D H + LK G++                           
Subjt:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------

Query:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG
                          KK+G +WRFCVDY+ALN  T+PDK PIP+IEEL D L+GA +F K+DL++ Y +IR+H +D+ KT FRTH  HYEFLVMPFG
Subjt:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG

Query:  LTNAPYNSSTI
        LTNAP    T+
Subjt:  LTNAPYNSSTI

TrEMBL top hitse value%identityAlignment
A0A2N9EGK1 Uncharacterized protein7.7e-3729.3Show/hide
Query:  MAQRQLEDRLAESEKNLEGMKEKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQRKEHRGEEQFHE---RHKFKKVEMPAFDGDQS
        ++   L  +L E  + + G+++ +  + ++VA+LNR              L+  N     N  HGGF    + G    +    + +F  ++ P F+GD  
Subjt:  MAQRQLEDRLAESEKNLEGMKEKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQRKEHRGEEQFHE---RHKFKKVEMPAFDGDQS

Query:  DDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRLEAMMKATQRIEDKNSALLTKTGSGG
           +  A+++        Q K+ + +    + AL+WY+W    QP   W +    +  RF P   + +     RL    K  +  + +   L  +     
Subjt:  DDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRLEAMMKATQRIEDKNSALLTKTGSGG

Query:  TRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQNPRECQVEEVEESSAEMDTAELSLNTVVGFSSP
         + W         P  AL   Y   L E          +I +  +   P  L+     G+ E+  N+  +    EE  E++ +    E+SL+T+ G    
Subjt:  TRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQNPRECQVEEVEESSAEMDTAELSLNTVVGFSSP

Query:  GSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHD
         ++  M +                     KK+  +WRFCVDYKALN   + D++PIP+I+ELLDELHG   F KLDL+S YH+IRV  +D+HKTAFRTHD
Subjt:  GSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHD

Query:  DHYEFLVMPFGLTNAPYNSSTIGMGNLFGT
         HYE LVMPFGLTNAP    ++ M ++F T
Subjt:  DHYEFLVMPFGLTNAPYNSSTIGMGNLFGT

A0A5A7T158 Ty3-gypsy retrotransposon protein3.7e-3929.16Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEM  F G   D WL  A+RY  IH L +  K++V  ISF   AL WYR    R+ F  W DLK ++L RFR  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------
        E M++  Q   DK         + +L +T   G   W  +   V+ P                        L   Y        PK  E           
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------

Query:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------
            P++T+TL  +  +  R+E P K         LV  +   + E IG     E   E V E  + E    ELS+N++VG ++ G++            
Subjt:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------

Query:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------
                                ++  +P                               D H + LK G++                           
Subjt:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------

Query:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG
                          KK+G +WRFCVDY+ALN  T+PDK PIP+IEEL D L+GA +F K+DL++ Y +IR+H +D+ KT FRTH  HYEFLVMPFG
Subjt:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG

Query:  LTNAPYNSSTI
        LTNAP    T+
Subjt:  LTNAPYNSSTI

A0A5A7VHR5 Transposon Tf2-1 polyprotein isoform X12.4e-3832.57Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEMP F+G   D WL  A+RY  IH L +  K+ V  ISF +  L WYR    R+ F  W DLK ++L RF+  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQ
        E  ++  +   DK         + +L +T   G   W  +   V+ P   +                         Q  +  LK+       K E++G  
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTNALGPSYTPKLLEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQ

Query:  NPRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELH
                          T+ L +                                      +K+ R+WRFCVDY+ALN  T+ DKFPIPIIE+L +EL+
Subjt:  NPRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELH

Query:  GARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAPY
        GA +F ++DL++ YH+IR++  D+ KTAFRTH+ HY+FLVMPF LTNAP+
Subjt:  GARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAPY

A0A5D3CSC2 Ty3-gypsy retrotransposon protein3.7e-3929.16Show/hide
Query:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--
        +++  +R KFKKVEM  F G   D WL  A+RY  IH L +  K++V  ISF   AL WYR    R+ F  W DLK ++L RFR  R    V  FL +  
Subjt:  EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRL--

Query:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------
        E M++  Q   DK         + +L +T   G   W  +   V+ P                        L   Y        PK  E           
Subjt:  EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY-------TPKLLE-----------

Query:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------
            P++T+TL  +  +  R+E P K         LV  +   + E IG     E   E V E  + E    ELS+N++VG ++ G++            
Subjt:  --PNPIKTITLPNI-TSHQRKEAPLK---------LVSDDQDGKDEAIGNQNPRECQVEEV-EESSAEMDTAELSLNTVVGFSSPGSVSRMN--------

Query:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------
                                ++  +P                               D H + LK G++                           
Subjt:  ------------------------INQLMP----------------------------DHTDPHEFVLKSGSD---------------------------

Query:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG
                          KK+G +WRFCVDY+ALN  T+PDK PIP+IEEL D L+GA +F K+DL++ Y +IR+H +D+ KT FRTH  HYEFLVMPFG
Subjt:  -----------------GKKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFG

Query:  LTNAPYNSSTI
        LTNAP    T+
Subjt:  LTNAPYNSSTI

A0A5D3DIM6 Transposon Ty3-G Gag-Pol polyprotein1.8e-3825.9Show/hide
Query:  KRAVILGKTRGMAQ-----RQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG--
        K+A  +G  +G  Q      Q+      +E+ LE ++   ++LP IE+++A L R++ ++ + ++ Q Q     QQ +   + G  +     RK   G  
Subjt:  KRAVILGKTRGMAQ-----RQLEDRLAESEKNLEGMK---EKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQ-----RKEHRG--

Query:  ------------------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLK
                                EE+  +R KFKKVEMP FDG   D WL  A+RY  IH L+D  K+TV  ISF   AL WYR    R+ F+GW DLK
Subjt:  ------------------------EEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLK

Query:  LRILERFRPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY
         ++L RFR  R    V  FL +  E  ++  +   DK         + +L +T   G   W  S    + P                        L  +Y
Subjt:  LRILERFRPYRRDRYVPIFLRL--EAMMKATQRIEDK--------NSALLTKTGSGGTRSWKPSPPRVVGPTN---------------------ALGPSY

Query:  TPKLLEPN--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEA
          K    N                    P++TITL  + T   R+E P K +SD                                       + G++  
Subjt:  TPKLLEPN--------------------PIKTITLPNI-TSHQRKEAPLKLVSD--------------------------------------DQDGKDEA

Query:  IGNQN--PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGS-----------------------------VSRMNI------------------------
        I  +     E  +++VE  +AE    ELSLN+VVG ++PG+                             V+++ +                        
Subjt:  IGNQN--PRECQVEEVEESSAEMDTAELSLNTVVGFSSPGS-----------------------------VSRMNI------------------------

Query:  -----------------------------------------------------------------------NQLMPDHTDPHEFVLKSGSDG--------
                                                                                 L P H+  H   LKSG+D         
Subjt:  -----------------------------------------------------------------------NQLMPDHTDPHEFVLKSGSDG--------

Query:  -----------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKT
                                           +K+  +WRFCVDY+ALN  T+PDKFPIP+IEEL DEL GA VF K+DL++ YH+IR+  +DI KT
Subjt:  -----------------------------------KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKT

Query:  AFRTHDDHYEFLVMPFGLTNAP
        AFRTH+ HYEFLVMPFGLTNAP
Subjt:  AFRTHDDHYEFLVMPFGLTNAP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.9e-1750.57Show/hide
Query:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        KKEG   R  VDYK LN+   P+ +P+P+IE+LL ++ G+ +F KLDL+S YH IRV   D HK AFR     +E+LVMP+G++ AP
Subjt:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

P0CT35 Transposon Tf2-2 polyprotein2.9e-1750.57Show/hide
Query:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        KKEG   R  VDYK LN+   P+ +P+P+IE+LL ++ G+ +F KLDL+S YH IRV   D HK AFR     +E+LVMP+G++ AP
Subjt:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

P0CT36 Transposon Tf2-3 polyprotein2.9e-1750.57Show/hide
Query:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        KKEG   R  VDYK LN+   P+ +P+P+IE+LL ++ G+ +F KLDL+S YH IRV   D HK AFR     +E+LVMP+G++ AP
Subjt:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

P0CT37 Transposon Tf2-4 polyprotein2.9e-1750.57Show/hide
Query:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        KKEG   R  VDYK LN+   P+ +P+P+IE+LL ++ G+ +F KLDL+S YH IRV   D HK AFR     +E+LVMP+G++ AP
Subjt:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

P0CT41 Transposon Tf2-12 polyprotein2.9e-1750.57Show/hide
Query:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP
        KKEG   R  VDYK LN+   P+ +P+P+IE+LL ++ G+ +F KLDL+S YH IRV   D HK AFR     +E+LVMP+G++ AP
Subjt:  KKEGRNWRFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAP

Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein2.8e-0724.8Show/hide
Query:  KKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRLEAMMKATQRIED
        +++EMP FDG    +W S  ER+  + +  D  K+ + A+S    AL+W+        F  W   + R+L RF P +      + +    +       E 
Subjt:  KKVEMPAFDGDQSDDWLSSAERYLDIHQLTDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRLEAMMKATQRIED

Query:  KNSALLTKTGSGGTRSWKPSPPRVV
        + +  L +      R  K S P +V
Subjt:  KNSALLTKTGSGGTRSWKPSPPRVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCGAAGGACGACGATTTCGTTCGCCAATGGACCATTTTGCTTGCAATCGTTGCCAAGGCCTTGACCGTCTCCTCAGAAGTTTGCGAAAGCTTCGCGCGGGTTCC
TTTCGATCGTAGTTTAGCGTTACCCTGGCCTAAGGGGATGGAGCAGAGTCTGTCGAAGCACGTCTGTAGCCATGGAAGAAGAGACTTTGCAAAAGATAAGTGTTGGCCAT
GGTCGCTGGTGTCGAGGGTCATGGTGATGGGTTGGAGAAGGATCGGCGGGGGCTATGGTGGTATGGAGAAGCGAGCGGTAATCCTGGGAAAGACACGAGGAATGGCGCAG
AGACAGTTGGAGGATCGTTTGGCAGAGTCTGAGAAGAACTTAGAAGGGATGAAGGAAAAATTACCTGAGATAGAGAAGTCGGTTGCGGAACTCAATCGAAGTATGGAGAA
ATTGTTCAATAGTGTAGAAGATCAGAGACAACTTTCTTTAGAGAATCAACAAGCGTTAGCTAATCTCGTCCATGGCGGATTCCAACGGAAGGAACACAGAGGAGAGGAGC
AGTTCCACGAGCGTCATAAGTTTAAGAAGGTGGAGATGCCGGCGTTTGATGGCGACCAATCGGACGACTGGCTCTCTAGTGCGGAAAGATACTTAGATATTCATCAGTTA
ACAGATCAAGGGAAGATAACGGTGACAGCAATCAGCTTCACCGAAGCCGCGCTAAGATGGTACCGATGGGCTGGTTGTAGACAACCTTTTTCTGGATGGAGGGATCTGAA
ACTCCGAATCCTGGAACGATTTAGACCTTACAGGAGGGATCGTTATGTGCCCATTTTCTTGCGCCTTGAGGCCATGATGAAAGCAACCCAGCGCATCGAAGATAAGAACT
CGGCCCTATTGACTAAAACAGGGTCAGGAGGAACCCGATCGTGGAAACCTTCTCCACCAAGAGTTGTTGGGCCAACCAACGCCCTTGGGCCTTCTTATACACCAAAATTG
CTCGAGCCCAACCCAATCAAAACAATTACTCTGCCCAACATTACATCACATCAACGGAAGGAGGCGCCACTTAAGCTTGTATCTGACGATCAAGACGGGAAGGACGAAGC
AATTGGAAATCAAAACCCACGAGAGTGCCAGGTCGAAGAAGTAGAGGAGTCTTCTGCCGAAATGGATACTGCCGAGTTATCCTTAAACACTGTGGTGGGATTTTCATCGC
CAGGATCTGTCTCAAGGATGAACATCAACCAGTTAATGCCCGACCATACAGACCCACACGAGTTCGTTCTCAAGTCTGGTTCTGATGGTAAAAAAGAAGGACGGAATTGG
CGGTTTTGTGTCGACTACAAGGCGCTCAATGAAGCAACAGTGCCTGATAAGTTCCCTATTCCCATTATAGAGGAACTGTTAGACGAGTTACACGGGGCTCGGGTATTTTT
AAAGCTAGACCTTCGGTCGAGATACCACAAAATCCGAGTACACTCCGATGATATCCACAAAACCGCTTTCCGCACGCATGACGACCATTACGAATTTCTTGTAATGCCTT
TCGGCTTGACTAACGCACCATACAATTCATCAACCATAGGAATGGGGAACTTATTTGGTACTGTGACCTTATTGAATGCTCAATATTCAATACAGAAACGTCAGCTCCCA
TCTTTCTTCTTAACCAACAGAATAAGGCTGGAAAATGCACTCGAGCTAGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCGAAGGACGACGATTTCGTTCGCCAATGGACCATTTTGCTTGCAATCGTTGCCAAGGCCTTGACCGTCTCCTCAGAAGTTTGCGAAAGCTTCGCGCGGGTTCC
TTTCGATCGTAGTTTAGCGTTACCCTGGCCTAAGGGGATGGAGCAGAGTCTGTCGAAGCACGTCTGTAGCCATGGAAGAAGAGACTTTGCAAAAGATAAGTGTTGGCCAT
GGTCGCTGGTGTCGAGGGTCATGGTGATGGGTTGGAGAAGGATCGGCGGGGGCTATGGTGGTATGGAGAAGCGAGCGGTAATCCTGGGAAAGACACGAGGAATGGCGCAG
AGACAGTTGGAGGATCGTTTGGCAGAGTCTGAGAAGAACTTAGAAGGGATGAAGGAAAAATTACCTGAGATAGAGAAGTCGGTTGCGGAACTCAATCGAAGTATGGAGAA
ATTGTTCAATAGTGTAGAAGATCAGAGACAACTTTCTTTAGAGAATCAACAAGCGTTAGCTAATCTCGTCCATGGCGGATTCCAACGGAAGGAACACAGAGGAGAGGAGC
AGTTCCACGAGCGTCATAAGTTTAAGAAGGTGGAGATGCCGGCGTTTGATGGCGACCAATCGGACGACTGGCTCTCTAGTGCGGAAAGATACTTAGATATTCATCAGTTA
ACAGATCAAGGGAAGATAACGGTGACAGCAATCAGCTTCACCGAAGCCGCGCTAAGATGGTACCGATGGGCTGGTTGTAGACAACCTTTTTCTGGATGGAGGGATCTGAA
ACTCCGAATCCTGGAACGATTTAGACCTTACAGGAGGGATCGTTATGTGCCCATTTTCTTGCGCCTTGAGGCCATGATGAAAGCAACCCAGCGCATCGAAGATAAGAACT
CGGCCCTATTGACTAAAACAGGGTCAGGAGGAACCCGATCGTGGAAACCTTCTCCACCAAGAGTTGTTGGGCCAACCAACGCCCTTGGGCCTTCTTATACACCAAAATTG
CTCGAGCCCAACCCAATCAAAACAATTACTCTGCCCAACATTACATCACATCAACGGAAGGAGGCGCCACTTAAGCTTGTATCTGACGATCAAGACGGGAAGGACGAAGC
AATTGGAAATCAAAACCCACGAGAGTGCCAGGTCGAAGAAGTAGAGGAGTCTTCTGCCGAAATGGATACTGCCGAGTTATCCTTAAACACTGTGGTGGGATTTTCATCGC
CAGGATCTGTCTCAAGGATGAACATCAACCAGTTAATGCCCGACCATACAGACCCACACGAGTTCGTTCTCAAGTCTGGTTCTGATGGTAAAAAAGAAGGACGGAATTGG
CGGTTTTGTGTCGACTACAAGGCGCTCAATGAAGCAACAGTGCCTGATAAGTTCCCTATTCCCATTATAGAGGAACTGTTAGACGAGTTACACGGGGCTCGGGTATTTTT
AAAGCTAGACCTTCGGTCGAGATACCACAAAATCCGAGTACACTCCGATGATATCCACAAAACCGCTTTCCGCACGCATGACGACCATTACGAATTTCTTGTAATGCCTT
TCGGCTTGACTAACGCACCATACAATTCATCAACCATAGGAATGGGGAACTTATTTGGTACTGTGACCTTATTGAATGCTCAATATTCAATACAGAAACGTCAGCTCCCA
TCTTTCTTCTTAACCAACAGAATAAGGCTGGAAAATGCACTCGAGCTAGGCTGA
Protein sequenceShow/hide protein sequence
MTSKDDDFVRQWTILLAIVAKALTVSSEVCESFARVPFDRSLALPWPKGMEQSLSKHVCSHGRRDFAKDKCWPWSLVSRVMVMGWRRIGGGYGGMEKRAVILGKTRGMAQ
RQLEDRLAESEKNLEGMKEKLPEIEKSVAELNRSMEKLFNSVEDQRQLSLENQQALANLVHGGFQRKEHRGEEQFHERHKFKKVEMPAFDGDQSDDWLSSAERYLDIHQL
TDQGKITVTAISFTEAALRWYRWAGCRQPFSGWRDLKLRILERFRPYRRDRYVPIFLRLEAMMKATQRIEDKNSALLTKTGSGGTRSWKPSPPRVVGPTNALGPSYTPKL
LEPNPIKTITLPNITSHQRKEAPLKLVSDDQDGKDEAIGNQNPRECQVEEVEESSAEMDTAELSLNTVVGFSSPGSVSRMNINQLMPDHTDPHEFVLKSGSDGKKEGRNW
RFCVDYKALNEATVPDKFPIPIIEELLDELHGARVFLKLDLRSRYHKIRVHSDDIHKTAFRTHDDHYEFLVMPFGLTNAPYNSSTIGMGNLFGTVTLLNAQYSIQKRQLP
SFFLTNRIRLENALELG