; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017271 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017271
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr5:1601554..1602502
RNA-Seq ExpressionLag0017271
SyntenyLag0017271
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN78049.1 hypothetical protein VITISV_015861 [Vitis vinifera]9.1e-4637.77Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV+LQETK    DR  + S+W  + + W ++ A  +SGGI+++W+ I     E V GS+S+T+ L+  +    W+T VYGPN +  RK FW E+ DL
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRF----LITDNCTQKFGNAIMLQNWWNNHPLEG
          L  P W +GGDFN+ R   EK      T  M+ F++FI    LLD PL++  +TW++ + + +  RF    L+     +KF      ++WW    +EG
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRF----LITDNCTQKFGNAIMLQNWWNNHPLEG

Query:  WPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD-DYTSKRRLSIKVDLLTLAARDDALW
        W GH FM+KLK  K  +KEWNI  FG+    K  ++ +L  ID  E+ G+L+ D  S+R L  K +L  L  +++  W
Subjt:  WPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD-DYTSKRRLSIKVDLLTLAARDDALW

RVW94236.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.1e-4634.59Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV++QETK  + DR F+ S+W+ R   W ++ A G+SGGIL++W+   L+  EVV GS+S+++  SL     LWI+ VYGPNS S RK FW E+FD+
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSL---IDRFLITDNCTQKFGNAIM-------------
          L  P W +GGDFN+ R S EK      T  M++F+ FI   ELLD PL++  +TW++ + + +   +DRFL ++     F   +              
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSL---IDRFLITDNCTQKFGNAIM-------------

Query:  -----------------------------LQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRR
                                      ++WW+     GW GH FM++L+  K  +KEWN  +FG     K  ++N+L + DA E+ G L+     +R
Subjt:  -----------------------------LQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRR

Query:  LSIKVDLLTLAARDDALW
         S K +L  L  R++  W
Subjt:  LSIKVDLLTLAARDDALW

RVX05281.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]8.2e-4738.83Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV++QETK    DR F+ S+W+ R   W ++ A G+S GIL++W+  IL+  EVV  S+S+++  SL     LWI+ VYGPNS S RK FW E+FD+
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAIMLQNWWNNHPLEGWPGH
          L  P W +GGDFN+ R S EK      T  M++F+ FI   ELLD PL++  +TW++ + + +  R      C          ++WW+     GW GH
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAIMLQNWWNNHPLEGWPGH

Query:  DFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLSIKVDLLTLAARDDALW
         FM++L+  K  +KEWN  +FG     K  ++N+L   DA E+ G L+     +R S K +L  L  R++  W
Subjt:  DFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLSIKVDLLTLAARDDALW

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]2.8e-4737.33Show/hide
Query:  PSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDLS
        P +VIL ETK+SSI+  FIKS+WSS  I W+S+DA G+SGGI+++W+++  + +EV+ G +S++++  LAD +  W+TGVY P    +RK FWQE+FDL+
Subjt:  PSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDLS

Query:  SLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRAN---SLIDRFLITDNCTQKFGN--------------AIM
         LC P W+LG DFNI RWS E S+   P  GM  FN FID   L+D  + +G+YTW++ R +   S I+RFL +   + KF +               I+
Subjt:  SLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRAN---SLIDRFLITDNCTQKFGN--------------AIM

Query:  L----QNW------------------------WNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD
        L    Q W                        W++   +G+ G+  +KKL      IK    N   N    K  +M ++  ID +EE+G +D
Subjt:  L----QNW------------------------WNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.2e-6641.46Show/hide
Query:  MNPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFD
        +NP+VVILQETK S +D   +KS+WS+  I WS++DA G + GIL++WN+  L   E+++G +SLT+N  L+DG+  W++G+YGP+++     FWQE+ D
Subjt:  MNPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFD

Query:  LSSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAI----------------
        LS LC+ +WIL GDFN+TRWSWEKSN R  TK M  FN FI+   L+D+PL +G++TW+ + + SLID FL+T+ C  K G  I                
Subjt:  LSSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAI----------------

Query:  --------------------------MLQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLS
                                   L+ WW N PL GWPGH  M KLK+ K  IK W    F    S K DL N +N +D  E S  +    S+ R+ 
Subjt:  --------------------------MLQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLS

Query:  IKVDLLTLAARDDALW
         K DLL++ A+++A W
Subjt:  IKVDLLTLAARDDALW

TrEMBL top hitse value%identityAlignment
A0A438IBZ1 LINE-1 retrotransposable element ORF2 protein4.4e-4634.59Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV++QETK  + DR F+ S+W+ R   W ++ A G+SGGIL++W+   L+  EVV GS+S+++  SL     LWI+ VYGPNS S RK FW E+FD+
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSL---IDRFLITDNCTQKFGNAIM-------------
          L  P W +GGDFN+ R S EK      T  M++F+ FI   ELLD PL++  +TW++ + + +   +DRFL ++     F   +              
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSL---IDRFLITDNCTQKFGNAIM-------------

Query:  -----------------------------LQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRR
                                      ++WW+     GW GH FM++L+  K  +KEWN  +FG     K  ++N+L + DA E+ G L+     +R
Subjt:  -----------------------------LQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRR

Query:  LSIKVDLLTLAARDDALW
         S K +L  L  R++  W
Subjt:  LSIKVDLLTLAARDDALW

A0A438J8L0 Transposon TX1 uncharacterized 149 kDa protein4.0e-4738.83Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV++QETK    DR F+ S+W+ R   W ++ A G+S GIL++W+  IL+  EVV  S+S+++  SL     LWI+ VYGPNS S RK FW E+FD+
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAIMLQNWWNNHPLEGWPGH
          L  P W +GGDFN+ R S EK      T  M++F+ FI   ELLD PL++  +TW++ + + +  R      C          ++WW+     GW GH
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAIMLQNWWNNHPLEGWPGH

Query:  DFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLSIKVDLLTLAARDDALW
         FM++L+  K  +KEWN  +FG     K  ++N+L   DA E+ G L+     +R S K +L  L  R++  W
Subjt:  DFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLSIKVDLLTLAARDDALW

A0A6J1CVN2 uncharacterized protein LOC1110146571.4e-4737.33Show/hide
Query:  PSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDLS
        P +VIL ETK+SSI+  FIKS+WSS  I W+S+DA G+SGGI+++W+++  + +EV+ G +S++++  LAD +  W+TGVY P    +RK FWQE+FDL+
Subjt:  PSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDLS

Query:  SLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRAN---SLIDRFLITDNCTQKFGN--------------AIM
         LC P W+LG DFNI RWS E S+   P  GM  FN FID   L+D  + +G+YTW++ R +   S I+RFL +   + KF +               I+
Subjt:  SLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRAN---SLIDRFLITDNCTQKFGN--------------AIM

Query:  L----QNW------------------------WNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD
        L    Q W                        W++   +G+ G+  +KKL      IK    N   N    K  +M ++  ID +EE+G +D
Subjt:  L----QNW------------------------WNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD

A0A6J1E2G6 uncharacterized protein LOC1110254055.9e-6741.46Show/hide
Query:  MNPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFD
        +NP+VVILQETK S +D   +KS+WS+  I WS++DA G + GIL++WN+  L   E+++G +SLT+N  L+DG+  W++G+YGP+++     FWQE+ D
Subjt:  MNPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFD

Query:  LSSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAI----------------
        LS LC+ +WIL GDFN+TRWSWEKSN R  TK M  FN FI+   L+D+PL +G++TW+ + + SLID FL+T+ C  K G  I                
Subjt:  LSSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAI----------------

Query:  --------------------------MLQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLS
                                   L+ WW N PL GWPGH  M KLK+ K  IK W    F    S K DL N +N +D  E S  +    S+ R+ 
Subjt:  --------------------------MLQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLS

Query:  IKVDLLTLAARDDALW
         K DLL++ A+++A W
Subjt:  IKVDLLTLAARDDALW

A5ALP8 Reverse transcriptase domain-containing protein4.4e-4637.77Show/hide
Query:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL
        NP VV+LQETK    DR  + S+W  + + W ++ A  +SGGI+++W+ I     E V GS+S+T+ L+  +    W+T VYGPN +  RK FW E+ DL
Subjt:  NPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDL

Query:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRF----LITDNCTQKFGNAIMLQNWWNNHPLEG
          L  P W +GGDFN+ R   EK      T  M+ F++FI    LLD PL++  +TW++ + + +  RF    L+     +KF      ++WW    +EG
Subjt:  SSLCDPNWILGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRF----LITDNCTQKFGNAIMLQNWWNNHPLEG

Query:  WPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD-DYTSKRRLSIKVDLLTLAARDDALW
        W GH FM+KLK  K  +KEWNI  FG+    K  ++ +L  ID  E+ G+L+ D  S+R L  K +L  L  +++  W
Subjt:  WPGHDFMKKLKAFKPFIKEWNINTFGNKDSVKHDLMNELNDIDAKEESGSLD-DYTSKRRLSIKVDLLTLAARDDALW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCATCAGTTGTCATCCTTCAAGAAACAAAAACATCATCTATTGACAGGGGTTTTATCAAATCCATATGGAGCTCTCGCTTTATTGGATGGTCCTCCATTGACGC
TATTGGATCATCGGGTGGCATTCTCGTTATGTGGAATGAAATTATCCTTAACATTATCGAGGTGGTTAAAGGTTCTTACTCTCTCACTTTGAATCTATCTTTGGCTGATG
GTTATAATTTATGGATTACAGGTGTATATGGTCCCAATTCTTCTTCAGAGAGAAAATGGTTCTGGCAGGAGATGTTTGACCTCTCAAGTTTGTGTGATCCAAACTGGATT
TTGGGGGGTGATTTCAACATCACAAGATGGTCTTGGGAAAAATCCAATCAGAGGCTTCCTACCAAAGGCATGAAAAATTTCAACAAATTTATAGATATGGTGGAGCTTCT
GGACATCCCGTTACAACATGGTAAATACACATGGACTAGTAGCCGGGCAAATTCCCTCATTGATCGATTCTTGATTACAGACAACTGCACTCAGAAATTCGGTAATGCTA
TTATGCTTCAGAATTGGTGGAACAATCACCCATTGGAAGGTTGGCCAGGTCACGATTTTATGAAAAAGCTCAAAGCCTTCAAACCTTTTATCAAAGAGTGGAATATCAAC
ACCTTTGGTAATAAGGATTCTGTCAAGCATGATCTAATGAACGAGCTTAATGACATTGATGCCAAGGAAGAGTCGGGTTCATTGGATGATTATACGTCCAAGCGTAGGCT
ATCCATAAAAGTCGACCTTTTAACCTTGGCAGCCCGAGATGATGCTCTGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCATCAGTTGTCATCCTTCAAGAAACAAAAACATCATCTATTGACAGGGGTTTTATCAAATCCATATGGAGCTCTCGCTTTATTGGATGGTCCTCCATTGACGC
TATTGGATCATCGGGTGGCATTCTCGTTATGTGGAATGAAATTATCCTTAACATTATCGAGGTGGTTAAAGGTTCTTACTCTCTCACTTTGAATCTATCTTTGGCTGATG
GTTATAATTTATGGATTACAGGTGTATATGGTCCCAATTCTTCTTCAGAGAGAAAATGGTTCTGGCAGGAGATGTTTGACCTCTCAAGTTTGTGTGATCCAAACTGGATT
TTGGGGGGTGATTTCAACATCACAAGATGGTCTTGGGAAAAATCCAATCAGAGGCTTCCTACCAAAGGCATGAAAAATTTCAACAAATTTATAGATATGGTGGAGCTTCT
GGACATCCCGTTACAACATGGTAAATACACATGGACTAGTAGCCGGGCAAATTCCCTCATTGATCGATTCTTGATTACAGACAACTGCACTCAGAAATTCGGTAATGCTA
TTATGCTTCAGAATTGGTGGAACAATCACCCATTGGAAGGTTGGCCAGGTCACGATTTTATGAAAAAGCTCAAAGCCTTCAAACCTTTTATCAAAGAGTGGAATATCAAC
ACCTTTGGTAATAAGGATTCTGTCAAGCATGATCTAATGAACGAGCTTAATGACATTGATGCCAAGGAAGAGTCGGGTTCATTGGATGATTATACGTCCAAGCGTAGGCT
ATCCATAAAAGTCGACCTTTTAACCTTGGCAGCCCGAGATGATGCTCTGTGGTGA
Protein sequenceShow/hide protein sequence
MNPSVVILQETKTSSIDRGFIKSIWSSRFIGWSSIDAIGSSGGILVMWNEIILNIIEVVKGSYSLTLNLSLADGYNLWITGVYGPNSSSERKWFWQEMFDLSSLCDPNWI
LGGDFNITRWSWEKSNQRLPTKGMKNFNKFIDMVELLDIPLQHGKYTWTSSRANSLIDRFLITDNCTQKFGNAIMLQNWWNNHPLEGWPGHDFMKKLKAFKPFIKEWNIN
TFGNKDSVKHDLMNELNDIDAKEESGSLDDYTSKRRLSIKVDLLTLAARDDALW