; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006694 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006694
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr6:45002649..45003353
RNA-Seq ExpressionLag0006694
SyntenyLag0006694
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0140097 - catalytic activity, acting on DNA (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVX11275.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.1e-3542.22Show/hide
Query:  GPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVE
        GP    + KV  F M  ISWN  G+GS +KR ++K+FLSS+ P +++IQETK    +R++V S+WS R+  WA++   G++GGI+I+W+      +E+V 
Subjt:  GPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVE

Query:  GNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        G++S+SI  ++ +  S W++ +YGPN+S  R   W EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ F
Subjt:  GNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

RVX11280.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]4.7e-3540.5Show/hide
Query:  RRRSNGPRSCKTFIPQLITTGPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGS
        RR+S GP   +T    L   G          CF M  ISWNV G+GS  KR ++KDFL S+NP +++IQETK    +R+ V S+W+ R+  W ++   G+
Subjt:  RRRSNGPRSCKTFIPQLITTGPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGS

Query:  AGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        +GGI+I+W+  +   +E+V G++S+S+  SL      WI+ +YGPNS   R   W EL D+  L  P W +GGDFNVIR + EK   ++ T +M+ F+ F
Subjt:  AGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]2.8e-4348.19Show/hide
Query:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG
        M  ++WNV G+GS  KRA IKD ++S  P ++I+ ETK   IN K +KS+WSS SIAWAS+D  G++GGI++LW++ S    E++ G++S+S+H  L D 
Subjt:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG

Query:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        F++W+TG+Y P   K+R + W+EL DL  LC P W+LG DFN+ RW+ E S+   P   M KFN F
Subjt:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]2.8e-4349.4Show/hide
Query:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG
        M F++WNV G+ SW+K ALIK F+S  NP ++I+QETK+  ++  IVKS+WS+  I W+++D  G A GI+ILWN+      E++EG +SL+I+  L DG
Subjt:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG

Query:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        F FW++GIYGP++++   + W+EL DL  LC  +WIL GDFNV RW+WEKS     T++M  FN F
Subjt:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]1.9e-3647.62Show/hide
Query:  RALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKE
        +A++K+ L   NP ++I+Q++K+  +NR +VKS+WSS  + WA+++  GS+GGI+ILW E S  V + ++G +S+SIH     GFS WITG+YGP+S + 
Subjt:  RALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKE

Query:  RSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKF
        R   W EL+ L  LC  NW +GGDFNV+RW  EKS+   PTR+M +F
Subjt:  RSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKF

TrEMBL top hitse value%identityAlignment
A0A438BTW6 LINE-1 retrotransposable element ORF2 protein3.9e-3543.02Show/hide
Query:  KVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIH
        +V CF M  ISWNV G+GS  KR ++KDFL S+NP +++IQETK    +R+ V S+W+ R+  W ++   G++GGI+I+W+  +   +E+V G++S+S+ 
Subjt:  KVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIH

Query:  LSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
         SL      WI+ +YGPNS   R   W EL D+  L  P W +GGDFNVIR + EK   ++ T +M+ F+ F
Subjt:  LSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

A0A438JQP8 LINE-1 retrotransposable element ORF2 protein2.3e-3540.5Show/hide
Query:  RRRSNGPRSCKTFIPQLITTGPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGS
        RR+S GP   +T    L   G          CF M  ISWNV G+GS  KR ++KDFL S+NP +++IQETK    +R+ V S+W+ R+  W ++   G+
Subjt:  RRRSNGPRSCKTFIPQLITTGPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGS

Query:  AGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        +GGI+I+W+  +   +E+V G++S+S+  SL      WI+ +YGPNS   R   W EL D+  L  P W +GGDFNVIR + EK   ++ T +M+ F+ F
Subjt:  AGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

A0A438JQQ0 LINE-1 retrotransposable element ORF2 protein1.0e-3542.22Show/hide
Query:  GPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVE
        GP    + KV  F M  ISWN  G+GS +KR ++K+FLSS+ P +++IQETK    +R++V S+WS R+  WA++   G++GGI+I+W+      +E+V 
Subjt:  GPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVE

Query:  GNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        G++S+SI  ++ +  S W++ +YGPN+S  R   W EL+D+  L  P W +GGDFNVIR + EK   +  T  MK F+ F
Subjt:  GNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

A0A6J1CVN2 uncharacterized protein LOC1110146571.3e-4348.19Show/hide
Query:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG
        M  ++WNV G+GS  KRA IKD ++S  P ++I+ ETK   IN K +KS+WSS SIAWAS+D  G++GGI++LW++ S    E++ G++S+S+H  L D 
Subjt:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG

Query:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        F++W+TG+Y P   K+R + W+EL DL  LC P W+LG DFN+ RW+ E S+   P   M KFN F
Subjt:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

A0A6J1E2G6 uncharacterized protein LOC1110254051.3e-4349.4Show/hide
Query:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG
        M F++WNV G+ SW+K ALIK F+S  NP ++I+QETK+  ++  IVKS+WS+  I W+++D  G A GI+ILWN+      E++EG +SL+I+  L DG
Subjt:  MIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDG

Query:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF
        F FW++GIYGP++++   + W+EL DL  LC  +WIL GDFNV RW+WEKS     T++M  FN F
Subjt:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRF

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein3.1e-0529.58Show/hide
Query:  ISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSF---VVKEIVEGNYSLSIHLSLVDG
        IS N+ G+ S  KR  + D+L  Q+PT   +QET +   +R  ++ +   ++I  A+  +   AG  +++ ++  F   V+K+  EG++ L     L + 
Subjt:  ISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAWASIDVVGSAGGIVILWNESSF---VVKEIVEGNYSLSIHLSLVDG

Query:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFN
         S  I  IY PN ++  + +   L  L+A   P+ I+ GDFN
Subjt:  FSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATGTGTATCATGCCAATGCCAGGAAGGCAAAAGACATCCAACGCCACCAAGAAGAAGATCAAATGGGCCAAGGAGTTGCAAAACCTTCATACCTCAGTTAATTAC
AACAGGACCCTACAGGTTATCTTATGGGAAGGTCGACTGTTTCCTCATGATTTTTATCTCATGGAATGTGTGTGGTATGGGCTCATGGAGAAAGAGAGCCCTTATTAAAG
ATTTTCTCTCCTCTCAGAATCCCACTCTGATTATTATCCAAGAAACAAAGATGGTCGGTATTAATAGGAAGATTGTTAAATCTATATGGAGCTCTAGGAGCATCGCATGG
GCTTCCATTGATGTAGTCGGTTCCGCTGGAGGGATTGTGATCCTTTGGAATGAATCCTCTTTTGTCGTTAAGGAGATTGTTGAAGGTAATTACTCGCTCTCCATCCATCT
CTCTTTAGTTGATGGTTTCTCTTTTTGGATCACAGGAATATATGGCCCCAATTCCTCCAAGGAAAGGAGTATGTTGTGGAAGGAGTTAGCGGATCTACAGGCTCTTTGTC
TTCCCAATTGGATTTTGGGTGGCGACTTTAATGTCATCCGTTGGACTTGGGAAAAATCCACTTACACAGCTCCTACCCGAGCCATGAAGAAATTCAACCGTTTCCAATGG
ATTGCAAGACATACCCCTCACCAATGGCAAGTTCACTTGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTATGTGTATCATGCCAATGCCAGGAAGGCAAAAGACATCCAACGCCACCAAGAAGAAGATCAAATGGGCCAAGGAGTTGCAAAACCTTCATACCTCAGTTAATTAC
AACAGGACCCTACAGGTTATCTTATGGGAAGGTCGACTGTTTCCTCATGATTTTTATCTCATGGAATGTGTGTGGTATGGGCTCATGGAGAAAGAGAGCCCTTATTAAAG
ATTTTCTCTCCTCTCAGAATCCCACTCTGATTATTATCCAAGAAACAAAGATGGTCGGTATTAATAGGAAGATTGTTAAATCTATATGGAGCTCTAGGAGCATCGCATGG
GCTTCCATTGATGTAGTCGGTTCCGCTGGAGGGATTGTGATCCTTTGGAATGAATCCTCTTTTGTCGTTAAGGAGATTGTTGAAGGTAATTACTCGCTCTCCATCCATCT
CTCTTTAGTTGATGGTTTCTCTTTTTGGATCACAGGAATATATGGCCCCAATTCCTCCAAGGAAAGGAGTATGTTGTGGAAGGAGTTAGCGGATCTACAGGCTCTTTGTC
TTCCCAATTGGATTTTGGGTGGCGACTTTAATGTCATCCGTTGGACTTGGGAAAAATCCACTTACACAGCTCCTACCCGAGCCATGAAGAAATTCAACCGTTTCCAATGG
ATTGCAAGACATACCCCTCACCAATGGCAAGTTCACTTGGTCTAG
Protein sequenceShow/hide protein sequence
MVCVSCQCQEGKRHPTPPRRRSNGPRSCKTFIPQLITTGPYRLSYGKVDCFLMIFISWNVCGMGSWRKRALIKDFLSSQNPTLIIIQETKMVGINRKIVKSIWSSRSIAW
ASIDVVGSAGGIVILWNESSFVVKEIVEGNYSLSIHLSLVDGFSFWITGIYGPNSSKERSMLWKELADLQALCLPNWILGGDFNVIRWTWEKSTYTAPTRAMKKFNRFQW
IARHTPHQWQVHLV