; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035539 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035539
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr3:23706702..23707182
RNA-Seq ExpressionLag0035539
SyntenyLag0035539
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647279.1 hypothetical protein Csa_002929 [Cucumis sativus]1.5e-1449.41Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI
        MK +SWN  GL S KKRAL+K   QQ NP+ VL++ETK     S++IK IWS   I W++LD +   GG+LI+W   DFT+ E +
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI

TYK17414.1 hypothetical protein E5676_scaffold434G002760 [Cucumis melo var. makuwa]4.4e-1452.33Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ
        MK  SWNV GL  WKKR ++K   QQHNP+ VL+QETK     S+L+KSI S + IGWS +D +  + G LIL S  DFT+ + IQ
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]1.3e-1350.59Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI
        M  L+WNV GLGS  KRA IK+T     P+ V++ ETK S+I +  IKS+WSS  I W+SLD  GA+GGI++LW  L  +  E I
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]1.6e-1654.65Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ
        MKFL+WNV GL SWKK ALIK    + NPN V++QETK S +   ++KS+WS+  I WS+LD  G A GILILW+D D    E I+
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ

XP_028066189.1 uncharacterized protein LOC114269125 [Camellia sinensis]5.4e-1244.19Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ
        MK +SWN+ GLG  +KR  +KN   QH  +F+L+QETK +TIT++LI SIW  +  G++++   G++GGI+ +W    F++ E IQ
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ

TrEMBL top hitse value%identityAlignment
A0A0A0KDG4 Uncharacterized protein7.4e-1549.41Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI
        MK +SWN  GL S KKRAL+K   QQ NP+ VL++ETK     S++IK IWS   I W++LD +   GG+LI+W   DFT+ E +
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI

A0A438C8T7 DUF4283 domain-containing protein2.6e-1239.42Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI----------QSDMR
        MK LSWN  GLGS KKR  ++      NP+ V++QETKR      L+ SIW    + W +L   GA+GGI+ILW  + F   E +           SD  
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI----------QSDMR

Query:  DEFW
        + FW
Subjt:  DEFW

A0A5D3D2A8 Uncharacterized protein2.1e-1452.33Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ
        MK  SWNV GL  WKKR ++K   QQHNP+ VL+QETK     S+L+KSI S + IGWS +D +  + G LIL S  DFT+ + IQ
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ

A0A6J1CVN2 uncharacterized protein LOC1110146576.2e-1450.59Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI
        M  L+WNV GLGS  KRA IK+T     P+ V++ ETK S+I +  IKS+WSS  I W+SLD  GA+GGI++LW  L  +  E I
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETI

A0A6J1E2G6 uncharacterized protein LOC1110254057.9e-1754.65Show/hide
Query:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ
        MKFL+WNV GL SWKK ALIK    + NPN V++QETK S +   ++KS+WS+  I WS+LD  G A GILILW+D D    E I+
Subjt:  MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTCCTCTCATGGAACGTATGTGGTTTGGGTTCATGGAAGAAAAGGGCTTTAATTAAGAATACCAGTCAACAGCATAATCCAAATTTTGTTCTTATTCAGGAAAC
CAAGAGATCAACTATCACTAGTTACCTGATTAAATCTATTTGGAGCTCTTCTCACATTGGTTGGAGTTCACTAGATTTCGTGGGAGCAGCAGGAGGTATCCTGATTCTTT
GGAGTGACCTAGATTTCACTATCAAAGAAACAATTCAGAGTGATATGCGGGATGAATTCTGGCAAGATTACATGACTTGGCAGGCCTGGGAAGAGATAGATGGATTCTTG
GAGGCGACTTCAATGTCACTTGTTGGTCTTGGGAGAAATCTAGCGATCAACCAGTCACTAGAAGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTCCTCTCATGGAACGTATGTGGTTTGGGTTCATGGAAGAAAAGGGCTTTAATTAAGAATACCAGTCAACAGCATAATCCAAATTTTGTTCTTATTCAGGAAAC
CAAGAGATCAACTATCACTAGTTACCTGATTAAATCTATTTGGAGCTCTTCTCACATTGGTTGGAGTTCACTAGATTTCGTGGGAGCAGCAGGAGGTATCCTGATTCTTT
GGAGTGACCTAGATTTCACTATCAAAGAAACAATTCAGAGTGATATGCGGGATGAATTCTGGCAAGATTACATGACTTGGCAGGCCTGGGAAGAGATAGATGGATTCTTG
GAGGCGACTTCAATGTCACTTGTTGGTCTTGGGAGAAATCTAGCGATCAACCAGTCACTAGAAGCATGA
Protein sequenceShow/hide protein sequence
MKFLSWNVCGLGSWKKRALIKNTSQQHNPNFVLIQETKRSTITSYLIKSIWSSSHIGWSSLDFVGAAGGILILWSDLDFTIKETIQSDMRDEFWQDYMTWQAWEEIDGFL
EATSMSLVGLGRNLAINQSLEA