; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037697 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037697
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold1:33819334..33824893
RNA-Seq ExpressionSpg037697
SyntenySpg037697
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004518 - nuclease activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR008491 - CDK5 regulatory subunit-associated protein 3
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.4e-1846.53Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS KKR ++  F+   N  +++LQETK +  DRRFV S+W+ + + W ++ A GASGGI ILW  S F   E V GSFS+++  +  +  SFW+T VYG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

RVW92839.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]7.4e-2042.86Show/hide
Query:  RTPQQQRKE-LNGSLGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLS
        R PQQ+ ++     LGS KKR +++DF+RL    ++++QETK    DRRFV S+W++RN  WA + A GASGGI ++W     +  EVV GSFS+S+  +
Subjt:  RTPQQQRKE-LNGSLGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLS

Query:  LADGFSFWITGVYGPEAAS
        +     FW++ VYGP + +
Subjt:  LADGFSFWITGVYGPEAAS

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]1.1e-1847.52Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS KKR ++  F+   N  +++LQETK +  DRRFV S+W  + + WA++ A GASGGI ILW  S F   E V GSFS+++  +  +  SFW+T VYG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]2.7e-2255.45Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS  KRA I+D I      ++IL ETKS +I+ +F+KSLWSS +IAWAS+DA GASGGI +LW + S + +EV+ G FS+S+H  LAD F++W+TGVY 
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]8.8e-2144.19Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        L SWKK ALI+ FI   N +++ILQETK   +D   VKSLWS+  I W+++DA G + GI ILW +      E++EG FSL+I+  L+DGF FW++G+YG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  PEAASGIIHTVLLEAALTKTRELKKLCES
        P       H +  +  L    +L  LCE+
Subjt:  PEAASGIIHTVLLEAALTKTRELKKLCES

TrEMBL top hitse value%identityAlignment
A0A438CP96 LINE-1 retrotransposable element ORF2 protein1.2e-1846.53Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS KKR ++  F+   N  +++LQETK +  DRRFV S+W+ + + W ++ A GASGGI ILW  S F   E V GSFS+++  +  +  SFW+T VYG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

A0A438I862 LINE-1 retrotransposable element ORF2 protein3.6e-2042.86Show/hide
Query:  RTPQQQRKE-LNGSLGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLS
        R PQQ+ ++     LGS KKR +++DF+RL    ++++QETK    DRRFV S+W++RN  WA + A GASGGI ++W     +  EVV GSFS+S+  +
Subjt:  RTPQQQRKE-LNGSLGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLS

Query:  LADGFSFWITGVYGPEAAS
        +     FW++ VYGP + +
Subjt:  LADGFSFWITGVYGPEAAS

A0A438K2W1 Putative ribonuclease H protein5.2e-1947.52Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS KKR ++  F+   N  +++LQETK +  DRRFV S+W  + + WA++ A GASGGI ILW  S F   E V GSFS+++  +  +  SFW+T VYG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

A0A6J1CVN2 uncharacterized protein LOC1110146571.3e-2255.45Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        LGS  KRA I+D I      ++IL ETKS +I+ +F+KSLWSS +IAWAS+DA GASGGI +LW + S + +EV+ G FS+S+H  LAD F++W+TGVY 
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  P
        P
Subjt:  P

A0A6J1E2G6 uncharacterized protein LOC1110254054.2e-2144.19Show/hide
Query:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG
        L SWKK ALI+ FI   N +++ILQETK   +D   VKSLWS+  I W+++DA G + GI ILW +      E++EG FSL+I+  L+DGF FW++G+YG
Subjt:  LGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHLSLADGFSFWITGVYG

Query:  PEAASGIIHTVLLEAALTKTRELKKLCES
        P       H +  +  L    +L  LCE+
Subjt:  PEAASGIIHTVLLEAALTKTRELKKLCES

SwissProt top hitse value%identityAlignment
Q9FG23 CDK5RAP3-like protein1.7e-1179.07Show/hide
Query:  EAALTKTRELKKLCESTLSSMFDGRPVNIIGEINTVLTSGLSA
        EAAL+KTRELK+LCE++LSSMFDGRPVNI GEINT+L +G+SA
Subjt:  EAALTKTRELKKLCESTLSSMFDGRPVNIIGEINTVLTSGLSA

Arabidopsis top hitse value%identityAlignment
AT5G06830.1 unknown protein1.2e-1279.07Show/hide
Query:  EAALTKTRELKKLCESTLSSMFDGRPVNIIGEINTVLTSGLSA
        EAAL+KTRELK+LCE++LSSMFDGRPVNI GEINT+L +G+SA
Subjt:  EAALTKTRELKKLCESTLSSMFDGRPVNIIGEINTVLTSGLSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGTGTATTATGCCTCTTCCTAATAAACAAAAGAACTCCACAGCAGCAAAGAAAAGAATTAAATGGGAGTCTTGGCTCTTGGAAGAAGAGAGCCCTTATAGAGGA
TTTCATCCGATTACACAATCTGTCCCTCATCATATTGCAAGAAACAAAATCGCAAGCCATTGACAGAAGATTTGTAAAATCTCTGTGGAGTTCTAGAAATATTGCTTGGG
CATCCATTGACGCGGTGGGTGCTTCAGGAGGGATTGGCATCCTCTGGAAAGAATCCTCCTTCAATGTCCTGGAAGTGGTAGAAGGTTCTTTTTCTCTTTCTATTCATCTC
TCCCTCGCTGATGGTTTCTCTTTTTGGATCACAGGAGTTTATGGCCCAGAGGCAGCATCAGGTATAATACATACGGTGCTGCTCGAAGCAGCTTTGACGAAAACTAGGGA
GTTGAAGAAGTTGTGTGAAAGCACTTTATCATCCATGTTTGATGGAAGGCCCGTCAACATTATTGGCGAGATAAATACTGTATTAACAAGTGGTCTCAGTGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGATGTGTATTATGCCTCTTCCTAATAAACAAAAGAACTCCACAGCAGCAAAGAAAAGAATTAAATGGGAGTCTTGGCTCTTGGAAGAAGAGAGCCCTTATAGAGGA
TTTCATCCGATTACACAATCTGTCCCTCATCATATTGCAAGAAACAAAATCGCAAGCCATTGACAGAAGATTTGTAAAATCTCTGTGGAGTTCTAGAAATATTGCTTGGG
CATCCATTGACGCGGTGGGTGCTTCAGGAGGGATTGGCATCCTCTGGAAAGAATCCTCCTTCAATGTCCTGGAAGTGGTAGAAGGTTCTTTTTCTCTTTCTATTCATCTC
TCCCTCGCTGATGGTTTCTCTTTTTGGATCACAGGAGTTTATGGCCCAGAGGCAGCATCAGGTATAATACATACGGTGCTGCTCGAAGCAGCTTTGACGAAAACTAGGGA
GTTGAAGAAGTTGTGTGAAAGCACTTTATCATCCATGTTTGATGGAAGGCCCGTCAACATTATTGGCGAGATAAATACTGTATTAACAAGTGGTCTCAGTGCGTAA
Protein sequenceShow/hide protein sequence
MGCVLCLFLINKRTPQQQRKELNGSLGSWKKRALIEDFIRLHNLSLIILQETKSQAIDRRFVKSLWSSRNIAWASIDAVGASGGIGILWKESSFNVLEVVEGSFSLSIHL
SLADGFSFWITGVYGPEAASGIIHTVLLEAALTKTRELKKLCESTLSSMFDGRPVNIIGEINTVLTSGLSA