; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012114 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012114
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationchr1:37487287..37488188
RNA-Seq ExpressionLag0012114
SyntenyLag0012114
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.6e-1540.95Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QETK    DRR V S+W+ + V W++L A  A+GGI+++W  +  +     LG+FS+ ++     +   W+T VYGP +   RK F LEL DL GL    
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

RVW50736.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.1e-1543.81Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QETK  K DRRLV S+W+ R+  W+ L A  A+GGI+ +W    +      LG+FSI ++   +     WI+ VYGP     RK FL+ELYD+ GL   +
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

XP_021820446.1 uncharacterized protein LOC110762145 [Prunus avium]1.4e-1639.05Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QETK  ++DRRLV S+W SR   W+ + +   +GGI+++W    + V++S +  FS+ I+       + W++G+YGPC  + R RF +EL  L GLC   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WC+ G
Subjt:  WCVVG

XP_022145142.1 uncharacterized protein LOC111014657 [Momordica charantia]3.2e-1644Show/hide
Query:  ETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGIW
        ETK + ++ + +KS+WSS  +AW SLDA+ A+GGIIL+W +     V    G FSI +          W+TGVY P   + RK F  EL+DL GLC  IW
Subjt:  ETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGIW

XP_031739979.1 uncharacterized protein LOC116403332 [Cucumis sativus]2.0e-2150.48Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        Q++K++ V+R LVKS+WSS  V W +L+A  ++GGI+++WKE+ I VV+S  G FSI I   F     GWITGVYGP  Y+ R +F  EL  L GLC   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

TrEMBL top hitse value%identityAlignment
A0A6J1CVN2 uncharacterized protein LOC1110146571.6e-1644Show/hide
Query:  ETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGIW
        ETK + ++ + +KS+WSS  +AW SLDA+ A+GGIIL+W +     V    G FSI +          W+TGVY P   + RK F  EL+DL GLC  IW
Subjt:  ETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGIW

A0A6P5T1U8 uncharacterized protein LOC1107621457.0e-1739.05Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QETK  ++DRRLV S+W SR   W+ + +   +GGI+++W    + V++S +  FS+ I+       + W++G+YGPC  + R RF +EL  L GLC   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WC+ G
Subjt:  WCVVG

A0A803P8A0 Uncharacterized protein2.0e-1643.81Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QE K A VDRR + SIW SR  AW+ L A   +GG +L+W    I V++S +G FSI +    + ++  W +GVYGPC Y+ R  F  EL  L  +C   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

A0A803QEA6 Uncharacterized protein5.4e-1742.86Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QE K A VDRR + SIW SR  AW+ + A   +GG +L+W    I V++S +G FSI +    + ++  W +GVYGPC Y+ R  F  EL  L  +C   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

A0A803QI00 Uncharacterized protein4.5e-1641.9Show/hide
Query:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI
        QE K   VDRR + SIW SR  AW+ + A   +GG +L+W    I V++S +G FSI +    + +   W +GVYGPC Y+ R  F  EL  L  +C   
Subjt:  QETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQRQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGI

Query:  WCVVG
        WCV G
Subjt:  WCVVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCTTGACCGACAAGGGGAAGAAGGAAACGAAGGTCCTGACCATGTGGCCGAAACTCCAGTCGGAGACGAAGAATATACTCTACGAGAGATAAGAATTCAT
GAGGAAGAACCCATTGCCTCTGAAACCGTTGTTGAGCAGTCGATAGTTAATCCCCAAGCCTTGCCAGTCTTTTCGCCAACGAACACAACGCACAATAGCTCCTTA
GATGGATTTTCGATTAGCAAAGAGGTTGTGATTACTCTTTGGAAAAATAACCTCTGCATTAGGCCTATCTCTGGGTCGAATATGAAGAAGGGAAGCACAGCTCAA
AAAAAGAGGAATAGGGAGATGACGAGCCTCCTTAGAACTTGGGAAAAAGAGGTCGAAGATACCAATGATAATGGATGCTCTGAAGATTGTGAAAATGCCTTCGAT
AGCTTAGAAAGGGCCAAACAGGAGACTAAGCTTGCTAAGGTAGATAGGCGTTTGGTTAAGTCCATTTGGAGCTCTAGGCATGTTGCGTGGTTGTCCCTAGACGCT
AATAATGCAGCGGGGGGTATCATTCTGATGTGGAAAGAGAACCATATTGATGTGGTTAACTCGTTTTTGGGGGCCTTTTCTATCATCATTCAATGCATGTTCCAA
AGGCAGAAAGAAGGATGGATTACTGGTGTTTACGGCCCGTGTGACTACCAAGGTAGAAAACGTTTCCTTCTAGAATTGTACGATTTACAGGGTTTATGTCAAGGC
ATTTGGTGTGTGGTGGGGACTTCAATATGGTTAGATGGTCGGAAGAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCTTGACCGACAAGGGGAAGAAGGAAACGAAGGTCCTGACCATGTGGCCGAAACTCCAGTCGGAGACGAAGAATATACTCTACGAGAGATAAGAATTCAT
GAGGAAGAACCCATTGCCTCTGAAACCGTTGTTGAGCAGTCGATAGTTAATCCCCAAGCCTTGCCAGTCTTTTCGCCAACGAACACAACGCACAATAGCTCCTTA
GATGGATTTTCGATTAGCAAAGAGGTTGTGATTACTCTTTGGAAAAATAACCTCTGCATTAGGCCTATCTCTGGGTCGAATATGAAGAAGGGAAGCACAGCTCAA
AAAAAGAGGAATAGGGAGATGACGAGCCTCCTTAGAACTTGGGAAAAAGAGGTCGAAGATACCAATGATAATGGATGCTCTGAAGATTGTGAAAATGCCTTCGAT
AGCTTAGAAAGGGCCAAACAGGAGACTAAGCTTGCTAAGGTAGATAGGCGTTTGGTTAAGTCCATTTGGAGCTCTAGGCATGTTGCGTGGTTGTCCCTAGACGCT
AATAATGCAGCGGGGGGTATCATTCTGATGTGGAAAGAGAACCATATTGATGTGGTTAACTCGTTTTTGGGGGCCTTTTCTATCATCATTCAATGCATGTTCCAA
AGGCAGAAAGAAGGATGGATTACTGGTGTTTACGGCCCGTGTGACTACCAAGGTAGAAAACGTTTCCTTCTAGAATTGTACGATTTACAGGGTTTATGTCAAGGC
ATTTGGTGTGTGGTGGGGACTTCAATATGGTTAGATGGTCGGAAGAAAGAATGA
Protein sequenceShow/hide protein sequence
MLLDRQGEEGNEGPDHVAETPVGDEEYTLREIRIHEEEPIASETVVEQSIVNPQALPVFSPTNTTHNSSLDGFSISKEVVITLWKNNLCIRPISGSNMKKGSTAQ
KKRNREMTSLLRTWEKEVEDTNDNGCSEDCENAFDSLERAKQETKLAKVDRRLVKSIWSSRHVAWLSLDANNAAGGIILMWKENHIDVVNSFLGAFSIIIQCMFQ
RQKEGWITGVYGPCDYQGRKRFLLELYDLQGLCQGIWCVVGTSIWLDGRKKE