; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029334 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029334
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold12:36294227..36299003
RNA-Seq ExpressionSpg029334
SyntenySpg029334
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW25035.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.8e-1127.03Show/hide
Query:  IGESSLFSDLSSPQRVIQPV--KAFKVTQSS--PPLREVESDGESEVSLS-SIEANLQPMNNDSDDF---------PEETCVDGLG---------LLFQD
        + ESS+ S + S      P+  ++  V+Q S  P +  VE      VS S  +E  L+  N +S+D+         PE+ CV G              +D
Subjt:  IGESSLFSDLSSPQRVIQPV--KAFKVTQSS--PPLREVESDGESEVSLS-SIEANLQPMNNDSDDF---------PEETCVDGLG---------LLFQD

Query:  YGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDIS
        +     +P ++  + S  E L       N    +  +   ++  +I+SW TRGLG + K   +++F+    PD+V++QE K  ++D R + S+W+ + + 
Subjt:  YGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDIS

Query:  WVSVDSIGRFGDMLIMWVDSKF
        WV++ + G  G ++I+W  SKF
Subjt:  WVSVDSIGRFGDMLIMWVDSKF

RVW67200.1 hypothetical protein CK203_065147 [Vitis vinifera]3.4e-1139.76Show/hide
Query:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG
        ++I+SW TRGLG R K   +++F+    PD+VL+QE K   +D R + S+W  + + WV++ + G FG ++I+W   KF  YG
Subjt:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG

RVW77727.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.4e-1139.02Show/hide
Query:  QIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG
        +I+SW TRGLG R K   +++F+    PD+V++QE K V +D R + S+W  R + W ++ + G  G ++I+W   KF+ YG
Subjt:  QIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG

RVW96808.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.8e-1128.39Show/hide
Query:  PMNNDSDDFPEETCVDGLG---------LLFQDYGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFI
        P+  DS   PE+ CV G           L  +D+     +P ++  + S  E L       N    +  +   ++  +I+SW TRGLG + K   +++F+
Subjt:  PMNNDSDDFPEETCVDGLG---------LLFQDYGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFI

Query:  QHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKF
            P++V++QE K  ++D R + S+W+ R + WV++ + G  G ++I+W  SKF
Subjt:  QHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKF

XP_038876676.1 uncharacterized protein LOC120069076 [Benincasa hispida]1.2e-1135Show/hide
Query:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKF--RSYGWTGYVIALKLRNLKV
        ++I++W TRGLGD SK + LK+F++   PD+VLIQE K    +   IKSLWSS+++    V++ G+ G +L +W DSK    S     + +++K + +  
Subjt:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKF--RSYGWTGYVIALKLRNLKV

Query:  VKLWFVDFETKTKSQDKRFL
           W  +       Q++R L
Subjt:  VKLWFVDFETKTKSQDKRFL

TrEMBL top hitse value%identityAlignment
A0A438CP96 LINE-1 retrotransposable element ORF2 protein2.8e-1127.03Show/hide
Query:  IGESSLFSDLSSPQRVIQPV--KAFKVTQSS--PPLREVESDGESEVSLS-SIEANLQPMNNDSDDF---------PEETCVDGLG---------LLFQD
        + ESS+ S + S      P+  ++  V+Q S  P +  VE      VS S  +E  L+  N +S+D+         PE+ CV G              +D
Subjt:  IGESSLFSDLSSPQRVIQPV--KAFKVTQSS--PPLREVESDGESEVSLS-SIEANLQPMNNDSDDF---------PEETCVDGLG---------LLFQD

Query:  YGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDIS
        +     +P ++  + S  E L       N    +  +   ++  +I+SW TRGLG + K   +++F+    PD+V++QE K  ++D R + S+W+ + + 
Subjt:  YGKVVCSPPRLDIINSKSEVLSF---ASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDIS

Query:  WVSVDSIGRFGDMLIMWVDSKF
        WV++ + G  G ++I+W  SKF
Subjt:  WVSVDSIGRFGDMLIMWVDSKF

A0A438FW30 Transposon TX1 uncharacterized 149 kDa protein4.8e-1137.82Show/hide
Query:  PRLDIINSKSEVLS--FASNSVPNKFSSLFEA----YGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVD
        P+  I+ S  EVL         P K + + EA    + ++IISW  RGLG R+K   +K F++   PD+V+IQE K  + D R + S+W+ R+  WV++ 
Subjt:  PRLDIINSKSEVLS--FASNSVPNKFSSLFEA----YGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVD

Query:  SIGRFGDMLIMWVDSKFRS
        + G  G +LI+W DSK  S
Subjt:  SIGRFGDMLIMWVDSKFRS

A0A438G4S3 Uncharacterized protein1.6e-1139.76Show/hide
Query:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG
        ++I+SW TRGLG R K   +++F+    PD+VL+QE K   +D R + S+W  + + WV++ + G FG ++I+W   KF  YG
Subjt:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG

A0A438H059 Transposon TX1 uncharacterized 149 kDa protein1.6e-1139.02Show/hide
Query:  QIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG
        +I+SW TRGLG R K   +++F+    PD+V++QE K V +D R + S+W  R + W ++ + G  G ++I+W   KF+ YG
Subjt:  QIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYG

A0A6P5T1U8 uncharacterized protein LOC1107621454.8e-1133.08Show/hide
Query:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMW-------VDSKFRSYGWTGYVIALKL
        ++IISW  RGLG + K + LK+ +   RPD+V++QE K    D R++ S+W SR   WV V S GR G ++I+W       +DS+   +      +++K+
Subjt:  LQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIKSLWSSRDISWVSVDSIGRFGDMLIMW-------VDSKFRSYGWTGYVIALKL

Query:  RNLKVVKLWFVDFETKTKSQDK-RFLAEIA
        R       W        + +D+ RF  E+A
Subjt:  RNLKVVKLWFVDFETKTKSQDK-RFLAEIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAGATGTTGACGTTTCAGTATCCAATAGAAAGGAGATTGATTTTGAACATTCAGTATTAATTAAGGCGTCTGATCTTCCAAATTTATCTCAAGCATTGAAAGC
AATTAATCATGATAATCCTCTTTATCAAGAATCTTCTGTTGTTTCCAAAAAGCCCATTGGTGAATCCTCTTTATTTTCAGATTTGTCAAGCCCCCAAAGAGTTATTCAAC
CTGTCAAAGCCTTTAAAGTTACTCAATCTTCCCCTCCCTTGCGAGAGGTTGAGTCTGATGGTGAATCTGAAGTAAGTTTGAGTAGCATTGAAGCAAATTTACAACCTATG
AATAATGATTCAGATGATTTCCCTGAAGAGACTTGTGTTGATGGTTTGGGTTTGCTTTTCCAGGATTATGGAAAAGTGGTATGTTCTCCTCCAAGATTAGATATTATCAA
TTCTAAGTCAGAAGTTTTGTCTTTTGCTTCTAATTCTGTACCAAACAAGTTCTCATCTCTTTTTGAAGCTTATGGCCTTCAAATTATTTCCTGGATTACTAGAGGCCTTG
GTGATCGTTCTAAAAGTATAGATTTGAAGAAATTTATTCAACACTATAGGCCAGATTTGGTGTTAATTCAGGAGATGAAGATGGTGTCTTTTGATAATCGGATAATCAAG
TCACTGTGGAGTTCTAGAGACATTAGTTGGGTCAGTGTTGATTCCATTGGAAGATTTGGAGACATGCTAATTATGTGGGTTGATAGTAAATTTAGATCATATGGTTGGAC
TGGTTATGTGATTGCTTTGAAGCTTAGAAATCTGAAAGTTGTAAAACTCTGGTTTGTTGATTTTGAAACAAAAACAAAGAGTCAGGATAAGAGGTTTCTAGCGGAAATTG
CTAGGGAACTTCTGATTTCTGAACTAAGTAATGCAGATGGGATTTCTCTTCTTTTCCTTGGGGAGGTTGAAGCTGAAGTTCTAAGATGTATTGACTTTGCAATTGCAAAC
AACAGTGTTCCAACTAAAGCTCAGGGTTTACCCAGTCTCTTGAAACAGCGTATGTGGGGCAAAGATCTTGGGCAAAAGGTTGCATTTGGATTTGGAAGATGGGTTGTCAT
CCTCTCTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAGATGTTGACGTTTCAGTATCCAATAGAAAGGAGATTGATTTTGAACATTCAGTATTAATTAAGGCGTCTGATCTTCCAAATTTATCTCAAGCATTGAAAGC
AATTAATCATGATAATCCTCTTTATCAAGAATCTTCTGTTGTTTCCAAAAAGCCCATTGGTGAATCCTCTTTATTTTCAGATTTGTCAAGCCCCCAAAGAGTTATTCAAC
CTGTCAAAGCCTTTAAAGTTACTCAATCTTCCCCTCCCTTGCGAGAGGTTGAGTCTGATGGTGAATCTGAAGTAAGTTTGAGTAGCATTGAAGCAAATTTACAACCTATG
AATAATGATTCAGATGATTTCCCTGAAGAGACTTGTGTTGATGGTTTGGGTTTGCTTTTCCAGGATTATGGAAAAGTGGTATGTTCTCCTCCAAGATTAGATATTATCAA
TTCTAAGTCAGAAGTTTTGTCTTTTGCTTCTAATTCTGTACCAAACAAGTTCTCATCTCTTTTTGAAGCTTATGGCCTTCAAATTATTTCCTGGATTACTAGAGGCCTTG
GTGATCGTTCTAAAAGTATAGATTTGAAGAAATTTATTCAACACTATAGGCCAGATTTGGTGTTAATTCAGGAGATGAAGATGGTGTCTTTTGATAATCGGATAATCAAG
TCACTGTGGAGTTCTAGAGACATTAGTTGGGTCAGTGTTGATTCCATTGGAAGATTTGGAGACATGCTAATTATGTGGGTTGATAGTAAATTTAGATCATATGGTTGGAC
TGGTTATGTGATTGCTTTGAAGCTTAGAAATCTGAAAGTTGTAAAACTCTGGTTTGTTGATTTTGAAACAAAAACAAAGAGTCAGGATAAGAGGTTTCTAGCGGAAATTG
CTAGGGAACTTCTGATTTCTGAACTAAGTAATGCAGATGGGATTTCTCTTCTTTTCCTTGGGGAGGTTGAAGCTGAAGTTCTAAGATGTATTGACTTTGCAATTGCAAAC
AACAGTGTTCCAACTAAAGCTCAGGGTTTACCCAGTCTCTTGAAACAGCGTATGTGGGGCAAAGATCTTGGGCAAAAGGTTGCATTTGGATTTGGAAGATGGGTTGTCAT
CCTCTCTCTGTAA
Protein sequenceShow/hide protein sequence
MKEDVDVSVSNRKEIDFEHSVLIKASDLPNLSQALKAINHDNPLYQESSVVSKKPIGESSLFSDLSSPQRVIQPVKAFKVTQSSPPLREVESDGESEVSLSSIEANLQPM
NNDSDDFPEETCVDGLGLLFQDYGKVVCSPPRLDIINSKSEVLSFASNSVPNKFSSLFEAYGLQIISWITRGLGDRSKSIDLKKFIQHYRPDLVLIQEMKMVSFDNRIIK
SLWSSRDISWVSVDSIGRFGDMLIMWVDSKFRSYGWTGYVIALKLRNLKVVKLWFVDFETKTKSQDKRFLAEIARELLISELSNADGISLLFLGEVEAEVLRCIDFAIAN
NSVPTKAQGLPSLLKQRMWGKDLGQKVAFGFGRWVVILSL