; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019797 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019797
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationscaffold5:30671530..30674244
RNA-Seq ExpressionSpg019797
SyntenySpg019797
Gene Ontology termsGO:0006284 - base-excision repair (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003906 - DNA-(apurinic or apyrimidinic site) endonuclease activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008081 - phosphoric diester hydrolase activity (molecular function)
GO:0008311 - double-stranded DNA 3'-5' exodeoxyribonuclease activity (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR020847 - AP endonuclease 1, binding site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW35852.1 hypothetical protein CK203_084592 [Vitis vinifera]6.7e-1539.74Show/hide
Query:  QNITKEGNEEGG---------PDKAGEENRGALTPTTQTESSKGNEGKVRDRPK--------EMTPTALAVILRRH--MIISWNVRGLGARPKRALIKDL
        Q I+K   E+GG         PD+  EEN+ AL+    TES    +  VR   +         ++P  +A   R +  + +SWNVRGLG+R KR ++KD 
Subjt:  QNITKEGNEEGG---------PDKAGEENRGALTPTTQTESSKGNEGKVRDRPK--------EMTPTALAVILRRH--MIISWNVRGLGARPKRALIKDL

Query:  LSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMW
        L  ENPD+V++QE+K    DR  + SVW++R+  WV L A G+ GGILI+W
Subjt:  LSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMW

RVW64166.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]6.7e-1538.96Show/hide
Query:  NEEGGPDKAGEENRGALTPTTQTESSKGNEGKVRDR-------------PKEMTPTALA-VILRRHMIISWNVRGLGARPKRALIKDLLSRENPDLVILQ
        N    PD+  EEN+ AL+    +ES    +  VR               P++M     A V +    IISWNVRGLG+R KR +IKD L  ENPD+V++Q
Subjt:  NEEGGPDKAGEENRGALTPTTQTESSKGNEGKVRDR-------------PKEMTPTALA-VILRRHMIISWNVRGLGARPKRALIKDLLSRENPDLVILQ

Query:  ESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVDSGEIVL
        E+K    DR  + SVW+ R+  WV L A G+ GGILI+W     +++   E+V+
Subjt:  ESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVDSGEIVL

XP_010263157.1 PREDICTED: uncharacterized protein LOC104601500 [Nelumbo nucifera]8.8e-1549.38Show/hide
Query:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD
        I+SWNVRGLG   KRALIK++L +ENPD+ ++QESKL  +D+  ++SVW +  + WV   + GS GGI+ +WK+  +E V+
Subjt:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD

XP_010269625.1 PREDICTED: uncharacterized protein LOC104606223 [Nelumbo nucifera]1.6e-1655.56Show/hide
Query:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD
        IISWNVRGLG+  KR +IKDLL RE PD+V+LQESKL  +D   ++S W SR +GW    + G+ GGI+ +WKE  +EVV+
Subjt:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]4.1e-2836.16Show/hide
Query:  EVRRINWNEVIVLTKRDFHDDWGRILEILQHQLETTLIINPFQPDKALMKYPNKDLADLL----------------------------------------
        EVRR+NW E IV+T+RDFHDDW RIL  ++ Q E++ IINPFQ DKALMK P+KDLA LL                                        
Subjt:  EVRRINWNEVIVLTKRDFHDDWGRILEILQHQLETTLIINPFQPDKALMKYPNKDLADLL----------------------------------------

Query:  ----------TKKQRVECLGGFIDYAELNSLLIDCIEVGIRISNNYCD------------------------------RAARIHGTFSSEAAHAFHRGPH
                  T K     LGGFIDY + NS  I+C +V I++ +NYC                               +   IHG FSSEAA +FH+G  
Subjt:  ----------TKKQRVECLGGFIDYAELNSLLIDCIEVGIRISNNYCD------------------------------RAARIHGTFSSEAAHAFHRGPH

Query:  DSCFNPVDKWQIENALNRPVVIIQ
        +   N +D+W++EN  N P V IQ
Subjt:  DSCFNPVDKWQIENALNRPVVIIQ

TrEMBL top hitse value%identityAlignment
A0A1U8A916 uncharacterized protein LOC1046015004.2e-1549.38Show/hide
Query:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD
        I+SWNVRGLG   KRALIK++L +ENPD+ ++QESKL  +D+  ++SVW +  + WV   + GS GGI+ +WK+  +E V+
Subjt:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD

A0A1U8B190 uncharacterized protein LOC1046062237.7e-1755.56Show/hide
Query:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD
        IISWNVRGLG+  KR +IKDLL RE PD+V+LQESKL  +D   ++S W SR +GW    + G+ GGI+ +WKE  +EVV+
Subjt:  IISWNVRGLGARPKRALIKDLLSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVD

A0A438DK43 Uncharacterized protein3.3e-1539.74Show/hide
Query:  QNITKEGNEEGG---------PDKAGEENRGALTPTTQTESSKGNEGKVRDRPK--------EMTPTALAVILRRH--MIISWNVRGLGARPKRALIKDL
        Q I+K   E+GG         PD+  EEN+ AL+    TES    +  VR   +         ++P  +A   R +  + +SWNVRGLG+R KR ++KD 
Subjt:  QNITKEGNEEGG---------PDKAGEENRGALTPTTQTESSKGNEGKVRDRPK--------EMTPTALAVILRRH--MIISWNVRGLGARPKRALIKDL

Query:  LSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMW
        L  ENPD+V++QE+K    DR  + SVW++R+  WV L A G+ GGILI+W
Subjt:  LSRENPDLVILQESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMW

A0A438FW30 Transposon TX1 uncharacterized 149 kDa protein3.3e-1538.96Show/hide
Query:  NEEGGPDKAGEENRGALTPTTQTESSKGNEGKVRDR-------------PKEMTPTALA-VILRRHMIISWNVRGLGARPKRALIKDLLSRENPDLVILQ
        N    PD+  EEN+ AL+    +ES    +  VR               P++M     A V +    IISWNVRGLG+R KR +IKD L  ENPD+V++Q
Subjt:  NEEGGPDKAGEENRGALTPTTQTESSKGNEGKVRDR-------------PKEMTPTALA-VILRRHMIISWNVRGLGARPKRALIKDLLSRENPDLVILQ

Query:  ESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVDSGEIVL
        E+K    DR  + SVW+ R+  WV L A G+ GGILI+W     +++   E+V+
Subjt:  ESKLTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVDSGEIVL

A0A6J1D6X4 uncharacterized protein LOC1110181862.0e-2836.16Show/hide
Query:  EVRRINWNEVIVLTKRDFHDDWGRILEILQHQLETTLIINPFQPDKALMKYPNKDLADLL----------------------------------------
        EVRR+NW E IV+T+RDFHDDW RIL  ++ Q E++ IINPFQ DKALMK P+KDLA LL                                        
Subjt:  EVRRINWNEVIVLTKRDFHDDWGRILEILQHQLETTLIINPFQPDKALMKYPNKDLADLL----------------------------------------

Query:  ----------TKKQRVECLGGFIDYAELNSLLIDCIEVGIRISNNYCD------------------------------RAARIHGTFSSEAAHAFHRGPH
                  T K     LGGFIDY + NS  I+C +V I++ +NYC                               +   IHG FSSEAA +FH+G  
Subjt:  ----------TKKQRVECLGGFIDYAELNSLLIDCIEVGIRISNNYCD------------------------------RAARIHGTFSSEAAHAFHRGPH

Query:  DSCFNPVDKWQIENALNRPVVIIQ
        +   N +D+W++EN  N P V IQ
Subjt:  DSCFNPVDKWQIENALNRPVVIIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAAAATATCGAACAAACGGGGAAGCTTCTTGGAGATCACAAAAGTAGCTAACTTGGGCGGCAAACATAATCTGGTTGTCCCAGCGGGAGTGGACTTCAGGGGGTG
GAAGGATTTTTTGCTCCTTTTGAGAAGTTTTGTTGATGGAAAATCTACGGAGGATGCCAACTTAGACAAGGATGAGGAACCCAAAAGGAAGGCTGGAAGGAAATCCTTCG
CAGATGCTTTAAAAGGTCCCATTAATAGAGATACAAAATCACCCCAAAACAGAGGAATGAGACCAAAAGAGAACCTCTATATGGACGCCTCGAAAACCTGCGTTAATGAA
GTGAGAAGAATTAACTGGAATGAGGTGATTGTGTTAACAAAGAGAGATTTTCATGATGATTGGGGTCGAATCCTAGAAATCTTGCAACATCAATTAGAGACCACCCTAAT
TATAAACCCATTCCAGCCAGACAAAGCCCTTATGAAATACCCGAACAAGGACCTTGCGGATTTGTTGACAAAAAAACAGAGGGTGGAATGCTTGGGAGGCTTCATTGATT
ACGCGGAACTAAACTCTTTGCTCATTGATTGCATTGAGGTGGGAATCAGAATAAGCAATAACTATTGCGATAGGGCTGCCAGAATCCATGGAACTTTCTCGTCGGAAGCT
GCCCATGCGTTTCACAGAGGTCCCCATGACTCGTGCTTCAATCCAGTGGATAAATGGCAAATTGAAAACGCTCTGAATCGCCCAGTGGTTATTATCCAGGGGTTGTGCAT
GGACAACGAGATATCGGATGTCCAAGGGGGGAAATCTCGAAGAATTAATTTTGAATTACCCCGCCAAAAAGATGATATGGAAAAAAGGAAATCTGACGACCCCGAAGCTG
AGATGGAAAAAAAGATGTATGATGGCCCAAACCTGATACAGAAACCCACAAATGAGGCCCACAAGTTGAAAGGAAAGGAAGCCCACCAAAAAGAACCTGAAGGAAAACAA
AAAAGGAAAATTGCAGATGGATATGCGTTGACTCCCACTTGGCAAAATATTACTAAAGAAGGAAATGAAGAGGGCGGACCCGATAAAGCAGGGGAGGAAAACAGAGGTGC
GTTGACCCCCACTACCCAGACTGAATCTTCGAAGGGAAATGAAGGGAAAGTTAGAGATAGGCCCAAGGAGATGACTCCGACAGCCTTGGCAGTTATCCTCCGACGACACA
TGATTATCTCTTGGAATGTTAGAGGCTTGGGAGCTCGACCGAAAAGAGCTTTAATCAAAGATTTGCTTAGTAGGGAGAATCCAGACCTGGTGATTCTCCAAGAATCCAAA
TTGACCAAGGTTGACAGGGGGATGATTAAATCAGTCTGGAGCTCTAGACATGTTGGTTGGGTGACTCTAGAGGCAATGGGATCCGTGGGAGGTATCCTTATCATGTGGAA
AGAGAGCTGCATCGAGGTGGTTGATTCAGGGGAAATTGTGCTAGATGGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAAAATATCGAACAAACGGGGAAGCTTCTTGGAGATCACAAAAGTAGCTAACTTGGGCGGCAAACATAATCTGGTTGTCCCAGCGGGAGTGGACTTCAGGGGGTG
GAAGGATTTTTTGCTCCTTTTGAGAAGTTTTGTTGATGGAAAATCTACGGAGGATGCCAACTTAGACAAGGATGAGGAACCCAAAAGGAAGGCTGGAAGGAAATCCTTCG
CAGATGCTTTAAAAGGTCCCATTAATAGAGATACAAAATCACCCCAAAACAGAGGAATGAGACCAAAAGAGAACCTCTATATGGACGCCTCGAAAACCTGCGTTAATGAA
GTGAGAAGAATTAACTGGAATGAGGTGATTGTGTTAACAAAGAGAGATTTTCATGATGATTGGGGTCGAATCCTAGAAATCTTGCAACATCAATTAGAGACCACCCTAAT
TATAAACCCATTCCAGCCAGACAAAGCCCTTATGAAATACCCGAACAAGGACCTTGCGGATTTGTTGACAAAAAAACAGAGGGTGGAATGCTTGGGAGGCTTCATTGATT
ACGCGGAACTAAACTCTTTGCTCATTGATTGCATTGAGGTGGGAATCAGAATAAGCAATAACTATTGCGATAGGGCTGCCAGAATCCATGGAACTTTCTCGTCGGAAGCT
GCCCATGCGTTTCACAGAGGTCCCCATGACTCGTGCTTCAATCCAGTGGATAAATGGCAAATTGAAAACGCTCTGAATCGCCCAGTGGTTATTATCCAGGGGTTGTGCAT
GGACAACGAGATATCGGATGTCCAAGGGGGGAAATCTCGAAGAATTAATTTTGAATTACCCCGCCAAAAAGATGATATGGAAAAAAGGAAATCTGACGACCCCGAAGCTG
AGATGGAAAAAAAGATGTATGATGGCCCAAACCTGATACAGAAACCCACAAATGAGGCCCACAAGTTGAAAGGAAAGGAAGCCCACCAAAAAGAACCTGAAGGAAAACAA
AAAAGGAAAATTGCAGATGGATATGCGTTGACTCCCACTTGGCAAAATATTACTAAAGAAGGAAATGAAGAGGGCGGACCCGATAAAGCAGGGGAGGAAAACAGAGGTGC
GTTGACCCCCACTACCCAGACTGAATCTTCGAAGGGAAATGAAGGGAAAGTTAGAGATAGGCCCAAGGAGATGACTCCGACAGCCTTGGCAGTTATCCTCCGACGACACA
TGATTATCTCTTGGAATGTTAGAGGCTTGGGAGCTCGACCGAAAAGAGCTTTAATCAAAGATTTGCTTAGTAGGGAGAATCCAGACCTGGTGATTCTCCAAGAATCCAAA
TTGACCAAGGTTGACAGGGGGATGATTAAATCAGTCTGGAGCTCTAGACATGTTGGTTGGGTGACTCTAGAGGCAATGGGATCCGTGGGAGGTATCCTTATCATGTGGAA
AGAGAGCTGCATCGAGGTGGTTGATTCAGGGGAAATTGTGCTAGATGGCTAG
Protein sequenceShow/hide protein sequence
MQKISNKRGSFLEITKVANLGGKHNLVVPAGVDFRGWKDFLLLLRSFVDGKSTEDANLDKDEEPKRKAGRKSFADALKGPINRDTKSPQNRGMRPKENLYMDASKTCVNE
VRRINWNEVIVLTKRDFHDDWGRILEILQHQLETTLIINPFQPDKALMKYPNKDLADLLTKKQRVECLGGFIDYAELNSLLIDCIEVGIRISNNYCDRAARIHGTFSSEA
AHAFHRGPHDSCFNPVDKWQIENALNRPVVIIQGLCMDNEISDVQGGKSRRINFELPRQKDDMEKRKSDDPEAEMEKKMYDGPNLIQKPTNEAHKLKGKEAHQKEPEGKQ
KRKIADGYALTPTWQNITKEGNEEGGPDKAGEENRGALTPTTQTESSKGNEGKVRDRPKEMTPTALAVILRRHMIISWNVRGLGARPKRALIKDLLSRENPDLVILQESK
LTKVDRGMIKSVWSSRHVGWVTLEAMGSVGGILIMWKESCIEVVDSGEIVLDG