; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015805 (gene) of Snake gourd v1 genome

Gene IDTan0015805
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionENDO3c domain-containing protein
Genome locationLG05:1232822..1234814
RNA-Seq ExpressionTan0015805
SyntenyTan0015805
Gene Ontology termsGO:0000278 - mitotic cell cycle (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0080111 - DNA demethylation (biological process)
GO:0006272 - leading strand elongation (biological process)
GO:0006287 - base-excision repair, gap-filling (biological process)
GO:0006297 - nucleotide-excision repair, DNA gap filling (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0045004 - DNA replication proofreading (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008622 - epsilon DNA polymerase complex (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0019104 - DNA N-glycosylase activity (molecular function)
GO:0035514 - DNA demethylase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
GO:0008310 - single-stranded DNA 3'-5' exodeoxyribonuclease activity (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR044811 - DNA glycosylase, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN46937.2 hypothetical protein Csa_020879 [Cucumis sativus]4.2e-3579.57Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDS+RR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIK+FL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

XP_008458622.1 PREDICTED: protein ROS1-like isoform X1 [Cucumis melo]6.5e-3681.72Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDSMRR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIKEFL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

XP_011658415.1 uncharacterized protein LOC101216331 isoform X1 [Cucumis sativus]4.2e-3579.57Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDS+RR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIK+FL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

XP_016902242.1 PREDICTED: protein ROS1-like isoform X2 [Cucumis melo]6.5e-3681.72Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDSMRR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIKEFL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

XP_031742557.1 uncharacterized protein LOC101216331 isoform X2 [Cucumis sativus]4.2e-3579.57Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDS+RR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIK+FL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

TrEMBL top hitse value%identityAlignment
A0A0A0KB10 ENDO3c domain-containing protein2.0e-3579.57Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDS+RR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIK+FL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

A0A1S3C8A0 protein ROS1-like isoform X13.1e-3681.72Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDSMRR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIKEFL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

A0A1S4E1Z0 protein ROS1-like isoform X23.1e-3681.72Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDSMRR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIKEFL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

A0A5A7SVP4 Protein ROS1-like isoform X13.1e-3681.72Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSLR KWDSMRR +  CEPRS D+MDSVDWEAVR AEPT IADAIKERGQHNIIAGRIKEFL R AR+HG IDLEWLR APP DVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

A0A6J1H5D5 protein ROS1-like1.2e-3276.34Show/hide
Query:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        MK E+DWNSL+ KWDSMRR YS  EPRS D+MDSVDWEAV SA+P  IA AIKERGQHN IA RIKEF++R AR+HG IDLEWLR APPNDVK
Subjt:  MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

SwissProt top hitse value%identityAlignment
B8YIE8 Protein ROS1C2.0e-1649.44Show/hide
Query:  NSLRGKWDSMRRVYSG---CEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        NS    WD +RR   G    + R  D  DSVDWEAVR A+   I+ AI+ERG +N++A RI++FL+R+   HG IDLEWLR  PP+  K
Subjt:  NSLRGKWDSMRRVYSG---CEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

C7IW64 Protein ROS1A1.7e-1549.4Show/hide
Query:  WDSMRR--VYS-GCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        WD +R+  +YS G + RS +  DS+DWE +R AE   I+D I+ERG +N++A RIK+FL+R+ R HG IDLEWLR    +  K
Subjt:  WDSMRR--VYS-GCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

Q8LK56 Transcriptional activator DEMETER1.8e-1751.19Show/hide
Query:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        +WDS+R+   G E    R+ + MDS+D+EA+R A  + I++AIKERG +N++A RIK+FL RI + HG IDLEWLR++PP+  K
Subjt:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

Q9SJQ6 DNA glycosylase/AP lyase ROS13.3e-1440.2Show/hide
Query:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVKWQSPSLLL
        K+  DW+ LR +     +  +G   ++   MD+VDW+A+R+A+   +A+ IK RG ++ +A RI+ FL R+   HG IDLEWLR  PP+  K      LL
Subjt:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVKWQSPSLLL

Query:  SF
        SF
Subjt:  SF

Q9SR66 DEMETER-like protein 25.0e-1544.57Show/hide
Query:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        K+ +DW+SLR + +S  R       R+   MD+VDW+A+R  +   IA+ I +RG +N++A RIK FL+R+ + HG IDLEWLR  PP+  K
Subjt:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

Arabidopsis top hitse value%identityAlignment
AT2G36490.1 demeter-like 12.3e-1540.2Show/hide
Query:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVKWQSPSLLL
        K+  DW+ LR +     +  +G   ++   MD+VDW+A+R+A+   +A+ IK RG ++ +A RI+ FL R+   HG IDLEWLR  PP+  K      LL
Subjt:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVKWQSPSLLL

Query:  SF
        SF
Subjt:  SF

AT3G10010.1 demeter-like 23.6e-1644.57Show/hide
Query:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        K+ +DW+SLR + +S  R       R+   MD+VDW+A+R  +   IA+ I +RG +N++A RIK FL+R+ + HG IDLEWLR  PP+  K
Subjt:  KQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

AT4G34060.1 demeter-like protein 34.1e-1242.5Show/hide
Query:  WDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        W+++RR+Y+    R   +MDSV+W  VR +    +   IK+RGQ  I++ RI +FL+     +G IDLEWLR AP + VK
Subjt:  WDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

AT5G04560.1 HhH-GPD base excision DNA repair family protein1.3e-1851.19Show/hide
Query:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        +WDS+R+   G E    R+ + MDS+D+EA+R A  + I++AIKERG +N++A RIK+FL RI + HG IDLEWLR++PP+  K
Subjt:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK

AT5G04560.2 HhH-GPD base excision DNA repair family protein1.3e-1851.19Show/hide
Query:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK
        +WDS+R+   G E    R+ + MDS+D+EA+R A  + I++AIKERG +N++A RIK+FL RI + HG IDLEWLR++PP+  K
Subjt:  KWDSMRRVYSGCE---PRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACAGGAAATTGACTGGAATAGTTTGAGGGGAAAATGGGACAGCATGAGGAGAGTCTATTCTGGTTGTGAGCCAAGAAGTAGCGATTACATGGATTCTGTAGATTG
GGAAGCAGTTCGTTCTGCAGAACCTACCACGATTGCAGATGCCATCAAGGAACGTGGCCAGCATAACATCATAGCAGGAAGAATCAAGGAATTTCTTCATCGAATAGCTA
GAATACATGGTCGCATTGACCTTGAATGGCTTAGAAAGGCTCCTCCAAATGATGTTAAGTGGCAGTCACCGTCTCTCCTCCTTTCCTTCCTCTCGTCTTCTTTCACCGTC
TCATCTTCCGGTAAGACACGGCGCCGGCCAGAAGGTCCGAATTCTGGCCGGAGAAGATGGGATGGGACAGTTACTGTCGCGGTTGCCGAACTCCCTTCTTCTCTCTCTCC
AAATTCCTTTGTTGACGGCCATGGGATAATTTATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAACAGGAAATTGACTGGAATAGTTTGAGGGGAAAATGGGACAGCATGAGGAGAGTCTATTCTGGTTGTGAGCCAAGAAGTAGCGATTACATGGATTCTGTAGATTG
GGAAGCAGTTCGTTCTGCAGAACCTACCACGATTGCAGATGCCATCAAGGAACGTGGCCAGCATAACATCATAGCAGGAAGAATCAAGGAATTTCTTCATCGAATAGCTA
GAATACATGGTCGCATTGACCTTGAATGGCTTAGAAAGGCTCCTCCAAATGATGTTAAGTGGCAGTCACCGTCTCTCCTCCTTTCCTTCCTCTCGTCTTCTTTCACCGTC
TCATCTTCCGGTAAGACACGGCGCCGGCCAGAAGGTCCGAATTCTGGCCGGAGAAGATGGGATGGGACAGTTACTGTCGCGGTTGCCGAACTCCCTTCTTCTCTCTCTCC
AAATTCCTTTGTTGACGGCCATGGGATAATTTATTTATGA
Protein sequenceShow/hide protein sequence
MKQEIDWNSLRGKWDSMRRVYSGCEPRSSDYMDSVDWEAVRSAEPTTIADAIKERGQHNIIAGRIKEFLHRIARIHGRIDLEWLRKAPPNDVKWQSPSLLLSFLSSSFTV
SSSGKTRRRPEGPNSGRRRWDGTVTVAVAELPSSLSPNSFVDGHGIIYL