; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003652 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003652
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCCHC-type domain-containing protein
Genome locationChr08:4904169..4906059
RNA-Seq ExpressionHG10003652
SyntenyHG10003652
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU41525.1 hypothetical protein TSUD_140560 [Trifolium subterraneum]2.3e-3136.46Show/hide
Query:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF
        A +     AW++     V+++ +NLFLF+F  + D   VL++GPW FD+ L+++   S E++PS +  H V FWVR  DLPF  ++  MAK LG+ +G+F
Subjt:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF

Query:  VDVDCDQGIKIQRF--------------------SHNETRWIDIRYERLPEFCYACGYIGHAVKECV----VPLPSLAEDRKKPYQYGPWLR
         +VD     +  RF                      ++  W+D +YERLP FC+ACG IGH +KEC     V   + ++  +K   YGPWLR
Subjt:  VDVDCDQGIKIQRF--------------------SHNETRWIDIRYERLPEFCYACGYIGHAVKECV----VPLPSLAEDRKKPYQYGPWLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]5.1e-3134.8Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKMN-RGFSVENIGKNLFLFKFCKQVDRLWVLKSGP
        M +  L E  KNF+LTSEED   V+I   A                          ++N    AWK++ + FSV+ IG N+FLF F +  DR  +L+ GP
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKMN-RGFSVENIGKNLFLFKFCKQVDRLWVLKSGP

Query:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW
        W FD+ L++++ P    KP  M F  V+ WV F DL     N+ MA  LG+ IG F DV+ +                       +GIK+         W
Subjt:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW

Query:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQG
        I I+YERLP+F Y CG + H +K+C        +   K  QYGPWLR+QG
Subjt:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]4.6e-4036.8Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGAIRNA--------------------------FINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPW
        M+ E L    + F+LTSEED   +++   A++ A                           + AWK+    +VE+IGKNLFLF FC++ D   V+K+GPW
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGAIRNA--------------------------FINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPW

Query:  LFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCDQ-----------------------GIKIQRFSHNETRWI
         FDK L+V+++P   K  S + F++VAFW+   DLP  + N+ MA  LG+ IG+FVDVDC++                       GIKI         WI
Subjt:  LFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCDQ-----------------------GIKIQRFSHNETRWI

Query:  DIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGS
         I+YERLP+FCY CG IGH+  +C     +  +D +   +YGPWLR+ GS
Subjt:  DIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGS

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.7e-3232.43Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKM-NRGFSVENIGKNLFLFKFCKQVDRLWVLKSGP
        M +  L E  KNF+LTSEE+   +++   A                          ++N    AWK+ N  F V+++G NLFLF F + +DR  + KSGP
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKM-NRGFSVENIGKNLFLFKFCKQVDRLWVLKSGP

Query:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW
        W FD+ L+++ +P     PS + F K+  WVRF DLP G   R MA  LG+ +G F + DCD                       +GIK+         W
Subjt:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW

Query:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGSGGLDQSLPLKKTDVEN----EGFVSITPQSSGILKGDEFLGRSPS
        I I+YERLP+FCY CG               L+  RKK +QYG WLRYQG+  +  ++P  K   E+     G  S +  +S +  G + +  +P+
Subjt:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGSGGLDQSLPLKKTDVEN----EGFVSITPQSSGILKGDEFLGRSPS

XP_024642377.1 uncharacterized protein LOC112422874 [Medicago truncatula]1.7e-2931.89Show/hide
Query:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF
        A +   I AW++     ++++ KNLFLFKF  + D   +L SGPW FD+ L++++  S E++PS +  H V FWVR  DLP   ++ +MA+ LGD +G F
Subjt:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF

Query:  VDVD----------------------CDQGIKIQRFSHNETRWIDIRYERLPEFCYACGYIGHAVKECVVPLPSLAED----RKKPYQYGPWLRYQGSGG
        ++VD                        +GI ++      + WI  +YERLP FCY CG IGH +KEC        E      ++   YG W+R      
Subjt:  VDVD----------------------CDQGIKIQRFSHNETRWIDIRYERLPEFCYACGYIGHAVKECVVPLPSLAED----RKKPYQYGPWLRYQGSGG

Query:  LDQSLPLKKTDVE----------NEGFVSITPQSSGILKGDEFLGRSPSVESVE
           + PL K  VE          ++   S T  S G   G+E   +  +V   E
Subjt:  LDQSLPLKKTDVE----------NEGFVSITPQSSGILKGDEFLGRSPSVESVE

TrEMBL top hitse value%identityAlignment
A0A2K3PII7 Cysteine desulfurase mitochondrial-like2.6e-2832.26Show/hide
Query:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF
        A +   I AW++     VE++ KNLFLF+F  + D   VLK+GPW FD+ L+++   S E++P+ +  +KVAFWVR  +LP   ++  MAK LG+ IG F
Subjt:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF

Query:  VDVDCDQGIKIQRFSHNETRWIDIR---------------------YERLPEFCYACGYIGHAVKECVVPLPSLAE------DRKKPYQYGPWLRYQGSG
         + D  +  +  RF   +   ID+R                     YERLP FC+ CG IGH +KEC      L E      + +K + +GPWLR     
Subjt:  VDVDCDQGIKIQRFSHNETRWIDIR---------------------YERLPEFCYACGYIGHAVKECVVPLPSLAE------DRKKPYQYGPWLRYQGSG

Query:  GLDQSLPLKKTDVENEGFVSITPQSSGILKGDEFLGRSPSVESVEGGTSARKLKVWKRKLKLETADNSKADLLVGGSRI
            + PL K   E          SSG      F   S S     G   + +++V ++K KL   + SK  +++    I
Subjt:  GLDQSLPLKKTDVENEGFVSITPQSSGILKGDEFLGRSPSVESVEGGTSARKLKVWKRKLKLETADNSKADLLVGGSRI

A0A2Z6NZV1 Uncharacterized protein1.1e-3136.46Show/hide
Query:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF
        A +     AW++     V+++ +NLFLF+F  + D   VL++GPW FD+ L+++   S E++PS +  H V FWVR  DLPF  ++  MAK LG+ +G+F
Subjt:  AIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDF

Query:  VDVDCDQGIKIQRF--------------------SHNETRWIDIRYERLPEFCYACGYIGHAVKECV----VPLPSLAEDRKKPYQYGPWLR
         +VD     +  RF                      ++  W+D +YERLP FC+ACG IGH +KEC     V   + ++  +K   YGPWLR
Subjt:  VDVDCDQGIKIQRF--------------------SHNETRWIDIRYERLPEFCYACGYIGHAVKECV----VPLPSLAEDRKKPYQYGPWLR

A0A6J1BSZ1 uncharacterized protein LOC1110054812.5e-3134.8Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKMN-RGFSVENIGKNLFLFKFCKQVDRLWVLKSGP
        M +  L E  KNF+LTSEED   V+I   A                          ++N    AWK++ + FSV+ IG N+FLF F +  DR  +L+ GP
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKMN-RGFSVENIGKNLFLFKFCKQVDRLWVLKSGP

Query:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW
        W FD+ L++++ P    KP  M F  V+ WV F DL     N+ MA  LG+ IG F DV+ +                       +GIK+         W
Subjt:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW

Query:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQG
        I I+YERLP+F Y CG + H +K+C        +   K  QYGPWLR+QG
Subjt:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQG

A0A6J1DU55 uncharacterized protein LOC1110231352.2e-4036.8Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGAIRNA--------------------------FINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPW
        M+ E L    + F+LTSEED   +++   A++ A                           + AWK+    +VE+IGKNLFLF FC++ D   V+K+GPW
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGAIRNA--------------------------FINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPW

Query:  LFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCDQ-----------------------GIKIQRFSHNETRWI
         FDK L+V+++P   K  S + F++VAFW+   DLP  + N+ MA  LG+ IG+FVDVDC++                       GIKI         WI
Subjt:  LFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCDQ-----------------------GIKIQRFSHNETRWI

Query:  DIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGS
         I+YERLP+FCY CG IGH+  +C     +  +D +   +YGPWLR+ GS
Subjt:  DIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGS

A0A6J1DX30 uncharacterized protein LOC1110248741.3e-3232.43Show/hide
Query:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKM-NRGFSVENIGKNLFLFKFCKQVDRLWVLKSGP
        M +  L E  KNF+LTSEE+   +++   A                          ++N    AWK+ N  F V+++G NLFLF F + +DR  + KSGP
Subjt:  MESEGLAEALKNFELTSEEDSGPVEIVPGA--------------------------IRNAFINAWKM-NRGFSVENIGKNLFLFKFCKQVDRLWVLKSGP

Query:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW
        W FD+ L+++ +P     PS + F K+  WVRF DLP G   R MA  LG+ +G F + DCD                       +GIK+         W
Subjt:  WLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLPFGFQNRMMAKLLGDTIGDFVDVDCD-----------------------QGIKIQRFSHNETRW

Query:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGSGGLDQSLPLKKTDVEN----EGFVSITPQSSGILKGDEFLGRSPS
        I I+YERLP+FCY CG               L+  RKK +QYG WLRYQG+  +  ++P  K   E+     G  S +  +S +  G + +  +P+
Subjt:  IDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGSGGLDQSLPLKKTDVEN----EGFVSITPQSSGILKGDEFLGRSPS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTGAAGGTCTTGCTGAAGCCTTGAAGAATTTTGAGTTGACTTCGGAGGAAGATTCGGGTCCAGTGGAAATTGTGCCTGGGGCTATTCGGAATGCTTTTATTAA
CGCCTGGAAGATGAATCGTGGCTTCAGCGTCGAAAATATTGGCAAGAACTTATTCCTCTTCAAATTCTGTAAACAGGTGGATAGGTTATGGGTCTTAAAGTCTGGACCGT
GGCTTTTTGACAAGTTTCTTATGGTGATGGAAGAACCCAGCATTGAGAAAAAACCTTCAAGAATGGGTTTTCATAAAGTGGCGTTCTGGGTTAGATTCTTGGATCTCCCT
TTTGGATTCCAAAATAGGATGATGGCAAAATTACTAGGTGATACAATTGGTGATTTTGTCGATGTAGATTGTGATCAAGGTATTAAAATTCAACGCTTCAGCCATAATGA
AACAAGGTGGATTGATATTCGATACGAACGATTACCTGAATTTTGTTATGCGTGTGGCTATATTGGGCACGCAGTTAAGGAGTGTGTTGTTCCTCTCCCTTCTTTGGCTG
AGGATCGTAAGAAGCCTTATCAGTATGGGCCTTGGCTGAGATACCAAGGAAGTGGTGGCCTTGACCAATCGTTACCTTTGAAGAAAACTGATGTGGAGAATGAGGGGTTT
GTTTCTATTACTCCTCAAAGCTCTGGGATTCTTAAAGGCGATGAGTTCTTGGGTAGAAGTCCGTCGGTTGAGAGTGTTGAGGGTGGCACCTCAGCCAGAAAGTTAAAAGT
CTGGAAAAGAAAATTGAAGCTGGAAACAGCTGATAATTCTAAGGCTGATTTGTTGGTTGGGGGATCTAGGATTCTCTGGTTCCCCCTCTACATGCCATGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTGAAGGTCTTGCTGAAGCCTTGAAGAATTTTGAGTTGACTTCGGAGGAAGATTCGGGTCCAGTGGAAATTGTGCCTGGGGCTATTCGGAATGCTTTTATTAA
CGCCTGGAAGATGAATCGTGGCTTCAGCGTCGAAAATATTGGCAAGAACTTATTCCTCTTCAAATTCTGTAAACAGGTGGATAGGTTATGGGTCTTAAAGTCTGGACCGT
GGCTTTTTGACAAGTTTCTTATGGTGATGGAAGAACCCAGCATTGAGAAAAAACCTTCAAGAATGGGTTTTCATAAAGTGGCGTTCTGGGTTAGATTCTTGGATCTCCCT
TTTGGATTCCAAAATAGGATGATGGCAAAATTACTAGGTGATACAATTGGTGATTTTGTCGATGTAGATTGTGATCAAGGTATTAAAATTCAACGCTTCAGCCATAATGA
AACAAGGTGGATTGATATTCGATACGAACGATTACCTGAATTTTGTTATGCGTGTGGCTATATTGGGCACGCAGTTAAGGAGTGTGTTGTTCCTCTCCCTTCTTTGGCTG
AGGATCGTAAGAAGCCTTATCAGTATGGGCCTTGGCTGAGATACCAAGGAAGTGGTGGCCTTGACCAATCGTTACCTTTGAAGAAAACTGATGTGGAGAATGAGGGGTTT
GTTTCTATTACTCCTCAAAGCTCTGGGATTCTTAAAGGCGATGAGTTCTTGGGTAGAAGTCCGTCGGTTGAGAGTGTTGAGGGTGGCACCTCAGCCAGAAAGTTAAAAGT
CTGGAAAAGAAAATTGAAGCTGGAAACAGCTGATAATTCTAAGGCTGATTTGTTGGTTGGGGGATCTAGGATTCTCTGGTTCCCCCTCTACATGCCATGGTAA
Protein sequenceShow/hide protein sequence
MESEGLAEALKNFELTSEEDSGPVEIVPGAIRNAFINAWKMNRGFSVENIGKNLFLFKFCKQVDRLWVLKSGPWLFDKFLMVMEEPSIEKKPSRMGFHKVAFWVRFLDLP
FGFQNRMMAKLLGDTIGDFVDVDCDQGIKIQRFSHNETRWIDIRYERLPEFCYACGYIGHAVKECVVPLPSLAEDRKKPYQYGPWLRYQGSGGLDQSLPLKKTDVENEGF
VSITPQSSGILKGDEFLGRSPSVESVEGGTSARKLKVWKRKLKLETADNSKADLLVGGSRILWFPLYMPW