; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014416 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014416
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function, DUF538
Genome locationChr02:11108276..11109903
RNA-Seq ExpressionHG10014416
SyntenyHG10014416
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008233 - peptidase activity (molecular function)
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151974.1 uncharacterized protein LOC101218420 [Cucumis sativus]3.0e-7185.47Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        M  LFG    R +ISILIILL+GLC +C      EDS SIHSLLRSMG PAGLVPKQ KSYT  ENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC PQVTLRNPLR+QR FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

XP_022937405.1 uncharacterized protein LOC111443705 [Cucurbita moschata]1.4e-7385.47Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MA LFGSWN R RISI IIL VG C SC CSNS +DS SIHSLLRSMGLPAGLVPKQVKSYT  EN RLEVYLD PCMAKYENRVIF++VFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGV+GMSQEELFLWLPVKDIIVN P+SGVILIDIGVAHKQLSLSLFEDPPDC PQ  LRNPLRK+R FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

XP_022976596.1 uncharacterized protein LOC111476944 [Cucurbita maxima]6.4e-7486.05Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MA LFGSWN R RISI IIL VG C SC CSNS +DS SIHSLLRSMGLPAGLVPKQVKSYT  EN RLEVYLD PCMAKYENRVIF++VFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVN P+SGVILIDIGVAHKQLSLSLFEDPPDC PQ  LRNPLRK+R FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

XP_023535139.1 uncharacterized protein LOC111796654 [Cucurbita pepo subsp. pepo]3.2e-7385.47Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MA LFGSWN R RISI IIL VG C SC CSNS +DS SIHSLLRSMGLPAGLVPKQVKSYT  EN RLEVYLD PCMAKYENRVIF++VFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVN P+SGVILIDIGVAHKQLSLSLFEDPPDC PQ  L NPLRK+R FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

XP_038900271.1 uncharacterized protein LOC120087352 [Benincasa hispida]4.3e-7890.12Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MAALFGSWNLR  ISI+ ILL+GLC +C C   GEDS SIHSLLRSMGLPAGLVPKQ KSYT  ENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC PQVTLRNPLRKQR FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

TrEMBL top hitse value%identityAlignment
A0A0A0L7K3 Uncharacterized protein1.4e-7185.47Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        M  LFG    R +ISILIILL+GLC +C      EDS SIHSLLRSMG PAGLVPKQ KSYT  ENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC PQVTLRNPLR+QR FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

A0A1S3BZT2 uncharacterized protein LOC1034949764.2e-7184.3Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        M  LFG    R +ISILIILL+GLC +C      ++S SIHSLLRSMGLPAGLVPKQ KSYT  +NGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC PQVTLRNPLR+QR FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

A0A6J1FGJ2 uncharacterized protein LOC1114437057.0e-7485.47Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MA LFGSWN R RISI IIL VG C SC CSNS +DS SIHSLLRSMGLPAGLVPKQVKSYT  EN RLEVYLD PCMAKYENRVIF++VFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGV+GMSQEELFLWLPVKDIIVN P+SGVILIDIGVAHKQLSLSLFEDPPDC PQ  LRNPLRK+R FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

A0A6J1FNJ8 uncharacterized protein LOC1114458792.4e-6681.29Show/hide
Query:  ALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIG
        ALF S NL   IS  +IL +G  CSCFCSNS +DS SIH LLRSMGLPAGLVPKQVKSYT  ENGRLEV+LD PCMAKYENRVIF++VFSANLSYGSLIG
Subjt:  ALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIG

Query:  VEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCK-PQVTLRNPLRKQRCFESLR
        V+GMSQEELFLWLPVKDIIVNYPTSGV+LIDIGVAHKQLSLSLFEDPPDC  PQ   RN LR QR FESLR
Subjt:  VEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCK-PQVTLRNPLRKQRCFESLR

A0A6J1IHB3 uncharacterized protein LOC1114769443.1e-7486.05Show/hide
Query:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL
        MA LFGSWN R RISI IIL VG C SC CSNS +DS SIHSLLRSMGLPAGLVPKQVKSYT  EN RLEVYLD PCMAKYENRVIF++VFSANLSYGSL
Subjt:  MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT-HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSL

Query:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR
        IGVEGMSQEELFLWLPVKDIIVN P+SGVILIDIGVAHKQLSLSLFEDPPDC PQ  LRNPLRK+R FESLR
Subjt:  IGVEGMSQEELFLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61667.1 Protein of unknown function, DUF5385.5e-3953.95Show/hide
Query:  ILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEELFLW
        +L++LLV      F + S +   SI +LL + GLP GL P  V+SY+   + G LEV L  PC A++ENRV FD V  ANLSYG L+G+EG++QEELFLW
Subjt:  ILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEELFLW

Query:  LPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQR
        LPVK I VN P+SG++L DIGVAHKQ+S SLFEDPP C P  ++   L K +
Subjt:  LPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQR

AT3G07460.1 Protein of unknown function, DUF5382.4e-2643.97Show/hide
Query:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL
        RI  + +L   L      S    ++ SI  +L + GLP GL PK VK +T   E GR  VYL+  C AKYE  + +D + S  + Y  +  + G+S +EL
Subjt:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL

Query:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC
        FLWL VK I V+ P+SG+I  D+GV  KQ SLSLFE P DC
Subjt:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC

AT3G07460.2 Protein of unknown function, DUF5382.4e-2643.97Show/hide
Query:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL
        RI  + +L   L      S    ++ SI  +L + GLP GL PK VK +T   E GR  VYL+  C AKYE  + +D + S  + Y  +  + G+S +EL
Subjt:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYT--HENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL

Query:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC
        FLWL VK I V+ P+SG+I  D+GV  KQ SLSLFE P DC
Subjt:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDC

AT3G07470.1 Protein of unknown function, DUF5385.3e-2640.56Show/hide
Query:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYTH--ENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL
        RI  +  L + L      S +  ++ +I+ +L + GLP+G+ PK V+ +T   E GR  VYL+  C AKYE  + +D   +  +    +  + G+S +EL
Subjt:  RISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYTH--ENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL

Query:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKP
        FLW PVK I V+ P+SG+I  D+GV  KQ SLSLFE P DC P
Subjt:  FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKP

AT5G54530.1 Protein of unknown function, DUF5384.6e-4659.87Show/hide
Query:  ISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSY-THENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEELFL
        IS +I+LL  L  S   S S    P++H +LRS GLPAGL+P++V SY  H +GRLEV+L APC AK+E  V F+ V   NLSYGSL+GVEG+SQ+ELFL
Subjt:  ISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSY-THENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEELFL

Query:  WLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFES
        WL VKDI+V  P SGVI+ DIGVA KQLSLSLFEDPP CKP   L+  +R+ R FE+
Subjt:  WLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCCTGTTTGGGAGCTGGAATTTGAGGACTCGGATCTCAATACTCATCATCTTGTTGGTGGGTTTGTGTTGTTCTTGTTTCTGCAGCAATTCTGGCGAAGATTC
TCCTTCGATTCATTCGCTTTTGCGGTCGATGGGTCTTCCAGCAGGGCTGGTGCCGAAGCAAGTGAAATCTTACACACATGAAAATGGTCGGTTGGAAGTGTATTTGGATG
CTCCATGTATGGCGAAATATGAGAACAGAGTGATTTTCGACACTGTTTTTAGTGCTAATCTTAGCTATGGCAGCTTGATTGGAGTGGAGGGTATGTCTCAAGAGGAGCTT
TTTCTATGGCTCCCTGTTAAAGATATCATTGTTAATTACCCTACTTCTGGTGTCATTCTTATTGACATTGGTGTTGCTCATAAACAACTCTCTTTGTCTCTCTTTGAAGA
TCCTCCTGATTGTAAACCTCAAGTTACATTGAGGAATCCTCTGAGGAAGCAAAGATGCTTTGAATCTCTAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCCCTGTTTGGGAGCTGGAATTTGAGGACTCGGATCTCAATACTCATCATCTTGTTGGTGGGTTTGTGTTGTTCTTGTTTCTGCAGCAATTCTGGCGAAGATTC
TCCTTCGATTCATTCGCTTTTGCGGTCGATGGGTCTTCCAGCAGGGCTGGTGCCGAAGCAAGTGAAATCTTACACACATGAAAATGGTCGGTTGGAAGTGTATTTGGATG
CTCCATGTATGGCGAAATATGAGAACAGAGTGATTTTCGACACTGTTTTTAGTGCTAATCTTAGCTATGGCAGCTTGATTGGAGTGGAGGGTATGTCTCAAGAGGAGCTT
TTTCTATGGCTCCCTGTTAAAGATATCATTGTTAATTACCCTACTTCTGGTGTCATTCTTATTGACATTGGTGTTGCTCATAAACAACTCTCTTTGTCTCTCTTTGAAGA
TCCTCCTGATTGTAAACCTCAAGTTACATTGAGGAATCCTCTGAGGAAGCAAAGATGCTTTGAATCTCTAAGATAA
Protein sequenceShow/hide protein sequence
MAALFGSWNLRTRISILIILLVGLCCSCFCSNSGEDSPSIHSLLRSMGLPAGLVPKQVKSYTHENGRLEVYLDAPCMAKYENRVIFDTVFSANLSYGSLIGVEGMSQEEL
FLWLPVKDIIVNYPTSGVILIDIGVAHKQLSLSLFEDPPDCKPQVTLRNPLRKQRCFESLR