; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015058 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015058
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF810)
Genome locationChr02:23483839..23492457
RNA-Seq ExpressionHG10015058
SyntenyHG10015058
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0006457 - protein folding (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
InterPro domainsIPR008528 - Protein unc-13 homologue
IPR014770 - Munc13 homology 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142381.1 protein unc-13 homolog isoform X1 [Cucumis sativus]4.2e-7097.24Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

XP_008447022.1 PREDICTED: uncharacterized protein LOC103489571 [Cucumis melo]9.3e-7096.55Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        MGAEIHIRKL+LYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

XP_022139020.1 uncharacterized protein LOC111010053 [Momordica charantia]8.7e-6893.79Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        ++LHS+EKSETNHEH LALLAEETKKLLKRDSSLFIPILSQRDAQATI+SASLLHKLYGY+LKPFLDGVEHLTEDVVSVFPAANSLEEYIL LITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
         GAEIHIRKLALYQIESISGTLVLRWVNSQLGRI+GWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

XP_031742236.1 protein unc-13 homolog isoform X2 [Cucumis sativus]4.2e-7097.24Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

XP_031742237.1 protein unc-13 homolog isoform X3 [Cucumis sativus]4.2e-7097.24Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

TrEMBL top hitse value%identityAlignment
A0A0A0KRS3 Uncharacterized protein3.5e-7097.92Show/hide
Query:  VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEM
        VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEEM
Subjt:  VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEM

Query:  GAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        GAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  GAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

A0A1S3BH15 uncharacterized protein LOC1034895714.5e-7096.55Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRD QATI+SASLLHKLYGYKLKPFLDG+EHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        MGAEIHIRKL+LYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

A0A6J1CB47 uncharacterized protein LOC1110100534.2e-6893.79Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        ++LHS+EKSETNHEH LALLAEETKKLLKRDSSLFIPILSQRDAQATI+SASLLHKLYGY+LKPFLDGVEHLTEDVVSVFPAANSLEEYIL LITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
         GAEIHIRKLALYQIESISGTLVLRWVNSQLGRI+GWVERAIQQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

A0A6J1HTL3 uncharacterized protein LOC111466644 isoform X28.0e-6792.41Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKS+ NHEH LALLAEETKKLLKRDSSLFIPILSQRDAQA+I+SASLLHKLYG++LKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        +GA+IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERA QQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

A0A6J1HVX9 uncharacterized protein LOC111466644 isoform X18.0e-6792.41Show/hide
Query:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
        +VLHSVEKS+ NHEH LALLAEETKKLLKRDSSLFIPILSQRDAQA+I+SASLLHKLYG++LKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE
Subjt:  KVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEE

Query:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        +GA+IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERA QQE
Subjt:  MGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

SwissProt top hitse value%identityAlignment
Q8RX56 Protein unc-13 homolog2.1e-4867.38Show/hide
Query:  SVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAE
        ++++S+ N+EH LALLAEETKKL+K+DS++F+PILSQR  QA   SASL+HKLYG KLKPFLDG EHLTED VSVFPAA+SLE+Y+L L+TS C E  + 
Subjt:  SVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAE

Query:  IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
         + +KL  Y++ES+SGTLVLRW+NSQLGRIL WVERA +QE
Subjt:  IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

Arabidopsis top hitse value%identityAlignment
AT2G20010.1 Protein of unknown function (DUF810)1.4e-1531.61Show/hide
Query:  RRRERKVLHSVE---KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILT
        ++ +R V HS +   +  TN+  +LA+LAE+   L   + ++F PIL      A  ++A+ LH  YG +LK F+ G+  LT D + V  AA+ LE+ ++ 
Subjt:  RRRERKVLHSVE---KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILT

Query:  LIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        +    +   E G +  IR++  ++ E + G LV  W+  ++ R+  W++R +QQE
Subjt:  LIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

AT2G20010.2 Protein of unknown function (DUF810)1.4e-1531.61Show/hide
Query:  RRRERKVLHSVE---KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILT
        ++ +R V HS +   +  TN+  +LA+LAE+   L   + ++F PIL      A  ++A+ LH  YG +LK F+ G+  LT D + V  AA+ LE+ ++ 
Subjt:  RRRERKVLHSVE---KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILT

Query:  LIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        +    +   E G +  IR++  ++ E + G LV  W+  ++ R+  W++R +QQE
Subjt:  LIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

AT2G25800.1 Protein of unknown function (DUF810)5.0e-1329.3Show/hide
Query:  KAGTRRRRERKVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYI
        KA + RR  R        ++ N    LA+LA++  +L  ++  +F PIL +    A  ++ + LH  YG ++K F+ G+  LT D V +  AA+ LE+ +
Subjt:  KAGTRRRRERKVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYI

Query:  LTLIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
        + +    S   + G +  IR++  ++ E++   LV  W+ +++ R+  WV+R +QQE
Subjt:  LTLIT--SACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

AT4G11670.1 Protein of unknown function (DUF810)2.6e-1737.68Show/hide
Query:  KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAEIHI
        KS     H+LALLA E   + K + + F+P+ S+   +  +ISA LLH+ YG +L PFL+GV  L+ DV  V PAA  L+E +  L     +    + + 
Subjt:  KSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAEIHI

Query:  RKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
         KL  Y+IE     ++L W+ SQ   IL W  RA + E
Subjt:  RKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE

AT5G06970.1 Protein of unknown function (DUF810)1.5e-4967.38Show/hide
Query:  SVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAE
        ++++S+ N+EH LALLAEETKKL+K+DS++F+PILSQR  QA   SASL+HKLYG KLKPFLDG EHLTED VSVFPAA+SLE+Y+L L+TS C E  + 
Subjt:  SVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSLEEYILTLITSACEEMGAE

Query:  IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE
         + +KL  Y++ES+SGTLVLRW+NSQLGRIL WVERA +QE
Subjt:  IHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAGAACAGATAAAGAGCAGATTGAATTCTATATATCATCATCTCTTAAGAATGCATTCTCCAGGATCCTTCATACTCAAGCTATGATAGATCCACTGACGCTGGA
GAAGAAGTCCCCGCCTGCATCAGACGAGCAAATTAGCCAGACTATCCCAGAGGGTAGTGTGAACACTACCAAACAAGCCAAAGAGACACATTTAATGAGGAGAGAGAAGA
ACAATGGGAAAGATGGAGCAACAAAACATTCTGGGTTAGCTAAAAATACTACAATGGAAATAAAAAGGACCCTTAAGATGGGTCGAGCAAACTTTTTTCACACAAAAAAG
GAGGAAGATAAATGGAAGAGCAGGAGAGGAAGATTAACAGGGAAGGCTGGTACAAGGAGGAGAAGAGAGAGAAAGGTTTTGCATTCTGTGGAGAAATCTGAAACAAATCA
TGAACATTCTCTAGCCTTGCTTGCTGAAGAAACAAAAAAACTTCTAAAGAGGGATTCATCTCTTTTCATACCAATCTTATCTCAAAGGGACGCTCAAGCAACCATCATTT
CAGCATCACTTCTCCATAAACTTTATGGTTACAAACTGAAGCCCTTCCTTGATGGAGTTGAACATCTGACCGAGGATGTTGTCTCTGTGTTTCCGGCAGCCAATAGTCTT
GAGGAGTATATATTGACCCTTATCACATCCGCATGTGAAGAGATGGGTGCTGAGATTCACATCAGAAAACTTGCTCTATACCAGATTGAATCTATATCTGGAACCCTGGT
GCTGCGGTGGGTCAACTCGCAACTTGGACGGATCTTAGGATGGGTGGAAAGGGCTATTCAACAAGAGGCAAGTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGAGAACAGATAAAGAGCAGATTGAATTCTATATATCATCATCTCTTAAGAATGCATTCTCCAGGATCCTTCATACTCAAGCTATGATAGATCCACTGACGCTGGA
GAAGAAGTCCCCGCCTGCATCAGACGAGCAAATTAGCCAGACTATCCCAGAGGGTAGTGTGAACACTACCAAACAAGCCAAAGAGACACATTTAATGAGGAGAGAGAAGA
ACAATGGGAAAGATGGAGCAACAAAACATTCTGGGTTAGCTAAAAATACTACAATGGAAATAAAAAGGACCCTTAAGATGGGTCGAGCAAACTTTTTTCACACAAAAAAG
GAGGAAGATAAATGGAAGAGCAGGAGAGGAAGATTAACAGGGAAGGCTGGTACAAGGAGGAGAAGAGAGAGAAAGGTTTTGCATTCTGTGGAGAAATCTGAAACAAATCA
TGAACATTCTCTAGCCTTGCTTGCTGAAGAAACAAAAAAACTTCTAAAGAGGGATTCATCTCTTTTCATACCAATCTTATCTCAAAGGGACGCTCAAGCAACCATCATTT
CAGCATCACTTCTCCATAAACTTTATGGTTACAAACTGAAGCCCTTCCTTGATGGAGTTGAACATCTGACCGAGGATGTTGTCTCTGTGTTTCCGGCAGCCAATAGTCTT
GAGGAGTATATATTGACCCTTATCACATCCGCATGTGAAGAGATGGGTGCTGAGATTCACATCAGAAAACTTGCTCTATACCAGATTGAATCTATATCTGGAACCCTGGT
GCTGCGGTGGGTCAACTCGCAACTTGGACGGATCTTAGGATGGGTGGAAAGGGCTATTCAACAAGAGGCAAGTACATAA
Protein sequenceShow/hide protein sequence
MSRTDKEQIEFYISSSLKNAFSRILHTQAMIDPLTLEKKSPPASDEQISQTIPEGSVNTTKQAKETHLMRREKNNGKDGATKHSGLAKNTTMEIKRTLKMGRANFFHTKK
EEDKWKSRRGRLTGKAGTRRRRERKVLHSVEKSETNHEHSLALLAEETKKLLKRDSSLFIPILSQRDAQATIISASLLHKLYGYKLKPFLDGVEHLTEDVVSVFPAANSL
EEYILTLITSACEEMGAEIHIRKLALYQIESISGTLVLRWVNSQLGRILGWVERAIQQEAST