; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004069 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004069
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEmbryogenesis-like protein
Genome locationChr08:13353509..13353988
RNA-Seq ExpressionHG10004069
SyntenyHG10004069
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602205.1 hypothetical protein SDJN03_07438, partial [Cucurbita argyrosperma subsp. sororia]1.4e-5979.76Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD ++ P SEF  + R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

KAG7032887.1 hypothetical protein SDJN02_06937, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-5979.17Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD ++ P SEF  + R DDLHF Y+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_022964579.1 uncharacterized protein LOC111464557 [Cucurbita moschata]5.4e-5979.17Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD ++ P SEF  + R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_022990474.1 uncharacterized protein LOC111487326 [Cucurbita maxima]3.8e-6080.95Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD  + P SEF  + R DDLHFSYLMRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

XP_038885743.1 uncharacterized protein LOC120076030 [Benincasa hispida]3.6e-7193.71Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES
        M+ RLRLP YRLLVGK QASISSSSCSF+RI DFTVKPKSEF NE RADDLHFSYLM SNR YSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMES

Query:  KETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        KETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
Subjt:  KETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

TrEMBL top hitse value%identityAlignment
A0A1S3C573 uncharacterized protein LOC1034965822.7e-5676.79Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS R RLPLYR L+G  Q SISSSS  F+RILDF+ KP       WRADDLHFSYLM SN         R +S  SG+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLE+YEGLLAKL +SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A5D3BXL9 Uncharacterized protein2.7e-5676.79Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS R RLPLYR L+G  Q SISSSS  F+RILDF+ KP       WRADDLHFSYLM SN         R +S  SG+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLE+YEGLLAKL +SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1BV77 uncharacterized protein LOC1110060461.6e-5677.98Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        M+ R RL L R L+GK QAS SSSS S +RI D  + PKSEF N WRA D HFS LMR N         RRYSG S  SEPEFDQVREVD INLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEM+EGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1HL89 uncharacterized protein LOC1114645572.6e-5979.17Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD ++ P SEF  + R DDLHFSY+MRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1JQ75 uncharacterized protein LOC1114873261.8e-6080.95Show/hide
Query:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR
        MS RLRL L RLL GK QAS SSSSCS +RILD  + P SEF  + R DDLHFSYLMRSN         RRYSGGSG SEPEFDQ+REVDRINLKFAEAR
Subjt:  MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSN---------RRYSGGSGTSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

SwissProt top hitse value%identityAlignment
Q9M9H3 Embryogenesis-like protein2.5e-3054.73Show/hide
Query:  LLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAE
        LL+G     I  SS S       +    S  S+       + S+ + + RRYS GS +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAE
Subjt:  LLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAE

Query:  CARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        CARDAV EVLEM++GLL K+ E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  CARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE

Arabidopsis top hitse value%identityAlignment
AT1G71730.1 unknown protein1.8e-3154.73Show/hide
Query:  LLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAE
        LL+G     I  SS S       +    S  S+       + S+ + + RRYS GS +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAE
Subjt:  LLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAE

Query:  CARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE
        CARDAV EVLEM++GLL K+ E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  CARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCTCGTTTACGCCTTCCACTCTACAGATTGCTTGTCGGAAAATGTCAGGCTTCAATTTCTTCATCTTCGTGTAGCTTTCAGCGAATTCTGGACTTCACT
GTGAAGCCAAAATCTGAGTTTTCTAACGAATGGCGGGCAGATGATCTCCATTTTTCGTATCTAATGAGGTCGAATCGGAGGTACAGTGGTGGTTCGGGGACATCG
GAGCCCGAATTCGATCAGGTTAGAGAGGTGGACAGGATCAATCTCAAGTTCGCCGAAGCTAGGGAGGAGATAGAGTCAGCTATGGAGTCTAAAGAGACCGTATAT
TTTGATGAAGAAGCCGAGTGTGCTCGGGATGCTGTGAAAGAAGTTTTAGAAATGTACGAGGGGCTTCTGGCGAAGTTGCCCGAGAGCGAAAGGAAGGCGTTGCAG
AGGTCAATGGGGCTTAAGATTGAGCAGCTGAAGGCTGAGCTTAAACAGCTTGACGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCTCGTTTACGCCTTCCACTCTACAGATTGCTTGTCGGAAAATGTCAGGCTTCAATTTCTTCATCTTCGTGTAGCTTTCAGCGAATTCTGGACTTCACT
GTGAAGCCAAAATCTGAGTTTTCTAACGAATGGCGGGCAGATGATCTCCATTTTTCGTATCTAATGAGGTCGAATCGGAGGTACAGTGGTGGTTCGGGGACATCG
GAGCCCGAATTCGATCAGGTTAGAGAGGTGGACAGGATCAATCTCAAGTTCGCCGAAGCTAGGGAGGAGATAGAGTCAGCTATGGAGTCTAAAGAGACCGTATAT
TTTGATGAAGAAGCCGAGTGTGCTCGGGATGCTGTGAAAGAAGTTTTAGAAATGTACGAGGGGCTTCTGGCGAAGTTGCCCGAGAGCGAAAGGAAGGCGTTGCAG
AGGTCAATGGGGCTTAAGATTGAGCAGCTGAAGGCTGAGCTTAAACAGCTTGACGAGTAA
Protein sequenceShow/hide protein sequence
MSSRLRLPLYRLLVGKCQASISSSSCSFQRILDFTVKPKSEFSNEWRADDLHFSYLMRSNRRYSGGSGTSEPEFDQVREVDRINLKFAEAREEIESAMESKETVY
FDEEAECARDAVKEVLEMYEGLLAKLPESERKALQRSMGLKIEQLKAELKQLDE