; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023482 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023482
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
Genome locationChr05:34594782..34598276
RNA-Seq ExpressionHG10023482
SyntenyHG10023482
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575480.1 hypothetical protein SDJN03_26119, partial [Cucurbita argyrosperma subsp. sororia]3.4e-7387.91Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG
        MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS ++++SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYL MRTKKQDE V EEEA 
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG

Query:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        PDSAK  KIAAPVVEES A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

XP_004141680.1 uncharacterized protein LOC101218777 isoform X2 [Cucumis sativus]1.2e-7387.29Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGP
        MS+E  PKLYANKP KAQ+KQFQE+HK  DASSS+    SS+M+SASSSP PPQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA P
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGP

Query:  DSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        DSAK TKIAAPVVEESLARP +VEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP++
Subjt:  DSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

XP_008462359.1 PREDICTED: uncharacterized protein LOC103500733 isoform X2 [Cucumis melo]2.0e-7388.46Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG
        MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA 
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG

Query:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        PDSAK TKIAAPVVEESLA+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

XP_022954191.1 uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata]4.4e-7388.11Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE
        MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS    +S+SSS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EE
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE

Query:  EAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        EA PDSAK  KIAAPVVEES A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  EAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

XP_022992414.1 uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima]5.2e-7487.63Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE
        MS EEPPKLYANKPKKAQVKQFQEQHKV  ASSSSPAPP+SS      SS+SSS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V E
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE

Query:  EEAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        EEA PDSAK  KIAAPVVEES A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  EEAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

TrEMBL top hitse value%identityAlignment
A0A0A0KAZ6 Uncharacterized protein5.6e-7487.29Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGP
        MS+E  PKLYANKP KAQ+KQFQE+HK  DASSS+    SS+M+SASSSP PPQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA P
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGP

Query:  DSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        DSAK TKIAAPVVEESLARP +VEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP++
Subjt:  DSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

A0A1S3CGT4 uncharacterized protein LOC103500733 isoform X29.6e-7488.46Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG
        MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA 
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG

Query:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        PDSAK TKIAAPVVEESLA+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

A0A6J1GQ86 uncharacterized protein LOC111456527 isoform X12.1e-7388.11Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE
        MS EEPPKLYANKPKKAQVKQFQEQHKV S +SSSSPAPP+SS    +S+SSS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EE
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKV-SDASSSSPAPPSSS---MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE

Query:  EAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        EA PDSAK  KIAAPVVEES A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  EAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

A0A6J1JXH4 uncharacterized protein LOC111488728 isoform X12.5e-7487.63Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE
        MS EEPPKLYANKPKKAQVKQFQEQHKV  ASSSSPAPP+SS      SS+SSS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V E
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSS-----MSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE

Query:  EEAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        EEA PDSAK  KIAAPVVEES A+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKD EEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  EEAGPDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

E5GCA6 Uncharacterized protein9.6e-7488.46Show/hide
Query:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG
        MS+E  PKLYANKP KAQ+KQFQEQHK  DASSS+    SSSM+SA SSSP PPQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEEA 
Subjt:  MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSA-SSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAG

Query:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
        PDSAK TKIAAPVVEESLA+P IVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPN+
Subjt:  PDSAKITKIAAPVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55160.1 unknown protein1.5e-3956.91Show/hide
Query:  EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVA
        EE PKL+ NKPKK    AQ+K  +   +  +   SS P+P +++ +S      S  PP PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V 
Subjt:  EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVA

Query:  EEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
        EE A    AK + +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR DEEKAILK+FI +K+IP
Subjt:  EEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP

AT1G55160.2 unknown protein4.2e-3765.93Show/hide
Query:  SPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWI
        S  PP PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D + V EE A    AK + +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+
Subjt:  SPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD-EHVAEEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWI

Query:  LEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP
        LEEKRK+ PK+ EEKKR DEEKAILK+FI +K+IP
Subjt:  LEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIP

AT1G55160.3 unknown protein1.8e-3550.23Show/hide
Query:  EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVG----------------
        EE PKL+ NKPKK    AQ+K  +   +  +   SS P+P +++ +S      S  PP PPKESFARRYK++WP+LLTVNLAVG                
Subjt:  EEPPKLYANKPKK----AQVKQFQEQ-HKVSDASSSSPAPPSSSMSS---ASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVG----------------

Query:  ---------AYLFMRTKKQD-EHVAEEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEK
                 +YLF RTKK+D + V EE A    AK + +AAPV VE++L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR DEEK
Subjt:  ---------AYLFMRTKKQD-EHVAEEEAGPDSAKITKIAAPV-VEESLARPTIVEPV--KVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEK

Query:  AILKEFIRAKSIP
        AILK+FI +K+IP
Subjt:  AILKEFIRAKSIP

AT2G19530.1 unknown protein4.3e-1838.67Show/hide
Query:  SSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE------------------------EEAG---PDSAKITKIAAPVVEE-----
        SSPS  +PP++  ++  K  W   +  NL   AY+F   +++D    E                        E+ G    D AK  + A P  EE     
Subjt:  SSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAE------------------------EEAG---PDSAKITKIAAPVVEE-----

Query:  --------------SLARPTIVEPVKV-REPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV
                      S+ +    E VKV R+PIP D+Q+ELFKWILEEKRKI+PKDR+EKK+IDEEKAILK+FIRA+ IP +
Subjt:  --------------SLARPTIVEPVKV-REPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAAGAAGAACCTCCCAAGCTCTACGCCAACAAACCCAAGAAAGCCCAGGTCAAACAATTTCAAGAACAGCACAAAGTCAGCGACGCTTCTTCTTCTTCACCGGC
GCCACCATCATCGAGCATGTCATCCGCGTCTTCTTCTCCTTCACCGCCGCAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTG
TCAACCTTGCTGTTGGAGCTTATCTGTTTATGAGAACAAAAAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTGGCCCGGATTCAGCCAAAATCACCAAGATTGCTGCT
CCTGTTGTTGAGGAATCATTGGCCAGACCAACCATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCGGTGGACCAGCAGCGTGAACTGTTCAAGTGGATTTTGGAAGA
GAAGCGCAAGATAAAGCCAAAGGACCGTGAAGAGAAAAAACGCATTGATGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTATTCCTAATGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAAGAAGAACCTCCCAAGCTCTACGCCAACAAACCCAAGAAAGCCCAGGTCAAACAATTTCAAGAACAGCACAAAGTCAGCGACGCTTCTTCTTCTTCACCGGC
GCCACCATCATCGAGCATGTCATCCGCGTCTTCTTCTCCTTCACCGCCGCAGCCTCCGAAGGAATCATTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGACTG
TCAACCTTGCTGTTGGAGCTTATCTGTTTATGAGAACAAAAAAGCAAGATGAACATGTAGCTGAAGAAGAGGCTGGCCCGGATTCAGCCAAAATCACCAAGATTGCTGCT
CCTGTTGTTGAGGAATCATTGGCCAGACCAACCATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCGGTGGACCAGCAGCGTGAACTGTTCAAGTGGATTTTGGAAGA
GAAGCGCAAGATAAAGCCAAAGGACCGTGAAGAGAAAAAACGCATTGATGAAGAGAAAGCAATTCTCAAAGAGTTCATCCGAGCAAAATCTATTCCTAATGTTTAA
Protein sequenceShow/hide protein sequence
MSKEEPPKLYANKPKKAQVKQFQEQHKVSDASSSSPAPPSSSMSSASSSPSPPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEEAGPDSAKITKIAA
PVVEESLARPTIVEPVKVREPIPVDQQRELFKWILEEKRKIKPKDREEKKRIDEEKAILKEFIRAKSIPNV