; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10023052 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10023052
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPolyketide cyclase/dehydrase and lipid transporter
Genome locationChr05:30806490..30809448
RNA-Seq ExpressionHG10023052
SyntenyHG10023052
Gene Ontology termsNA
InterPro domainsIPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]7.9e-8978.57Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPN L   HRSIRRRNGILFMAIPT RSINSRS SLP+ VFKIPR SSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_022148316.1 uncharacterized protein LOC111016984 isoform X3 [Momordica charantia]3.9e-8885.94Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNS PN  N SH SI+RRNG+L MAIPT RSINS+S SLPE VF+IPRGS KRS NP  PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRF+PKG SSCL+ELTVSYEVPPLLSPVASALQPLLERLL++GL+SFATFAKKY+TA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]3.3e-8775.89Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSINSRSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]1.5e-8775.89Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT R+INSRSLSLP+ +F+IPRGSSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

XP_038898800.1 uncharacterized protein LOC120086302 [Benincasa hispida]1.1e-8777.68Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        M+IAALTCNSTPN L  +HRSIRRRNGILFMAIPTCRSI+SRSLSLPE VFKIPR SSKR RNPICP LK VSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFA FAKKY+T+
Subjt:  PLLERLLQQGLKSFATFAKKYETA

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein8.2e-8475.89Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NS+PN L   HRSIRRRNGI+FMAIPT RSINSRS SLP+ VFKIPR SSKRSRNP    LKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X13.8e-8978.57Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPN L   HRSIRRRNGILFMAIPT RSINSRS SLP+ VFKIPR SSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PT NQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLLQ+GLKSFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A6J1D507 uncharacterized protein LOC111016984 isoform X31.9e-8885.94Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNS PN  N SH SI+RRNG+L MAIPT RSINS+S SLPE VF+IPRGS KRS NP  PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA
        REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRF+PKG SSCL+ELTVSYEVPPLLSPVASALQPLLERLL++GL+SFATFAKKY+TA
Subjt:  REAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYETA

A0A6J1H9D8 uncharacterized protein LOC1114612531.6e-8775.89Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSINSRSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

A0A6J1KR05 uncharacterized protein LOC1114978788.0e-8775.45Show/hide
Query:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN LN SHRSIRR NG+LFMAIPT RSIN RSLSLP+ +F+IPR SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIPRGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVK                                PTLNQKIHWRSLEGLPNRGVVRF+PKGPSSCL+ELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQQGLKSFATFAKKYETA
        PLLERLL++GL+SFATFAKKY+TA
Subjt:  PLLERLLQQGLKSFATFAKKYETA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein5.3e-3548.63Show/hide
Query:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFFPK
        PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+SSV                                +P  ++KIHWRS+EG  NRG VRFFP+
Subjt:  PVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFFPK

Query:  GPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
        GPSSCL+E++ SYEVP   +PVA A++P +E++++ GL+ FA F K
Subjt:  GPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.3e-3348.3Show/hide
Query:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFFP
        PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+SSV                                +P  ++KIHWRS+EG  NRG VRFFP
Subjt:  PVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWRSLEGLPNRGVVRFFP

Query:  KGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
        +GPSSCL+E++ SYEVP   +PVA A++P +E++++ GL+ FA F K
Subjt:  KGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.0e-4255.56Show/hide
Query:  SKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWR
        S+RSR  I P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV                                +PT NQKIHWR
Subjt:  SKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSV--------------------------------KPTLNQKIHWR

Query:  SLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK
        SLEGLPN+G VRFFPKGPSSC++ELTVSYEVP LL+PVAS L+P +E LL+ GL+ FA  AK
Subjt:  SLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein2.4e-3551.01Show/hide
Query:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKG
        +MEWQ C  KM+V++P SVAY  YS+RE+IPKWM FISSVK                                P  NQKIHW SLEGLPN+G VRFFP G
Subjt:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVK--------------------------------PTLNQKIHWRSLEGLPNRGVVRFFPKG

Query:  PSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYET
        PSSC +ELT +YEVP LL P A+ALQPL++ L++  L+ FA  AK  +T
Subjt:  PSSCLIELTVSYEVPPLLSPVASALQPLLERLLQQGLKSFATFAKKYET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTGTGCAAGAAGAACAGCCTCCGAAAATTTCTTTTTCCAGAGCCACAGTAATATCAAAGGGCTGTTCTTGCAGTTTGGTGCAAGACTGCAAGTCAAATCTCTGCT
CGCAACTCCCTTTAATGTCCCTTTGCCGTTTTCTTCTTCACTAATCCCAACACCCACTATGTCAATTGCAGCACTCACCTGCAATTCAACTCCAAACCCTCTGAATCCCA
GTCATCGATCGATCAGAAGGAGAAATGGCATCTTATTCATGGCGATTCCCACTTGCAGAAGCATCAATTCCAGGTCACTGTCTCTACCCGAGTTCGTCTTCAAGATCCCA
CGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCTCGACTCAAATTCGTCTCCCCTGTGATGGAATGGCAGAACTGCACGGCTAAGATGGAAGTTGACATACCTGC
TTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTCATTTCATCTGTGAAGCCGACCCTAAATCAAAAAATTCATTGGCGGTCACTTG
AAGGTCTTCCAAACAGAGGTGTTGTACGTTTTTTTCCAAAGGGCCCCTCATCTTGCCTTATAGAACTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCA
TCCGCACTGCAACCTTTGCTTGAGAGATTACTTCAACAAGGTCTTAAAAGCTTTGCCACGTTTGCCAAGAAATACGAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTGTGCAAGAAGAACAGCCTCCGAAAATTTCTTTTTCCAGAGCCACAGTAATATCAAAGGGCTGTTCTTGCAGTTTGGTGCAAGACTGCAAGTCAAATCTCTGCT
CGCAACTCCCTTTAATGTCCCTTTGCCGTTTTCTTCTTCACTAATCCCAACACCCACTATGTCAATTGCAGCACTCACCTGCAATTCAACTCCAAACCCTCTGAATCCCA
GTCATCGATCGATCAGAAGGAGAAATGGCATCTTATTCATGGCGATTCCCACTTGCAGAAGCATCAATTCCAGGTCACTGTCTCTACCCGAGTTCGTCTTCAAGATCCCA
CGCGGTTCTTCGAAGCGCAGCAGAAACCCCATTTGCCCTCGACTCAAATTCGTCTCCCCTGTGATGGAATGGCAGAACTGCACGGCTAAGATGGAAGTTGACATACCTGC
TTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTCATTTCATCTGTGAAGCCGACCCTAAATCAAAAAATTCATTGGCGGTCACTTG
AAGGTCTTCCAAACAGAGGTGTTGTACGTTTTTTTCCAAAGGGCCCCTCATCTTGCCTTATAGAACTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCA
TCCGCACTGCAACCTTTGCTTGAGAGATTACTTCAACAAGGTCTTAAAAGCTTTGCCACGTTTGCCAAGAAATACGAAACGGCTTGA
Protein sequenceShow/hide protein sequence
MVCARRTASENFFFQSHSNIKGLFLQFGARLQVKSLLATPFNVPLPFSSSLIPTPTMSIAALTCNSTPNPLNPSHRSIRRRNGILFMAIPTCRSINSRSLSLPEFVFKIP
RGSSKRSRNPICPRLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKPTLNQKIHWRSLEGLPNRGVVRFFPKGPSSCLIELTVSYEVPPLLSPVA
SALQPLLERLLQQGLKSFATFAKKYETA