; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004552 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004552
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1677)
Genome locationChr08:18246830..18248275
RNA-Seq ExpressionHG10004552
SyntenyHG10004552
Gene Ontology termsNA
InterPro domainsIPR012876 - Protein of unknown function DUF1677, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445002.1 PREDICTED: uncharacterized protein LOC103488172 isoform X1 [Cucumis melo]2.1e-7584.7Show/hide
Query:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME
        MAPNKLETKVEV Q+P    SMEGLQRTISDI  ELTKE LAVAG N+PLPPI+EVEAAACECCGLSE+CTAEYIGRV+DKFMGKLICGLCAEAVNE ME
Subjt:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME

Query:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
        KKGWKREEALKEHMSACAKFNRIGR YPVLYQAEAIK ILK  KTS  GIKS V A HR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV

XP_016899975.1 PREDICTED: uncharacterized protein LOC103488172 isoform X2 [Cucumis melo]1.9e-7384.15Show/hide
Query:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME
        MAPNKLETKVEV Q+P    SMEGLQRTISDI  ELTKE LAVAG N+PLPPI+EVEAAACECCGLSE+CTAEYIGRV+DKFMGKLICGLCAEAVNE ME
Subjt:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME

Query:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
         KGWKREEALKEHMSACAKFNRIGR YPVLYQAEAIK ILK  KTS  GIKS V A HR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV

XP_023547063.1 uncharacterized protein LOC111805983 [Cucurbita pepo subsp. pepo]8.4e-6976.92Show/hide
Query:  MAPNKLETKVEVQRP---SMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG
        MAPN+LETKVEVQ+P   SMEGLQRTISDI SELTK  L  A   PLP I+EVEAA CECCGLSEECTAEYI RVR +FMGK+ICGLCAEAVNE M K+G
Subjt:  MAPNKLETKVEVQRP---SMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG

Query:  WKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKST---VAATHRNGRIGRTSSCIPAITRDVCDPTIVK
        WK+EE+LKEHM+ACAKFNR GR YPVLYQAEAIK ILKK+ST+G   +     A HRNGRIGRTSSCIPAITRDVCDPT+VK
Subjt:  WKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKST---VAATHRNGRIGRTSSCIPAITRDVCDPTIVK

XP_038885709.1 uncharacterized protein LOC120076007 isoform X1 [Benincasa hispida]1.6e-7284.88Show/hide
Query:  LETKVEVQRPSMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALK
        + +KVEVQ+PSMEGLQRTISDI SELTKE LAV GNLPLP I+EVEAAACECCGLSE+CTAEYIG VRDKFMGKLICGLCAEAVNE MEKK W REEALK
Subjt:  LETKVEVQRPSMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALK

Query:  EHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCD-PTIVK
        EHM+ACAKFN+IGRAYPVLYQAEAIK ILKKTS    KS V+A HRNGRIGRTSSCIPAITRDVCD PT+VK
Subjt:  EHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCD-PTIVK

XP_038885710.1 uncharacterized protein LOC120076007 isoform X2 [Benincasa hispida]1.5e-7084.3Show/hide
Query:  LETKVEVQRPSMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALK
        + +KVEVQ+PSMEGLQRTISDI SELTKE LAV GNLPLP I+EVEAAACECCGLSE+CTAEYIG VRDKFMGKLICGLCAEAVNE ME K W REEALK
Subjt:  LETKVEVQRPSMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALK

Query:  EHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCD-PTIVK
        EHM+ACAKFN+IGRAYPVLYQAEAIK ILKKTS    KS V+A HRNGRIGRTSSCIPAITRDVCD PT+VK
Subjt:  EHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCD-PTIVK

TrEMBL top hitse value%identityAlignment
A0A0A0LLX3 Uncharacterized protein3.0e-6479.5Show/hide
Query:  MEGLQRTISDICSELTKEDLAV-AGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFN
        MEGLQRTISDI  EL+KE L V A N+PLPPI+EVEAAACECCGLSE+CTAEYIG V+DKFMGKLICGLCAEAVNE MEK GWKREEALKEHMSACAKFN
Subjt:  MEGLQRTISDICSELTKEDLAV-AGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFN

Query:  RIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIVK
        RIGR YPVLYQAEAIK ILKKT          + HR GRIGR+SSCIPA+ RDVCDPT+VK
Subjt:  RIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIVK

A0A1S3BBP3 uncharacterized protein LOC103488172 isoform X11.0e-7584.7Show/hide
Query:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME
        MAPNKLETKVEV Q+P    SMEGLQRTISDI  ELTKE LAVAG N+PLPPI+EVEAAACECCGLSE+CTAEYIGRV+DKFMGKLICGLCAEAVNE ME
Subjt:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME

Query:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
        KKGWKREEALKEHMSACAKFNRIGR YPVLYQAEAIK ILK  KTS  GIKS V A HR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV

A0A1S4DW91 uncharacterized protein LOC103488172 isoform X29.3e-7484.15Show/hide
Query:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME
        MAPNKLETKVEV Q+P    SMEGLQRTISDI  ELTKE LAVAG N+PLPPI+EVEAAACECCGLSE+CTAEYIGRV+DKFMGKLICGLCAEAVNE ME
Subjt:  MAPNKLETKVEV-QRP----SMEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGME

Query:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
         KGWKREEALKEHMSACAKFNRIGR YPVLYQAEAIK ILK  KTS  GIKS V A HR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  KKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV

A0A5A7VHU7 DUF1677 domain-containing protein1.8e-6482.1Show/hide
Query:  MEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFN
        MEGLQRTISDI  ELTKE L VAG N+PLPPI+EVEAAACECCGLSE+CTAEYIGRV+DKFMGKLICGLCAEA       KGWKREEALKEHMSACAKFN
Subjt:  MEGLQRTISDICSELTKEDLAVAG-NLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFN

Query:  RIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
        RIGR YPVLYQAEAIK ILK  KTS  GIKS V A HR GRIGRTSSCIPA+TRD+CDPT+V
Subjt:  RIGRAYPVLYQAEAIKHILK--KTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV

A0A6J1K942 uncharacterized protein LOC1114917772.6e-6875.82Show/hide
Query:  MAPNKLETKVEVQRP---SMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG
        MAPN+ ETKVEV++P   SMEGLQRTISDI SELTK  L  A   PLPPI+EVEAA CECCGLSEECTAEYI RVR +FMGK+ICGLCAEAVNE M K+G
Subjt:  MAPNKLETKVEVQRP---SMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG

Query:  WKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKST---VAATHRNGRIGRTSSCIPAITRDVCDPTIVK
        W++EE+LKEHMSACAKFNR GR YPVLYQAEAIK ILKK+ST+G   +     A HRNGRIGRTSSCIPAIT+DVCDPT+VK
Subjt:  WKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKST---VAATHRNGRIGRTSSCIPAITRDVCDPTIVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79770.1 Protein of unknown function (DUF1677)7.2e-3451.63Show/hide
Query:  EGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWK-REEALKEHMSACAKFNR
        + L R++SDI  +  +E+    G      I EVE A CECCG+ EECT EYI RVR+KF GK ICGLC+EAV E  +K+  +  E ALKEHMSAC +FN+
Subjt:  EGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWK-REEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDV
        +GR YP L+QA+A++ +L++ ST+G +S +        I RTSSCIPAITRD+
Subjt:  IGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDV

AT2G25780.1 Protein of unknown function (DUF1677)9.7e-1531.54Show/hide
Query:  LQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKK---GWKREEALKEHMSACAKFNR
        L++  SD+  E  K +++           EV    C+CCG+ EECT +YI +VR+ + G  +CGLC E V E + K        +EA   H   C  FN 
Subjt:  LQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKK---GWKREEALKEHMSACAKFNR

Query:  IGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAI
          R  P L    +++ I K++S   + S  +      +I RT SC P +
Subjt:  IGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAI

AT3G22540.1 Protein of unknown function (DUF1677)5.7e-1540.66Show/hide
Query:  EVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAV-NEGMEKKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTS
        E+E+  CECCGL E+CT +YI  V+  F  K +CGLC+EAV +E   +K    +EA+K H+S C KF +     P ++ A+ ++ +L++ S
Subjt:  EVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAV-NEGMEKKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTS

AT4G14819.1 Protein of unknown function (DUF1677)6.1e-1743.33Show/hide
Query:  EVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTS
        E+E+  CECCGL E+CT  YI +V+  F GK +CGLC+EAV++   +     EEA+  HMS C KFN    A P    A+ ++ +L++ S
Subjt:  EVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTS

AT5G25840.1 Protein of unknown function (DUF1677)8.7e-4053.14Show/hide
Query:  MEGLQRTISDICSE------LTKEDLAVAGN---LPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG-------WKR
        M  LQRTISDI  +      LTKE    A     L L  I+EVE A CECCG+SEECT EYI RVR KF GKLICGLC +AV   MEK          +R
Subjt:  MEGLQRTISDICSE------LTKEDLAVAGN---LPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKG-------WKR

Query:  EEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV
        EEA+K HMSAC++FNR+GR+YPVLYQAEA+K +LKK S K + +T       G + R+SSC+PA+ +++ D T+V
Subjt:  EEALKEHMSACAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCCAATAAATTGGAAACAAAAGTGGAAGTGCAAAGGCCATCAATGGAGGGTCTCCAACGAACCATATCCGACATCTGCTCGGAGTTAACGAAAGAAGATCTTGC
AGTTGCCGGAAACCTACCCCTGCCGCCCATCACCGAGGTGGAGGCCGCCGCTTGTGAGTGTTGCGGCCTCTCTGAGGAATGCACGGCGGAGTACATCGGTCGTGTTCGTG
ACAAGTTCATGGGAAAGCTTATTTGTGGCCTCTGCGCCGAGGCCGTAAACGAGGGGATGGAGAAGAAGGGTTGGAAAAGAGAGGAGGCATTGAAGGAACATATGAGCGCT
TGTGCAAAGTTCAATAGGATCGGAAGGGCATATCCGGTGTTGTATCAAGCGGAAGCAATTAAACACATTCTCAAGAAAACCTCCACCAAAGGAATTAAGTCCACCGTCGC
CGCCACCCACCGCAATGGCAGGATTGGTCGGACCTCCAGTTGCATTCCGGCCATCACTAGAGACGTTTGTGATCCAACCATAGTTAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCCAATAAATTGGAAACAAAAGTGGAAGTGCAAAGGCCATCAATGGAGGGTCTCCAACGAACCATATCCGACATCTGCTCGGAGTTAACGAAAGAAGATCTTGC
AGTTGCCGGAAACCTACCCCTGCCGCCCATCACCGAGGTGGAGGCCGCCGCTTGTGAGTGTTGCGGCCTCTCTGAGGAATGCACGGCGGAGTACATCGGTCGTGTTCGTG
ACAAGTTCATGGGAAAGCTTATTTGTGGCCTCTGCGCCGAGGCCGTAAACGAGGGGATGGAGAAGAAGGGTTGGAAAAGAGAGGAGGCATTGAAGGAACATATGAGCGCT
TGTGCAAAGTTCAATAGGATCGGAAGGGCATATCCGGTGTTGTATCAAGCGGAAGCAATTAAACACATTCTCAAGAAAACCTCCACCAAAGGAATTAAGTCCACCGTCGC
CGCCACCCACCGCAATGGCAGGATTGGTCGGACCTCCAGTTGCATTCCGGCCATCACTAGAGACGTTTGTGATCCAACCATAGTTAAATGA
Protein sequenceShow/hide protein sequence
MAPNKLETKVEVQRPSMEGLQRTISDICSELTKEDLAVAGNLPLPPITEVEAAACECCGLSEECTAEYIGRVRDKFMGKLICGLCAEAVNEGMEKKGWKREEALKEHMSA
CAKFNRIGRAYPVLYQAEAIKHILKKTSTKGIKSTVAATHRNGRIGRTSSCIPAITRDVCDPTIVK