; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013501 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013501
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function, DUF599
Genome locationChr02:2116218..2116712
RNA-Seq ExpressionHG10013501
SyntenyHG10013501
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038728.1 uncharacterized protein E6C27_scaffold92G002740 [Cucumis melo var. makuwa]3.9e-7693.94Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A++T++YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

XP_004136437.1 uncharacterized protein LOC101209101 [Cucumis sativus]2.1e-7794.51Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A++T+DYISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS  T KRKTIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

XP_008466268.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103503728 [Cucumis melo]1.2e-7492.73Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPP   ++T++YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

XP_023536044.1 uncharacterized protein LOC111797297 [Cucurbita pepo subsp. pepo]2.5e-7592.68Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLC GLAAVLSSTYSIKKPLNDAVYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG AAVTSDYISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

XP_038899190.1 uncharacterized protein LOC120086553 [Benincasa hispida]1.2e-7795.73Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP G AAVTSDYISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        N VGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS+AT KRKTIVAGEGNGEL+ V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

TrEMBL top hitse value%identityAlignment
A0A0A0LEJ6 Uncharacterized protein9.9e-7894.51Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A++T+DYISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS  T KRKTIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

A0A1S3CQV1 LOW QUALITY PROTEIN: uncharacterized protein LOC1035037286.0e-7592.73Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPP   ++T++YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

A0A5A7T794 Uncharacterized protein1.9e-7693.94Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG A++T++YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS ATGK+K TIVAG GNGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRK-TIVAGEGNGELSIV

A0A6J1FE98 uncharacterized protein LOC1114432817.9e-7592.07Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLC GLAAVLSSTYSIKKPLNDAVYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG AAVTS YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

A0A6J1IF19 uncharacterized protein LOC1114760396.7e-7490.85Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL
        MGCTLMATTSILLC GLAAVLSSTYSIKKPLND VYGAHGDFML LKYVTLLTLFLFSFFCHSLSIRFINQ NILINIPPG  AVTS YISDLLDKGFIL
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFIL

Query:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV
        NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD VCS A GKRKT+ AG+ NGELS V
Subjt:  NTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18215.1 Protein of unknown function, DUF5991.1e-1533.33Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA-AVTSDYISDLLDKGFI
        M  TL+ATT+I LC+ +   +S++ S K    + +YG+    +   K   +L  FL +F C+  SIR+   V+ L+ +P         +Y+S  L++   
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFA-AVTSDYISDLLDKGFI

Query:  LNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS
          ++G R FY + P+ LW FGP+ +FVC   M  +LY LD   S
Subjt:  LNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCS

AT4G31330.1 Protein of unknown function, DUF5997.6e-5473.29Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP-------PGFAAVTSDYISDL
        MG TLMATTSILLC GLAAVLSSTY++KKPLNDAV+GA G+FM+ LKYVT+LT+FLFSFF HSLSIRFINQVNILIN P               +Y+++L
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIP-------PGFAAVTSDYISDL

Query:  LDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD
        L++GFILNTVGNRLFYAALP++LWIFGPVLVF+CSV MVP+LYNLD
Subjt:  LDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLD

AT5G10580.1 Protein of unknown function, DUF5997.1e-5271.33Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG---------FAAVTSDYIS
        MG TLMATT ILLC GLAAVLSSTYSIKKPLNDAVYGAHGDF + LKYVT+LT+FLF+FF HSLSIRFINQVNILIN P            + VT +Y+S
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG---------FAAVTSDYIS

Query:  DLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV
        +LL+K F+LNTVGNRLFY  LP++LWIFGPVLVF+ S  ++PVLYNLD V
Subjt:  DLLDKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV

AT5G10580.2 Protein of unknown function, DUF5991.6e-3570.54Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG---------FAAVTSDYIS
        MG TLMATT ILLC GLAAVLSSTYSIKKPLNDAVYGAHGDF + LKYVT+LT+FLF+FF HSLSIRFINQVNILIN P            + VT +Y+S
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPG---------FAAVTSDYIS

Query:  DLLDKGFILNTV
        +LL+K F+LNT+
Subjt:  DLLDKGFILNTV

AT5G24790.1 Protein of unknown function, DUF5991.5e-4664.63Show/hide
Query:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI------PPGFAAVTSDYISDLL
        MG TLMATT +LLC GLAAVLSSTYSIKKPLNDAV+GAHGDF + +KY+T+LT+F+FSFF HSLSIRF+NQV IL+NI      P G   +TS+++S++ 
Subjt:  MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINI------PPGFAAVTSDYISDLL

Query:  DKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV
        +KG  LNTVGNRLFYA   ++LWIFGP+LVF   + MV VL +LD V
Subjt:  DKGFILNTVGNRLFYAALPMLLWIFGPVLVFVCSVSMVPVLYNLDVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGCACCCTAATGGCCACCACTTCGATTCTCCTCTGTACGGGTCTAGCGGCAGTATTGAGTAGCACATATAGCATAAAAAAGCCCCTAAACGACGCCGTTTACGG
GGCCCACGGCGATTTCATGTTGGGCCTTAAATACGTGACTTTACTCACACTATTCCTCTTCTCCTTCTTCTGCCATTCCCTCTCCATTAGATTCATAAATCAGGTTAATA
TTCTCATCAACATTCCCCCCGGCTTCGCCGCCGTCACCTCCGATTACATTTCCGATCTTCTTGACAAAGGATTTATTCTCAACACCGTCGGCAACCGCCTCTTCTACGCC
GCCCTGCCGATGCTCCTCTGGATTTTTGGCCCTGTTTTGGTCTTCGTTTGCTCTGTTTCTATGGTCCCTGTGCTTTATAATCTTGACGTCGTTTGCTCCGCCGCCACCGG
AAAAAGGAAGACGATCGTCGCCGGTGAGGGTAATGGTGAGCTCAGTATTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATGCACCCTAATGGCCACCACTTCGATTCTCCTCTGTACGGGTCTAGCGGCAGTATTGAGTAGCACATATAGCATAAAAAAGCCCCTAAACGACGCCGTTTACGG
GGCCCACGGCGATTTCATGTTGGGCCTTAAATACGTGACTTTACTCACACTATTCCTCTTCTCCTTCTTCTGCCATTCCCTCTCCATTAGATTCATAAATCAGGTTAATA
TTCTCATCAACATTCCCCCCGGCTTCGCCGCCGTCACCTCCGATTACATTTCCGATCTTCTTGACAAAGGATTTATTCTCAACACCGTCGGCAACCGCCTCTTCTACGCC
GCCCTGCCGATGCTCCTCTGGATTTTTGGCCCTGTTTTGGTCTTCGTTTGCTCTGTTTCTATGGTCCCTGTGCTTTATAATCTTGACGTCGTTTGCTCCGCCGCCACCGG
AAAAAGGAAGACGATCGTCGCCGGTGAGGGTAATGGTGAGCTCAGTATTGTTTAG
Protein sequenceShow/hide protein sequence
MGCTLMATTSILLCTGLAAVLSSTYSIKKPLNDAVYGAHGDFMLGLKYVTLLTLFLFSFFCHSLSIRFINQVNILINIPPGFAAVTSDYISDLLDKGFILNTVGNRLFYA
ALPMLLWIFGPVLVFVCSVSMVPVLYNLDVVCSAATGKRKTIVAGEGNGELSIV