; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016147 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016147
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationChr03:3160017..3160823
RNA-Seq ExpressionHG10016147
SyntenyHG10016147
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134744.2 uncharacterized protein LOC101214369 [Cucumis sativus]1.4e-8287.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M++TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCI VRSA +YKGL+ NKQLA GSL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_008439942.1 PREDICTED: uncharacterized protein LOC103484578 [Cucumis melo]1.4e-8287.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M++TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_022950216.1 uncharacterized protein LOC111453374 [Cucurbita moschata]1.1e-8287.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        MQ+  GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVF
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIA+VVGFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CFVHGLFAVAYYVSATA +REE +    PPPQG+PAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_022977369.1 uncharacterized protein LOC111477717 [Cucurbita maxima]6.2e-8387.91Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        MQ+  GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVF
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIALVVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVSATA HRE  +    PPPQG+PAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

XP_038881391.1 uncharacterized protein LOC120072925 [Benincasa hispida]4.3e-8490.11Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        MQ+TYGF+VC+L+IIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANLAGG IFVRSA EYKGLT NKQLA GSLVF
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVSATAEHREE K    PPPQGNPA A G V
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

TrEMBL top hitse value%identityAlignment
A0A0A0KLA9 Uncharacterized protein6.7e-8387.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M++TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCI VRSA +YKGL+ NKQLA GSL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIALVVGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A1S3AZY5 uncharacterized protein LOC1034845786.7e-8387.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M++TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A5D3CMA3 Uncharacterized protein6.7e-8387.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M++TYG VVCIL+II+DVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANL GGCIFVRSA +YKGL+ NKQLAAGSL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIAL+VGFSLLISGAMYNTRSRKSCGLAHN+LLSIGGIVCFVHGLFAVAYYVS TA  RE+ K    PPPQGNPA ATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A6J1GE80 uncharacterized protein LOC1114533745.1e-8387.36Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        MQ+  GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVF
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIA+VVGFSLLISGAMYNTRSRKSCGLAHN LLSIGGI CFVHGLFAVAYYVSATA +REE +    PPPQG+PAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

A0A6J1IR67 uncharacterized protein LOC1114777173.0e-8387.91Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        MQ+  GFVVCILVI+MDVTAGILGIQAE+AQNKVNHFKMWIFECKDPSYNAFKLGLAAAILL LAHAIANL GGC++VRSA EYKGLT+NKQLAAGSLVF
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV
        AWIALVVGFSLLISGAMYNTRSRKSCGLAH+ LLSIGGIVCFVHGLFAVAYYVSATA HRE  +    PPPQG+PAGATGHV
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05291.1 Protein of unknown function (DUF1218)1.4e-1936.57Show/hide
Query:  VVCILVII-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNK---QLAAGSLVFAWI
        +VCI++ + +D+ AG +G+QA+ AQ  V H K+   ECK PS  AF LG+ A   LA AH  AN+  GC           L  NK         L   W+
Subjt:  VVCILVII-MDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNK---QLAAGSLVFAWI

Query:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVS---ATAEHREEMKPPPPPPPQGNP
          + G  +L +G   NT SR  C   +N + SIGG VCF+H + +  YY+S   A A H    KP    P +  P
Subjt:  ALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVS---ATAEHREEMKPPPPPPPQGNP

AT1G11500.1 Protein of unknown function (DUF1218)3.4e-3142.01Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQL
        M+   GF+V +++I  D+TA +LGI+AEIAQ+K  H             C + PS  AF  G+AA +LL + H +AN+ GGC ++RS  ++K  T NK L
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFE------C-KDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQL

Query:  AAGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE
        A   LV +WI  VV +S L+ G + N+R+ + C L H     IGGI C  HG+   AYYVSA A  +E+
Subjt:  AAGSLVFAWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE

AT2G32280.1 Protein of unknown function (DUF1218)2.9e-4650.31Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M +  G +VC++++ +DV A ILGIQAE+AQN+V H ++W+FEC++PS +AF+LGL AA +L +AH + NL GGC+ + S DE++  ++ +Q++   LV 
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHRE
         WI   VGF  ++ G M N++SR SCG  H+  LSIGGI+CF+H LF VAYYVSATA   E
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHRE

AT4G21310.1 Protein of unknown function (DUF1218)1.1e-5865.43Show/hide
Query:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF
        M R  GF +CIL++ MDV+AGILGI+AEIAQNKV H KMWIFEC+DPSY AFK GLAA ILL LAH  AN  GGC+ V S  + +  + NKQLA  SL+F
Subjt:  MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVF

Query:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE
         WI L + FS+LI G M N+RSRK+CG++H+R+LSIGGI+CFVHGLFAVAYY+SATA  RE+
Subjt:  AWIALVVGFSLLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATTATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCTCAAAACAAGGTGAACCATTT
CAAAATGTGGATATTTGAGTGTAAAGACCCAAGCTATAATGCTTTCAAGCTAGGTTTGGCTGCAGCCATACTGCTTGCCCTTGCTCACGCCATTGCCAACCTGGCTGGTG
GGTGCATTTTTGTTCGATCTGCTGACGAATACAAAGGATTAACCACTAACAAGCAACTTGCTGCGGGTTCACTCGTCTTTGCATGGATTGCGCTAGTGGTCGGATTCTCC
TTGCTTATTTCCGGGGCGATGTATAACACAAGGTCGAGAAAATCATGCGGACTTGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTGTGCTTCGTACACGGCTTGTT
TGCAGTTGCATATTACGTTTCTGCCACAGCTGAACACAGGGAGGAGATGAAGCCGCCGCCGCCGCCACCACCACAGGGGAATCCTGCTGGGGCTACAGGCCATGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGAACTTATGGTTTTGTTGTCTGCATCTTGGTTATTATTATGGATGTTACAGCCGGAATTCTTGGAATTCAAGCTGAAATAGCTCAAAACAAGGTGAACCATTT
CAAAATGTGGATATTTGAGTGTAAAGACCCAAGCTATAATGCTTTCAAGCTAGGTTTGGCTGCAGCCATACTGCTTGCCCTTGCTCACGCCATTGCCAACCTGGCTGGTG
GGTGCATTTTTGTTCGATCTGCTGACGAATACAAAGGATTAACCACTAACAAGCAACTTGCTGCGGGTTCACTCGTCTTTGCATGGATTGCGCTAGTGGTCGGATTCTCC
TTGCTTATTTCCGGGGCGATGTATAACACAAGGTCGAGAAAATCATGCGGACTTGCTCACAATCGTCTTCTGTCTATAGGGGGGATTGTGTGCTTCGTACACGGCTTGTT
TGCAGTTGCATATTACGTTTCTGCCACAGCTGAACACAGGGAGGAGATGAAGCCGCCGCCGCCGCCACCACCACAGGGGAATCCTGCTGGGGCTACAGGCCATGTTTAG
Protein sequenceShow/hide protein sequence
MQRTYGFVVCILVIIMDVTAGILGIQAEIAQNKVNHFKMWIFECKDPSYNAFKLGLAAAILLALAHAIANLAGGCIFVRSADEYKGLTTNKQLAAGSLVFAWIALVVGFS
LLISGAMYNTRSRKSCGLAHNRLLSIGGIVCFVHGLFAVAYYVSATAEHREEMKPPPPPPPQGNPAGATGHV