; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020802 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020802
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function (DUF1118)
Genome locationChr05:2555441..2556131
RNA-Seq ExpressionHG10020802
SyntenyHG10020802
Gene Ontology termsGO:0032774 - RNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR009500 - Protein of unknown function DUF1118


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152441.1 uncharacterized protein LOC101214681 [Cucumis sativus]1.3e-8189.81Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR-----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSN
        MAVTSQSPSSS  TA  SHL NRFSISRF H+SN+AR      SRSNPLTI AMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSN
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR-----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSN

Query:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLG
        VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLG
Subjt:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLG

Query:  GLQEAD
        GLQEAD
Subjt:  GLQEAD

XP_008437595.1 PREDICTED: uncharacterized protein LOC103482961 [Cucumis melo]1.2e-7988.67Show/hide
Query:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK
        +TS SPSSS  TA+ SH PNRFSISR  H+SNH R     SRSNPLTIVAMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSNVEK
Subjt:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK

Query:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ
        AGLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQ
Subjt:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ

Query:  EAD
        EAD
Subjt:  EAD

XP_022137512.1 uncharacterized protein LOC111008941 [Momordica charantia]8.4e-8187.68Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR--PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK
        MAV S S +++AATAAPSHLPN+F  S+FRH  NHAR  PSRSN LTIVAMAP+KKVNKYD  W+KKWFGAGIFYES+EDVEVDVFKKLE KKVLSNVEK
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR--PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK

Query:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ
        AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIV+IPDDS ALV LQAVVGGGLALGAAGL VGSVVLGGLQ
Subjt:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ

Query:  EAD
        EAD
Subjt:  EAD

XP_023000865.1 uncharacterized protein LOC111495182 [Cucurbita maxima]1.6e-7987.75Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE
        MAVT+QSPSSS   AAPS LPNRFSISRFRH SNHARP   SRSNPL I+AMA QKKVNKYD  WEKKWFGAGIFYES+EDVEVDVFKKLE KKVLSNVE
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAALAAIV+IPDDS  LV LQAVV GGL LGAAGLFVGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

XP_038895665.1 uncharacterized protein LOC120083851 [Benincasa hispida]2.3e-8692.65Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE
        MAVTSQSPSSSAATAAPSHL NRFSI RFRHVSNH+RP   S  NPLTIVAMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSNVE
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEK+ SSSPSALASLALPILV ALAAIV+IPDDSVALVVLQAVVGGGLALGAAGL VGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

TrEMBL top hitse value%identityAlignment
A0A0A0LW40 Uncharacterized protein6.2e-8289.81Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR-----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSN
        MAVTSQSPSSS  TA  SHL NRFSISRF H+SN+AR      SRSNPLTI AMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSN
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR-----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSN

Query:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLG
        VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLG
Subjt:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLG

Query:  GLQEAD
        GLQEAD
Subjt:  GLQEAD

A0A1S3AV09 uncharacterized protein LOC1034829615.8e-8088.67Show/hide
Query:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK
        +TS SPSSS  TA+ SH PNRFSISR  H+SNH R     SRSNPLTIVAMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSNVEK
Subjt:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK

Query:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ
        AGLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQ
Subjt:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ

Query:  EAD
        EAD
Subjt:  EAD

A0A5D3BL78 DUF1118 domain-containing protein5.8e-8088.67Show/hide
Query:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK
        +TS SPSSS  TA+ SH PNRFSISR  H+SNH R     SRSNPLTIVAMAPQKKVNKYD AWEKKWFGAGIFYESAEDVEVDVFKKLE KKVLSNVEK
Subjt:  VTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR----PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK

Query:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ
        AGLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAAL AIV+IPDDSVALV LQAVVGGGLALGAAGL VGSVVLGGLQ
Subjt:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ

Query:  EAD
        EAD
Subjt:  EAD

A0A6J1C8G3 uncharacterized protein LOC1110089414.0e-8187.68Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR--PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK
        MAV S S +++AATAAPSHLPN+F  S+FRH  NHAR  PSRSN LTIVAMAP+KKVNKYD  W+KKWFGAGIFYES+EDVEVDVFKKLE KKVLSNVEK
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHAR--PSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEK

Query:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ
        AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIV+IPDDS ALV LQAVVGGGLALGAAGL VGSVVLGGLQ
Subjt:  AGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQ

Query:  EAD
        EAD
Subjt:  EAD

A0A6J1KJH4 uncharacterized protein LOC1114951827.6e-8087.75Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE
        MAVT+QSPSSS   AAPS LPNRFSISRFRH SNHARP   SRSNPL I+AMA QKKVNKYD  WEKKWFGAGIFYES+EDVEVDVFKKLE KKVLSNVE
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARP---SRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAALAAIV+IPDDS  LV LQAVV GGL LGAAGLFVGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74730.1 Protein of unknown function (DUF1118)6.7e-5261.19Show/hide
Query:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARPSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEKAG
        MAV     SS AA      L N   + RFR   +  +   +   ++VAMAPQKKVNKYD  W+K+W+GAG+F+E +E + VDVFKKLEK+KVLSNVEK+G
Subjt:  MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARPSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEKAG

Query:  LLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEA
        LLSKAE LG TLSS+EKL VFSKAE+LGLLSLLE +A +SP+ LAS ALP L AA+ A+V+IPDDS  LVV QAV+ G LAL    L VGSVVL GLQEA
Subjt:  LLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEA

Query:  D
        D
Subjt:  D

AT5G08050.1 Protein of unknown function (DUF1118)3.7e-1036.72Show/hide
Query:  ESAEDVEVDVFKKLEKKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQ
        +S    +V +  ++E+ K+L+  EKAGLLS AE+ GF+LS+IE+LG+ +KAEE G+LS        +P  L +L+L +L+       ++P+D    VV+Q
Subjt:  ESAEDVEVDVFKKLEKKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQ

Query:  AVVGGGLALGAAGLFVGSVVLGGLQEAD
         +V     LG +  F  S  +  LQ++D
Subjt:  AVVGGGLALGAAGLFVGSVVLGGLQEAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTCACTTCACAATCTCCATCTTCCTCCGCTGCTACTGCTGCTCCCTCTCACCTTCCCAACCGCTTCTCCATTTCCAGATTCCGCCACGTCTCTAACCATGCCCG
CCCTTCCAGATCCAATCCACTCACCATCGTCGCCATGGCCCCTCAGAAGAAGGTGAACAAATACGACGGCGCTTGGGAGAAAAAATGGTTTGGAGCTGGAATCTTTTACG
AGAGCGCGGAGGATGTGGAAGTCGATGTGTTCAAGAAGCTTGAGAAGAAGAAAGTGCTTAGCAATGTCGAAAAAGCAGGGCTGTTATCGAAGGCGGAGGAATTAGGGTTT
ACGCTTTCTTCGATTGAGAAATTGGGGGTTTTCTCAAAAGCTGAGGAGTTAGGGCTTCTTAGCTTGCTTGAGAAAGTTGCGAGCTCTTCTCCGTCGGCTTTGGCTTCTCT
TGCTCTTCCCATTCTCGTGGCGGCACTGGCGGCGATTGTTATCATTCCCGATGACTCGGTGGCGCTTGTAGTGTTGCAGGCGGTGGTCGGTGGCGGACTGGCGCTTGGGG
CTGCCGGATTGTTCGTTGGATCGGTGGTGCTGGGTGGGTTGCAGGAAGCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTCACTTCACAATCTCCATCTTCCTCCGCTGCTACTGCTGCTCCCTCTCACCTTCCCAACCGCTTCTCCATTTCCAGATTCCGCCACGTCTCTAACCATGCCCG
CCCTTCCAGATCCAATCCACTCACCATCGTCGCCATGGCCCCTCAGAAGAAGGTGAACAAATACGACGGCGCTTGGGAGAAAAAATGGTTTGGAGCTGGAATCTTTTACG
AGAGCGCGGAGGATGTGGAAGTCGATGTGTTCAAGAAGCTTGAGAAGAAGAAAGTGCTTAGCAATGTCGAAAAAGCAGGGCTGTTATCGAAGGCGGAGGAATTAGGGTTT
ACGCTTTCTTCGATTGAGAAATTGGGGGTTTTCTCAAAAGCTGAGGAGTTAGGGCTTCTTAGCTTGCTTGAGAAAGTTGCGAGCTCTTCTCCGTCGGCTTTGGCTTCTCT
TGCTCTTCCCATTCTCGTGGCGGCACTGGCGGCGATTGTTATCATTCCCGATGACTCGGTGGCGCTTGTAGTGTTGCAGGCGGTGGTCGGTGGCGGACTGGCGCTTGGGG
CTGCCGGATTGTTCGTTGGATCGGTGGTGCTGGGTGGGTTGCAGGAAGCTGATTGA
Protein sequenceShow/hide protein sequence
MAVTSQSPSSSAATAAPSHLPNRFSISRFRHVSNHARPSRSNPLTIVAMAPQKKVNKYDGAWEKKWFGAGIFYESAEDVEVDVFKKLEKKKVLSNVEKAGLLSKAEELGF
TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALAAIVIIPDDSVALVVLQAVVGGGLALGAAGLFVGSVVLGGLQEAD