; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy1G007770 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy1G007770
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProtein of unknown function (DUF1118)
Genome locationGy14Chr1:4987912..4989175
RNA-Seq ExpressionCsGy1G007770
SyntenyCsGy1G007770
Gene Ontology termsGO:0032774 - RNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003899 - DNA-directed 5'-3' RNA polymerase activity (molecular function)
InterPro domainsIPR009500 - Protein of unknown function DUF1118


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584260.1 hypothetical protein SDJN03_20192, partial [Cucurbita argyrosperma subsp. sororia]1.45e-10287.25Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVT+QSPSSS AP S LHNRFSISRF H SN+AR RP  SS SN L I AMAPQKKVNKYDD WEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAAL AIV+IPDDS  LVALQAVV GGL LGAAGL VGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

XP_004152441.1 uncharacterized protein LOC101214681 [Cucumis sativus]1.37e-126100Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

XP_008437595.1 PREDICTED: uncharacterized protein LOC103482961 [Cucumis melo]1.01e-11294.06Show/hide
Query:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
        +TS SPSSSTA LSH  NRFSISR HHISN+ RR PF SSRSNPLTI AMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

XP_023000865.1 uncharacterized protein LOC111495182 [Cucurbita maxima]4.16e-10287.25Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVT+QSPSSS AP S L NRFSISRF H SN+AR RP  SSRSNPL I AMA QKKVNKYDD WEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAAL AIV+IPDDS  LVALQAVV GGL LGAAGL VGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

XP_038895665.1 uncharacterized protein LOC120083851 [Benincasa hispida]1.52e-10789.32Show/hide
Query:  MAVTSQSPSSS--TAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSN
        MAVTSQSPSSS  TA  SHL NRFSI RF H+SN++R  PF SS  NPLTI AMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSN
Subjt:  MAVTSQSPSSS--TAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSN

Query:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLG
        VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEK+ SSSPSALASLALPILV AL AIV+IPDDSVALV LQAVVGGGLALGAAGLLVGSVVLG
Subjt:  VEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLG

Query:  GLQEAD
        GLQEAD
Subjt:  GLQEAD

TrEMBL top hitse value%identityAlignment
A0A0A0LW40 Uncharacterized protein6.61e-127100Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

A0A1S3AV09 uncharacterized protein LOC1034829614.87e-11394.06Show/hide
Query:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
        +TS SPSSSTA LSH  NRFSISR HHISN+ RR PF SSRSNPLTI AMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

A0A5D3BL78 DUF1118 domain-containing protein4.87e-11394.06Show/hide
Query:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
        +TS SPSSSTA LSH  NRFSISR HHISN+ RR PF SSRSNPLTI AMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA
Subjt:  VTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKA

Query:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
        GLLSKAEELG TLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE
Subjt:  GLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQE

Query:  AD
        AD
Subjt:  AD

A0A6J1E7H0 uncharacterized protein LOC1114314272.34e-10186.76Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVT+QSPSSS AP S L NRFSISRF H SN+AR RP  SS SN L I AMAPQKKVNKYDD WEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAAL AIV+IPDDS  LVALQAVV GGL LGAAGL VGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

A0A6J1KJH4 uncharacterized protein LOC1114951822.01e-10287.25Show/hide
Query:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE
        MAVT+QSPSSS AP S L NRFSISRF H SN+AR RP  SSRSNPL I AMA QKKVNKYDD WEKKWFGAGIFYES+EDVEVDVFKKLETKKVLSNVE
Subjt:  MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVE

Query:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL
        KAGLLSKAEELGFTLSSIEKLGVFSKAEE GLLSLLEKVAS+SPS LASLALPILVAAL AIV+IPDDS  LVALQAVV GGL LGAAGL VGSVVLGGL
Subjt:  KAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGL

Query:  QEAD
        QEAD
Subjt:  QEAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G74730.1 Protein of unknown function (DUF1118)6.8e-5267.65Show/hide
Query:  RRPFSSSRS-NPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLS
        RR FS+ +S    ++ AMAPQKKVNKYD  W+K+W+GAG+F+E +E + VDVFKKLE +KVLSNVEK+GLLSKAE LG TLSS+EKL VFSKAE+LGLLS
Subjt:  RRPFSSSRS-NPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLS

Query:  LLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQEAD
        LLE +A +SP+ LAS ALP L AA+VA+V+IPDDS  LV  QAV+ G LAL    LLVGSVVL GLQEAD
Subjt:  LLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQEAD

AT5G08050.1 Protein of unknown function (DUF1118)6.4e-1039.67Show/hide
Query:  ESAEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSV------
        +S    +V +  ++E  K+L+  EKAGLLS AE+ GF+LS+IE+LG+ +KAEE G+LS        +P  L +L+L +L+   V   V+P+D        
Subjt:  ESAEDVEVDVFKKLETKKVLSNVEKAGLLSKAEELGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSV------

Query:  ALVALQAVVGGGLALGAAGLL
         LVAL +V+GG  A  A+G +
Subjt:  ALVALQAVVGGGLALGAAGLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTCACTTCACAATCTCCATCTTCCTCCACTGCTCCTCTCTCTCACCTTCACAACCGTTTCTCCATTTCCAGATTCCACCACATCTCTAACTATGCCCGCCGCCG
CCCCTTCTCTTCTTCCAGATCCAATCCACTCACCATCTTCGCCATGGCCCCTCAGAAGAAGGTGAACAAATACGACGATGCTTGGGAGAAGAAATGGTTTGGAGCTGGAA
TTTTTTACGAGAGTGCGGAGGATGTGGAAGTGGATGTGTTCAAGAAGCTTGAGACTAAGAAAGTTCTTAGCAATGTTGAGAAAGCAGGGTTGTTATCGAAGGCGGAGGAA
TTAGGGTTTACGCTTTCCTCCATTGAGAAGTTAGGGGTTTTCTCTAAAGCTGAAGAATTAGGGCTTCTTAGCTTGCTTGAGAAAGTTGCGAGTTCTTCTCCCTCCGCTTT
GGCTTCTCTTGCACTTCCCATTTTGGTTGCGGCGCTGGTGGCCATTGTTGTAATTCCCGATGATTCGGTGGCCCTTGTGGCGTTGCAGGCGGTGGTCGGCGGTGGATTGG
CGCTTGGGGCTGCCGGATTGTTAGTTGGATCGGTGGTGCTGGGTGGGTTGCAAGAAGCTGATTGA
mRNA sequenceShow/hide mRNA sequence
CACTTGTTTTTGTTGAGAGTGTGGGAGGAGAGAAAAAGGAGGCATTGTTCAGGTTCATTAAATGTGGGTTTTGGACCTTACCGTTGGGCCGTTTAGGCCTTGAATCTGAC
AGGCCGAGAGTATTCGTCATCGGTGCATATTAATTGGGCCTGTTCCTTTCATCGGCCCAATACAAAATCTCTTCAATTTGATTTGAAATTGAGATAAAGCAATGGAGGAT
ATGATTGGTGGAGATGAATGGACAGCAGATTACACTTGGAGATCTCTCTCTATCATACAATGTCAAAACTTCTCCTCTCTCCACTAATTCGTGGCTGAAGCTGAACAACA
CCTAAGCTTCCTTTCCTCTTCAACAATGGCGGTCACTTCACAATCTCCATCTTCCTCCACTGCTCCTCTCTCTCACCTTCACAACCGTTTCTCCATTTCCAGATTCCACC
ACATCTCTAACTATGCCCGCCGCCGCCCCTTCTCTTCTTCCAGATCCAATCCACTCACCATCTTCGCCATGGCCCCTCAGAAGAAGGTGAACAAATACGACGATGCTTGG
GAGAAGAAATGGTTTGGAGCTGGAATTTTTTACGAGAGTGCGGAGGATGTGGAAGTGGATGTGTTCAAGAAGCTTGAGACTAAGAAAGTTCTTAGCAATGTTGAGAAAGC
AGGGTTGTTATCGAAGGCGGAGGAATTAGGGTTTACGCTTTCCTCCATTGAGAAGTTAGGGGTTTTCTCTAAAGCTGAAGAATTAGGGCTTCTTAGCTTGCTTGAGAAAG
TTGCGAGTTCTTCTCCCTCCGCTTTGGCTTCTCTTGCACTTCCCATTTTGGTTGCGGCGCTGGTGGCCATTGTTGTAATTCCCGATGATTCGGTGGCCCTTGTGGCGTTG
CAGGCGGTGGTCGGCGGTGGATTGGCGCTTGGGGCTGCCGGATTGTTAGTTGGATCGGTGGTGCTGGGTGGGTTGCAAGAAGCTGATTGAAGAAGGTGTGTTATGATAGT
TTTAAATGTCTTTTTTGTGTAATTATGAAAATAAATATGGTGTTAATTAAATCAGCTTTATAATGTGTATTTCCCTCTTTATCTAGGTGATTAGAGATATCAAAGATTGG
AAAAAATTAAAAGATAGAATTTAGTCTATATAATTTGGTAAAATTTTCTATACAATTTAATAACATGGTAG
Protein sequenceShow/hide protein sequence
MAVTSQSPSSSTAPLSHLHNRFSISRFHHISNYARRRPFSSSRSNPLTIFAMAPQKKVNKYDDAWEKKWFGAGIFYESAEDVEVDVFKKLETKKVLSNVEKAGLLSKAEE
LGFTLSSIEKLGVFSKAEELGLLSLLEKVASSSPSALASLALPILVAALVAIVVIPDDSVALVALQAVVGGGLALGAAGLLVGSVVLGGLQEAD