; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI02G02220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI02G02220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationChr2:1527832..1531226
RNA-Seq ExpressionCSPI02G02220
SyntenyCSPI02G02220
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139277.1 uncharacterized protein LOC101212944 [Cucumis sativus]5.9e-91100Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

XP_008457359.1 PREDICTED: uncharacterized protein LOC103497063 [Cucumis melo]1.3e-9099.42Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

XP_022143173.1 uncharacterized protein LOC111013108 [Momordica charantia]6.1e-8896.51Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSK SCPPCICDCPPPLSLLKI+PGLANLSVTDCGSNDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

XP_022938183.1 uncharacterized protein LOC111444344 [Cucurbita moschata]2.4e-8492.4Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPAL+WRFKKAL LGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEG
        EAV+GEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERK+TSLWERRARQMGW+G
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEG

XP_038890751.1 uncharacterized protein LOC120080238 [Benincasa hispida]8.5e-9098.26Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLS+TDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

TrEMBL top hitse value%identityAlignment
A0A0A0LG38 Uncharacterized protein2.8e-91100Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

A0A1S3C6L6 uncharacterized protein LOC1034970636.3e-9199.42Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

A0A5A7VC79 DUF1068 domain-containing protein6.3e-9199.42Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

A0A6J1CQ21 uncharacterized protein LOC1110131082.9e-8896.51Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKA QLGDSK SCPPCICDCPPPLSLLKI+PGLANLSVTDCGSNDPDLK EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK+TSLWERRARQMGWEGE
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

A0A6J1FI57 uncharacterized protein LOC1114443441.2e-8492.4Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ
        MSRRSGACLRCCLVFFAVVSALAVCGPAL+WRFKKAL LGDSK SC PCICDCPPPLSLLKI+PGLANLSVTDCG NDPDLK+EMEKQFVDLLTEELKLQ
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQ

Query:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEG
        EAV+GEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IKERK+TSLWERRARQMGW+G
Subjt:  EAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)3.9e-3244.58Show/hide
Query:  RSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAV
        R  A L+  L    +  A  + GP LYW   +AL    S +SCP C C+C    S + I   L+N S  DC  +DP++ ++ EK + +LLTEELKL+EA 
Subjt:  RSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAV

Query:  SGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGW
        S E  +  ++ L EAK+  S YQ+EA+KC +  ETCEEARE+AE  + +++K+TS WE RARQ GW
Subjt:  SGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGW

AT2G24290.1 Protein of unknown function (DUF1068)2.6e-6872.57Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKAS---CPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEEL
        M+RRSG C+R CLV F+VVSAL VCGPALYW+  K   +G ++++   CPPC+CD PPPLSLL+I+PGLANLS+T CGS+DP+LK+EMEK FVDLLTEEL
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKAS---CPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEEL

Query:  KLQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        KLQEAV+ EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERA+AL++KERK+T LWERRARQ+GWEGE
Subjt:  KLQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE

AT2G32580.1 Protein of unknown function (DUF1068)1.2e-3345.73Show/hide
Query:  ACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGE
        A L+  L   A+     + GP LYW   +AL +  S  SC  C+CDC   L LL I  GL+N S TDC   DP++ ++ EK + +LLTEELK +EA S E
Subjt:  ACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGE

Query:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWE
          + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++K+TS+WE+RARQ G++
Subjt:  HTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWE

AT4G04360.1 Protein of unknown function (DUF1068)2.6e-2841.52Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDS-KASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKL
        M+RR     +   V   +     + GP+LYW   +   + DS  +SCPPC+CDC     LL I  GL+N S  DC  ++ +  +E E  F +++ EELKL
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDS-KASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKL

Query:  QEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWE
        +EA + E     +  L +AK+AASQYQ+EA+KC    ETCE ARE+AEA + ++R+++ +WE RARQ GW+
Subjt:  QEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWE

AT4G30996.1 Protein of unknown function (DUF1068)2.0e-7378.16Show/hide
Query:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKAS--CPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELK
        M RRSG C+R CLV FAVVSAL VCGPALYW+F K   +G ++A+  CPPC+CDCPPPLSLL+I+PGLANLS+TDCGS+DP+LKQEMEKQFVDLLTEELK
Subjt:  MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKAS--CPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELK

Query:  LQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE
        LQEAV+ EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERAEAL+IKERK+TSLWE+RARQ GWEGE
Subjt:  LQEAVSGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCCGATCTGGGGCTTGCTTAAGGTGTTGTCTCGTGTTTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCAGCTTTGTATTGGAGATTCAAGAAGGCTTT
GCAATTGGGAGATTCAAAAGCCTCCTGTCCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTTCCCCTGGTCTGGCCAATCTCTCCGTTACAGACT
GTGGGAGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGACCTTTTGACAGAAGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAGCACACTCGCCAT
ATGAACATCACTTTGTTTGAGGCAAAAAGAGCTGCTTCTCAGTATCAGAGAGAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAAGCCCGAGAACGTGCTGA
AGCATTGATGATCAAGGAGAGGAAGGTTACATCATTGTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATTCAACTGTGGGGGGAAAAAATGGAAACTTTTTAGATTTTATGTTGTTTTGTTACATTACAATACAACCCCATTAAATTTTTCAATACACCGACTCCTCCAGAAGAAAT
TTTGTTAGCTAAAGTTTTTTTATTTTGTTGGTTTTTTCCATTCATTTCATTGGTTTCTTTCAGACTTTCACTGCTGCAATTAGATCTCAGTTCCTCACTTTCTCAGTAAC
GCGGAAAGAGAGAGGAAAATCACGACGAAAAAAGAAAGAAAAACAAGAATCAAAGAGGAAGATCAAGGTGAGAAATGTCACGCCGATCTGGGGCTTGCTTAAGGTGTTGT
CTCGTGTTTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCAGCTTTGTATTGGAGATTCAAGAAGGCTTTGCAATTGGGAGATTCAAAAGCCTCCTGTCCTCCTTG
CATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTTCCCCTGGTCTGGCCAATCTCTCCGTTACAGACTGTGGGAGTAATGACCCAGATCTCAAGCAGGAGATGG
AAAAACAATTTGTGGACCTTTTGACAGAAGAATTGAAGCTTCAAGAAGCAGTTTCTGGCGAGCACACTCGCCATATGAACATCACTTTGTTTGAGGCAAAAAGAGCTGCT
TCTCAGTATCAGAGAGAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAAGCCCGAGAACGTGCTGAAGCATTGATGATCAAGGAGAGGAAGGTTACATCATT
GTGGGAGCGACGAGCCCGCCAAATGGGTTGGGAAGGGGAATAAATCTATAATTTCAAAGGCAACCTCTTAGTTCAACTAGTCGTCAAGTTATTCAATCCACAAGATCAGT
AGAAACTGGCAAGCGAAGTGCTTGGTTCTTCAGTCTGTCAAAGGCATAATGTTCTTCTCTTTTCCTCCATTGATTGTATTGCACCTTTTCTGGAAACATGTCAATTCTAC
ATCATAATTCATACATGACAATGTTATAGATCCCACTCTTCTGCATTTGCTAATATATAATATATATGATTATCGCCTCTTCAACCGTAGCTTTGTGACTAGTTGGCTAG
ATGTTTCTGAGTCATCCACAAGCTCGAATAGACCCCATTGTTCTCATTCTCATATTGGAAAAACCATGCTGTTCTCAGTTTTCGCACCAAATAATTGATGTTAGTAAAAT
CTGATGTTCACGTTTGCATACTGGGAATTTTCAGCTTCATTACCTGCTTTAAAGAAAAGTGGATATTTGATGGTAACGGCT
Protein sequenceShow/hide protein sequence
MSRRSGACLRCCLVFFAVVSALAVCGPALYWRFKKALQLGDSKASCPPCICDCPPPLSLLKISPGLANLSVTDCGSNDPDLKQEMEKQFVDLLTEELKLQEAVSGEHTRH
MNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKVTSLWERRARQMGWEGE