; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5156 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5156
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionProtein of unknown function (DUF1068)
Genome locationctg1227:4723571..4727377
RNA-Seq ExpressionCucsat.G5156
SyntenyCucsat.G5156
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652279.1 hypothetical protein Csa_022101 [Cucumis sativus]2.11e-167100Show/hide
Query:  MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS
        MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS
Subjt:  MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLS

Query:  TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER
        TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER
Subjt:  TCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARER

Query:  AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  AEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_004151898.1 uncharacterized protein LOC101219040 [Cucumis sativus]3.81e-129100Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_008455917.1 PREDICTED: uncharacterized protein LOC103495987 isoform X1 [Cucumis melo]4.34e-11993.65Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTEEL+NTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_008455918.1 PREDICTED: uncharacterized protein LOC103495987 isoform X2 [Cucumis melo]1.58e-11089.95Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTE        DCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

XP_023534557.1 uncharacterized protein LOC111796098 isoform X1 [Cucurbita pepo subsp. pepo]3.29e-11090.11Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHFMEGL   SSSS STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IVTS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

TrEMBL top hitse value%identityAlignment
A0A0A0LQC2 Uncharacterized protein1.85e-129100Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A1S3C1Z9 uncharacterized protein LOC103495987 isoform X12.10e-11993.65Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTEEL+NTTFRDCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A1S3C2Q4 uncharacterized protein LOC103495987 isoform X27.63e-11189.95Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MA KPVGS SPGLTKVGL  MA+CIAAYILGPPLYWHF EGL AFSSSSLSTCPPCFCDCSSLTDFAFTE        DCVKHDSGMNEETEKNFAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS
        EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLT LWETRARQRGWRD+IVTSRGT+Q S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTSRGTIQGS

A0A6J1G5W1 uncharacterized protein LOC111451120 isoform X15.30e-10989.01Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHF+EGL   SSSS STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE+ FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        EELKLREAEA+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IV S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

A0A6J1I131 uncharacterized protein LOC111469880 isoform X18.75e-10887.91Show/hide
Query:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS
        MAVKP GSCSPGLTKVGL  +ALCIAAYILGPPLYWHFMEGL   SSS  STCPPCFCDCSS TDFAFT+E ENTTFRDCVKHDSGMNEETE++FAELLS
Subjt:  MAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLS

Query:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        E+LKLREA+A+E HRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQK+LTALWE RARQRGWRD+IV S
Subjt:  EELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)9.9e-5565.03Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        K+GL L+ L +A YILGPPLYWH  E L A S+SS   CP C C+CS+ +     +EL N +F DC KHD  +NE+TEKN+AELL+EELKLREAE+LE H
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRD
        +RAD+ LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  LA QK+LT+ WE RARQ+GWR+
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRD

AT2G32580.1 Protein of unknown function (DUF1068)1.1e-4858.93Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        KVGL L+AL +  YILGPPLYWH  E L    + S ++C  C CDCSSL        L N +F DC K D  +NE+TEKN+AELL+EELK REA ++E H
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
        +R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE RARQ+G++D    S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

AT2G32580.2 Protein of unknown function (DUF1068)6.9e-3264.42Show/hide
Query:  DCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDN
        +C K D  +NE+TEKN+AELL+EELK REA ++E H+R D  LLEAKK+TS YQKEADKCNSGMETCE ARE+AE  L  QK+LT++WE RARQ+G++D 
Subjt:  DCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDN

Query:  IVTS
           S
Subjt:  IVTS

AT4G04360.1 Protein of unknown function (DUF1068)1.3e-4355.36Show/hide
Query:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH
        KV   +M LCI AYI GP LYWH  E     + S  S+CPPC CDCSS    +  + L N +F DC++H+ G +EE+E +F E+++EELKLREA+A E+ 
Subjt:  KVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCSSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENH

Query:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS
         RAD  LL+AKK  SQYQKEADKC+ GMETCE ARE+AEA L  Q+RL+ +WE RARQ GW++  V S
Subjt:  RRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGWRDNIVTS

AT4G30996.1 Protein of unknown function (DUF1068)4.8e-3347.8Show/hide
Query:  LFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDC-SSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRR
        L + A+  A  + GP LYW F +G    S+ + S CPPC CDC   L+       L N +  DC   D  + +E EK F +LL+EELKL+EA A E+ R 
Subjt:  LFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDC-SSLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRR

Query:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGW
         +++L EAK++ SQYQKEA+KCN+  E CE+ARERAEA L  ++++T+LWE RARQ GW
Subjt:  ADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRARQRGW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTGAAGGTAAATAAATCAAACATAGTATTTAATCCAAAGTTTAATATTTATTCCACTTTCCTCTCTACTAAGTATAAACACATCTTTCCCCAACGGGTTGGTTC
CCATTTTCCATTTCTGGACTACACCAAGGTCTTGAAAATGGCAGTGAAGCCGGTGGGTTCCTGCTCTCCGGGCCTGACGAAGGTGGGATTGTTTTTAATGGCTCTTTGTA
TAGCAGCCTACATTCTGGGTCCGCCTCTATACTGGCATTTCATGGAGGGCTTGCCCGCTTTCTCTTCTTCCTCTCTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCT
TCTCTCACTGACTTTGCCTTCACTGAAGAGCTCGAAAATACCACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAATTTTGCAGAGTT
ATTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAGAATCATCGGCGTGCCGACATATCTCTACTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAG
CAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAGGCTAACAGCATTATGGGAAACTAGGGCTCGC
CAAAGAGGATGGAGAGACAACATTGTTACATCCCGTGGTACCATCCAAGGCTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTGAAGGTAAATAAATCAAACATAGTATTTAATCCAAAGTTTAATATTTATTCCACTTTCCTCTCTACTAAGTATAAACACATCTTTCCCCAACGGGTTGGTTC
CCATTTTCCATTTCTGGACTACACCAAGGTCTTGAAAATGGCAGTGAAGCCGGTGGGTTCCTGCTCTCCGGGCCTGACGAAGGTGGGATTGTTTTTAATGGCTCTTTGTA
TAGCAGCCTACATTCTGGGTCCGCCTCTATACTGGCATTTCATGGAGGGCTTGCCCGCTTTCTCTTCTTCCTCTCTCTCAACTTGCCCACCTTGCTTTTGTGACTGTTCT
TCTCTCACTGACTTTGCCTTCACTGAAGAGCTCGAAAATACCACTTTTAGAGATTGTGTGAAACATGACTCTGGCATGAATGAGGAAACAGAAAAGAATTTTGCAGAGTT
ATTGTCTGAGGAACTGAAACTGAGGGAAGCTGAAGCTTTGGAGAATCATCGGCGTGCCGACATATCTCTACTAGAAGCAAAGAAGATGACATCTCAATATCAGAAAGAAG
CAGACAAGTGCAATTCAGGCATGGAAACATGTGAAGCAGCAAGGGAAAGAGCTGAAGCTACATTAGCTTCACAAAAGAGGCTAACAGCATTATGGGAAACTAGGGCTCGC
CAAAGAGGATGGAGAGACAACATTGTTACATCCCGTGGTACCATCCAAGGCTCATAA
Protein sequenceShow/hide protein sequence
MKLKVNKSNIVFNPKFNIYSTFLSTKYKHIFPQRVGSHFPFLDYTKVLKMAVKPVGSCSPGLTKVGLFLMALCIAAYILGPPLYWHFMEGLPAFSSSSLSTCPPCFCDCS
SLTDFAFTEELENTTFRDCVKHDSGMNEETEKNFAELLSEELKLREAEALENHRRADISLLEAKKMTSQYQKEADKCNSGMETCEAARERAEATLASQKRLTALWETRAR
QRGWRDNIVTSRGTIQGS