; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G31210 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G31210
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionMyb/SANT-like DNA-binding domain protein
Genome locationChr1:25726714..25727749
RNA-Seq ExpressionCSPI01G31210
SyntenyCSPI01G31210
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8653537.1 hypothetical protein Csa_006853 [Cucumis sativus]8.3e-8886.89Show/hide
Query:  MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL
        MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKD TLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL
Subjt:  MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL

Query:  IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMRGEIEIAYRKERVLDHALGSCYFAQSFLG---ATLLDV
        IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMR EIEIAYR    L HA  S  +   FLG    TL D+
Subjt:  IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMRGEIEIAYRKERVLDHALGSCYFAQSFLG---ATLLDV

Query:  ARGKEY
         + + +
Subjt:  ARGKEY

KAF8395743.1 hypothetical protein HHK36_019694 [Tetracentron sinense]1.0e-1340Show/hide
Query:  NSKPSQLR-ITNY-ELCMIFDNKEKTEGWSIV--EKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDD
        +++P + R I NY ELC+I  + ++   WS +  E   K  T N H   E+++ ++DD+    +     D  EASSQQT +RP+SSSHS++  K++   D
Subjt:  NSKPSQLR-ITNY-ELCMIFDNKEKTEGWSIV--EKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDD

Query:  LIVQIVSVMAANVARIADAL--SDRPTCLDQVFDVVQTMPGLDEDLILDA
         +++ +S +AAN+ RIADAL  S+R   LD++F++VQ +PG D+DLI++A
Subjt:  LIVQIVSVMAANVARIADAL--SDRPTCLDQVFDVVQTMPGLDEDLILDA

XP_022153203.1 uncharacterized protein At2g29880-like isoform X1 [Momordica charantia]1.9e-1567.47Show/hide
Query:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK
        I NY ELCMIF N+++T GWSI  KH+KD  LN+ +HT  +VGISDD A  G+ SSG+DS EASS+QT TRPSSSSHSRKSLK
Subjt:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK

XP_022153211.1 uncharacterized protein At2g29880-like isoform X2 [Momordica charantia]1.9e-1567.47Show/hide
Query:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK
        I NY ELCMIF N+++T GWSI  KH+KD  LN+ +HT  +VGISDD A  G+ SSG+DS EASS+QT TRPSSSSHSRKSLK
Subjt:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK

XP_023877154.1 uncharacterized protein LOC111989590 [Quercus suber]2.7e-1435.96Show/hide
Query:  ITNYELCMIFDNKEKTEG-W---SIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSV
        I NY+   I    E  +G W       +   + T N+  H E+ V +  ++      +  SD  + SSQQT  RPSSSSHS++ LKRR S D++++++S 
Subjt:  ITNYELCMIFDNKEKTEG-W---SIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSV

Query:  MAANVARIADALSD--RPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMRGEIEIAYRKERVLDHALGSC
        MAA++ RIADAL++  +  CLD++F++VQT+PG D+DLI++A           I +M+  +    RK+ +L    G C
Subjt:  MAANVARIADALSD--RPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMRGEIEIAYRKERVLDHALGSC

TrEMBL top hitse value%identityAlignment
A0A0A0M2Z2 Uncharacterized protein5.3e-7299.32Show/hide
Query:  MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL
        MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKD TLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL
Subjt:  MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDL

Query:  IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDA
        IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDA
Subjt:  IVQIVSVMAANVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDA

A0A5B7BRF2 Uncharacterized protein5.9e-1544.53Show/hide
Query:  ELCMIFDNKEKTEGWSIV-EKHDKDCTLNNHNHTES--QVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSVMAANV
        EL +I DN   T  + +   K D + T NN  H E+  Q    D++    N ++G   T+ SSQQT  RPSSSSHS++  K+R   DL+V+++S MAAN+
Subjt:  ELCMIFDNKEKTEGWSIV-EKHDKDCTLNNHNHTES--QVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSVMAANV

Query:  ARIADAL--SDRPTCLDQVFDVVQTMPGLDEDLILDA
         RIADAL  S++  CLD++F++VQ +PG D+DLI++A
Subjt:  ARIADAL--SDRPTCLDQVFDVVQTMPGLDEDLILDA

A0A6J1DIA7 uncharacterized protein At2g29880-like isoform X19.1e-1667.47Show/hide
Query:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK
        I NY ELCMIF N+++T GWSI  KH+KD  LN+ +HT  +VGISDD A  G+ SSG+DS EASS+QT TRPSSSSHSRKSLK
Subjt:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK

A0A6J1DJY9 uncharacterized protein At2g29880-like isoform X29.1e-1667.47Show/hide
Query:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK
        I NY ELCMIF N+++T GWSI  KH+KD  LN+ +HT  +VGISDD A  G+ SSG+DS EASS+QT TRPSSSSHSRKSLK
Subjt:  ITNY-ELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLK

A0A7N2KMQ1 Uncharacterized protein2.5e-1340.14Show/hide
Query:  ITNYELCMIFDNKEKTEG-W---SIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSV
        I NY+   I    E  +G W       + + + T N+  H E+ V +  ++      +  SD  + SSQQT  RPSSSSHS++ LKRR S D++++++S 
Subjt:  ITNYELCMIFDNKEKTEG-W---SIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSV

Query:  MAANVARIADALSD--RPTCLDQVFDVVQTMPGLDEDLILDA
        MAA++ RIADAL++  +  CLD++F++VQT+PG D+DLI++A
Subjt:  MAANVARIADALSD--RPTCLDQVFDVVQTMPGLDEDLILDA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCGGGAATTCCAAGCCAAGTCAATTGAGAATTACAAATTATGAGCTGTGTATGATTTTTGATAACAAGGAGAAAACTGAAGGATGGTCAATAGTTGAAAAACACGA
TAAGGACTGTACTTTGAACAACCACAACCATACAGAATCCCAAGTAGGGATATCAGATGATGATGCAGGGGGAGGTAATGGTTCCAGTGGTTCTGATAGCACGGAGGCTT
CATCTCAACAAACAGGAACTAGACCATCCTCCTCTTCACATTCACGAAAGTCTTTAAAGAGAAGATGCAGCGATGATCTCATTGTGCAAATAGTGAGTGTCATGGCTGCT
AACGTTGCTCGAATAGCTGATGCATTGTCAGACAGGCCAACTTGCTTAGATCAAGTGTTTGATGTTGTTCAAACCATGCCTGGGTTGGACGAGGATCTGATCCTCGACGC
ATATAGACTGGCTTGGCTGCAGACCGGTCATTTGATTGGATATATGAGAGGGGAAATTGAGATTGCTTACAGGAAAGAGAGAGTCTTGGATCATGCCCTTGGGAGTTGTT
ATTTTGCTCAATCATTTCTTGGAGCTACTTTGTTAGATGTTGCTAGAGGCAAGGAGTATAGGACGATAATGAAGGAGTTGCTTCTGTATCAGTGGTTGAGGGAGATTTTT
GTGACAAGTAGTGCACGTACAATTTGTAGGGTCTGGAAGGGTGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCGGGAATTCCAAGCCAAGTCAATTGAGAATTACAAATTATGAGCTGTGTATGATTTTTGATAACAAGGAGAAAACTGAAGGATGGTCAATAGTTGAAAAACACGA
TAAGGACTGTACTTTGAACAACCACAACCATACAGAATCCCAAGTAGGGATATCAGATGATGATGCAGGGGGAGGTAATGGTTCCAGTGGTTCTGATAGCACGGAGGCTT
CATCTCAACAAACAGGAACTAGACCATCCTCCTCTTCACATTCACGAAAGTCTTTAAAGAGAAGATGCAGCGATGATCTCATTGTGCAAATAGTGAGTGTCATGGCTGCT
AACGTTGCTCGAATAGCTGATGCATTGTCAGACAGGCCAACTTGCTTAGATCAAGTGTTTGATGTTGTTCAAACCATGCCTGGGTTGGACGAGGATCTGATCCTCGACGC
ATATAGACTGGCTTGGCTGCAGACCGGTCATTTGATTGGATATATGAGAGGGGAAATTGAGATTGCTTACAGGAAAGAGAGAGTCTTGGATCATGCCCTTGGGAGTTGTT
ATTTTGCTCAATCATTTCTTGGAGCTACTTTGTTAGATGTTGCTAGAGGCAAGGAGTATAGGACGATAATGAAGGAGTTGCTTCTGTATCAGTGGTTGAGGGAGATTTTT
GTGACAAGTAGTGCACGTACAATTTGTAGGGTCTGGAAGGGTGGTTGA
Protein sequenceShow/hide protein sequence
MCGNSKPSQLRITNYELCMIFDNKEKTEGWSIVEKHDKDCTLNNHNHTESQVGISDDDAGGGNGSSGSDSTEASSQQTGTRPSSSSHSRKSLKRRCSDDLIVQIVSVMAA
NVARIADALSDRPTCLDQVFDVVQTMPGLDEDLILDAYRLAWLQTGHLIGYMRGEIEIAYRKERVLDHALGSCYFAQSFLGATLLDVARGKEYRTIMKELLLYQWLREIF
VTSSARTICRVWKGG