; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G06530 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G06530
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDihydroxy-acid dehydratase
Genome locationChr4:4534299..4535231
RNA-Seq ExpressionCSPI04G06530
SyntenyCSPI04G06530
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649212.1 hypothetical protein Csa_014712 [Cucumis sativus]2.7e-11199.5Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP+SEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  K
        K
Subjt:  K

XP_008451927.1 PREDICTED: uncharacterized protein LOC103493080 [Cucumis melo]2.6e-9391.01Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+KEFNS
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

XP_011653223.1 uncharacterized protein LOC105435189 [Cucumis sativus]1.9e-11299.02Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP+SEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  KEFN
        K FN
Subjt:  KEFN

XP_022985422.1 uncharacterized protein LOC111483432 [Cucurbita maxima]5.9e-8280.95Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLPISEPLDAGKSYFLLPLS++     S       + D+ + S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLP GG+G+WRVKLVIDTKQLGEILAE+GNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KFFPIDVGN+NKAQMK+F+S
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

XP_038907067.1 uncharacterized protein LOC120092893 [Benincasa hispida]7.0e-8381.44Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQ-----STNDGESPLPVPPPSKD
        MGNCSLKGM VDCEKPIRILTDSG+IINFHGPKQVHQILNNYPPG+YGVFRRPNLSSPLPISEPLDAGKSYFLLPLS+       +DGE     PPP K+
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQ-----STNDGESPLPVPPPSKD

Query:  VGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        +GS SGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAA    +SP+RGKIGGWK  WGN  KFFPIDVGN+NKAQ+K+F++
Subjt:  VGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

TrEMBL top hitse value%identityAlignment
A0A0A0KYF7 Uncharacterized protein9.2e-11399.02Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP+SEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  KEFN
        K FN
Subjt:  KEFN

A0A1S3BTS2 uncharacterized protein LOC1034930801.2e-9391.01Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+KEFNS
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

A0A5A7TTS6 Uncharacterized protein1.2e-9391.01Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+KEFNS
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

A0A6J1EZE8 uncharacterized protein LOC1114376145.4e-8180.42Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLPISE LDAGKSYFLLPLS++     S       ++D+ S S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLP GG+G+WRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KF PIDVGN+NKAQ+K+F+S
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

A0A6J1J4V0 uncharacterized protein LOC1114834322.9e-8280.95Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLPISEPLDAGKSYFLLPLS++     S       + D+ + S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS
        GLEVLP GG+G+WRVKLVIDTKQLGEILAE+GNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KFFPIDVGN+NKAQMK+F+S
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G10120.1 unknown protein1.6e-0529.03Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTND--------GESPLPVPPP
        MGNC      V  +K I+I+ + G ++ + GP +VH IL  + P  Y +F     +  L     L  G+ Y+LLP  Q TN          +     P  
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTND--------GESPLPVPPP

Query:  SKDVGSESGL----EVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRA
         K+   E  L    +      NGV RVK+V+  ++L E L + G+   ++ R  A
Subjt:  SKDVGSESGL----EVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRA

AT3G61920.1 unknown protein1.9e-1435.67Show/hide
Query:  MGNCSLKG------MAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPG-IYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTN-----DGESPLP
        MGNC  KG      +    +  I+++T +G ++  H P     I N +P   I+      + S PL   E L  G  Y+LLPLS S       D    L 
Subjt:  MGNCSLKG------MAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPG-IYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTN-----DGESPLP

Query:  VPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAA
          P     G    +  L  GG GVW+V+LVI  +QL EILAE+  TEAL+E +R  A
Subjt:  VPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAA

AT4G10910.1 unknown protein4.2e-0958.82Show/hide
Query:  LEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQS
        ++V P   NGVW+ K+VI +KQL EILA EGNT ALI+++R AAA A V S
Subjt:  LEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCGCCATTAACGAGGTGGAAGAAAAAGCTTTGAAAATCGCAATGGGAAACTGTTCTCTCAAAGGAATGGCGGTGGATTGCGAGAAGCCCATCAGAATCTTAAC
GGATTCCGGCCACATAATCAACTTCCACGGCCCTAAACAAGTTCATCAAATCCTCAACAACTATCCTCCCGGCATCTATGGCGTTTTCCGGCGACCCAATCTATCTTCTC
CCTTACCCATCTCCGAGCCCCTTGACGCCGGAAAATCCTACTTTCTTCTCCCGCTTTCCCAATCCACTAACGACGGAGAGTCACCGCTGCCGGTGCCGCCGCCGTCGAAG
GATGTGGGAAGTGAGTCGGGTCTGGAAGTGCTCCCGGCGGGTGGCAACGGCGTTTGGAGGGTGAAATTGGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCGGAGGA
AGGGAATACGGAGGCGTTGATTGAGAGGATGAGAGCGGCAGCGGCGACGGCGGCGGTGCAAAGTCCACGGCGGGGGAAGATCGGAGGGTGGAAGCCGATGTGGGGGAATT
GGTTCAAATTTTTTCCAATTGACGTTGGAAATAGTAATAAAGCACAAATGAAAGAATTTAATTCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAATTTAAAAAAACAAAGTTCAAATTTTCACTATATATATAAAGAAAAGCCACTGATGAGCTCCGCCATTAACGAGGTGGAAGAAAAAGCTTTGAAAATCGCAATGGGA
AACTGTTCTCTCAAAGGAATGGCGGTGGATTGCGAGAAGCCCATCAGAATCTTAACGGATTCCGGCCACATAATCAACTTCCACGGCCCTAAACAAGTTCATCAAATCCT
CAACAACTATCCTCCCGGCATCTATGGCGTTTTCCGGCGACCCAATCTATCTTCTCCCTTACCCATCTCCGAGCCCCTTGACGCCGGAAAATCCTACTTTCTTCTCCCGC
TTTCCCAATCCACTAACGACGGAGAGTCACCGCTGCCGGTGCCGCCGCCGTCGAAGGATGTGGGAAGTGAGTCGGGTCTGGAAGTGCTCCCGGCGGGTGGCAACGGCGTT
TGGAGGGTGAAATTGGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCGGAGGAAGGGAATACGGAGGCGTTGATTGAGAGGATGAGAGCGGCAGCGGCGACGGCGGC
GGTGCAAAGTCCACGGCGGGGGAAGATCGGAGGGTGGAAGCCGATGTGGGGGAATTGGTTCAAATTTTTTCCAATTGACGTTGGAAATAGTAATAAAGCACAAATGAAAG
AATTTAATTCTTGAAATGGGTGTTCATATTCAACATAAACCCTAAGTATTTTCTTTCTAATTTGATTATTAGGAGAAAGTTTGTGTAAAATGTAATTTTTGAGCTTGTTT
GGTTGTATATAAGCTTGTATGTAAGGTGTGAGTTTTTAGGGCTCTCTTAATAGAGTTATAAAGTTATTATTATGTTGTGCTCTCAATTTTTTTATTTAATTGAAGACTTT
GTGAAAATATACAAAAGGATAATAATACCAGGGCCCTACCGGGTTCGAACCGG
Protein sequenceShow/hide protein sequence
MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPISEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSK
DVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKEFNS