; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G006460 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G006460
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDihydroxy-acid dehydratase
Genome locationGy14Chr4:4661056..4661846
RNA-Seq ExpressionCsGy4G006460
SyntenyCsGy4G006460
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649212.1 hypothetical protein Csa_014712 [Cucumis sativus]9.88e-14599.5Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  KV
        K 
Subjt:  KV

XP_008451927.1 PREDICTED: uncharacterized protein LOC103493080 [Cucumis melo]6.81e-11990.43Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+K FN
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

XP_011653223.1 uncharacterized protein LOC105435189 [Cucumis sativus]4.69e-148100Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  KVFNY
        KVFNY
Subjt:  KVFNY

XP_022985422.1 uncharacterized protein LOC111483432 [Cucurbita maxima]1.65e-10480.32Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP+SEPLDAGKSYFLLPLS++     S       + D+ + S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLP GG+G+WRVKLVIDTKQLGEILAE+GNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KFFPIDVGN+NKAQMK F+
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

XP_038907067.1 uncharacterized protein LOC120092893 [Benincasa hispida]7.51e-10681.35Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQ-----STNDGESPLPVPPPSKD
        MGNCSLKGM VDCEKPIRILTDSG+IINFHGPKQVHQILNNYPPG+YGVFRRPNLSSPLP+SEPLDAGKSYFLLPLS+       +DGE P    PP K+
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQ-----STNDGESPLPVPPPSKD

Query:  VGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        +GS SGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAA    +SP+RGKIGGWK  WGN  KFFPIDVGN+NKAQ+K F+
Subjt:  VGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

TrEMBL top hitse value%identityAlignment
A0A0A0KYF7 Uncharacterized protein2.27e-148100Show/hide
Query:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
        MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE
Subjt:  MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGE

Query:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
        SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM
Subjt:  SPLPVPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQM

Query:  KVFNY
        KVFNY
Subjt:  KVFNY

A0A1S3BTS2 uncharacterized protein LOC1034930803.30e-11990.43Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+K FN
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

A0A5A7TTS6 Uncharacterized protein3.30e-11990.43Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKGMAVDC KPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLP SEPLDAGKSYFLLPLSQ TND ES  P P PSKD+GSES
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLPA GNGVWRVKLVIDTKQLGEILAEEGNTEALIER+RAAAATAAVQSPRRGKI GWKPMWGNW KFFP+D GN+NKAQ+K FN
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

A0A6J1EZE8 uncharacterized protein LOC1114376143.77e-10379.79Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP+SE LDAGKSYFLLPLS++     S       ++D+ S S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLP GG+G+WRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KF PIDVGN+NKAQ+K F+
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

A0A6J1J4V0 uncharacterized protein LOC1114834327.96e-10580.32Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES
        MGNCSLKG+A DCEKPIRILTDSG+IINFHGPKQV QIL NYPPG+YGVFRRPNLSSPLP+SEPLDAGKSYFLLPLS++     S       + D+ + S
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSKDVGSES

Query:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN
        GLEVLP GG+G+WRVKLVIDTKQLGEILAE+GNTEALIERMRAAAATAAVQSPRR KIGGWKP WGNW KFFPIDVGN+NKAQMK F+
Subjt:  GLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G10120.1 unknown protein1.3e-0529.03Show/hide
Query:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTND--------GESPLPVPPP
        MGNC      V  +K I+I+ + G ++ + GP +VH IL  + P  Y +F     +  L     L  G+ Y+LLP  Q TN          +     P  
Subjt:  MGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTND--------GESPLPVPPP

Query:  SKDVGSESGL----EVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRA
         K+   E  L    +      NGV RVK+V+  ++L E L + G+   ++ R  A
Subjt:  SKDVGSESGL----EVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRA

AT3G61920.1 unknown protein1.9e-1435.67Show/hide
Query:  MGNCSLKG------MAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPG-IYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTN-----DGESPLP
        MGNC  KG      +    +  I+++T +G ++  H P     I N +P   I+      + S PL   E L  G  Y+LLPLS S       D    L 
Subjt:  MGNCSLKG------MAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPG-IYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTN-----DGESPLP

Query:  VPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAA
          P     G    +  L  GG GVW+V+LVI  +QL EILAE+  TEAL+E +R  A
Subjt:  VPPPSKDVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAA

AT4G10910.1 unknown protein4.2e-0958.82Show/hide
Query:  LEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQS
        ++V P   NGVW+ K+VI +KQL EILA EGNT ALI+++R AAA A V S
Subjt:  LEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTCCGCCATTAACGAGGTGGAAGAAAAAGCTTTGAAAATCGCAATGGGAAACTGTTCTCTCAAAGGAATGGCGGTGGATTGCGAGAAGCCCATCAGAATCTTAAC
GGATTCCGGCCATATAATCAACTTCCACGGCCCTAAACAAGTTCATCAAATCCTCAACAACTATCCTCCCGGTATCTATGGCGTTTTCCGGCGACCCAATCTATCTTCTC
CCTTACCCGTCTCCGAGCCCCTTGACGCCGGAAAATCCTACTTTCTTCTCCCGCTTTCCCAATCCACTAACGACGGAGAGTCACCGCTGCCGGTGCCGCCGCCGTCGAAG
GATGTGGGAAGTGAGTCGGGTCTGGAAGTGCTCCCGGCGGGTGGCAACGGCGTTTGGAGGGTGAAATTGGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCGGAGGA
AGGGAATACGGAGGCGTTGATTGAGAGGATGAGAGCAGCAGCGGCGACGGCGGCGGTGCAAAGTCCACGGCGGGGGAAGATCGGAGGGTGGAAGCCGATGTGGGGGAATT
GGTTCAAATTTTTTCCAATTGATGTTGGAAATAGTAATAAAGCACAAATGAAAGTATTTAATTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTCCGCCATTAACGAGGTGGAAGAAAAAGCTTTGAAAATCGCAATGGGAAACTGTTCTCTCAAAGGAATGGCGGTGGATTGCGAGAAGCCCATCAGAATCTTAAC
GGATTCCGGCCATATAATCAACTTCCACGGCCCTAAACAAGTTCATCAAATCCTCAACAACTATCCTCCCGGTATCTATGGCGTTTTCCGGCGACCCAATCTATCTTCTC
CCTTACCCGTCTCCGAGCCCCTTGACGCCGGAAAATCCTACTTTCTTCTCCCGCTTTCCCAATCCACTAACGACGGAGAGTCACCGCTGCCGGTGCCGCCGCCGTCGAAG
GATGTGGGAAGTGAGTCGGGTCTGGAAGTGCTCCCGGCGGGTGGCAACGGCGTTTGGAGGGTGAAATTGGTGATCGATACGAAGCAGTTGGGGGAAATTTTGGCGGAGGA
AGGGAATACGGAGGCGTTGATTGAGAGGATGAGAGCAGCAGCGGCGACGGCGGCGGTGCAAAGTCCACGGCGGGGGAAGATCGGAGGGTGGAAGCCGATGTGGGGGAATT
GGTTCAAATTTTTTCCAATTGATGTTGGAAATAGTAATAAAGCACAAATGAAAGTATTTAATTATTGAAATGGGTGTTCATATTCAACATAAACCCTAAGTATTTTCTTT
CTAATTTGATTATTAGGAGAAAGTTTGTGTAAAATGTAATTTTTGAGCTTGTTTGGTTGTATATAAGCTTGTATGTAAGGTGTGAGTTTTTAGGGCTCTCTTAATAGAGT
TATAAAGTTATTATTATGTTG
Protein sequenceShow/hide protein sequence
MSSAINEVEEKALKIAMGNCSLKGMAVDCEKPIRILTDSGHIINFHGPKQVHQILNNYPPGIYGVFRRPNLSSPLPVSEPLDAGKSYFLLPLSQSTNDGESPLPVPPPSK
DVGSESGLEVLPAGGNGVWRVKLVIDTKQLGEILAEEGNTEALIERMRAAAATAAVQSPRRGKIGGWKPMWGNWFKFFPIDVGNSNKAQMKVFNY