; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G024230 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G024230
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGBBH-like_N domain-containing protein
Genome locationGy14Chr3:24252573..24254261
RNA-Seq ExpressionCsGy3G024230
SyntenyCsGy3G024230
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR010376 - Gamma-butyrobetaine hydroxylase-like, N-terminal
IPR038492 - GBBH-like, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650784.1 hypothetical protein Csa_017608 [Cucumis sativus]8.54e-139100Show/hide
Query:  MCGSRLMHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAE
        MCGSRLMHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAE
Subjt:  MCGSRLMHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAE

Query:  FLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        FLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  FLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

KAG7017806.1 hypothetical protein SDJN02_19672 [Cucurbita argyrosperma subsp. argyrosperma]1.94e-6897.17Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFT LRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

XP_022935010.1 uncharacterized protein LOC111442002 isoform X1 [Cucurbita moschata]1.94e-6897.17Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFT LRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

XP_022984013.1 uncharacterized protein LOC111482460 isoform X1 [Cucurbita maxima]1.94e-6897.17Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFT LRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

XP_031737737.1 uncharacterized protein LOC101219823 isoform X1 [Cucumis sativus]1.07e-68100Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

TrEMBL top hitse value%identityAlignment
A0A0A0LB75 GBBH-like_N domain-containing protein3.48e-9982.61Show/hide
Query:  MHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAEFLRVYS
        MHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMEL +               + +S  V +C+       VEVKFADGSVFNLSAEFLRVYS
Subjt:  MHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAEFLRVYS

Query:  PAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
        PAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK
Subjt:  PAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK

A0A1S3BNF3 uncharacterized protein LOC103491759 isoform X15.55e-6896.23Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLR+YSPAADAK+RSIGGEKVI GRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFTLLRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

A0A6J1DIS9 uncharacterized protein LOC111021468 isoform X13.51e-6691.51Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGS+FNLSAEFLR+YSPA DAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLH+TGIYSWDYF+HLGSNKFTLLRNYV+TL+KHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPP+RK
Subjt:  DPPKRK

A0A6J1F9C8 uncharacterized protein LOC111442002 isoform X19.40e-6997.17Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFT LRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

A0A6J1J9C4 uncharacterized protein LOC111482460 isoform X19.40e-6997.17Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEVKFADGSVFNLSAEFLRVYSPAADAK+RSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYF+HLGSNKFT LRNYVKTLKKHGLSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPPKRK
        DPPKRK
Subjt:  DPPKRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G27340.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 1035 Blast hits to 1035 proteins in 399 species: Archae - 0; Bacteria - 765; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 231 (source: NCBI BLink).8.8e-4678.64Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEV++ADG+ FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGVR++FDDLHRTGIY WDYF+ LGSNKF L+RNY+KTL+KH LSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPP
        +PP
Subjt:  DPP

AT3G27340.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 693 Blast hits to 693 proteins in 282 species: Archae - 0; Bacteria - 530; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 124 (source: NCBI BLink).3.1e-2281.36Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRI
        VEV++ADG+ FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGVRI
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRI

AT3G27340.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: oxidation reduction; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Gamma-butyrobetaine dioxygenase/Trimethyllysine dioxygenase, N-terminal (InterPro:IPR010376); Has 945 Blast hits to 945 proteins in 390 species: Archae - 0; Bacteria - 710; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 196 (source: NCBI BLink).4.7e-3972.82Show/hide
Query:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR
        VEV++ADG+ FN S+EFLR++SPAAD K+RSIGGEKVISGRR+VGIMSAEPVGNYGV        RTGIY WDYF+ LGSNKF L+RNY+KTL+KH LSR
Subjt:  VEVKFADGSVFNLSAEFLRVYSPAADAKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSR

Query:  DPP
        +PP
Subjt:  DPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGGGAGCAGATTGATGCATGTGGTAGAGAGGGGGATGTTTGCTAGGAATGGGGGAAAGTACACCAAGCTTGGTAAGAGGGTTAGGTACTGCACTGATACAATTGC
CTTTGCGCAGCTCTGTGGCACAGTTTTGATGGAGCTGGTGCAGTTTGTGTGGGGGAATAGCTATTGGATTTGGGCAAGTCATAAAGGCAAAGCCACTTCTTTGGGAGTGC
GCGCATGTGTTAGAACTCCAACAAGATTTGTGGTAGAGGTTAAATTTGCAGATGGAAGTGTGTTCAACTTGTCAGCTGAGTTCTTGAGAGTATATAGTCCAGCTGCTGAT
GCTAAAATCCGGTCAATTGGGGGTGAAAAGGTAATCTCTGGACGACGTCATGTAGGTATTATGTCTGCTGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGACGA
TTTGCATAGAACTGGGATTTATTCGTGGGATTATTTCTTTCATCTTGGGAGCAACAAATTCACTCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCATGGGCTGAGCC
GAGATCCACCAAAGAGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGGGAGCAGATTGATGCATGTGGTAGAGAGGGGGATGTTTGCTAGGAATGGGGGAAAGTACACCAAGCTTGGTAAGAGGGTTAGGTACTGCACTGATACAATTGC
CTTTGCGCAGCTCTGTGGCACAGTTTTGATGGAGCTGGTGCAGTTTGTGTGGGGGAATAGCTATTGGATTTGGGCAAGTCATAAAGGCAAAGCCACTTCTTTGGGAGTGC
GCGCATGTGTTAGAACTCCAACAAGATTTGTGGTAGAGGTTAAATTTGCAGATGGAAGTGTGTTCAACTTGTCAGCTGAGTTCTTGAGAGTATATAGTCCAGCTGCTGAT
GCTAAAATCCGGTCAATTGGGGGTGAAAAGGTAATCTCTGGACGACGTCATGTAGGTATTATGTCTGCTGAACCAGTTGGGAACTATGGGGTAAGGATCCTTTTTGACGA
TTTGCATAGAACTGGGATTTATTCGTGGGATTATTTCTTTCATCTTGGGAGCAACAAATTCACTCTCTTGAGAAATTATGTTAAAACACTGAAGAAGCATGGGCTGAGCC
GAGATCCACCAAAGAGAAAATGA
Protein sequenceShow/hide protein sequence
MCGSRLMHVVERGMFARNGGKYTKLGKRVRYCTDTIAFAQLCGTVLMELVQFVWGNSYWIWASHKGKATSLGVRACVRTPTRFVVEVKFADGSVFNLSAEFLRVYSPAAD
AKIRSIGGEKVISGRRHVGIMSAEPVGNYGVRILFDDLHRTGIYSWDYFFHLGSNKFTLLRNYVKTLKKHGLSRDPPKRK