; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G008640 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G008640
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionGTP cyclohydrolase II isoform 1
Genome locationGy14Chr4:6641033..6641647
RNA-Seq ExpressionCsGy4G008640
SyntenyCsGy4G008640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045323.1 uncharacterized protein E6C27_scaffold316G00950 [Cucumis melo var. makuwa]4.19e-8889.1Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLE+R
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQE EE  EEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

KAG6600670.1 hypothetical protein SDJN03_05903, partial [Cucurbita argyrosperma subsp. sororia]1.84e-6066.86Show/hide
Query:  NQKALDSRDMV----NRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA
        N +ALDS DM+     + GV+LIRNCDLPPPQK+F +                   GMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA
Subjt:  NQKALDSRDMV----NRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA

Query:  RMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNG-GNGGMKWVWALAICLSVVGVGFLLGYTCN
        R++FCYRQSLKL+ELRV KL+K++    EEEE+    NG GN G+KWVWALAICLSVVGVG LLGYTC+
Subjt:  RMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNG-GNGGMKWVWALAICLSVVGVGFLLGYTCN

XP_008455680.1 PREDICTED: uncharacterized protein LOC103495794 [Cucumis melo]2.07e-8889.74Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQE EE  EEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

XP_031739981.1 uncharacterized protein LOC105435202 [Cucumis sativus]3.53e-10099.35Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        VFKLQKQEEEEEEEEEEES+KNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

XP_038904267.1 uncharacterized protein LOC120090620 [Benincasa hispida]3.39e-7580.65Show/hide
Query:  MKAGTSNNQKALDSR-DMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLEL
        MKA  S N KALDS  +MV+R GV+L+RNCDLPPPQKV +SGMEEK+ELLKALRLSQTRAREAERKAAKLMEE+DCISRAFEDEAR++FCYRQS+KLL+L
Subjt:  MKAGTSNNQKALDSR-DMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLEL

Query:  RVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        RV KLQK  EEEEEEEEEE + NGG+ GMKWVWALAICLSVVGVG LL YTCN S
Subjt:  RVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

TrEMBL top hitse value%identityAlignment
A0A0A0KZ57 Uncharacterized protein8.64e-9278.06Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEEEEEEEEE------------------------------------------SEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        VFKLQKQEEEEEEEEEEE                                          S+KNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEEEEEEEEE------------------------------------------SEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A1S3C2P7 uncharacterized protein LOC1034957941.00e-8889.74Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQE EE  EEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A5A7TP56 Uncharacterized protein2.03e-8889.1Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLE+R
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQE EE  EEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A5D3BEN7 Uncharacterized protein1.00e-8889.74Show/hide
Query:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQE EE  EEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEE--EEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A6J1HY55 uncharacterized protein LOC1114690542.13e-6066.07Show/hide
Query:  NQKALDSRDMV----NRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA
        N +ALDS DM+     + GV+LIRNCDLPPPQK+F +                   GMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA
Subjt:  NQKALDSRDMV----NRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEA

Query:  RMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN
        R++FCYRQSLKL+ELRV KL+K++      EEEE   NG  GG+KWVWALAICLSVVGVG LLGY C+
Subjt:  RMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT1G01240.2 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT1G01240.3 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT2G46550.1 unknown protein2.3e-0734.92Show/hide
Query:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG
        K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F Y+Q L+LL+L    LQ + +E                      +E  +   K G 
Subjt:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG

Query:  NGGMKWVWALAICLSVVGVGFLLGYT
          G K+   LA+ +S+VG G LLG+T
Subjt:  NGGMKWVWALAICLSVVGVGFLLGYT

AT2G46550.2 unknown protein2.3e-0734.92Show/hide
Query:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG
        K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F Y+Q L+LL+L    LQ + +E                      +E  +   K G 
Subjt:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG

Query:  NGGMKWVWALAICLSVVGVGFLLGYT
          G K+   LA+ +S+VG G LLG+T
Subjt:  NGGMKWVWALAICLSVVGVGFLLGYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCAGGGACGAGCAACAATCAGAAAGCCCTAGATTCGAGAGACATGGTCAACAGAGGCGGGGTTGAGCTAATTAGGAATTGCGATCTTCCGCCGCCGCAGAAGGT
ATTCAAGAGTGGGATGGAGGAAAAAATGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCAAGAGAGGCGGAGAGGAAGGCGGCAAAATTGATGGAGGAAAGGG
ATTGTATTAGTAGGGCTTTTGAAGACGAGGCCAGAATGGTGTTCTGTTATCGACAATCCCTCAAATTGCTGGAGCTCAGGGTTTTCAAGTTGCAGAAACAAGAAGAGGAA
GAAGAGGAAGAAGAGGAAGAAGAAAGTGAAAAGAATGGTGGAAATGGAGGAATGAAATGGGTTTGGGCTTTGGCTATTTGTTTGAGTGTTGTTGGAGTTGGCTTTCTTTT
GGGCTATACATGTAATCCTTCATGA
mRNA sequenceShow/hide mRNA sequence
GGAAGTTTGTCTGAAGAAGCAAGCAAAGAATTGTTTGCCAATAAATAAGTTCAGTCAATTTCAATCTTTCTGTATCGATCTTTCCCATACCCATCCACAATCCCTTTCGT
TCCAAAATTCGATCGCACTCGACCGGCCACCCGCGAAATCATGAAGGCAGGGACGAGCAACAATCAGAAAGCCCTAGATTCGAGAGACATGGTCAACAGAGGCGGGGTTG
AGCTAATTAGGAATTGCGATCTTCCGCCGCCGCAGAAGGTATTCAAGAGTGGGATGGAGGAAAAAATGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCAAGA
GAGGCGGAGAGGAAGGCGGCAAAATTGATGGAGGAAAGGGATTGTATTAGTAGGGCTTTTGAAGACGAGGCCAGAATGGTGTTCTGTTATCGACAATCCCTCAAATTGCT
GGAGCTCAGGGTTTTCAAGTTGCAGAAACAAGAAGAGGAAGAAGAGGAAGAAGAGGAAGAAGAAAGTGAAAAGAATGGTGGAAATGGAGGAATGAAATGGGTTTGGGCTT
TGGCTATTTGTTTGAGTGTTGTTGGAGTTGGCTTTCTTTTGGGCTATACATGTAATCCTTCATGA
Protein sequenceShow/hide protein sequence
MKAGTSNNQKALDSRDMVNRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE
EEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS