; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G08820 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G08820
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGTP cyclohydrolase II isoform 1
Genome locationChr4:6567954..6568748
RNA-Seq ExpressionCSPI04G08820
SyntenyCSPI04G08820
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045323.1 uncharacterized protein E6C27_scaffold316G00950 [Cucumis melo var. makuwa]1.8e-6789.1Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLE+R
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQ  EE EEEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

XP_008455680.1 PREDICTED: uncharacterized protein LOC103495794 [Cucumis melo]1.1e-6789.74Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQ  EE EEEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

XP_022970087.1 uncharacterized protein LOC111469054 [Cucurbita maxima]2.7e-4765.68Show/hide
Query:  NNQKALDSRDMV----HRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE
        +N +ALDS DM+    ++ GV+LIRNCDLPPPQK+F +                   GMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE
Subjt:  NNQKALDSRDMV----HRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE

Query:  ARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN
        AR++FCYRQSLKL+ELRV KL+K++EEEE+        NG  GG+KWVWALAICLSVVGVG LLGY C+
Subjt:  ARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN

XP_031739981.1 uncharacterized protein LOC105435202 [Cucumis sativus]4.8e-7698.7Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKAGTSNNQKALDSRDMV+RGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        VFKLQKQEEEEEEEEEEES+KNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

XP_038904267.1 uncharacterized protein LOC120090620 [Benincasa hispida]2.0e-5881.29Show/hide
Query:  MKAGTSNNQKALDS-RDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLEL
        MKA  S N KALDS  +MVHR GV+L+RNCDLPPPQKV +SGMEEK+ELLKALRLSQTRAREAERKAAKLMEE+DCISRAFEDEAR++FCYRQS+KLL+L
Subjt:  MKAGTSNNQKALDS-RDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLEL

Query:  RVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        RV KLQK  EEEEEEEEEE + NGG+ GMKWVWALAICLSVVGVG LL YTCN S
Subjt:  RVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

TrEMBL top hitse value%identityAlignment
A0A0A0KZ57 Uncharacterized protein3.2e-7077.55Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKAGTSNNQKALDSRDMV+RGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ------------------------------------------EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        VFKLQKQ                                          EEEEEEEEEEES+KNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ------------------------------------------EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A1S3C2P7 uncharacterized protein LOC1034957945.2e-6889.74Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQ  EE EEEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A5A7TP56 Uncharacterized protein8.8e-6889.1Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLE+R
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQ  EE EEEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A5D3BEN7 Uncharacterized protein5.2e-6889.74Show/hide
Query:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR
        MKA TSNN+++LDSRDMV R GVELIRNCDLPPPQKVFKSGMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEAR++FCYRQSLKLLELR
Subjt:  MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELR

Query:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
        V KLQKQ  EE EEEEEEEE ++NGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS
Subjt:  VFKLQKQ--EEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS

A0A6J1HY55 uncharacterized protein LOC1114690541.3e-4765.68Show/hide
Query:  NNQKALDSRDMV----HRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE
        +N +ALDS DM+    ++ GV+LIRNCDLPPPQK+F +                   GMEEK+ELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE
Subjt:  NNQKALDSRDMV----HRGGVELIRNCDLPPPQKVFKS-------------------GMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDE

Query:  ARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN
        AR++FCYRQSLKL+ELRV KL+K++EEEE+        NG  GG+KWVWALAICLSVVGVG LLGY C+
Subjt:  ARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT1G01240.2 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT1G01240.3 unknown protein1.2e-1132.8Show/hide
Query:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC
        I+NCDLPPPQK+ KS                                  G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFKS----------------------------------GMEE----------------KMELLKALRLSQTRAREAERKAAKLMEERDC

Query:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  Y+Q LKLLE+    LQ ++EEE+EE+ +           E +K G  G  +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEEEEEEEEE-----------ESEKNGGNGGMKWVWALAICLSVVGVGFLLGYT

AT2G46550.1 unknown protein2.3e-0734.92Show/hide
Query:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG
        K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F Y+Q L+LL+L    LQ + +E                      +E  +   K G 
Subjt:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG

Query:  NGGMKWVWALAICLSVVGVGFLLGYT
          G K+   LA+ +S+VG G LLG+T
Subjt:  NGGMKWVWALAICLSVVGVGFLLGYT

AT2G46550.2 unknown protein2.3e-0734.92Show/hide
Query:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG
        K ELL+ALR SQTRAREAE  A +   E++ + +    +A  +F Y+Q L+LL+L    LQ + +E                      +E  +   K G 
Subjt:  KMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE--------------------EEEEEEEESEKNGG

Query:  NGGMKWVWALAICLSVVGVGFLLGYT
          G K+   LA+ +S+VG G LLG+T
Subjt:  NGGMKWVWALAICLSVVGVGFLLGYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCAGGGACGAGCAACAATCAGAAAGCCCTAGATTCGAGAGACATGGTCCACAGAGGCGGGGTTGAGCTAATTAGGAATTGCGATCTTCCGCCGCCGCAGAAGGT
ATTCAAGAGTGGGATGGAGGAAAAAATGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCAAGAGAGGCGGAGAGGAAGGCGGCAAAATTGATGGAGGAAAGGG
ATTGTATTAGTAGGGCTTTTGAAGACGAGGCCAGAATGGTGTTCTGTTATCGACAATCCCTCAAATTGCTGGAGCTCAGGGTTTTCAAGTTGCAGAAACAAGAAGAGGAA
GAAGAGGAAGAAGAGGAAGAAGAAAGTGAAAAGAATGGTGGAAATGGAGGAATGAAATGGGTTTGGGCTTTGGCTATTTGTTTGAGTGTTGTTGGAGTTGGCTTTCTTTT
GGGCTATACATGTAATCCTTCATGA
mRNA sequenceShow/hide mRNA sequence
AAGCCATATTATAATGTGATTAAAGAAAAGTTGTGAGATGCAATATAAAAAAGGAGTAACAATATTGAAAGGAATTTGAATGAGTGCCTTGAAGCAAAAATAAACCGGAA
GTTTGTCTGAAGAAGCAAGCAAAGAATTGTTTGCCAATAAATAAGTTCAGTCAATTTCAATCTTTCTGTATCGATCTTTCCCATACCCATCCACAATCCCTTTCGTTCCA
AAATTCGATCGCACTCGACCGGCCATCCGCGAAATCATGAAGGCAGGGACGAGCAACAATCAGAAAGCCCTAGATTCGAGAGACATGGTCCACAGAGGCGGGGTTGAGCT
AATTAGGAATTGCGATCTTCCGCCGCCGCAGAAGGTATTCAAGAGTGGGATGGAGGAAAAAATGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCAAGAGAGG
CGGAGAGGAAGGCGGCAAAATTGATGGAGGAAAGGGATTGTATTAGTAGGGCTTTTGAAGACGAGGCCAGAATGGTGTTCTGTTATCGACAATCCCTCAAATTGCTGGAG
CTCAGGGTTTTCAAGTTGCAGAAACAAGAAGAGGAAGAAGAGGAAGAAGAGGAAGAAGAAAGTGAAAAGAATGGTGGAAATGGAGGAATGAAATGGGTTTGGGCTTTGGC
TATTTGTTTGAGTGTTGTTGGAGTTGGCTTTCTTTTGGGCTATACATGTAATCCTTCATGAACCATACATTATTACCTTTTTACCCCCTACATCTTTTACCCAATTTTAC
TCTTTTACTTTCTTAAATAAGAGTA
Protein sequenceShow/hide protein sequence
MKAGTSNNQKALDSRDMVHRGGVELIRNCDLPPPQKVFKSGMEEKMELLKALRLSQTRAREAERKAAKLMEERDCISRAFEDEARMVFCYRQSLKLLELRVFKLQKQEEE
EEEEEEEESEKNGGNGGMKWVWALAICLSVVGVGFLLGYTCNPS