; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007649 (gene) of Snake gourd v1 genome

Gene IDTan0007649
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGTP cyclohydrolase II isoform 1
Genome locationLG01:14412943..14413722
RNA-Seq ExpressionTan0007649
SyntenyTan0007649
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600670.1 hypothetical protein SDJN03_05903, partial [Cucurbita argyrosperma subsp. sororia]7.7e-6273.06Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A  GRGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K KEEEE+           +KWVWALAICLSVVGVG LLGYTC NVDE PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

KAG7031309.1 hypothetical protein SDJN02_05349, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-6273.71Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A  GRGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLGYTC N DE PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

XP_022136673.1 uncharacterized protein LOC111008325 [Momordica charantia]1.0e-5066.67Show/hide
Query:  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEA
        N KA+DS              GVDLIRNCDLPPPQK+FT MA V + R R+E ESGG+EEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEA
Subjt:  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEA

Query:  RLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL
        RLIF Y+Q +KL++LR+S LQK  +EEE     ++         GGE MKWVWALAIC +VVGVGFL GYTCNVDE P L
Subjt:  RLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL

XP_022943244.1 uncharacterized protein LOC111448032 [Cucurbita moschata]1.6e-5972.16Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIA W    N    N +ALDST           VDLIRNCDLPPPQK+FTA    A   RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLGYTC NV E PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

XP_022970087.1 uncharacterized protein LOC111469054 [Cucurbita maxima]1.2e-6274.09Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A   RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K K   EEEE+  N NGG +KWVWALAICLSVVGVG LLGY C NVDE PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

TrEMBL top hitse value%identityAlignment
A0A1S3C2P7 uncharacterized protein LOC1034957943.1e-4869.94Show/hide
Query:  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYK
        N ++LDS       GV+LIRNCDLPPPQKVF                  GMEEK+ELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL+FCY+
Subjt:  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYK

Query:  QWLKLMELRVSKLQKG-----KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN
        Q LKL+ELRV KLQK      +EEEEEEE +EN   G MKWVWALAICLSVVGVGFLLGYTCN
Subjt:  QWLKLMELRVSKLQKG-----KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN

A0A5D3BEN7 Uncharacterized protein3.1e-4869.94Show/hide
Query:  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYK
        N ++LDS       GV+LIRNCDLPPPQKVF                  GMEEK+ELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEARL+FCY+
Subjt:  NHKALDS------TGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYK

Query:  QWLKLMELRVSKLQKG-----KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN
        Q LKL+ELRV KLQK      +EEEEEEE +EN   G MKWVWALAICLSVVGVGFLLGYTCN
Subjt:  QWLKLMELRVSKLQKG-----KEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCN

A0A6J1C873 uncharacterized protein LOC1110083255.0e-5166.67Show/hide
Query:  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEA
        N KA+DS              GVDLIRNCDLPPPQK+FT MA V + R R+E ESGG+EEKLELLKALRLSQTRAREAERKAAKLM+ERDCISRAFEDEA
Subjt:  NHKALDST-------------GVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEA

Query:  RLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL
        RLIF Y+Q +KL++LR+S LQK  +EEE     ++         GGE MKWVWALAIC +VVGVGFL GYTCNVDE P L
Subjt:  RLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEEN-------CNGGE-MKWVWALAICLSVVGVGFLLGYTCNVDEHPFL

A0A6J1FTQ9 uncharacterized protein LOC1114480327.8e-6072.16Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIA W    N    N +ALDST           VDLIRNCDLPPPQK+FTA    A   RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDSTG----------VDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K K     EEEE+N NG   +KWVWALAICLSVVGVG LLGYTC NV E PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGE-MKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

A0A6J1HY55 uncharacterized protein LOC1114690545.7e-6374.09Show/hide
Query:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA
        MK STIAAW    N    N +ALDST          GVDLIRNCDLPPPQK+FTA    A   RGRGREEVE+GGMEEKLELLKALRLSQTRAREAERKA
Subjt:  MKPSTIAAWPTKPN---PNHKALDST----------GVDLIRNCDLPPPQKVFTAM---AWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKA

Query:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN
        AKLM+ERDCISRAFEDEARLIFCY+Q LKLMELRVSKL+K K   EEEE+  N NGG +KWVWALAICLSVVGVG LLGY C NVDE PF++N
Subjt:  AKLMKERDCISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTC-NVDEHPFLIN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G01240.1 unknown protein1.3e-1433.7Show/hide
Query:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC
        I+NCDLPPPQK+  ++                W   V + R    +   G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC

Query:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  YKQWLKL+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT

AT1G01240.2 unknown protein1.3e-1433.7Show/hide
Query:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC
        I+NCDLPPPQK+  ++                W   V + R    +   G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC

Query:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  YKQWLKL+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT

AT1G01240.3 unknown protein1.3e-1433.7Show/hide
Query:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC
        I+NCDLPPPQK+  ++                W   V + R    +   G  E                K +LL+ALR SQTRAREAER A +   E+D 
Subjt:  IRNCDLPPPQKVFTAM---------------AW---VGRGRGREEVESGGMEE----------------KLELLKALRLSQTRAREAERKAAKLMKERDC

Query:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT
        +      +A  +  YKQWLKL+E+    LQ  KEEE+EE+             E+   G   +++ A A+  S++G G LLG+T
Subjt:  ISRAFEDEARLIFCYKQWLKLMELRVSKLQKGKEEEEEEE------------EEENCNGGEMKWVWALAICLSVVGVGFLLGYT

AT2G46550.1 unknown protein3.0e-1131.22Show/hide
Query:  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFE
        +D + NCDLP PQK+  +     RG                         + R E  S     K ELL+ALR SQTRAREAE  A +   E++ + +   
Subjt:  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFE

Query:  DEARLIFCYKQWLKLMELRVSKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVVGVGFLLGYT
         +A  +F YKQWL+L++L    LQ K KE + +  ++         NG                   K+   LA+ +S+VG G LLG+T
Subjt:  DEARLIFCYKQWLKLMELRVSKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVVGVGFLLGYT

AT2G46550.2 unknown protein3.0e-1131.22Show/hide
Query:  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFE
        +D + NCDLP PQK+  +     RG                         + R E  S     K ELL+ALR SQTRAREAE  A +   E++ + +   
Subjt:  VDLIRNCDLPPPQKVFTAMAWVGRG-------------------------RGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFE

Query:  DEARLIFCYKQWLKLMELRVSKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVVGVGFLLGYT
         +A  +F YKQWL+L++L    LQ K KE + +  ++         NG                   K+   LA+ +S+VG G LLG+T
Subjt:  DEARLIFCYKQWLKLMELRVSKLQ-KGKEEEEEEEEEEN------CNG----------------GEMKWVWALAICLSVVGVGFLLGYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCCGTCGACGATCGCCGCCTGGCCGACCAAGCCCAATCCAAATCACAAAGCCCTAGATTCCACCGGCGTTGACCTAATTCGGAATTGCGATCTCCCGCCGCCGCA
GAAGGTATTCACGGCGATGGCGTGGGTGGGTAGAGGTAGAGGTCGGGAAGAGGTGGAATCGGGGGGGATGGAGGAGAAATTGGAGCTGTTGAAGGCTCTGAGACTGTCGC
AAACGAGGGCGAGAGAAGCGGAGAGAAAGGCGGCGAAATTGATGAAGGAGAGGGATTGTATAAGTAGGGCTTTTGAAGATGAGGCCAGATTGATCTTCTGTTACAAACAG
TGGCTGAAATTGATGGAGCTTAGGGTTTCGAAGTTGCAGAAGGGCAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAATTGCAATGGCGGAGAAATGAAATGGGTTTG
GGCATTGGCGATTTGTCTGAGTGTTGTGGGAGTGGGCTTTCTCTTGGGCTATACATGTAATGTCGATGAACATCCATTTCTTATCAACAATACCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAGAGGCCAATTTTGAAAGGAATCAGTGTCTTGAAGCAAAAATAAACCGCAAGTTTATGAGAAGAAGCAAGCAAAGAATTCTTTCACAATAAATACCTTTTTGT
CATTTTCCTGCTCTCTCTAAAAATCCAATCCCCTTCCTTCAGAATTCCGAGATCATGAAGCCGTCGACGATCGCCGCCTGGCCGACCAAGCCCAATCCAAATCACAAAGC
CCTAGATTCCACCGGCGTTGACCTAATTCGGAATTGCGATCTCCCGCCGCCGCAGAAGGTATTCACGGCGATGGCGTGGGTGGGTAGAGGTAGAGGTCGGGAAGAGGTGG
AATCGGGGGGGATGGAGGAGAAATTGGAGCTGTTGAAGGCTCTGAGACTGTCGCAAACGAGGGCGAGAGAAGCGGAGAGAAAGGCGGCGAAATTGATGAAGGAGAGGGAT
TGTATAAGTAGGGCTTTTGAAGATGAGGCCAGATTGATCTTCTGTTACAAACAGTGGCTGAAATTGATGGAGCTTAGGGTTTCGAAGTTGCAGAAGGGCAAAGAAGAAGA
AGAAGAAGAAGAAGAAGAAGAAAATTGCAATGGCGGAGAAATGAAATGGGTTTGGGCATTGGCGATTTGTCTGAGTGTTGTGGGAGTGGGCTTTCTCTTGGGCTATACAT
GTAATGTCGATGAACATCCATTTCTTATCAACAATACCTGACTATATTACCTTTTCCCCCTTTATATATATATTTTTTTTCTTTTTCCAACCATGTATTTATTTATTGCG
AAAATTTCTC
Protein sequenceShow/hide protein sequence
MKPSTIAAWPTKPNPNHKALDSTGVDLIRNCDLPPPQKVFTAMAWVGRGRGREEVESGGMEEKLELLKALRLSQTRAREAERKAAKLMKERDCISRAFEDEARLIFCYKQ
WLKLMELRVSKLQKGKEEEEEEEEEENCNGGEMKWVWALAICLSVVGVGFLLGYTCNVDEHPFLINNT