; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020984 (gene) of Snake gourd v1 genome

Gene IDTan0020984
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine dioxygenase
Genome locationLG02:92687051..92689202
RNA-Seq ExpressionTan0020984
SyntenyTan0020984
Gene Ontology termsGO:0017172 - cysteine dioxygenase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR011051 - RmlC-like cupin domain superfamily
IPR012864 - Cysteine oxygenase/2-aminoethanethiol dioxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8056931.1 hypothetical protein FH972_013663 [Carpinus fangiana]1.1e-3271.13Show/hide
Query:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKS
        GL G V+D +  AP EA  LFP SGGNIHSF A++ CAILDVLSPPYSEELGRPSTYFSDFPIP+LPGY++LEE + P+DL V+GAPYLG  IVT++
Subjt:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKS

KAF8376816.1 hypothetical protein HHK36_031515 [Tetracentron sinense]8.1e-3371Show/hide
Query:  AVGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        A+GL GKV+D ++ AP EA  LFPRSGGNIHSF A++ CAILDVLSPPYSEE GRPSTYFSD PIP LPGY++LEE + P DL V GAPYLG  IVT  D
Subjt:  AVGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

XP_003544221.1 plant cysteine oxidase 4 [Glycine max]4.0e-3269.7Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        +GL G+V+D V+ AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSEE GRPSTYFSD PIP+L GY++LEE   P+DL V GAPYLG SIVT  D
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

XP_028198585.1 plant cysteine oxidase 4-like [Glycine soja]4.0e-3269.7Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        +GL G+V+D V+ AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSEE GRPSTYFSD PIP+L GY++LEE   P+DL V GAPYLG SIVT  D
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

XP_038886772.1 plant cysteine oxidase 4-like [Benincasa hispida]1.9e-4285.15Show/hide
Query:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSDET
        GLG KV+D+VWSAPSEARALFP SGGNIHSFRA S CAILDVLSPPYS+ LGRPSTYFSDFP+PTLP  +MLEEI QP DLYVVGAPYLGSSIVTK DET
Subjt:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSDET

Query:  Y
        Y
Subjt:  Y

TrEMBL top hitse value%identityAlignment
A0A151TZZ4 Cysteine dioxygenase1.9e-3269.7Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        +GLGGKV+D V  AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSE+ GRPSTY+SD PIP+L GY++LEE   P DL V GAPYLG SIVT  D
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

A0A371E730 Cysteine dioxygenase (Fragment)1.9e-3265.71Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSDE
        +GL G+V+D V  AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSE+ GRPSTY+SD PIP+L GYS+LEE   PNDL V GAPYLG SIVT  D 
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSDE

Query:  TYEKL
         + ++
Subjt:  TYEKL

A0A5N6R8F2 Cysteine dioxygenase5.1e-3371.13Show/hide
Query:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKS
        GL G V+D +  AP EA  LFP SGGNIHSF A++ CAILDVLSPPYSEELGRPSTYFSDFPIP+LPGY++LEE + P+DL V+GAPYLG  IVT++
Subjt:  GLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKS

K7M7P6 Cysteine dioxygenase1.9e-3269.7Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        +GL G+V+D V+ AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSEE GRPSTYFSD PIP+L GY++LEE   P+DL V GAPYLG SIVT  D
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

K7M7P7 Cysteine dioxygenase1.9e-3269.7Show/hide
Query:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD
        +GL G+V+D V+ AP E   LFPRSGGNIHSF A++ CAILDVLSPPYSEE GRPSTYFSD PIP+L GY++LEE   P+DL V GAPYLG SIVT  D
Subjt:  VGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSD

SwissProt top hitse value%identityAlignment
Q1G3U6 Plant cysteine oxidase 32.1e-1538.24Show/hide
Query:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEEINQPNDLYVVGAPYLGS
        V D+V +  SE  AL+P++GGN+H F A++ CA+LD+LSPPY E +GR  +Y+ D+P  T                 Y+ L +I+ P+DL++    Y G 
Subjt:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEEINQPNDLYVVGAPYLGS

Query:  SI
        +I
Subjt:  SI

Q8LGJ5 Plant cysteine oxidase 29.6e-1337Show/hide
Query:  IDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL------------PGYSMLEEINQ-PNDLYVVGAPYLGSSI
        +D  ++AP +   L+P  GGN+H F A + CA+LDV+ PPYS+  GR  TY+ D+P  +              GY+ L+E  + P DL V    Y G +I
Subjt:  IDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL------------PGYSMLEEINQ-PNDLYVVGAPYLGSSI

Q9LXG9 Plant cysteine oxidase 11.9e-1339.81Show/hide
Query:  IDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEE--INQPNDLYVVGAPYLG
        +D  ++AP  A  L+P  GGN+H F AI+ CA+LDVL PPY    GR  TYF +FP+  L               GY+ L+E   N  +   VVGA Y G
Subjt:  IDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEE--INQPNDLYVVGAPYLG

Query:  SSI
          +
Subjt:  SSI

Q9LXT4 Plant cysteine oxidase 51.5e-1341.49Show/hide
Query:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI
        ++PS A  L+P +GGNIH F+AI+ CAI D+LSPPYS   GR   YF   P+  LPG             + LEE   P++  +   PY G  I
Subjt:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI

Q9SJI9 Plant cysteine oxidase 46.7e-1442.57Show/hide
Query:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV
        V D   +A S    L+P+SGGNIH F+AI+ CAILD+L+PPYS E  R  TYF       LPG            + LEE   P+D  +   PY G  I 
Subjt:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV

Query:  T
        T
Subjt:  T

Arabidopsis top hitse value%identityAlignment
AT1G18490.1 Protein of unknown function (DUF1637)1.5e-1638.24Show/hide
Query:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEEINQPNDLYVVGAPYLGS
        V D+V +  SE  AL+P++GGN+H F A++ CA+LD+LSPPY E +GR  +Y+ D+P  T                 Y+ L +I+ P+DL++    Y G 
Subjt:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTL--------------PGYSMLEEINQPNDLYVVGAPYLGS

Query:  SI
        +I
Subjt:  SI

AT2G42670.1 Protein of unknown function (DUF1637)4.7e-1542.57Show/hide
Query:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV
        V D   +A S    L+P+SGGNIH F+AI+ CAILD+L+PPYS E  R  TYF       LPG            + LEE   P+D  +   PY G  I 
Subjt:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV

Query:  T
        T
Subjt:  T

AT2G42670.2 Protein of unknown function (DUF1637)4.7e-1542.57Show/hide
Query:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV
        V D   +A S    L+P+SGGNIH F+AI+ CAILD+L+PPYS E  R  TYF       LPG            + LEE   P+D  +   PY G  I 
Subjt:  VIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG-----------YSMLEEINQPNDLYVVGAPYLGSSIV

Query:  T
        T
Subjt:  T

AT3G58670.1 Protein of unknown function (DUF1637)1.1e-1441.49Show/hide
Query:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI
        ++PS A  L+P +GGNIH F+AI+ CAI D+LSPPYS   GR   YF   P+  LPG             + LEE   P++  +   PY G  I
Subjt:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI

AT3G58670.2 Protein of unknown function (DUF1637)1.1e-1441.49Show/hide
Query:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI
        ++PS A  L+P +GGNIH F+AI+ CAI D+LSPPYS   GR   YF   P+  LPG             + LEE   P++  +   PY G  I
Subjt:  SAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPG------------YSMLEEINQPNDLYVVGAPYLGSSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCAGTTGGGTTGGGAGGGAAGGTGATTGATAGAGTTTGGAGTGCGCCAAGTGAAGCTAGGGCTTTGTTTCCAAGAAGTGGAGGGAACATTCATTCGTTTAGGGC
AATTTCAGAATGTGCCATTTTGGATGTGTTGTCTCCGCCATATTCTGAAGAGCTTGGAAGACCTTCCACTTACTTCTCCGACTTCCCCATTCCAACTCTTCCTGGTTACT
CTATGCTAGAGGAGATAAATCAGCCCAACGATCTGTATGTAGTAGGAGCACCGTATCTTGGCTCTTCAATAGTTACAAAAAGTGATGAAACTTATGAAAAGTTACCTTCT
TTCATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCAGTTGGGTTGGGAGGGAAGGTGATTGATAGAGTTTGGAGTGCGCCAAGTGAAGCTAGGGCTTTGTTTCCAAGAAGTGGAGGGAACATTCATTCGTTTAGGGC
AATTTCAGAATGTGCCATTTTGGATGTGTTGTCTCCGCCATATTCTGAAGAGCTTGGAAGACCTTCCACTTACTTCTCCGACTTCCCCATTCCAACTCTTCCTGGTTACT
CTATGCTAGAGGAGATAAATCAGCCCAACGATCTGTATGTAGTAGGAGCACCGTATCTTGGCTCTTCAATAGTTACAAAAAGTGATGAAACTTATGAAAAGTTACCTTCT
TTCATGTAA
Protein sequenceShow/hide protein sequence
MVAVGLGGKVIDRVWSAPSEARALFPRSGGNIHSFRAISECAILDVLSPPYSEELGRPSTYFSDFPIPTLPGYSMLEEINQPNDLYVVGAPYLGSSIVTKSDETYEKLPS
FM