; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018088 (gene) of Snake gourd v1 genome

Gene IDTan0018088
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGATA transcription factor 29-like
Genome locationLG07:10857864..10861430
RNA-Seq ExpressionTan0018088
SyntenyTan0018088
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009908 - flower development (biological process)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type
IPR044272 - GATA transcription factor 18/19/20


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010246.1 GATA transcription factor 29, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-7361.71Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG--------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMASC
        MK  EYDH RF       H   QVDLTLRLGFPNG                    +AGE  PDIDH+ P FS  LH+INHH  LQ HQVMV NNHG+AS 
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG--------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMASC

Query:  GGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASS
        G RNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN+       ++PLQ   N NND YTLLNPA R TDD +   +S 
Subjt:  GGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASS

Query:  SARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
          RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EAMAAEN+H
Subjt:  SARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_022943672.1 GATA zinc finger domain-containing protein 8-like [Cucurbita moschata]9.1e-7461.62Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA
        MK  EYDH RF       H   QVDLTLRLGFPNG                      +AGE  PDIDH++P FS  LH+INHHA LQ HQVMV NNHG+A
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA

Query:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA
        S GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN        ++PLQ   N NND YTLLNPA R TDD +   +
Subjt:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA

Query:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EAMAAEN+H
Subjt:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_022985849.1 GATA zinc finger domain-containing protein 8-like [Cucurbita maxima]1.2e-7664.21Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA
        MK HEYDH RF       H   QVDLTLRLGFPNG                      +AGE  PDIDH++PLFS  LH+INHHA LQ HQVMV NNHG+A
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA

Query:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA
        S GGRNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           +GSTAN+    ANL  PLQ   N NND YTLLNPA R  DD  EA+A
Subjt:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA

Query:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        SS +RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_023513295.1 putative GATA transcription factor 22 [Cucurbita pepo subsp. pepo]8.5e-7261.25Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA
        MK  E+D  RF       H   QVDLTLRLGFPNG                      +AGE  PDIDH++P FS  +H+INHHA LQ HQVMV NNHG+A
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA

Query:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA
        S GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN+    ANL  PLQ   N NND YTLLNPA R TDD +   +
Subjt:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA

Query:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

XP_038902274.1 putative GATA transcription factor 22 [Benincasa hispida]8.5e-8071.08Show/hide
Query:  MKGHEYD-HRRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDID-HNQPLFSTALHNINHHATLQTHQVMVNNHGMASCGGRNGEMMMSEGNYNPHQI
        MK HEYD HRR  HSE      +QVDLTLRLGFP+G A E  PDID HN P FSTA HNINHH  LQTHQVMV+NHG AS GGRNGE MMSE NYNPHQI
Subjt:  MKGHEYD-HRRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDID-HNQPLFSTALHNINHHATLQTHQVMVNNHGMASCGGRNGEMMMSEGNYNPHQI

Query:  VWSKELKTYVLNSNNYEQL-----------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAE
        VWSKELKTYVL SN++ QL            PI +T NN   + NL +P       NNDLYTLLNPASRITDD  EA+ASS  RR+G+RRRRVS+ N+AE
Subjt:  VWSKELKTYVLNSNNYEQL-----------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAE

Query:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS
        +RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS
Subjt:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENS

TrEMBL top hitse value%identityAlignment
A0A0A0L0J0 GATA-type domain-containing protein6.4e-4963.78Show/hide
Query:  HQVMVNNHGMASCGGRNGEMMMSEGNY-NPHQIVWSKELKTYVLNSN-------------NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLN
        +QVM NN G+     RN E MMSE NY NPHQIVWSK+LKTYVL SN             N  Q  PI ++  NN    NL +P      +NNDLYTLLN
Subjt:  HQVMVNNHGMASCGGRNGEMMMSEGNY-NPHQIVWSKELKTYVLNSN-------------NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLN

Query:  PASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        PASR+ DD  EA+ASS+ RRKG+RRRRVS+ N+ E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKET+N+EAMAAENS+
Subjt:  PASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A5A7TDI3 GATA transcription factor 29-like8.6e-7063.35Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDIDH---NQPLFSTALHNINHHATLQTHQVMVNNHGMASCGGRNGEMMMSEGNYNPHQ
        MK HEYDH R   SE+      QVDLTLRLGFPNG AGE  PDIDH   N P FST  H       LQTHQ+  NN G+A    RN E MMSE NYNPHQ
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDIDH---NQPLFSTALHNINHHATLQTHQVMVNNHGMASCGGRNGEMMMSEGNYNPHQ

Query:  IVWSKELKTYVLNSNNYEQLMPIG-----------STANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEA
        IVWSK+LKTYVL SNNY QL P             +T+  N    NL +P      +NNDLYTLLNPASRI+DD  EA+ASS  RRKG+RRRR S+ N+ 
Subjt:  IVWSKELKTYVLNSNNYEQLMPIG-----------STANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEA

Query:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKET+N+EAMAAENS+
Subjt:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A6J1DIM0 GATA transcription factor 295.6e-6159.75Show/hide
Query:  RRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV--NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKT
        RRF  SE+      QVDLTLRLGFPNG+ G      DHN+PLFS  LH++      QTHQV    NN G+ SCGG      MSE NYNP+QI+WSKELKT
Subjt:  RRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV--NNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKT

Query:  YVLNSNNYEQL---------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNT
        YVL SNN   L         + IG+ +  N +A   ++PLQ    N+NDLYTLLNPASR TDD ++ + SS +RRK +R RRVS  +E EKRCTNYNCNT
Subjt:  YVLNSNNYEQL---------MPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNT

Query:  TFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
         FTPMWRKGPLGPKSLCNACGIRYRKE +NREAM AENSH+
Subjt:  TFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

A0A6J1FTP0 GATA zinc finger domain-containing protein 8-like4.4e-7461.62Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA
        MK  EYDH RF       H   QVDLTLRLGFPNG                      +AGE  PDIDH++P FS  LH+INHHA LQ HQVMV NNHG+A
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA

Query:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA
        S GGR+GE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           IGSTAN        ++PLQ   N NND YTLLNPA R TDD +   +
Subjt:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA

Query:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        S   RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EAMAAEN+H
Subjt:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

A0A6J1JEF0 GATA zinc finger domain-containing protein 8-like5.6e-7764.21Show/hide
Query:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA
        MK HEYDH RF       H   QVDLTLRLGFPNG                      +AGE  PDIDH++PLFS  LH+INHHA LQ HQVMV NNHG+A
Subjt:  MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNG----------------------DAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMV-NNHGMA

Query:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA
        S GGRNGE MMSE NYNPHQIVWSKELKTYVL SNN+ QL P           +GSTAN+    ANL  PLQ   N NND YTLLNPA R  DD  EA+A
Subjt:  SCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLNSNNYEQLMP-----------IGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADA

Query:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH
        SS +RR+G +RRRVS+CNE E+RCTNYNCNT FTPMWRKGPLGPKSLCNACGIRYRKE +N+EA+AAEN+H
Subjt:  SSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSH

SwissProt top hitse value%identityAlignment
B8AX51 GATA transcription factor 156.3e-0968.42Show/hide
Query:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        ++RC   NC T  TP+WR GP GPKSLCNACGIRY+KE
Subjt:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q6L5E5 GATA transcription factor 156.3e-0968.42Show/hide
Query:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        ++RC   NC T  TP+WR GP GPKSLCNACGIRY+KE
Subjt:  EKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q6QPM2 GATA transcription factor 197.4e-1060Show/hide
Query:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
        N   +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE   R A  A NS S
Subjt:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

Q8LC79 GATA transcription factor 186.3e-0970.27Show/hide
Query:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE
Subjt:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Q9LT45 GATA transcription factor 291.8e-1143.18Show/hide
Query:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +D Y L++ PA R    +  +   +++ ++    +R+  C        E  K+CTN NCN   TPMWR+GPLGPKSLCNACGI++RKE
Subjt:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

Arabidopsis top hitse value%identityAlignment
AT3G20750.1 GATA transcription factor 291.3e-1243.18Show/hide
Query:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +D Y L++ PA R    +  +   +++ ++    +R+  C        E  K+CTN NCN   TPMWR+GPLGPKSLCNACGI++RKE
Subjt:  NDLYTLLN-PASRITDDQQEADASSSARRKGTRRRRVSSCN-------EAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT3G50870.1 GATA type zinc finger transcription factor family protein4.4e-1070.27Show/hide
Query:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE
Subjt:  KRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT4G36620.1 GATA transcription factor 195.3e-1160Show/hide
Query:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS
        N   +RC   NC+TT TP+WR GP GPKSLCNACGIR++KE   R A  A NS S
Subjt:  NEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKETLNREAMAAENSHS

AT5G49300.1 GATA transcription factor 164.9e-0959.09Show/hide
Query:  SSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE
        +S N+ +K C   +C T+ TP+WR GP+GPKSLCNACGIR RK+
Subjt:  SSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIRYRKE

AT5G56860.1 GATA type zinc finger transcription factor family protein7.6e-1042.15Show/hide
Query:  NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKS
        N +QL  I  T NNN   ++ H PL    N + D +  LN  + +T   ++  A+++  R  T      S N    R  + +CNTT TP+WR GP GPKS
Subjt:  NYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKS

Query:  LCNACGIRYRKETLNREAMAA
        LCNACGIR RK    R AMAA
Subjt:  LCNACGIRYRKETLNREAMAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGGCCATGAATATGATCATCGTCGGTTTCGTCATTCAGAAACTACTACTCATGATCATGACCAGGTCGATTTGACGCTCCGACTTGGCTTCCCAAATGGCGATGC
CGGTGAAACTTTACCCGATATTGATCATAATCAACCATTATTCTCCACAGCTCTTCATAATATCAACCATCATGCTACTCTTCAAACTCATCAGGTGATGGTGAACAATC
ATGGGATGGCGTCGTGCGGAGGCAGAAATGGTGAGATGATGATGAGTGAAGGAAATTACAACCCTCACCAAATTGTATGGTCAAAAGAGCTGAAAACCTATGTTCTTAAT
TCAAATAATTACGAGCAGCTGATGCCCATTGGCAGCACTGCAAATAACAACATGATAGCAGCCAATCTTCATATTCCTCTTCAAAAGAACAATAATAATAATAATGACCT
CTACACACTCCTCAACCCTGCTTCTAGAATCACCGATGATCAACAAGAAGCCGATGCTTCCTCCAGTGCCCGCCGGAAAGGTACACGGCGGCGCCGAGTTTCCTCCTGCA
ACGAGGCCGAGAAGAGGTGCACCAATTACAATTGCAACACCACTTTTACACCCATGTGGCGTAAAGGCCCTCTTGGTCCCAAGAGCCTTTGCAATGCATGCGGCATAAGG
TACAGAAAGGAAACGTTGAATCGGGAAGCAATGGCGGCAGAAAACAGCCACAGCCACAGCCACAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGGCCATGAATATGATCATCGTCGGTTTCGTCATTCAGAAACTACTACTCATGATCATGACCAGGTCGATTTGACGCTCCGACTTGGCTTCCCAAATGGCGATGC
CGGTGAAACTTTACCCGATATTGATCATAATCAACCATTATTCTCCACAGCTCTTCATAATATCAACCATCATGCTACTCTTCAAACTCATCAGGTGATGGTGAACAATC
ATGGGATGGCGTCGTGCGGAGGCAGAAATGGTGAGATGATGATGAGTGAAGGAAATTACAACCCTCACCAAATTGTATGGTCAAAAGAGCTGAAAACCTATGTTCTTAAT
TCAAATAATTACGAGCAGCTGATGCCCATTGGCAGCACTGCAAATAACAACATGATAGCAGCCAATCTTCATATTCCTCTTCAAAAGAACAATAATAATAATAATGACCT
CTACACACTCCTCAACCCTGCTTCTAGAATCACCGATGATCAACAAGAAGCCGATGCTTCCTCCAGTGCCCGCCGGAAAGGTACACGGCGGCGCCGAGTTTCCTCCTGCA
ACGAGGCCGAGAAGAGGTGCACCAATTACAATTGCAACACCACTTTTACACCCATGTGGCGTAAAGGCCCTCTTGGTCCCAAGAGCCTTTGCAATGCATGCGGCATAAGG
TACAGAAAGGAAACGTTGAATCGGGAAGCAATGGCGGCAGAAAACAGCCACAGCCACAGCCACAGCTAG
Protein sequenceShow/hide protein sequence
MKGHEYDHRRFRHSETTTHDHDQVDLTLRLGFPNGDAGETLPDIDHNQPLFSTALHNINHHATLQTHQVMVNNHGMASCGGRNGEMMMSEGNYNPHQIVWSKELKTYVLN
SNNYEQLMPIGSTANNNMIAANLHIPLQKNNNNNNDLYTLLNPASRITDDQQEADASSSARRKGTRRRRVSSCNEAEKRCTNYNCNTTFTPMWRKGPLGPKSLCNACGIR
YRKETLNREAMAAENSHSHSHS