; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022804 (gene) of Snake gourd v1 genome

Gene IDTan0022804
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBZIP domain-containing protein
Genome locationLG05:14214402..14215997
RNA-Seq ExpressionTan0022804
SyntenyTan0022804
Gene Ontology termsGO:0009744 - response to sucrose (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0080149 - sucrose induced translational repression (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460967.1 PREDICTED: bZIP transcription factor 53-like [Cucumis melo]2.1e-5677.71Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLMAMV QLRKDNQQIVANL VT QHYAAVE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEI+AFLNP DGVF+D  D+YG      NGG G           GGGG  FNPLQ AFYMSQPL A++DVF++Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

XP_011649241.1 bZIP transcription factor 11 [Cucumis sativus]8.1e-5676.57Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLMAMV QL+KDNQQIVANL VT QHYAAVE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEI+AFLNP DGVF+D  D+Y     G NGG  G          GGGG  FNPLQ AF+MSQPL+A++DVF++Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

XP_022947249.1 bZIP transcription factor 11-like [Cucurbita moschata]1.4e-5573.86Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASS  TSST SSMEEGE  ALMEQRKRKRM+SNRESARRSRMRKQKHL+DL+ MV +L ++ QQIVANL +TAQ+YAAVET NSILRAQA ELAHRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGE-IFNPLQTAFYMSQPLMAAADVFEQY
        L EI+AFLNPCDGV+ DAC+SY G  DGG GG+  G+FCADPI++GGGG+  FNPL+  F  SQP+MA+ADVFE+Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGE-IFNPLQTAFYMSQPLMAAADVFEQY

XP_022986630.1 bZIP transcription factor 11-like [Cucurbita maxima]1.1e-5578.44Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLD+LMAMV QLRKDNQQIVANLGVT QHYAAVE ENSILRAQAAEL HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDAC-DSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLM
        LNEI++FLNP DGVFEDAC DSYG                   +  GGGG  FNPLQ AF+MSQPLM
Subjt:  LNEIVAFLNPCDGVFEDAC-DSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLM

XP_038902907.1 bZIP transcription factor 11-like [Benincasa hispida]8.1e-5677.84Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLM MV QLRKDNQQIVANL VT QHYA VE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNP-CDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEIVAFLNP  DGVFED  D+YG   DG +              VGGGG  FNPLQ AFYMSQPLMA+ADVF++Y
Subjt:  LNEIVAFLNP-CDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

TrEMBL top hitse value%identityAlignment
A0A0A0LIT2 BZIP domain-containing protein3.9e-5676.57Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLMAMV QL+KDNQQIVANL VT QHYAAVE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEI+AFLNP DGVF+D  D+Y     G NGG  G          GGGG  FNPLQ AF+MSQPL+A++DVF++Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

A0A1S3CDN8 bZIP transcription factor 53-like1.0e-5677.71Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLMAMV QLRKDNQQIVANL VT QHYAAVE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEI+AFLNP DGVF+D  D+YG      NGG G           GGGG  FNPLQ AFYMSQPL A++DVF++Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

A0A5D3DZI5 BZIP transcription factor 53-like1.0e-5677.71Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLDDLMAMV QLRKDNQQIVANL VT QHYAAVE ENSIL+AQAAEL+HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY
        LNEI+AFLNP DGVF+D  D+YG      NGG G           GGGG  FNPLQ AFYMSQPL A++DVF++Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY

A0A6J1G689 bZIP transcription factor 11-like6.7e-5673.86Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASS  TSST SSMEEGE  ALMEQRKRKRM+SNRESARRSRMRKQKHL+DL+ MV +L ++ QQIVANL +TAQ+YAAVET NSILRAQA ELAHRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGE-IFNPLQTAFYMSQPLMAAADVFEQY
        L EI+AFLNPCDGV+ DAC+SY G  DGG GG+  G+FCADPI++GGGG+  FNPL+  F  SQP+MA+ADVFE+Y
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGE-IFNPLQTAFYMSQPLMAAADVFEQY

A0A6J1JEK0 bZIP transcription factor 11-like5.1e-5678.44Show/hide
Query:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        MASSSGTSSTSSSMEEGELAALMEQRKRKRM+SNRESARRSRMRKQKHLD+LMAMV QLRKDNQQIVANLGVT QHYAAVE ENSILRAQAAEL HRLQS
Subjt:  MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDAC-DSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLM
        LNEI++FLNP DGVFEDAC DSYG                   +  GGGG  FNPLQ AF+MSQPLM
Subjt:  LNEIVAFLNPCDGVFEDAC-DSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLM

SwissProt top hitse value%identityAlignment
C0Z2L5 bZIP transcription factor 445.3e-2650Show/hide
Query:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        SS  T+  ++S  E +L    L+++RKRKR  SNRESARRSRMRKQKHLDDL A V  LRK+N QIVA + VT QHY  +E EN ILRAQ  EL HRLQS
Subjt:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAA----DVF
        LNEIV F+            S G G + G G   GG F            + NP+   FY +QP+MA+A    DVF
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAA----DVF

O65683 bZIP transcription factor 111.7e-2747.65Show/hide
Query:  ASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSL
        +SSSGT+S++     G   +LMEQRKRKRMLSNRESARRSRM+KQK LDDL A V  L+K+N +IV ++ +T QHY  VE ENS+LRAQ  EL HRLQSL
Subjt:  ASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSL

Query:  NEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIF-NPLQTAFYMSQPLMAAAD
        N+I+ FL+                 +  N   G    C++P+      + F N +  ++ M+QPLMA++D
Subjt:  NEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIF-NPLQTAFYMSQPLMAAAD

P24068 Ocs element-binding factor 15.5e-1547.86Show/hide
Query:  SSSGTSSTS--SSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        SSS  S T+  +S  +G+ AA    R+ KR LSNRESARRSR+RKQ+HLD+L+  V +L+ DN ++ A     A  Y  VE EN++LRA+AAEL  RL+S
Subjt:  SSSGTSSTS--SSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFED
        +NE++  +    GV  D
Subjt:  LNEIVAFLNPCDGVFED

Q9LZP8 bZIP transcription factor 531.6e-1444.76Show/hide
Query:  STSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSLNEIVAFL
        ++  S  +   A + ++RKRKRM+SNRESARRSRMRKQK L DL+  V  L+ DN +I   +   ++ Y  +E++N++LRAQA+EL  RL+SLN ++  +
Subjt:  STSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSLNEIVAFL

Query:  NPCDG
            G
Subjt:  NPCDG

Q9SI15 bZIP transcription factor 27.4e-2046.82Show/hide
Query:  MASSSGTSSTSSSMEEG-----ELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELA
        MASSS T  +SSS + G     +    +++RKRKRMLSNRESARRSRMRKQKH+DDL A + QL  DN+QI+ +L VT+Q Y  ++ ENS+L AQ  EL+
Subjt:  MASSSGTSSTSSSMEEG-----ELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELA

Query:  HRLQSLNEIVAFLNPCDGVF-EDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMA
         RLQSLNEIV  +      F  D  D  G G D    G  G Y   D + +       N    + Y +QP+MA
Subjt:  HRLQSLNEIVAFLNPCDGVF-EDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMA

Arabidopsis top hitse value%identityAlignment
AT1G75390.1 basic leucine-zipper 443.8e-2750Show/hide
Query:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS
        SS  T+  ++S  E +L    L+++RKRKR  SNRESARRSRMRKQKHLDDL A V  LRK+N QIVA + VT QHY  +E EN ILRAQ  EL HRLQS
Subjt:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQS

Query:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAA----DVF
        LNEIV F+            S G G + G G   GG F            + NP+   FY +QP+MA+A    DVF
Subjt:  LNEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAA----DVF

AT1G75390.2 basic leucine-zipper 444.2e-1861.11Show/hide
Query:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQ
        SS  T+  ++S  E +L    L+++RKRKR  SNRESARRSRMRKQKHLDDL A V  LRK+N QIVA + VT QHY  +E EN ILRAQ
Subjt:  SSSGTSSTSSSMEEGELAA--LMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQ

AT2G18160.1 basic leucine-zipper 25.3e-2146.82Show/hide
Query:  MASSSGTSSTSSSMEEG-----ELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELA
        MASSS T  +SSS + G     +    +++RKRKRMLSNRESARRSRMRKQKH+DDL A + QL  DN+QI+ +L VT+Q Y  ++ ENS+L AQ  EL+
Subjt:  MASSSGTSSTSSSMEEG-----ELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELA

Query:  HRLQSLNEIVAFLNPCDGVF-EDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMA
         RLQSLNEIV  +      F  D  D  G G D    G  G Y   D + +       N    + Y +QP+MA
Subjt:  HRLQSLNEIVAFLNPCDGVF-EDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMA

AT3G62420.1 basic region/leucine zipper motif 531.1e-1544.76Show/hide
Query:  STSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSLNEIVAFL
        ++  S  +   A + ++RKRKRM+SNRESARRSRMRKQK L DL+  V  L+ DN +I   +   ++ Y  +E++N++LRAQA+EL  RL+SLN ++  +
Subjt:  STSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSLNEIVAFL

Query:  NPCDG
            G
Subjt:  NPCDG

AT4G34590.1 G-box binding factor 61.2e-2847.65Show/hide
Query:  ASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSL
        +SSSGT+S++     G   +LMEQRKRKRMLSNRESARRSRM+KQK LDDL A V  L+K+N +IV ++ +T QHY  VE ENS+LRAQ  EL HRLQSL
Subjt:  ASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSL

Query:  NEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIF-NPLQTAFYMSQPLMAAAD
        N+I+ FL+                 +  N   G    C++P+      + F N +  ++ M+QPLMA++D
Subjt:  NEIVAFLNPCDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIF-NPLQTAFYMSQPLMAAAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGTCTAGTGGCACATCTTCGACTTCCTCGTCAATGGAAGAAGGGGAATTGGCGGCGTTGATGGAGCAGAGGAAGAGGAAGAGGATGCTTTCGAATCGGGAATC
GGCGAGGAGATCGAGGATGAGGAAGCAGAAGCATTTGGACGATTTAATGGCGATGGTGGGGCAGTTGAGGAAGGACAATCAACAAATTGTTGCGAATCTCGGCGTTACGG
CGCAGCACTACGCCGCCGTGGAGACTGAGAATTCGATTCTCAGAGCTCAGGCGGCGGAGCTGGCCCACCGGTTGCAGTCCTTGAATGAGATCGTTGCTTTCTTGAACCCC
TGTGATGGGGTTTTTGAAGATGCCTGTGATTCCTACGGCGGTGGCGAAGATGGCGGCAACGGTGGTGCTGGTGGTGGGTACTTTTGTGCTGACCCGATTGAAGTCGGCGG
CGGCGGGGAGATTTTTAATCCTCTGCAAACGGCTTTTTATATGAGCCAGCCTCTAATGGCTGCTGCCGATGTGTTTGAACAGTACTGA
mRNA sequenceShow/hide mRNA sequence
TTTTCCTATAAATTATCCCCCAATCCCTTCTAAGCCTCTTCCCCCTTCCCTTCTTCACCAATCACCATTACTCAAATAACTCTGAGTTCAAGCTTTCTTCTTCTTCTCCA
TTTTTCCCATCTGGGTTTTCTTCTTTCTCACGAAGTTGAGATTTTTCTCTTCTGGGTTTTCATAATTTTGGCCACCGAGCTTCATCTTGATGTTCATCACTCTGTTCTTG
ATGTTTGAAATTCATGCATCTAATTCTCAGTGAACAAACCCTCCGTTCTGGTTGTATGATAATAAACTCCACATTCCGCCGCCATCACCTCGTTCAATCCTTCTCTGTTG
TTTTTCTCTACTTTCTCTACTACGTTTCATGATATAACATCAAAAAACACGAAATCCCAAAATCCAAATTACAAAAATATAACTAGTCCCTATTAGTAATAATATTGCTC
TGTTTTACTCTGTTTTTGTTGCTTTTTTTTTTCTTTTTCTTTTTCTTTTGTTGGGTCTTTTTTAATGGCTTCGTCTAGTGGCACATCTTCGACTTCCTCGTCAATGGAAG
AAGGGGAATTGGCGGCGTTGATGGAGCAGAGGAAGAGGAAGAGGATGCTTTCGAATCGGGAATCGGCGAGGAGATCGAGGATGAGGAAGCAGAAGCATTTGGACGATTTA
ATGGCGATGGTGGGGCAGTTGAGGAAGGACAATCAACAAATTGTTGCGAATCTCGGCGTTACGGCGCAGCACTACGCCGCCGTGGAGACTGAGAATTCGATTCTCAGAGC
TCAGGCGGCGGAGCTGGCCCACCGGTTGCAGTCCTTGAATGAGATCGTTGCTTTCTTGAACCCCTGTGATGGGGTTTTTGAAGATGCCTGTGATTCCTACGGCGGTGGCG
AAGATGGCGGCAACGGTGGTGCTGGTGGTGGGTACTTTTGTGCTGACCCGATTGAAGTCGGCGGCGGCGGGGAGATTTTTAATCCTCTGCAAACGGCTTTTTATATGAGC
CAGCCTCTAATGGCTGCTGCCGATGTGTTTGAACAGTACTGATTTGGGATTTATCTTTCTCTCTCTTTCTATCCTACAAGTTATGAATTTTCTTGGCATCCAAACAAGGA
GAGAGAAAGAGAATGGTAGTGTTGTGTGTCTAGAGTTGGAAGTGTCTTGATTGATTATGATTTATGACCAAGAATAAAAGGGATTTAAGGTGTTATCTTTTGTAAAACCT
ATGGACTAAGAGAAATGGAAGATGTGAATTTTGGGGGCTTTACCCTTTTTAAGGTAATTTTACCTTATTAGTCGACTCCAACCTTACTTTACTATTGAGACATTAGTGAC
TTTATCTCTACATTGAGATTTTTCATTTGAGTTCAACAACCTATAGCGATAGAGATTCGAACTTTTGACCTTTAGCTTGGTTATATAATGCCTAAACTAATTGAGTGTTT
ACTAAAATGTAATTAATGTATATGGGTGTAATAGGATTATTGTGGGGTAATGGTAATAGAATTGTGGGGTAAAAATAATGACATGTAATGGAGAGAATTGTATTTGATAT
TGTCTACTAAAGGTAATCGCAATTCAAACTTCCCAAATAGGTATGATTACTATTAC
Protein sequenceShow/hide protein sequence
MASSSGTSSTSSSMEEGELAALMEQRKRKRMLSNRESARRSRMRKQKHLDDLMAMVGQLRKDNQQIVANLGVTAQHYAAVETENSILRAQAAELAHRLQSLNEIVAFLNP
CDGVFEDACDSYGGGEDGGNGGAGGGYFCADPIEVGGGGEIFNPLQTAFYMSQPLMAAADVFEQY