; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G00230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G00230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionC2H2-type domain-containing protein
Genome locationChr3:197978..198706
RNA-Seq ExpressionCSPI03G00230
SyntenyCSPI03G00230
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008449156.1 PREDICTED: uncharacterized protein LOC103491103 [Cucumis melo]3.8e-12095.04Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTNTNANTNININIDAVSETSPDQRH  ERSPMAASP PPAR IDDINNPPT VIEPSSS +VAAPES TPGDVRRRQSV+AMFDVGTSSHEHVGGS
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        SD EAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV+A+
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

XP_011652703.1 uncharacterized protein LOC105435052 [Cucumis sativus]6.5e-128100Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

XP_022923503.1 uncharacterized protein LOC111431177 [Cucurbita moschata]1.6e-7870.2Show/hide
Query:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV
        MEKNTN NAN NIN++ D       TSPDQRH RE SPMA SPPP +  IDD NNP   VI  SS  V  APE  T GDV    S         S+H HV
Subjt:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV

Query:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV
        GG S+ E GKKRGRGDGGEQQQQVKAAKKKGELTEVPKG+P+CATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIR CQQQ ASTLLTVAQ V
Subjt:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV

Query:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        +ASRRGLDIDLNQPS A++ +SP+ +   GFDLN+E PPESD+++
Subjt:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

XP_023552101.1 uncharacterized protein LOC111809870 [Cucurbita pepo subsp. pepo]3.6e-7869.8Show/hide
Query:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV
        MEKNTN NAN NIN++ D       TSPDQRH RE SPMA SPPP +  IDD N+P   VI  SS  V  APE  T GDV    S        +S+H HV
Subjt:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV

Query:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV
        GG S+ E GKKRGRGDGGEQQQQVKAAKKKGELTEVPKG+P+CATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIR CQQQ ASTLLTVAQ V
Subjt:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV

Query:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        +ASRRGLDIDLNQPS A++ +SP+ +   GFDLN+E PPESD+++
Subjt:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

XP_038876670.1 uncharacterized protein LOC120069065 [Benincasa hispida]3.8e-8877.55Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPM-AASPPPPARIIDDINNP-PTDVIEPSSSIVVAAPESVTPGDVRRRQSVNA-MFDVGTSSHEHV
        MEKNTNTNAN N N N   VSETSPDQ H  ER+P+ A SP PPAR  D++NNP PT VIE SSSIVVAAPES T GD RRRQ V+A + DV TSSHE++
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPM-AASPPPPARIIDDINNP-PTDVIEPSSSIVVAAPESVTPGDVRRRQSVNA-MFDVGTSSHEHV

Query:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV
        GGSS++E GKKRG+GDG EQQQQV+ AKKKGELTEVPKGEP+CATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI    QQLASTLLTVAQQV
Subjt:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV

Query:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        +ASRRGLDIDLNQPSTADDGD P+ T   GFDLN++PP +SDDEK
Subjt:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

TrEMBL top hitse value%identityAlignment
A0A0A0L0X7 C2H2-type domain-containing protein3.1e-128100Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

A0A1S3BLF2 uncharacterized protein LOC1034911031.8e-12095.04Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTNTNANTNININIDAVSETSPDQRH  ERSPMAASP PPAR IDDINNPPT VIEPSSS +VAAPES TPGDVRRRQSV+AMFDVGTSSHEHVGGS
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        SD EAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV+A+
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

A0A5D3CLS0 Zinc finger family protein1.8e-12095.04Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTNTNANTNININIDAVSETSPDQRH  ERSPMAASP PPAR IDDINNPPT VIEPSSS +VAAPES TPGDVRRRQSV+AMFDVGTSSHEHVGGS
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        SD EAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV+A+
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

A0A6J1E9T9 uncharacterized protein LOC1114311777.8e-7970.2Show/hide
Query:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV
        MEKNTN NAN NIN++ D       TSPDQRH RE SPMA SPPP +  IDD NNP   VI  SS  V  APE  T GDV    S         S+H HV
Subjt:  MEKNTNTNANTNININID---AVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHV

Query:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV
        GG S+ E GKKRGRGDGGEQQQQVKAAKKKGELTEVPKG+P+CATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIR CQQQ ASTLLTVAQ V
Subjt:  GGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQV

Query:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        +ASRRGLDIDLNQPS A++ +SP+ +   GFDLN+E PPESD+++
Subjt:  SASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

A0A6J1L5G4 uncharacterized protein LOC1115001938.9e-7569.42Show/hide
Query:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS
        MEKNTN NA+ N          TSPDQRH  ERSPMA SPPP +  IDD NNP   VI  SS  V  APE  T GDV    S         SSH HVGG 
Subjt:  MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGS

Query:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS
        S+ E GKKRGRGDGGEQQQQVKAAKKKGELTEVPKG+P+CATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIR CQQQ ASTLLTVAQ V+AS
Subjt:  SDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSAS

Query:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK
        RRGLDIDLNQPS A++  SP+ +   GFDLN+E PPES++++
Subjt:  RRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLEPPPESDDEK

SwissProt top hitse value%identityAlignment
O65499 Zinc finger protein ZAT31.1e-0540Show/hide
Query:  VGGSSDVEAGKKRGRGDGGEQQQQVKAAKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP
        +  S D    KKR +          K+A K    K      PK    C  C + F SWKALFGH+R HPER +RG  PPP
Subjt:  VGGSSDVEAGKKRGRGDGGEQQQQVKAAKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP

Q9SIJ0 Zinc finger protein ZAT29.4e-0562.5Show/hide
Query:  CATCNKVFKSWKALFGHLRSHPERTYRGALPP
        C  C K F S KALFGH+R HPER +RG  PP
Subjt:  CATCNKVFKSWKALFGHLRSHPERTYRGALPP

Arabidopsis top hitse value%identityAlignment
AT2G26940.1 C2H2-type zinc finger family protein1.9e-0528.43Show/hide
Query:  SSHEHVGGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPK--------------GEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAEL
        SS +  GG   + + +   +G   + ++     KKK ++  V K              G+ RC  C K F++  +LFGH+R HP+RT++G  PPP + + 
Subjt:  SSHEHVGGSSDVEAGKKRGRGDGGEQQQQVKAAKKKGELTEVPK--------------GEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAEL

Query:  DI
        ++
Subjt:  DI

AT4G35280.1 C2H2-like zinc finger protein7.9e-0740Show/hide
Query:  VGGSSDVEAGKKRGRGDGGEQQQQVKAAKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP
        +  S D    KKR +          K+A K    K      PK    C  C + F SWKALFGH+R HPER +RG  PPP
Subjt:  VGGSSDVEAGKKRGRGDGGEQQQQVKAAKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP

AT4G35610.1 zinc finger (C2H2 type) family protein5.6e-1336.67Show/hide
Query:  GKKRGR----------GDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQ
        GKK+ R          GDG  + +  K  KK  ELT  PKG P C  C + F SWKA+FGH+R+H +R Y+G LPPPT +    R     L S   T+A 
Subjt:  GKKRGR----------GDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQ

Query:  QVSASRRGLDI---DLNQPSTADDGDSP--DNTRDAGFDLNLEPPPESDD
            S  G+ +        S +  GD+P  +  R +G DLN+EP  + ++
Subjt:  QVSASRRGLDI---DLNQPSTADDGDSP--DNTRDAGFDLNLEPPPESDD

AT4G35700.1 zinc finger (C2H2 type) family protein4.3e-1337.75Show/hide
Query:  VKAAKKKG--ELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPT--AAELDI----RRCQQQLASTLLTVAQQVSASRRG---------
        VK  +KKG  +LT +P+G P C  C K F SWKA+FGHLR H +R Y G LPPPT  AAE               +T ++V +  +AS  G         
Subjt:  VKAAKKKG--ELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPT--AAELDI----RRCQQQLASTLLTVAQQVSASRRG---------

Query:  ---------LDIDLNQPSTADDGDSPDNTRDAG----FDLNLEPPPESDDE
                   IDLN    AD  +  D     G    FDLN  PPP+ D+E
Subjt:  ---------LDIDLNQPSTADDGDSPDNTRDAG----FDLNLEPPPESDDE

AT5G56200.1 C2H2 type zinc finger transcription factor family1.6e-0430.07Show/hide
Query:  EAGKK--RGRGDGGEQQQQVKAAKK---------KGELTEVPKG--EPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTL
        E GK+   G+  GG ++  V   +K          G +    +G  E  C  C K F S KAL+GH+R HP+R ++G LPPP      +       +S+ 
Subjt:  EAGKK--RGRGDGGEQQQQVKAAKK---------KGELTEVPKG--EPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTL

Query:  LTVAQQVSASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLE
        L    +  +S    D D +     DD D  D+     +D NLE
Subjt:  LTVAQQVSASRRGLDIDLNQPSTADDGDSPDNTRDAGFDLNLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAACACCAACACAAATGCCAATACCAATATCAATATCAACATTGATGCGGTATCCGAGACCTCTCCCGACCAACGCCACCGTCGAGAAAGGTCTCCAATGGC
AGCTTCTCCACCACCACCTGCTCGCATTATCGATGACATAAATAACCCTCCCACTGATGTCATCGAACCCTCGTCCTCGATAGTAGTGGCGGCCCCGGAAAGCGTTACAC
CAGGCGACGTAAGACGGCGCCAAAGTGTAAACGCAATGTTCGACGTCGGAACATCATCACACGAGCACGTTGGAGGAAGTTCCGACGTCGAAGCGGGGAAAAAAAGAGGA
CGAGGGGATGGAGGAGAGCAGCAGCAGCAGGTGAAAGCTGCAAAGAAAAAAGGAGAGCTAACGGAGGTTCCAAAGGGTGAGCCAAGATGTGCAACATGTAACAAAGTGTT
CAAATCGTGGAAAGCACTATTTGGACACTTAAGGTCTCACCCTGAACGGACCTACCGTGGAGCTCTTCCTCCGCCAACCGCCGCCGAACTTGACATTCGCCGTTGTCAGC
AGCAGCTCGCTTCCACTTTGCTGACAGTAGCTCAGCAAGTGTCAGCGTCCAGAAGAGGGCTGGATATTGATCTCAACCAACCCTCTACTGCTGACGACGGTGACTCGCCG
GACAACACCAGAGACGCCGGGTTTGATCTGAACCTCGAACCCCCGCCGGAGAGTGACGACGAGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAACACCAACACAAATGCCAATACCAATATCAATATCAACATTGATGCGGTATCCGAGACCTCTCCCGACCAACGCCACCGTCGAGAAAGGTCTCCAATGGC
AGCTTCTCCACCACCACCTGCTCGCATTATCGATGACATAAATAACCCTCCCACTGATGTCATCGAACCCTCGTCCTCGATAGTAGTGGCGGCCCCGGAAAGCGTTACAC
CAGGCGACGTAAGACGGCGCCAAAGTGTAAACGCAATGTTCGACGTCGGAACATCATCACACGAGCACGTTGGAGGAAGTTCCGACGTCGAAGCGGGGAAAAAAAGAGGA
CGAGGGGATGGAGGAGAGCAGCAGCAGCAGGTGAAAGCTGCAAAGAAAAAAGGAGAGCTAACGGAGGTTCCAAAGGGTGAGCCAAGATGTGCAACATGTAACAAAGTGTT
CAAATCGTGGAAAGCACTATTTGGACACTTAAGGTCTCACCCTGAACGGACCTACCGTGGAGCTCTTCCTCCGCCAACCGCCGCCGAACTTGACATTCGCCGTTGTCAGC
AGCAGCTCGCTTCCACTTTGCTGACAGTAGCTCAGCAAGTGTCAGCGTCCAGAAGAGGGCTGGATATTGATCTCAACCAACCCTCTACTGCTGACGACGGTGACTCGCCG
GACAACACCAGAGACGCCGGGTTTGATCTGAACCTCGAACCCCCGCCGGAGAGTGACGACGAGAAGTGA
Protein sequenceShow/hide protein sequence
MEKNTNTNANTNININIDAVSETSPDQRHRRERSPMAASPPPPARIIDDINNPPTDVIEPSSSIVVAAPESVTPGDVRRRQSVNAMFDVGTSSHEHVGGSSDVEAGKKRG
RGDGGEQQQQVKAAKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIRRCQQQLASTLLTVAQQVSASRRGLDIDLNQPSTADDGDSP
DNTRDAGFDLNLEPPPESDDEK