; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020966 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020966
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold290:245154..245861
RNA-Seq ExpressionMS020966
SyntenyMS020966
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596278.1 Zinc finger protein ZAT3, partial [Cucurbita argyrosperma subsp. sororia]6.6e-6164.03Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN        P+TS DQRH GERSP   TS    HN D NNP P       E SS SD  P P    PAA+T R       DA +DVGTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSDVNN-------PDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLAST
         G S+ +  KKRGR D          P Q   KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLAST
Subjt:  -GGSDIDQVKKRGRSDVNN-------PDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLAST

Query:  LLTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD
        LLT AQ  AASRR LDIDLNQPS AD+    E   GVGFDLNA+PPPESDD+D
Subjt:  LLTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD

KAG7027829.1 Zinc finger protein ZAT3, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-6163.78Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN        P+TS DQRH GERSP   TS    HN D NNP P       E SS SD  P P    PAA+T R       DA +DVGTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSDVNN--------PDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLAS
         G S+ +  KKRGR D           P Q   KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLAS
Subjt:  -GGSDIDQVKKRGRSDVNN--------PDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLAS

Query:  TLLTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD
        TLLT AQ  AASRR LDIDLNQPS AD+    E   GVGFDLNA+PPPESDD+D
Subjt:  TLLTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD

XP_008449156.1 PREDICTED: uncharacterized protein LOC103491103 [Cucumis melo]4.1e-6365.34Show/hide
Query:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT
        MEKNTNTNA    N+N     +TSPDQRH GERSP+ +  L  A N+ DINNPP     VVIE SSS  T    P  A   D  R  S   +DA  DVGT
Subjt:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT

Query:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
        S     GG SD +  KKRGR D     Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK
        LT A Q AA+RR LDIDLNQPS AD+ D        GFDLN EPPPESDD+
Subjt:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK

XP_022962440.1 uncharacterized protein LOC111462854 [Cucurbita moschata]8.6e-6164.29Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN         +TS DQRH GERSP   TSP   HN D NNP P       E SS SDT   P    PAA+T R       DA +DVGTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
         G S+ +  KKRGR D         P Q   KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAAQ-AAASRRRLDIDLNQPSAADEEDRT----GVGFDLNAEPPPESDDKD
        LT AQ  AASRR LDIDLNQPS AD+ D      GVGFDLNA+PPPESDD+D
Subjt:  LTAAQ-AAASRRRLDIDLNQPSAADEEDRT----GVGFDLNAEPPPESDDKD

XP_022971305.1 uncharacterized protein LOC111470063 [Cucurbita maxima]3.3e-6064.29Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN        P+TS DQ H GERSP   TSP   HN D NNP P         SS SDT     +V PAADT R       DA +D+GTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
         G SD +  KKRGR D         P Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD
        LT AQ  AASRR LDIDLNQPS AD+    E   GVGFDLNA+PPP SDD+D
Subjt:  LTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD

TrEMBL top hitse value%identityAlignment
A0A0A0L0X7 C2H2-type domain-containing protein2.7e-6063.35Show/hide
Query:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTS--PSLAHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT
        MEKNTNTNA    N+N     +TSPDQRH  ERSP+ +  P  A  + DINNPP      VIE SSS     P  V     D  R  S   ++A  DVGT
Subjt:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTS--PSLAHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT

Query:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
        S     GG SD++  KKRGR D     Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK
        LT A Q +ASRR LDIDLNQPS AD+ D        GFDLN EPPPESDD+
Subjt:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK

A0A1S3BLF2 uncharacterized protein LOC1034911032.0e-6365.34Show/hide
Query:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT
        MEKNTNTNA    N+N     +TSPDQRH GERSP+ +  L  A N+ DINNPP     VVIE SSS  T    P  A   D  R  S   +DA  DVGT
Subjt:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT

Query:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
        S     GG SD +  KKRGR D     Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK
        LT A Q AA+RR LDIDLNQPS AD+ D        GFDLN EPPPESDD+
Subjt:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK

A0A5D3CLS0 Zinc finger family protein2.0e-6365.34Show/hide
Query:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT
        MEKNTNTNA    N+N     +TSPDQRH GERSP+ +  L  A N+ DINNPP     VVIE SSS  T    P  A   D  R  S   +DA  DVGT
Subjt:  MEKNTNTNA----NVNSSTAPQTSPDQRHDGERSPVTSPSL--AHNV-DINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGT

Query:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
        S     GG SD +  KKRGR D     Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  S----YGGGSDIDQVKKRGRSDVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK
        LT A Q AA+RR LDIDLNQPS AD+ D        GFDLN EPPPESDD+
Subjt:  LTAA-QAAASRRRLDIDLNQPSAADEED----RTGVGFDLNAEPPPESDDK

A0A6J1HF26 uncharacterized protein LOC1114628544.2e-6164.29Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN         +TS DQRH GERSP   TSP   HN D NNP P       E SS SDT   P    PAA+T R       DA +DVGTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
         G S+ +  KKRGR D         P Q   KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAAQ-AAASRRRLDIDLNQPSAADEEDRT----GVGFDLNAEPPPESDDKD
        LT AQ  AASRR LDIDLNQPS AD+ D      GVGFDLNA+PPPESDD+D
Subjt:  LTAAQ-AAASRRRLDIDLNQPSAADEEDRT----GVGFDLNAEPPPESDDKD

A0A6J1I5D3 uncharacterized protein LOC1114700631.6e-6064.29Show/hide
Query:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--
        MEKN NTN        P+TS DQ H GERSP   TSP   HN D NNP P         SS SDT     +V PAADT R       DA +D+GTSY   
Subjt:  MEKNTNTNANVNSSTAPQTSPDQRHDGERSPV--TSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYG--

Query:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL
         G SD +  KKRGR D         P Q V KA+KKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI RCQQQLASTL
Subjt:  -GGSDIDQVKKRGRSD------VNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTL

Query:  LTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD
        LT AQ  AASRR LDIDLNQPS AD+    E   GVGFDLNA+PPP SDD+D
Subjt:  LTAAQ-AAASRRRLDIDLNQPSAADE----EDRTGVGFDLNAEPPPESDDKD

SwissProt top hitse value%identityAlignment
O65499 Zinc finger protein ZAT33.3e-0742.25Show/hide
Query:  KKRGRSDVNNPDQAVPKASKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP
        KK+    V +   + PK++ K    K      PK    C  C + F SWKALFGH+R HPER +RG  PPP
Subjt:  KKRGRSDVNNPDQAVPKASKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP

Arabidopsis top hitse value%identityAlignment
AT2G26940.1 C2H2-type zinc finger family protein1.7e-0629Show/hide
Query:  GTSYGGGSDIDQVKKRGRSDVNNPDQAVPKASK-------KKGELTEVP------KGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI
        G +YGG   I  ++   +  +    +   K  K       KK +  E+       +G+ RC  C K F++  +LFGH+R HP+RT++G  PPP + + ++
Subjt:  GTSYGGGSDIDQVKKRGRSDVNNPDQAVPKASK-------KKGELTEVP------KGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI

AT4G35280.1 C2H2-like zinc finger protein2.4e-0842.25Show/hide
Query:  KKRGRSDVNNPDQAVPKASKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP
        KK+    V +   + PK++ K    K      PK    C  C + F SWKALFGH+R HPER +RG  PPP
Subjt:  KKRGRSDVNNPDQAVPKASKK----KGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPP

AT4G35610.1 zinc finger (C2H2 type) family protein1.9e-1030.69Show/hide
Query:  DASVDVGTSYGGGSDIDQVKKRGRSDVNNPDQA-------VPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI
        + +V  G + G GS   + K R R      D+          K  KK  ELT  PKG P C  C + F SWKA+FGH+R+H +R Y+G LPPPT +    
Subjt:  DASVDVGTSYGGGSDIDQVKKRGRSDVNNPDQA-------VPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDI

Query:  HRCQQQLASTLLTA---------------------------AQAAASRRRLDIDLNQPSAADEEDRTGVG----FDLNAEPPPESDDKD
              L S    A                           A  +   R   IDLN       E+ T  G    FDLN  PP + +++D
Subjt:  HRCQQQLASTLLTA---------------------------AQAAASRRRLDIDLNQPSAADEEDRTGVG----FDLNAEPPPESDDKD

AT4G35700.1 zinc finger (C2H2 type) family protein3.5e-1252.38Show/hide
Query:  VNNPDQAVPKASKKKG--ELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPT
        V   +  V K  +KKG  +LT +P+G P C  C K F SWKA+FGHLR H +R Y G LPPPT
Subjt:  VNNPDQAVPKASKKKG--ELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPT

AT5G56200.1 C2H2 type zinc finger transcription factor family1.1e-0535.87Show/hide
Query:  EPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTLLTAAQAAASRRRLDIDLNQPSAADEEDRTGVGFDLNAE
        E  C  C K F S KAL+GH+R HP+R ++G LPPP    L  H      +STL    +  +S    D D +     D++D     +D N E
Subjt:  EPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTLLTAAQAAASRRRLDIDLNQPSAADEEDRTGVGFDLNAE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAACACGAATACCAATGCCAATGTCAATTCCAGTACTGCTCCACAGACCTCTCCCGACCAACGCCACGACGGGGAAAGGTCTCCGGTGACTTCTCCATCACT
CGCACACAACGTGGATATCAATAACCCTCCTCCTCCCGCCACCATTGTCGTCATCGAACTCTCCTCCTCCTCCGATACTCCTCCGCCGCCACCTGTAGTAGCTCCTGCAG
CCGACACAGGACGACACTCCTCCGCCACCACTATAGACGCTTCGGTCGATGTCGGGACGTCGTACGGAGGCGGTTCCGACATCGACCAAGTGAAAAAGAGAGGGCGGAGT
GATGTGAATAACCCGGATCAGGCCGTTCCCAAAGCCTCCAAAAAGAAGGGAGAGCTGACGGAGGTTCCAAAAGGGGAGCCGAGATGTGCCACGTGCAACAAAGTGTTCAA
ATCGTGGAAGGCACTTTTCGGGCACTTGAGGTCTCACCCGGAACGAACCTACCGTGGAGCTCTTCCGCCACCCACGGCGGCAGAGCTTGACATCCACCGGTGCCAGCAGC
AGCTGGCTTCCACTTTGCTGACGGCGGCTCAGGCGGCGGCTTCCAGAAGGAGACTGGACATCGACCTGAACCAGCCGTCAGCCGCCGACGAGGAGGACCGCACCGGCGTC
GGCTTCGATTTGAACGCCGAGCCCCCGCCGGAGAGCGATGACAAGGAT
mRNA sequenceShow/hide mRNA sequence
ATGGAGAAGAACACGAATACCAATGCCAATGTCAATTCCAGTACTGCTCCACAGACCTCTCCCGACCAACGCCACGACGGGGAAAGGTCTCCGGTGACTTCTCCATCACT
CGCACACAACGTGGATATCAATAACCCTCCTCCTCCCGCCACCATTGTCGTCATCGAACTCTCCTCCTCCTCCGATACTCCTCCGCCGCCACCTGTAGTAGCTCCTGCAG
CCGACACAGGACGACACTCCTCCGCCACCACTATAGACGCTTCGGTCGATGTCGGGACGTCGTACGGAGGCGGTTCCGACATCGACCAAGTGAAAAAGAGAGGGCGGAGT
GATGTGAATAACCCGGATCAGGCCGTTCCCAAAGCCTCCAAAAAGAAGGGAGAGCTGACGGAGGTTCCAAAAGGGGAGCCGAGATGTGCCACGTGCAACAAAGTGTTCAA
ATCGTGGAAGGCACTTTTCGGGCACTTGAGGTCTCACCCGGAACGAACCTACCGTGGAGCTCTTCCGCCACCCACGGCGGCAGAGCTTGACATCCACCGGTGCCAGCAGC
AGCTGGCTTCCACTTTGCTGACGGCGGCTCAGGCGGCGGCTTCCAGAAGGAGACTGGACATCGACCTGAACCAGCCGTCAGCCGCCGACGAGGAGGACCGCACCGGCGTC
GGCTTCGATTTGAACGCCGAGCCCCCGCCGGAGAGCGATGACAAGGAT
Protein sequenceShow/hide protein sequence
MEKNTNTNANVNSSTAPQTSPDQRHDGERSPVTSPSLAHNVDINNPPPPATIVVIELSSSSDTPPPPPVVAPAADTGRHSSATTIDASVDVGTSYGGGSDIDQVKKRGRS
DVNNPDQAVPKASKKKGELTEVPKGEPRCATCNKVFKSWKALFGHLRSHPERTYRGALPPPTAAELDIHRCQQQLASTLLTAAQAAASRRRLDIDLNQPSAADEEDRTGV
GFDLNAEPPPESDDKD