; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g11240 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g11240
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionDUF4216 domain-containing protein
Genome locationchr5:8758380..8777036
RNA-Seq ExpressionMoc05g11240
SyntenyMoc05g11240
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025312 - Domain of unknown function DUF4216


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]2.7e-7671.55Show/hide
Query:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL
        KK  HIRTDCP LKSSKKSK+KAMK T DDS E  SESE EE AN   M  SDKEDE DDEV L+P S +ELFE FEN+QNDLEKL SKYV+LKKK NVL
Subjt:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL

Query:  TSENKSLLDGIACLKKN---EHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKP
        +SENKSLLD IAC K+N   + + +N+S +KH+ DC+EK+ LLDK+RFLEHD CEKDNLIK+LK+NE N L +LDKAK++IKKLTIGAQRLDKIIEVGK 
Subjt:  TSENKSLLDGIACLKKN---EHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKP

Query:  YGDKRGLGYIDECSTPSSTKTIFVKESPNMPK
        YGDKR LGYIDE ST SS KT FVK SP +PK
Subjt:  YGDKRGLGYIDECSTPSSTKTIFVKESPNMPK

XP_022148890.1 uncharacterized protein LOC111017452 [Momordica charantia]4.7e-8167.07Show/hide
Query:  GSNGITEMLDDLTNEEYHDQDYSTGIGSSDNSQSGIVNELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEV
        G++G    L ++   E   +  S+ + S+ N    +VNELYAL CG D +VNSYQ CVTNGVRFNTNERD   TTQNSGVCVFGGDDNE SDFYGIIKEV
Subjt:  GSNGITEMLDDLTNEEYHDQDYSTGIGSSDNSQSGIVNELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEV

Query:  IELKYIKDKQVLLFSCDRYDK-------------TSINTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQC
        IELKYIKDK+VLLF CD YD              TSINTCHLWYKDD FILVSQAQQVFYV DLQLGNGWKVAQ+IQHRHLWDVP+VEEIDL+E DINQC
Subjt:  IELKYIKDKQVLLFSCDRYDK-------------TSINTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQC

Query:  AVDEVDLETQTFHRPDIDPSILSDNT---------DRMSSDSEGED
         VDEV+LETQTFHR DIDPSI+SDNT         D +  + E ED
Subjt:  AVDEVDLETQTFHRPDIDPSILSDNT---------DRMSSDSEGED

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]3.7e-8677.02Show/hide
Query:  RTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVLTSENKS
        +TDCPLLKSSKKSKKKAMK T DDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQN+LEKLGSKYV+LK KCNV TSENKS
Subjt:  RTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVLTSENKS

Query:  LLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGY
        L D IACLKKNEHDV                                DNLIKLLKKNES+ALVELDKAKD IK+LTIGAQRLDKIIE GKPYGDKRGLGY
Subjt:  LLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGY

Query:  IDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAK
        I+EC+TPSS+KTIFVK SPNMPKLVAPKV  + +K
Subjt:  IDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAK

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]1.1e-8554.97Show/hide
Query:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL
        KKP HIRTDCP LKSSKKSKKKAMK T DDSDESG+ESENEEVANFCFMAHSDKEDE+DDE+ LDPLSYDELFEAFENMQNDLEK               
Subjt:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL

Query:  TSENKSLLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGD
                                                                           LVELDKAKDSIKKLTIGAQRLDKIIE+GKPYGD
Subjt:  TSENKSLLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGD

Query:  KRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLK-----YAQTTSSRRNFSQRAKFHNAPRKNFSKKSR
        KRGLGYIDECSTPSS+K IFVK SPNMPKLVAPKVV                        C K       Y  +  SR     ++KF       FSKK  
Subjt:  KRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLK-----YAQTTSSRRNFSQRAKFHNAPRKNFSKKSR

Query:  MHKFVVKDNSLHNVVCFSCSNDFLERNFGDLLISDKSKEIASSKQEVSIDENKVDGFSSMPKGWKYAPSHPKDLILGDPEQG
                      V F  S    ERNFGDLL+SDKSKEI SSKQEVSI+ENKVDGFSSMPK WKYAPSHPKDLILGDPEQG
Subjt:  MHKFVVKDNSLHNVVCFSCSNDFLERNFGDLLISDKSKEIASSKQEVSIDENKVDGFSSMPKGWKYAPSHPKDLILGDPEQG

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]9.7e-11973.9Show/hide
Query:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL
        K+  HIRTDCPLLKSSKKSKKKAMK T DDS E  SESE EE+AN   MAHSDK+DE DD+V L+PLS DELFE FE+MQNDLEKL SKYV+LKKK NVL
Subjt:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL

Query:  TSENKSLLDGIACLKKNEH----DVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGK
         SENKSLLD IAC K+NE+    + +N+S +KHV  C EK+ LLDK+RFLEHD CEKDNLIK+LK+NE + L ELDKAK++IKKLTIGAQRLDKIIEVGK
Subjt:  TSENKSLLDGIACLKKNEH----DVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGK

Query:  PYGDKRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLKYAQTTSSRRNFSQRAKFHNAPRKNFSKKSRM
         YGDKRGLGYIDE STPSS+KT FVK SP +PK      VS H K +FVPICH CGVE H+RPKCFKLKYAQ T SRRNFSQRAKF+ APRKNFS KSR+
Subjt:  PYGDKRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLKYAQTTSSRRNFSQRAKFHNAPRKNFSKKSRM

Query:  HKFVVKDNSLHNVVCFSC
        HKFV+K+ SLHNVVCFSC
Subjt:  HKFVVKDNSLHNVVCFSC

TrEMBL top hitse value%identityAlignment
A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein1.3e-7671.55Show/hide
Query:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL
        KK  HIRTDCP LKSSKKSK+KAMK T DDS E  SESE EE AN   M  SDKEDE DDEV L+P S +ELFE FEN+QNDLEKL SKYV+LKKK NVL
Subjt:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL

Query:  TSENKSLLDGIACLKKN---EHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKP
        +SENKSLLD IAC K+N   + + +N+S +KH+ DC+EK+ LLDK+RFLEHD CEKDNLIK+LK+NE N L +LDKAK++IKKLTIGAQRLDKIIEVGK 
Subjt:  TSENKSLLDGIACLKKN---EHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKP

Query:  YGDKRGLGYIDECSTPSSTKTIFVKESPNMPK
        YGDKR LGYIDE ST SS KT FVK SP +PK
Subjt:  YGDKRGLGYIDECSTPSSTKTIFVKESPNMPK

A0A6J1D6Q8 uncharacterized protein LOC1110174522.3e-8167.07Show/hide
Query:  GSNGITEMLDDLTNEEYHDQDYSTGIGSSDNSQSGIVNELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEV
        G++G    L ++   E   +  S+ + S+ N    +VNELYAL CG D +VNSYQ CVTNGVRFNTNERD   TTQNSGVCVFGGDDNE SDFYGIIKEV
Subjt:  GSNGITEMLDDLTNEEYHDQDYSTGIGSSDNSQSGIVNELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEV

Query:  IELKYIKDKQVLLFSCDRYDK-------------TSINTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQC
        IELKYIKDK+VLLF CD YD              TSINTCHLWYKDD FILVSQAQQVFYV DLQLGNGWKVAQ+IQHRHLWDVP+VEEIDL+E DINQC
Subjt:  IELKYIKDKQVLLFSCDRYDK-------------TSINTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQC

Query:  AVDEVDLETQTFHRPDIDPSILSDNT---------DRMSSDSEGED
         VDEV+LETQTFHR DIDPSI+SDNT         D +  + E ED
Subjt:  AVDEVDLETQTFHRPDIDPSILSDNT---------DRMSSDSEGED

A0A6J1DS74 uncharacterized protein LOC1110238061.8e-8677.02Show/hide
Query:  RTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVLTSENKS
        +TDCPLLKSSKKSKKKAMK T DDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQN+LEKLGSKYV+LK KCNV TSENKS
Subjt:  RTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVLTSENKS

Query:  LLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGY
        L D IACLKKNEHDV                                DNLIKLLKKNES+ALVELDKAKD IK+LTIGAQRLDKIIE GKPYGDKRGLGY
Subjt:  LLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGY

Query:  IDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAK
        I+EC+TPSS+KTIFVK SPNMPKLVAPKV  + +K
Subjt:  IDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAK

A0A6J1DY46 uncharacterized protein LOC1110252595.2e-8654.97Show/hide
Query:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL
        KKP HIRTDCP LKSSKKSKKKAMK T DDSDESG+ESENEEVANFCFMAHSDKEDE+DDE+ LDPLSYDELFEAFENMQNDLEK               
Subjt:  KKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVL

Query:  TSENKSLLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGD
                                                                           LVELDKAKDSIKKLTIGAQRLDKIIE+GKPYGD
Subjt:  TSENKSLLDGIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGD

Query:  KRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLK-----YAQTTSSRRNFSQRAKFHNAPRKNFSKKSR
        KRGLGYIDECSTPSS+K IFVK SPNMPKLVAPKVV                        C K       Y  +  SR     ++KF       FSKK  
Subjt:  KRGLGYIDECSTPSSTKTIFVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLK-----YAQTTSSRRNFSQRAKFHNAPRKNFSKKSR

Query:  MHKFVVKDNSLHNVVCFSCSNDFLERNFGDLLISDKSKEIASSKQEVSIDENKVDGFSSMPKGWKYAPSHPKDLILGDPEQG
                      V F  S    ERNFGDLL+SDKSKEI SSKQEVSI+ENKVDGFSSMPK WKYAPSHPKDLILGDPEQG
Subjt:  MHKFVVKDNSLHNVVCFSCSNDFLERNFGDLLISDKSKEIASSKQEVSIDENKVDGFSSMPKGWKYAPSHPKDLILGDPEQG

A5BJF7 DUF4216 domain-containing protein2.9e-4431.56Show/hide
Query:  NELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEVIELKYIKDKQVLLFSCDRYDK-------------TSI
        +ELY+L  G D +V+ Y  CV NG+RF+T +RD   TTQNSGV V G    ++ DFYG++  V+ L YI + QV+LF C+ +D                I
Subjt:  NELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIKEVIELKYIKDKQVLLFSCDRYDK-------------TSI

Query:  NTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQCAVDEVDLETQTFHRPDIDPSILSDNTDRMSSDSEGED
        N  + WY++D ++L SQAQQ+FYV+D +LG+ WKV Q++ HRH++DV +    +  E D     ++E   E              +D+TD + S ++   
Subjt:  NTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQCAVDEVDLETQTFHRPDIDPSILSDNTDRMSSDSEGED

Query:  TIAGITRTRGETGGEFLLRVVNVVGKIK-------LEWTEQQDRPIGPGRSLLSIYLSHMCINIFYTRLELDSKIIERNYIVITKSHVIQHVLAKSRIKT
            + + +   G    + + NVV   +         +   QD         L  Y S    N      + DS          + S      L K R  T
Subjt:  TIAGITRTRGETGGEFLLRVVNVVGKIK-------LEWTEQQDRPIGPGRSLLSIYLSHMCINIFYTRLELDSKIIERNYIVITKSHVIQHVLAKSRIKT

Query:  ENKKTGTYCVIDENLLHGRNKMALSKLRFNHRGGPKPF---QCIEKIRKEDGTYLSLIHIFFNMHYSEEKGWINEEARNSYEEMITLKAYHASQGDEKTE
          K T T   I    ++  N     KL FNHRGG KPF   +    +    G     + +F   HY+E KGWIN+ A++ YEEM  L+      GD    
Subjt:  ENKKTGTYCVIDENLLHGRNKMALSKLRFNHRGGPKPF---QCIEKIRKEDGTYLSLIHIFFNMHYSEEKGWINEEARNSYEEMITLKAYHASQGDEKTE

Query:  E-EIMETVLGRRSNYIIGMRYGPKPTRNKGSSSKYSDEYVESLEARLQKHEEELATQRKANEDQQIATQ
        + EI + VLG+RS+YI G+ YG +  R+  +++++ DE +E L  ++ K EE   T     E+ +   Q
Subjt:  E-EIMETVLGRRSNYIIGMRYGPKPTRNKGSSSKYSDEYVESLEARLQKHEEELATQRKANEDQQIATQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein9.0e-0626.19Show/hide
Query:  KHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSTKTIF-------
        +H +   E+N +L K +        K       K+ E++ L E  K   +++ L  G ++L  I+ +GK   DK GLG+      PS +  +F       
Subjt:  KHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSTKTIF-------

Query:  -----VKESPNMPKLVA-------------------PKVVSEHAKINFVPICHYCGVESHVRPKCFKL
             VKE+  + ++ +                    K+ SE  ++ F P+CH+CGV  H+RP+CF+L
Subjt:  -----VKESPNMPKLVA-------------------PKVVSEHAKINFVPICHYCGVESHVRPKCFKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAACCGGATCATATTAGAACCGATTGTCCTCTTCTTAAATCATCCAAGAAATCTAAGAAGAAAGCAATGAAGACTACTCGGGATGATAGTGATGAAAGTGGAAG
TGAAAGTGAGAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCACAGTGACAAAGAGGATGAACAAGATGATGAGGTAAATCTTGACCCCCTTTCTTATGATGAGTTGT
TTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAACTTGGTTCTAAATATGTTATACTTAAAAAGAAATGTAATGTTTTAACTAGTGAAAATAAGTCTTTACTTGAT
GGTATTGCTTGCTTAAAGAAAAATGAGCATGATGTTGTAAATATCTCTTGTAATAAGCATGTTCTTGATTGTGATGAGAAAAATGTCTTACTTGATAAAATTAGATTTCT
TGAGCATGATGGTTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTGGAACTTGATAAGGCTAAAGATTCTATTAAAAAGTTAACAA
TAGGTGCTCAAAGGTTGGACAAGATAATTGAAGTAGGTAAGCCTTATGGTGATAAAAGAGGTTTAGGCTATATTGATGAATGCTCTACTCCCTCAAGTACTAAAACTATC
TTTGTTAAAGAATCTCCTAATATGCCTAAGCTTGTTGCTCCTAAGGTTGTATCTGAACATGCTAAAATTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGAAAGTCA
TGTAAGACCCAAGTGCTTTAAATTAAAATATGCTCAAACTACTTCTTCTAGAAGAAATTTCTCTCAAAGGGCAAAGTTTCACAATGCTCCAAGAAAGAATTTCTCCAAGA
AAAGTAGGATGCATAAATTTGTTGTAAAAGATAATTCATTGCATAATGTTGTTTGCTTTTCATGTAGCAATGATTTTTTAGAAAGAAATTTTGGGGATTTACTTATTAGT
GACAAAAGCAAAGAGATTGCTTCAAGTAAGCAAGAAGTGAGCATCGACGAAAATAAGGTCGACGGTTTTTCATCCATGCCTAAGGGGTGGAAGTATGCTCCATCCCATCC
TAAGGATTTAATTCTTGGTGATCCCGAACAAGGCTGTGCTGAAAATCTTGGTAGATATGTGATTGCCCACTCAATGAGGTATCTTTATTGCTGCCGCAACCGCCTCCCTC
TACCGCCGGTGTCGTCACTTTCTCTGCCGTCCATTGTCGTCGTCGCTTTCAGTATTGTCGTCGCCATGAATAGAAACTTTGCATGTGTTTCTGCTAAGGAAGAAGCATTT
GAGTGTTTTTCTTTGTCGTCTCCTGATCCAATTCAGGGATCAAATGGAATTACTGAAATGTTAGACGATTTGACTAATGAGGAATATCATGATCAAGATTACTCAACTGG
GATTGGATCAAGTGATAATAGTCAAAGTGGAATTGTGAATGAGTTGTATGCACTTGTCTGTGGCTCGGACTGTCAAGTAAATTCATATCAAGGGTGTGTTACTAATGGGG
TTCGGTTCAACACAAACGAGAGGGATGGCTGTTGCACCACTCAAAATAGTGGAGTCTGTGTATTTGGTGGAGATGATAATGAGATATCTGACTTCTACGGTATTATTAAG
GAAGTGATTGAATTGAAGTACATTAAAGACAAACAAGTTCTTCTTTTTAGTTGCGATCGGTATGATAAAACAAGCATAAACACGTGTCACTTATGGTATAAGGATGATCA
GTTTATACTTGTTTCTCAAGCACAACAAGTATTTTATGTTGATGATCTCCAATTAGGTAATGGATGGAAAGTAGCTCAAAGAATTCAACACAGACATTTATGGGATGTGC
CTAAAGTAGAAGAAATTGATTTAATCGAAACCGATATCAATCAATGTGCAGTGGATGAAGTGGATCTCGAGACACAAACATTTCATAGACCAGACATAGATCCAAGTATA
TTATCTGACAATACTGATAGAATGTCATCAGACAGCGAAGGTGAGGATACCATCGCTGGAATCACACGAACTCGAGGGGAAACTGGTGGCGAATTCTTGTTACGAGTCGT
AAATGTAGTTGGAAAAATCAAGCTCGAATGGACGGAGCAACAAGATAGACCAATTGGACCTGGGAGAAGTTTGTTGTCGATATATCTCAGCCACATGTGCATAAATATAT
TTTATACGAGATTGGAACTCGATTCAAAGATTATCGAGCGAAACTACATCGTCATTACAAAAAGTCACGTGATCCAGCACGTGCTTGCCAAAAGCCGTATAAAGACAGAA
AACAAGAAGACTGGAACATATTGTGTGATAGATGAGAATCTCCTGCATGGAAGAAATAAGATGGCACTTAGTAAATTGAGGTTCAATCATCGAGGTGGACCAAAACCATT
TCAGTGCATCGAGAAGATTCGAAAAGAAGATGGGACATATTTAAGTCTCATTCATATATTCTTCAATATGCATTATTCGGAGGAAAAAGGTTGGATTAACGAGGAAGCGA
GGAATTCATATGAAGAAATGATTACCTTAAAGGCGTATCATGCTTCTCAAGGAGACGAAAAAACAGAAGAAGAAATCATGGAGACGGTCCTTGGGAGAAGATCAAATTAT
ATAATAGGAATGAGATATGGACCGAAACCCACCAGAAACAAAGGATCATCATCAAAATACTCTGATGAATATGTGGAGTCATTAGAGGCTCGTCTTCAGAAGCATGAGGA
AGAATTGGCAACTCAACGAAAAGCGAACGAAGACCAACAAATTGCCACCCAAGAACAAATACAAAAAGCATTTGAGGCTCAAAGCCAGGAGTTTTCAAAGCCAGGAAAAG
GGAACGACGAAACCTTGTTTTCGTCCCTTCCGTGGAGACGAAAAACGTTTCGTCGCTTATGCGGCGACGAAAGGAAAGCGTCGCTAAGGCGACATTCTACCGTTTCGTCG
CTAAAATGGGGTTTCGTCGCTGAAGAATTTAGGATGATTAAGCTGAGGGCTGCAAAGACTGCTGAATTGTTGTTGGAACCTGAGATGAAGAGTTTCTATGCTGGAGCCTG
TGGGAACCTGTCGAATGCTCTAACATATGTGACTGCAGTAACTTATTGCCCCAGTTTTAACCGTGTTCGTACAGCTTCCAACTTGTTCAACAGATCGGTGACCTCATTTA
TGTGGGAGTTCACTGAGGTACTCTTCTCCATGTGGATGATGAAGTATCTTGTACTCGCCATGAGACTTGCCACATTCATCATCAAGCACAACATGATGATTGCTACCGCT
TGTTCATCCATCTCTGTCCACTCTTTATCTGACATCCAGTGGTCGGTCGTTCCTTCAAGATTTATGAATCTTCTTGTATGCTCAGTTTTGACTGCACCAATAGTAACTGA
TCAACAACTGCTCAGCCCAAACCAAGCTTCGTTTGTGATTGCTCCAAGATTGAGCAATCCTACGTGGTATTGCCTCCCAGTTGCTAGAGACACAGTCACGTGCTCTCACT
CGATCAAAGTGGTCCTTGATGCTTCCTTCTTCACTCACACCCAACAACTCCTAAAGAAAGATTTCTCTTTTGATGACTTGGCACTCTCACCCAAGTCTGTTGCCGCTCTA
GAGACATTGTCCTTGACTTCAGCACCAATAGTAGTCTCCTCCCTTGATGCACTCTTTCACCTACGCGACTTTATCAGCTCGCCAAACCGTTGGACAACTGGGGCTCGAAA
CTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAACCGGATCATATTAGAACCGATTGTCCTCTTCTTAAATCATCCAAGAAATCTAAGAAGAAAGCAATGAAGACTACTCGGGATGATAGTGATGAAAGTGGAAG
TGAAAGTGAGAATGAAGAAGTGGCCAACTTTTGCTTCATGGCTCACAGTGACAAAGAGGATGAACAAGATGATGAGGTAAATCTTGACCCCCTTTCTTATGATGAGTTGT
TTGAAGCTTTTGAGAATATGCAAAATGATTTAGAAAAACTTGGTTCTAAATATGTTATACTTAAAAAGAAATGTAATGTTTTAACTAGTGAAAATAAGTCTTTACTTGAT
GGTATTGCTTGCTTAAAGAAAAATGAGCATGATGTTGTAAATATCTCTTGTAATAAGCATGTTCTTGATTGTGATGAGAAAAATGTCTTACTTGATAAAATTAGATTTCT
TGAGCATGATGGTTGTGAAAAAGATAATTTGATTAAATTGCTTAAGAAAAATGAATCAAATGCTTTAGTGGAACTTGATAAGGCTAAAGATTCTATTAAAAAGTTAACAA
TAGGTGCTCAAAGGTTGGACAAGATAATTGAAGTAGGTAAGCCTTATGGTGATAAAAGAGGTTTAGGCTATATTGATGAATGCTCTACTCCCTCAAGTACTAAAACTATC
TTTGTTAAAGAATCTCCTAATATGCCTAAGCTTGTTGCTCCTAAGGTTGTATCTGAACATGCTAAAATTAACTTTGTGCCTATATGTCATTATTGTGGTGTTGAAAGTCA
TGTAAGACCCAAGTGCTTTAAATTAAAATATGCTCAAACTACTTCTTCTAGAAGAAATTTCTCTCAAAGGGCAAAGTTTCACAATGCTCCAAGAAAGAATTTCTCCAAGA
AAAGTAGGATGCATAAATTTGTTGTAAAAGATAATTCATTGCATAATGTTGTTTGCTTTTCATGTAGCAATGATTTTTTAGAAAGAAATTTTGGGGATTTACTTATTAGT
GACAAAAGCAAAGAGATTGCTTCAAGTAAGCAAGAAGTGAGCATCGACGAAAATAAGGTCGACGGTTTTTCATCCATGCCTAAGGGGTGGAAGTATGCTCCATCCCATCC
TAAGGATTTAATTCTTGGTGATCCCGAACAAGGCTGTGCTGAAAATCTTGGTAGATATGTGATTGCCCACTCAATGAGGTATCTTTATTGCTGCCGCAACCGCCTCCCTC
TACCGCCGGTGTCGTCACTTTCTCTGCCGTCCATTGTCGTCGTCGCTTTCAGTATTGTCGTCGCCATGAATAGAAACTTTGCATGTGTTTCTGCTAAGGAAGAAGCATTT
GAGTGTTTTTCTTTGTCGTCTCCTGATCCAATTCAGGGATCAAATGGAATTACTGAAATGTTAGACGATTTGACTAATGAGGAATATCATGATCAAGATTACTCAACTGG
GATTGGATCAAGTGATAATAGTCAAAGTGGAATTGTGAATGAGTTGTATGCACTTGTCTGTGGCTCGGACTGTCAAGTAAATTCATATCAAGGGTGTGTTACTAATGGGG
TTCGGTTCAACACAAACGAGAGGGATGGCTGTTGCACCACTCAAAATAGTGGAGTCTGTGTATTTGGTGGAGATGATAATGAGATATCTGACTTCTACGGTATTATTAAG
GAAGTGATTGAATTGAAGTACATTAAAGACAAACAAGTTCTTCTTTTTAGTTGCGATCGGTATGATAAAACAAGCATAAACACGTGTCACTTATGGTATAAGGATGATCA
GTTTATACTTGTTTCTCAAGCACAACAAGTATTTTATGTTGATGATCTCCAATTAGGTAATGGATGGAAAGTAGCTCAAAGAATTCAACACAGACATTTATGGGATGTGC
CTAAAGTAGAAGAAATTGATTTAATCGAAACCGATATCAATCAATGTGCAGTGGATGAAGTGGATCTCGAGACACAAACATTTCATAGACCAGACATAGATCCAAGTATA
TTATCTGACAATACTGATAGAATGTCATCAGACAGCGAAGGTGAGGATACCATCGCTGGAATCACACGAACTCGAGGGGAAACTGGTGGCGAATTCTTGTTACGAGTCGT
AAATGTAGTTGGAAAAATCAAGCTCGAATGGACGGAGCAACAAGATAGACCAATTGGACCTGGGAGAAGTTTGTTGTCGATATATCTCAGCCACATGTGCATAAATATAT
TTTATACGAGATTGGAACTCGATTCAAAGATTATCGAGCGAAACTACATCGTCATTACAAAAAGTCACGTGATCCAGCACGTGCTTGCCAAAAGCCGTATAAAGACAGAA
AACAAGAAGACTGGAACATATTGTGTGATAGATGAGAATCTCCTGCATGGAAGAAATAAGATGGCACTTAGTAAATTGAGGTTCAATCATCGAGGTGGACCAAAACCATT
TCAGTGCATCGAGAAGATTCGAAAAGAAGATGGGACATATTTAAGTCTCATTCATATATTCTTCAATATGCATTATTCGGAGGAAAAAGGTTGGATTAACGAGGAAGCGA
GGAATTCATATGAAGAAATGATTACCTTAAAGGCGTATCATGCTTCTCAAGGAGACGAAAAAACAGAAGAAGAAATCATGGAGACGGTCCTTGGGAGAAGATCAAATTAT
ATAATAGGAATGAGATATGGACCGAAACCCACCAGAAACAAAGGATCATCATCAAAATACTCTGATGAATATGTGGAGTCATTAGAGGCTCGTCTTCAGAAGCATGAGGA
AGAATTGGCAACTCAACGAAAAGCGAACGAAGACCAACAAATTGCCACCCAAGAACAAATACAAAAAGCATTTGAGGCTCAAAGCCAGGAGTTTTCAAAGCCAGGAAAAG
GGAACGACGAAACCTTGTTTTCGTCCCTTCCGTGGAGACGAAAAACGTTTCGTCGCTTATGCGGCGACGAAAGGAAAGCGTCGCTAAGGCGACATTCTACCGTTTCGTCG
CTAAAATGGGGTTTCGTCGCTGAAGAATTTAGGATGATTAAGCTGAGGGCTGCAAAGACTGCTGAATTGTTGTTGGAACCTGAGATGAAGAGTTTCTATGCTGGAGCCTG
TGGGAACCTGTCGAATGCTCTAACATATGTGACTGCAGTAACTTATTGCCCCAGTTTTAACCGTGTTCGTACAGCTTCCAACTTGTTCAACAGATCGGTGACCTCATTTA
TGTGGGAGTTCACTGAGGTACTCTTCTCCATGTGGATGATGAAGTATCTTGTACTCGCCATGAGACTTGCCACATTCATCATCAAGCACAACATGATGATTGCTACCGCT
TGTTCATCCATCTCTGTCCACTCTTTATCTGACATCCAGTGGTCGGTCGTTCCTTCAAGATTTATGAATCTTCTTGTATGCTCAGTTTTGACTGCACCAATAGTAACTGA
TCAACAACTGCTCAGCCCAAACCAAGCTTCGTTTGTGATTGCTCCAAGATTGAGCAATCCTACGTGGTATTGCCTCCCAGTTGCTAGAGACACAGTCACGTGCTCTCACT
CGATCAAAGTGGTCCTTGATGCTTCCTTCTTCACTCACACCCAACAACTCCTAAAGAAAGATTTCTCTTTTGATGACTTGGCACTCTCACCCAAGTCTGTTGCCGCTCTA
GAGACATTGTCCTTGACTTCAGCACCAATAGTAGTCTCCTCCCTTGATGCACTCTTTCACCTACGCGACTTTATCAGCTCGCCAAACCGTTGGACAACTGGGGCTCGAAA
CTGA
Protein sequenceShow/hide protein sequence
MKKPDHIRTDCPLLKSSKKSKKKAMKTTRDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFENMQNDLEKLGSKYVILKKKCNVLTSENKSLLD
GIACLKKNEHDVVNISCNKHVLDCDEKNVLLDKIRFLEHDGCEKDNLIKLLKKNESNALVELDKAKDSIKKLTIGAQRLDKIIEVGKPYGDKRGLGYIDECSTPSSTKTI
FVKESPNMPKLVAPKVVSEHAKINFVPICHYCGVESHVRPKCFKLKYAQTTSSRRNFSQRAKFHNAPRKNFSKKSRMHKFVVKDNSLHNVVCFSCSNDFLERNFGDLLIS
DKSKEIASSKQEVSIDENKVDGFSSMPKGWKYAPSHPKDLILGDPEQGCAENLGRYVIAHSMRYLYCCRNRLPLPPVSSLSLPSIVVVAFSIVVAMNRNFACVSAKEEAF
ECFSLSSPDPIQGSNGITEMLDDLTNEEYHDQDYSTGIGSSDNSQSGIVNELYALVCGSDCQVNSYQGCVTNGVRFNTNERDGCCTTQNSGVCVFGGDDNEISDFYGIIK
EVIELKYIKDKQVLLFSCDRYDKTSINTCHLWYKDDQFILVSQAQQVFYVDDLQLGNGWKVAQRIQHRHLWDVPKVEEIDLIETDINQCAVDEVDLETQTFHRPDIDPSI
LSDNTDRMSSDSEGEDTIAGITRTRGETGGEFLLRVVNVVGKIKLEWTEQQDRPIGPGRSLLSIYLSHMCINIFYTRLELDSKIIERNYIVITKSHVIQHVLAKSRIKTE
NKKTGTYCVIDENLLHGRNKMALSKLRFNHRGGPKPFQCIEKIRKEDGTYLSLIHIFFNMHYSEEKGWINEEARNSYEEMITLKAYHASQGDEKTEEEIMETVLGRRSNY
IIGMRYGPKPTRNKGSSSKYSDEYVESLEARLQKHEEELATQRKANEDQQIATQEQIQKAFEAQSQEFSKPGKGNDETLFSSLPWRRKTFRRLCGDERKASLRRHSTVSS
LKWGFVAEEFRMIKLRAAKTAELLLEPEMKSFYAGACGNLSNALTYVTAVTYCPSFNRVRTASNLFNRSVTSFMWEFTEVLFSMWMMKYLVLAMRLATFIIKHNMMIATA
CSSISVHSLSDIQWSVVPSRFMNLLVCSVLTAPIVTDQQLLSPNQASFVIAPRLSNPTWYCLPVARDTVTCSHSIKVVLDASFFTHTQQLLKKDFSFDDLALSPKSVAAL
ETLSLTSAPIVVSSLDALFHLRDFISSPNRWTTGARN