; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020832 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020832
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description14 kDa zinc-binding protein-like
Genome locationtig00153574:407070..421946
RNA-Seq ExpressionSgr020832
SyntenySgr020832
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR019808 - Histidine triad, conserved site
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145437.1 14 kDa zinc-binding protein isoform X2 [Cucumis sativus]2.2e-6592.09Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RA ITNNEE AAKAA +DA SGAPTIFDKII+KEIPSNIVYED+KVLAFRD+NPQAPVH+LIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_011660307.1 14 kDa zinc-binding protein isoform X1 [Cucumis sativus]2.2e-6592.09Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RA ITNNEE AAKAA +DA SGAPTIFDKII+KEIPSNIVYED+KVLAFRD+NPQAPVH+LIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_022142542.1 14 kDa zinc-binding protein-like isoform X1 [Momordica charantia]3.8e-6592.81Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        L +RASITNNEE AAKAA +DA SGAPTIFDKIIAKEIPSNIVYEDEKVLAFRD+NPQAPVHILIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGA+ACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_022927152.1 14 kDa zinc-binding protein-like [Cucurbita moschata]1.7e-6593.53Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RAS TNNEE AAKAA +DA SGAPTIFDKIIAKEIPSNIVYEDEKVLAFRD+NPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQ+KWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

XP_023519175.1 14 kDa zinc-binding protein-like [Cucurbita pepo subsp. pepo]2.2e-6592.81Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RAS TNNEE AAKAA +DA SGAPTIFDKIIAKEIPSNIVYEDEKVLAFRD+NPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAK+VAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQ+KWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

TrEMBL top hitse value%identityAlignment
A0A0A0M163 HIT domain-containing protein1.1e-6592.09Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RA ITNNEE AAKAA +DA SGAPTIFDKII+KEIPSNIVYED+KVLAFRD+NPQAPVH+LIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A5A7TIN3 14 kDa zinc-binding protein isoform X12.4e-6591.37Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RA ITNNEE AAKAA +DA SGAPTIFDKII+KEIPSNIVYED+KVLAFRD+NPQAPVH+LIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A5D3CK60 14 kDa zinc-binding protein isoform X22.4e-6591.37Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RA ITNNEE AAKAA +DA SGAPTIFDKII+KEIPSNIVYED+KVLAFRD+NPQAPVH+LIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A6J1CN26 14 kDa zinc-binding protein-like isoform X11.8e-6592.81Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        L +RASITNNEE AAKAA +DA SGAPTIFDKIIAKEIPSNIVYEDEKVLAFRD+NPQAPVHILIIPKLRDGLTELGKAEARHGEILG+LLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGA+ACQSVYHLHLHVLGGRQMKWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

A0A6J1EGC5 14 kDa zinc-binding protein-like8.2e-6693.53Show/hide
Query:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
        LT+RAS TNNEE AAKAA +DA SGAPTIFDKIIAKEIPSNIVYEDEKVLAFRD+NPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK
Subjt:  LTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEK

Query:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        EGI+EGFRVVINSGASACQSVYHLHLHVLGGRQ+KWPPG
Subjt:  EGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

SwissProt top hitse value%identityAlignment
P42855 14 kDa zinc-binding protein (Fragment)2.5e-4371.43Show/hide
Query:  TIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLH
        TIF KII+KEIPS +VYED+KVLAFRD+ PQ PVHIL+IPK+RDGLT L KAE RH +ILG+LLY AK+VA++EG+ EGFR+VIN G   CQSVYH+H+H
Subjt:  TIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLH

Query:  VLGGRQMKWPPG
        ++GGRQM WPPG
Subjt:  VLGGRQMKWPPG

P42856 14 kDa zinc-binding protein1.8e-4675.44Show/hide
Query:  APTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLH
        +PTIFDKII KEIPS +VYEDEKVLAFRD+NPQAP HILIIPK++DGLT L KAE RH EILG LLY AK+VA++EG+ +G+RVVIN G S CQSVYH+H
Subjt:  APTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLH

Query:  LHVLGGRQMKWPPG
        +H+LGGRQM WPPG
Subjt:  LHVLGGRQMKWPPG

Q8GUN2 Adenylylsulfatase HINT16.3e-4761.18Show/hide
Query:  SHSLSQVDNSFSILTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEIL
        SH +S + + FS     +++  +E+ AA AAT    S +PTIFDKII+KEIPS +V+ED+KVLAFRD+ PQ PVHIL+IPK+RDGLT L KAE RH +IL
Subjt:  SHSLSQVDNSFSILTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEIL

Query:  GQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        G+LLY AK+VA++EG+ EGFR+VIN G   CQSVYH+H+H++GGRQM WPPG
Subjt:  GQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Q8SQ21 Histidine triad nucleotide-binding protein 2, mitochondrial2.0e-3251.11Show/hide
Query:  ASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIL
        A +T+ +E  AKA     G  APTIF +I+ + +P++I+YED++ LAFRDV PQAPVH L+IPK    +  + +AE    ++LG LL  AK  A+ EG+ 
Subjt:  ASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIL

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        +G+R+VIN G    QSVYHLH+HVLGGRQ++WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Q9BX68 Histidine triad nucleotide-binding protein 2, mitochondrial3.4e-3251.11Show/hide
Query:  ASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIL
        A +T+  E  AKA     G  APTIF +I+ K +P++I+YED++ L FRDV PQAPVH L+IPK    +  + +AE    ++LG LL  AK  A+ EG+ 
Subjt:  ASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGIL

Query:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        +G+R+VIN G    QSVYHLH+HVLGGRQ++WPPG
Subjt:  EGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

Arabidopsis top hitse value%identityAlignment
AT1G31160.1 HISTIDINE TRIAD NUCLEOTIDE-BINDING 21.1e-5983.97Show/hide
Query:  NNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFR
        +NEEAAAKAA + A +GAPTIFDKIIAKEIPS+IVYEDE VLAFRD+NPQAPVH+L+IPKLRDGLT LGKAE RH E+LGQLL+A+KIVAEKEGIL+GFR
Subjt:  NNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFR

Query:  VVINSGASACQSVYHLHLHVLGGRQMKWPPG
        VVIN+G  ACQSVYHLHLHVLGGRQMKWPPG
Subjt:  VVINSGASACQSVYHLHLHVLGGRQMKWPPG

AT3G56490.1 HIS triad family protein 34.5e-4861.18Show/hide
Query:  SHSLSQVDNSFSILTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEIL
        SH +S + + FS     +++  +E+ AA AAT    S +PTIFDKII+KEIPS +V+ED+KVLAFRD+ PQ PVHIL+IPK+RDGLT L KAE RH +IL
Subjt:  SHSLSQVDNSFSILTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGEIL

Query:  GQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG
        G+LLY AK+VA++EG+ EGFR+VIN G   CQSVYH+H+H++GGRQM WPPG
Subjt:  GQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPG

AT4G16566.1 histidine triad nucleotide-binding 42.3e-0427.78Show/hide
Query:  AGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGE--ILGQLLYAAKIVAEKEGILEGFRVVINSGASACQ
        AG     IF +I+     + +++ DEKV+AF+D+ P A  H L+IPK  + +  +   + R  +  ++  +L   + + +K+      R   +       
Subjt:  AGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKLRDGLTELGKAEARHGE--ILGQLLYAAKIVAEKEGILEGFRVVINSGASACQ

Query:  SVYHLHLH
        SV HLHLH
Subjt:  SVYHLHLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGGCTACGACTACAAGAACGATGGTGACTGCGAAATTGAAACATTCCAGCAACCACTTCACGAGCTTTCTCCTCCCACTTCGCACTCGCTCTCGCAGGTAGACAA
CTCGTTTTCTATATTGACGATGCGTGCTAGTATCACAAACAACGAAGAAGCTGCTGCTAAAGCAGCTACAACTGATGCAGGCAGTGGAGCTCCGACCATATTTGACAAAA
TAATAGCTAAGGAAATTCCTTCAAACATTGTATATGAGGATGAGAAAGTCCTAGCATTTCGAGACGTAAATCCACAGGCTCCTGTTCATATATTAATCATTCCAAAGCTC
AGAGATGGATTGACAGAGCTGGGGAAGGCTGAAGCAAGACATGGGGAGATACTGGGTCAGCTCCTCTATGCAGCCAAAATAGTGGCTGAGAAAGAGGGTATTCTTGAAGG
ATTTCGTGTAGTCATCAACAGTGGTGCAAGTGCCTGCCAATCTGTATATCATCTTCACTTGCATGTCCTGGGTGGGAGGCAGATGAAGTGGCCTCCAGGTCGGATAGGGT
TCACGATCAAGCTTAGGGACGAGAAGAAAGAACAGAAACCATTCCGCCATGTGCTGGTGGCCTGGATACTTAAGCTCCGTGACTTTGCCGACGACTTTCCCTCCGCGCGC
TATGCCGTGGTGCTCAATTACACCGTACGTGAATCTCTTCGGCACATCGTAGAGGTAAATCTTACAGGCGAAACCAATTTCCCTGAACTATTGAGGAGATCACTATCTGG
GGAACCAAACTCATCCGAAACCATAACAGAATTGGCAGTGCCGAAGGGGAAGGAGCGGTCAAGCTTGAGAGTGGCCGTGGAGCTGATGAAGGTGTTGACAATAGCATATA
GGGCGAGAATGAAACACAGAGAAGCTAGGGTTCGCTTCAGGAGAGAGGATTTTCGCGCCATTGTTGGAAATAGAACTCGGCTTGCAGAATTTTGCAGGACGACAGATAAG
TTTAGGACTGAGGAAGCGATACTTGGTTTAATGGCGCAATGTGAGGCAGGGCGCGTGAACCACACGTCCAATGGTAGACGAGGAGGAGGAAGGAGCTCTGGCTTGACAAC
GGAACCTTCTCGATTGGCATTTCGGAAAAACGAGCTCACTGTGGACTCGGCTTCTTCATCTTCTTTCGGCCCGAAGTTCACTGAGCGAGCATCAAATTTCAACTCGACAG
GGCAGAGACCATCATACAGTTCTCTACTAGTGTCATTTTCAGACTTGTCGCTGGATGTAGGCTTCCCCCCTTCAAATTTGAATATCAAGGAATCAAATACATCAGCACGA
TTCATCAAGTTTGCTTGTGTATTATTCGAAAGGAATCACCAGCTAAACCCGAGGCCGCAAGGGGTTAACGGCAATCCCAAAAATCTGCGGAGAGTGGAAGCCGCATCGCC
GTCGCGTTGCTGCTCGCCGAAGAGTGAAGCGACAGAGGAAGAAGTAGAGGGAACGATGGGCGACGAAGTGAGAAGAAATCTGCGGAGATTGAGAATGGGAGGAGGAGAGA
GAGAAGGCAACAGAGAAGGCGATGAAAAGAGGCGGGTGAGGGAGGTGGGAACAGGGTTGAAGACGAGTGGACCGATCGAATCTTTAACAGAAGGACCGGAGATTATGAGG
GGGGACTTGAAAACTGCGCCGATTGGGAAGAGGGACTTCCATTCTTCTTCGGGCATATTGCGAACTAACCTTTGGAAGTTGCAGAAGCCACCGAGATCGAAGATGAAAAC
ATAG
mRNA sequenceShow/hide mRNA sequence
ATGTATGGCTACGACTACAAGAACGATGGTGACTGCGAAATTGAAACATTCCAGCAACCACTTCACGAGCTTTCTCCTCCCACTTCGCACTCGCTCTCGCAGGTAGACAA
CTCGTTTTCTATATTGACGATGCGTGCTAGTATCACAAACAACGAAGAAGCTGCTGCTAAAGCAGCTACAACTGATGCAGGCAGTGGAGCTCCGACCATATTTGACAAAA
TAATAGCTAAGGAAATTCCTTCAAACATTGTATATGAGGATGAGAAAGTCCTAGCATTTCGAGACGTAAATCCACAGGCTCCTGTTCATATATTAATCATTCCAAAGCTC
AGAGATGGATTGACAGAGCTGGGGAAGGCTGAAGCAAGACATGGGGAGATACTGGGTCAGCTCCTCTATGCAGCCAAAATAGTGGCTGAGAAAGAGGGTATTCTTGAAGG
ATTTCGTGTAGTCATCAACAGTGGTGCAAGTGCCTGCCAATCTGTATATCATCTTCACTTGCATGTCCTGGGTGGGAGGCAGATGAAGTGGCCTCCAGGTCGGATAGGGT
TCACGATCAAGCTTAGGGACGAGAAGAAAGAACAGAAACCATTCCGCCATGTGCTGGTGGCCTGGATACTTAAGCTCCGTGACTTTGCCGACGACTTTCCCTCCGCGCGC
TATGCCGTGGTGCTCAATTACACCGTACGTGAATCTCTTCGGCACATCGTAGAGGTAAATCTTACAGGCGAAACCAATTTCCCTGAACTATTGAGGAGATCACTATCTGG
GGAACCAAACTCATCCGAAACCATAACAGAATTGGCAGTGCCGAAGGGGAAGGAGCGGTCAAGCTTGAGAGTGGCCGTGGAGCTGATGAAGGTGTTGACAATAGCATATA
GGGCGAGAATGAAACACAGAGAAGCTAGGGTTCGCTTCAGGAGAGAGGATTTTCGCGCCATTGTTGGAAATAGAACTCGGCTTGCAGAATTTTGCAGGACGACAGATAAG
TTTAGGACTGAGGAAGCGATACTTGGTTTAATGGCGCAATGTGAGGCAGGGCGCGTGAACCACACGTCCAATGGTAGACGAGGAGGAGGAAGGAGCTCTGGCTTGACAAC
GGAACCTTCTCGATTGGCATTTCGGAAAAACGAGCTCACTGTGGACTCGGCTTCTTCATCTTCTTTCGGCCCGAAGTTCACTGAGCGAGCATCAAATTTCAACTCGACAG
GGCAGAGACCATCATACAGTTCTCTACTAGTGTCATTTTCAGACTTGTCGCTGGATGTAGGCTTCCCCCCTTCAAATTTGAATATCAAGGAATCAAATACATCAGCACGA
TTCATCAAGTTTGCTTGTGTATTATTCGAAAGGAATCACCAGCTAAACCCGAGGCCGCAAGGGGTTAACGGCAATCCCAAAAATCTGCGGAGAGTGGAAGCCGCATCGCC
GTCGCGTTGCTGCTCGCCGAAGAGTGAAGCGACAGAGGAAGAAGTAGAGGGAACGATGGGCGACGAAGTGAGAAGAAATCTGCGGAGATTGAGAATGGGAGGAGGAGAGA
GAGAAGGCAACAGAGAAGGCGATGAAAAGAGGCGGGTGAGGGAGGTGGGAACAGGGTTGAAGACGAGTGGACCGATCGAATCTTTAACAGAAGGACCGGAGATTATGAGG
GGGGACTTGAAAACTGCGCCGATTGGGAAGAGGGACTTCCATTCTTCTTCGGGCATATTGCGAACTAACCTTTGGAAGTTGCAGAAGCCACCGAGATCGAAGATGAAAAC
ATAG
Protein sequenceShow/hide protein sequence
MYGYDYKNDGDCEIETFQQPLHELSPPTSHSLSQVDNSFSILTMRASITNNEEAAAKAATTDAGSGAPTIFDKIIAKEIPSNIVYEDEKVLAFRDVNPQAPVHILIIPKL
RDGLTELGKAEARHGEILGQLLYAAKIVAEKEGILEGFRVVINSGASACQSVYHLHLHVLGGRQMKWPPGRIGFTIKLRDEKKEQKPFRHVLVAWILKLRDFADDFPSAR
YAVVLNYTVRESLRHIVEVNLTGETNFPELLRRSLSGEPNSSETITELAVPKGKERSSLRVAVELMKVLTIAYRARMKHREARVRFRREDFRAIVGNRTRLAEFCRTTDK
FRTEEAILGLMAQCEAGRVNHTSNGRRGGGRSSGLTTEPSRLAFRKNELTVDSASSSSFGPKFTERASNFNSTGQRPSYSSLLVSFSDLSLDVGFPPSNLNIKESNTSAR
FIKFACVLFERNHQLNPRPQGVNGNPKNLRRVEAASPSRCCSPKSEATEEEVEGTMGDEVRRNLRRLRMGGGEREGNREGDEKRRVREVGTGLKTSGPIESLTEGPEIMR
GDLKTAPIGKRDFHSSSGILRTNLWKLQKPPRSKMKT