; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029129 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029129
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein LSM12-like protein A
Genome locationtig00153210:3478630..3486693
RNA-Seq ExpressionSgr029129
SyntenySgr029129
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0030198 - extracellular matrix organization (biological process)
GO:0030574 - collagen catabolic process (biological process)
GO:0031012 - extracellular matrix (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR019181 - Anticodon-binding domain
IPR039683 - Protein Lsm12-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034336.1 protein LSM12-like protein A [Cucumis melo var. makuwa]2.7e-8289.56Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQAEAE
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE

Query:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG
        AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLP+SV+GGTPAANERVKKV      L L    LQV  G
Subjt:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG

KGN52415.2 hypothetical protein Csa_008581 [Cucumis sativus]5.9e-8296.93Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQAEAE
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE

Query:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV
        AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLPESV+GGTPAANERVKKV
Subjt:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV

TYK15583.1 protein LSM12-like protein A [Cucumis melo var. makuwa]5.9e-8290.5Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQAEAE
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE

Query:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV
        AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLP+SV+GGTPAANERVKKV      L L    LQV
Subjt:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV

XP_004135229.1 protein LSM12 homolog A [Cucumis sativus]2.5e-8095.18Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNIL   EGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV
        EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLPESV+GGTPAANERVKKV
Subjt:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV

XP_038892233.1 protein LSM12 homolog A-like [Benincasa hispida]1.1e-8089.56Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNIL   EGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV
        EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLPESV+GGTPAANERVKKV      L L    LQV
Subjt:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV

TrEMBL top hitse value%identityAlignment
A0A0A0KQA9 AD domain-containing protein1.2e-8095.18Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNIL   EGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQA
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV
        EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLPESV+GGTPAANERVKKV
Subjt:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV

A0A1S3BGE7 protein LSM12 homolog A1.5e-7879.13Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------------------------EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDL
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNIL                        EGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDL
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------------------------EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDL

Query:  KKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEAN
        K CYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLP+SV+GGTPAANERVKKV      L L    
Subjt:  KKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEAN

Query:  LQVSRG
        LQV  G
Subjt:  LQVSRG

A0A5A7SWT7 Protein LSM12-like protein A1.3e-8289.56Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQAEAE
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE

Query:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG
        AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLP+SV+GGTPAANERVKKV      L L    LQV  G
Subjt:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG

A0A5D3CUL6 Protein LSM12-like protein A2.9e-8290.5Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE
        MALDGSGNGDDF+VGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPR NIRLLKANYIKEFSFLGHGEDPLDLK CYLDLNTLRAREELAIRQAEAE
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAE

Query:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV
        AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYLP+SV+GGTPAANERVKKV      L L    LQV
Subjt:  AERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQV

A0A6J1DDK3 protein LSM12 homolog2.1e-8088.11Show/hide
Query:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA
        MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL   EGSKPGP  NIRLLKANYIKEF+FLGHGEDPLDLKKCYLDLN+LRAREELAIRQA
Subjt:  MALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQA

Query:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG
        EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRV SPYL ESVTGGTPAANERVKKV      L L    LQV  G
Subjt:  EAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRG

SwissProt top hitse value%identityAlignment
Q5ZML5 Protein LSM12 homolog1.7e-1030.12Show/hide
Query:  GDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEA
        G+ F+VGS  S +T      QG+V+ FD PS +L         KP    +I L+   Y+ E   +    E P  L    L+++ L  +  +   +  ++A
Subjt:  GDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEAEA

Query:  ERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNK
          I  GV+ E Q +F  + KT+   +W +  IVVM EV +  PY  E+  G   +A   V+K+  K
Subjt:  ERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNK

Q6GP89 Protein LSM12 homolog3.0e-1229.47Show/hide
Query:  SGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL-----EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEA
        +G G+ FA+G++ S +T      QG+V+ FD PS +L       S      +I LL  +Y+ +   +    + P  L    L++  L +R  L   +  +
Subjt:  SGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL-----EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEA

Query:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRGPVLGGQ
        +A  I  GV+ + Q +F  + KT+   +W +  IVVM+EV +  PY  E+  G    A   V K+  K  H    E    V R PV   Q
Subjt:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRGPVLGGQ

Q6NSN1 Protein LSM12 homolog B6.7e-1229.67Show/hide
Query:  GNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFL-GHGEDPLDLKKCYLDLNTLRAREELAIRQAEA
        G G+ F+VGS  S  T LG   QG+V+ FD PS +L         KP    ++ L+   Y+ E   +    E P  L     +    RAR E   + + A
Subjt:  GNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFL-GHGEDPLDLKKCYLDLNTLRAREELAIRQAEA

Query:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVS
         A  +  GV+ E Q +F  + KT+   +W +  I+VM++V +  PY  ++  G   +A   V+K+  K    H  +   Q+S
Subjt:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVS

Q6P833 Protein LSM12 homolog1.7e-1028.8Show/hide
Query:  SGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAE
        +G G+ FA+G++ S +T      QG+V+ FD PS +L         KP    +I LL  +Y+ +   +    + P  L    L++  L +R  L   +  
Subjt:  SGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAE

Query:  AEAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRGPVLGGQ
        ++A  I  GV+ + Q +F  + KT+   +W +  IVVM E  +  PY  E+  G    A   V K+  K  H    E    V R P    Q
Subjt:  AEAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRGPVLGGQ

Q6PBA2 Protein LSM12 homolog A3.0e-1230.95Show/hide
Query:  GNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEA
        G G+ F+VGS  S  T LG   QG+V+ FD PS +L         KP    ++ L+   Y+ E   +    E P  L    +     RAR E   + ++A
Subjt:  GNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL------EGSKPGPRWNIRLLKANYIKEFSFLG-HGEDPLDLKKCYLDLNTLRAREELAIRQAEA

Query:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNK
         A  I  GV+ E Q +F  + KT+   +W +  I+VM++V +  PY  E+  G   +A   ++K+  K
Subjt:  EAERIGVGVTSEAQSIFDALSKTL-PVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNK

Arabidopsis top hitse value%identityAlignment
AT1G24050.1 RNA-processing, Lsm domain1.4e-4457.06Show/hide
Query:  AKMALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRW--NIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELA
        A +A    G G+ FAVG+ +S+K   GDEF+G V+ +D   N +   EG+KP P    N R++ A++I   S+LG  EDPLD     +DLN LRA+E LA
Subjt:  AKMALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNIL---EGSKPGPRW--NIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELA

Query:  IRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV
        IRQAEA+AER+GVGVT+EAQSIFDALSKTLPV+W+ + I+VM EVRV SPYL + V GGT AAN RVKKV
Subjt:  IRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKV

AT1G70220.1 RNA-processing, Lsm domain2.8e-2131.78Show/hide
Query:  PRKKAKMALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSN-------------------------------------------ILEGSKPGP--
        P       +D  G  + F VG  +++K T GD+F G V+ +D   N                                           + EG+KP P  
Subjt:  PRKKAKMALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSN-------------------------------------------ILEGSKPGP--

Query:  RWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESV
          ++R++  NYI E   LG  ++ L  K   ++L+ L  +E  AI    +  E+IG GVT+E Q IFDA+SKTLP+RW    ++VM +V + SPY  + V
Subjt:  RWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRAREELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESV

Query:  TGGTPAANERVKKV
         GG    NERVK V
Subjt:  TGGTPAANERVKKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCAGTCATTCTGAACACGATCTCCAACGCCATTAGCGTATGGAAATGGATGCATGTAAGGACCCGGGTGCCCATCGGGCCTCAACAGCGATATCTTCGTCACGT
CCAAGGCCTCGATTCTCAGCCCCCCGACTTGCTTCGCTCTCGCCTTTGCCTCTTCCACTTCCTCCACCTCTATCTCCCTTATCTCTGCGTCCATTCCCTCCACTTTCCTC
TCGTTTTCCTCGTAAGGCTTCGTCTTGGGGCACGCGCCTGCTTTATCCCACTCTCCCTCGAAGTGCGAAGGCGAAAATGTCGCCAGAAATACGTCCATTTCATTGCTGCT
CGGACCCCTTCTGTCGATTATGGTCTGAAATGTCGTTCTCAAGCCCCTCCTCAGGCGTCTGCCGTCGACGAGAAACGGCGACCAGTACACCGAAACCGTGAGATTATGAG
AGGGGAAGTTCCATCTCCGGAACTTGTTGTCCTCGCCGTCTCTCTGGAGAAAGACGTCGGGGTCGAACATTGGGAGGCTGCATTTGCTGGGCTTCCAGCGCCAATAGAGA
TAACCCAAGAGAGGCAGCGAGATTCCTGGTGGATGGCTCATGGAGTCACATTTGCAGGCAAGGACCCCATTTACCTATGGGTTTCGGTTTGGTGCCCAAAACCCGCTACA
CGTGGTGCCTTCTCATTGGAGGAGGAAGCGACGGTTCCAACTTTTATACCGAGGAAGAAGGCGAAAATGGCACTGGACGGTAGCGGCAATGGGGATGACTTCGCAGTTGG
GTCCTTCTTCTCCATTAAGACGACCTTAGGCGATGAATTTCAAGGACAAGTCATTACCTTTGACCGCCCCTCCAACATCCTCGAGGGTTCGAAGCCAGGACCTCGTTGGA
ACATAAGGCTGCTGAAGGCCAATTATATAAAGGAGTTTTCGTTTTTGGGACATGGCGAAGATCCTCTTGATCTCAAAAAGTGTTACCTCGATCTCAATACTCTCCGTGCT
CGAGAGGAACTGGCCATTAGGCAGGCAGAGGCAGAGGCGGAGAGGATAGGAGTGGGTGTGACCAGCGAGGCTCAGAGTATTTTTGACGCCTTATCTAAAACGCTTCCGGT
TCGCTGGGACAAGACTGTCATAGTTGTAATGAATGAAGTACGTGTTGGCAGTCCCTACCTACCAGAATCCGTTACTGGAGGCACCCCTGCTGCCAATGAACGGGTGAAGA
AAGTGAGTAACAAAGCTGATCATCTTCATCTGAACGAAGCTAATCTGCAAGTTTCCAGAGGACCTGTTCTTGGTGGGCAAACCAGAATCCTCGCTGACCTGCTGCAATCT
CAGTCGCTGAAACAATTCTCTGAAACAAGTCTCCTGGCAAGAGGACTTGTCTACATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCAGTCATTCTGAACACGATCTCCAACGCCATTAGCGTATGGAAATGGATGCATGTAAGGACCCGGGTGCCCATCGGGCCTCAACAGCGATATCTTCGTCACGT
CCAAGGCCTCGATTCTCAGCCCCCCGACTTGCTTCGCTCTCGCCTTTGCCTCTTCCACTTCCTCCACCTCTATCTCCCTTATCTCTGCGTCCATTCCCTCCACTTTCCTC
TCGTTTTCCTCGTAAGGCTTCGTCTTGGGGCACGCGCCTGCTTTATCCCACTCTCCCTCGAAGTGCGAAGGCGAAAATGTCGCCAGAAATACGTCCATTTCATTGCTGCT
CGGACCCCTTCTGTCGATTATGGTCTGAAATGTCGTTCTCAAGCCCCTCCTCAGGCGTCTGCCGTCGACGAGAAACGGCGACCAGTACACCGAAACCGTGAGATTATGAG
AGGGGAAGTTCCATCTCCGGAACTTGTTGTCCTCGCCGTCTCTCTGGAGAAAGACGTCGGGGTCGAACATTGGGAGGCTGCATTTGCTGGGCTTCCAGCGCCAATAGAGA
TAACCCAAGAGAGGCAGCGAGATTCCTGGTGGATGGCTCATGGAGTCACATTTGCAGGCAAGGACCCCATTTACCTATGGGTTTCGGTTTGGTGCCCAAAACCCGCTACA
CGTGGTGCCTTCTCATTGGAGGAGGAAGCGACGGTTCCAACTTTTATACCGAGGAAGAAGGCGAAAATGGCACTGGACGGTAGCGGCAATGGGGATGACTTCGCAGTTGG
GTCCTTCTTCTCCATTAAGACGACCTTAGGCGATGAATTTCAAGGACAAGTCATTACCTTTGACCGCCCCTCCAACATCCTCGAGGGTTCGAAGCCAGGACCTCGTTGGA
ACATAAGGCTGCTGAAGGCCAATTATATAAAGGAGTTTTCGTTTTTGGGACATGGCGAAGATCCTCTTGATCTCAAAAAGTGTTACCTCGATCTCAATACTCTCCGTGCT
CGAGAGGAACTGGCCATTAGGCAGGCAGAGGCAGAGGCGGAGAGGATAGGAGTGGGTGTGACCAGCGAGGCTCAGAGTATTTTTGACGCCTTATCTAAAACGCTTCCGGT
TCGCTGGGACAAGACTGTCATAGTTGTAATGAATGAAGTACGTGTTGGCAGTCCCTACCTACCAGAATCCGTTACTGGAGGCACCCCTGCTGCCAATGAACGGGTGAAGA
AAGTGAGTAACAAAGCTGATCATCTTCATCTGAACGAAGCTAATCTGCAAGTTTCCAGAGGACCTGTTCTTGGTGGGCAAACCAGAATCCTCGCTGACCTGCTGCAATCT
CAGTCGCTGAAACAATTCTCTGAAACAAGTCTCCTGGCAAGAGGACTTGTCTACATTTGA
Protein sequenceShow/hide protein sequence
MDAVILNTISNAISVWKWMHVRTRVPIGPQQRYLRHVQGLDSQPPDLLRSRLCLFHFLHLYLPYLCVHSLHFPLVFLVRLRLGARACFIPLSLEVRRRKCRQKYVHFIAA
RTPSVDYGLKCRSQAPPQASAVDEKRRPVHRNREIMRGEVPSPELVVLAVSLEKDVGVEHWEAAFAGLPAPIEITQERQRDSWWMAHGVTFAGKDPIYLWVSVWCPKPAT
RGAFSLEEEATVPTFIPRKKAKMALDGSGNGDDFAVGSFFSIKTTLGDEFQGQVITFDRPSNILEGSKPGPRWNIRLLKANYIKEFSFLGHGEDPLDLKKCYLDLNTLRA
REELAIRQAEAEAERIGVGVTSEAQSIFDALSKTLPVRWDKTVIVVMNEVRVGSPYLPESVTGGTPAANERVKKVSNKADHLHLNEANLQVSRGPVLGGQTRILADLLQS
QSLKQFSETSLLARGLVYI