; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023836 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023836
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000892:7351727..7352828
RNA-Seq ExpressionSgr023836
SyntenySgr023836
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PQM35718.1 pentatricopeptide repeat-containing protein [Prunus yedoensis var. nudiflora]3.5e-3451.5Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+Y+KCGCI SA+Q+F    DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  + +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

XP_008236936.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21065-like [Prunus mume]3.1e-3547.72Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+F    DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  AE------------LQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNI-------------------FSSLKKDVSDCGSYIL
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+                      LK   +D GSY+L
Subjt:  AE------------LQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNI-------------------FSSLKKDVSDCGSYIL

XP_021827919.1 pentatricopeptide repeat-containing protein At4g21065-like [Prunus avium]1.6e-3447.52Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+F    DK+                C+DAIDLF KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYI--LAELVVLRGLPQAYELISK
        A               +YGV+P I H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K  KD       I  L +L +      +Y LI  
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYI--LAELVVLRGLPQAYELISK

Query:  IY
        +Y
Subjt:  IY

XP_034228644.1 pentatricopeptide repeat-containing protein At4g21065-like [Prunus dulcis]1.6e-3452.1Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+     DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

XP_034228706.1 pentatricopeptide repeat-containing protein At4g21065-like [Prunus dulcis]1.6e-3452.1Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+     DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

TrEMBL top hitse value%identityAlignment
A0A314UFA2 Pentatricopeptide repeat-containing protein1.7e-3451.5Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+Y+KCGCI SA+Q+F    DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  + +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

A0A5E4GB73 PREDICTED: pentatricopeptide repeat-containing7.5e-3552.1Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+     DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

A0A5E4GBA1 PREDICTED: pentatricopeptide repeat-containing7.5e-3552.1Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+     DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

A0A6P5TLJ5 pentatricopeptide repeat-containing protein At4g21065-like7.5e-3547.52Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKCGCI SA+Q+F    DK+                C+DAIDLF KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYI--LAELVVLRGLPQAYELISK
        A               +YGV+P I H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K  KD       I  L +L +      +Y LI  
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYI--LAELVVLRGLPQAYELISK

Query:  IY
        +Y
Subjt:  IY

M5VNU1 DYW_deaminase domain-containing protein3.7e-3452.1Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        GAL +GRRVH +VE   +G K+NV +ALID+YAKC CI SA Q+F    DK+                C+DAIDLF+KM  FGIKPDERTMTAVLSACRN
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK--ADKE----------------CRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK
        A               +YGV+PTI H GC+VDLLARAGH+KEAE FI  M +  +  +C N+  + K
Subjt:  A------------ELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK

SwissProt top hitse value%identityAlignment
Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.4e-1737.91Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR
        LGAL  G+ +H  + K ++   S +   LID+YAKCG +  A ++FK  K+                   R+AI  F +M   GIKP+  T TAVL+AC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEM
           L            + Y +KPTI H GCIVDLL RAG + EA+ FI  M +
Subjt:  NAEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEM

Q9LS72 Pentatricopeptide repeat-containing protein At3g292302.1e-1832Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        G L LG R+H I+++  LGS + V  AL+D+YAKCG +  A  +F                        ++AI+LF++M   GI+PD+ T  AVL +C +
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  AEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKIY
        A L            + Y + P + H GC+VDLL R G +KEA   +  M M     +   +  + +  +  D    +L  LV L    P  Y L+S IY
Subjt:  AEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKIY

Q9SMZ2 Pentatricopeptide repeat-containing protein At4g331709.5e-1931.84Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSACR
        L AL  GR++H    K    +   V T+L+D+YAKCG I+ A  +FK  +                  E ++ + LF +M + GIKPD+ T   VLSAC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKI
        ++ L              YG+KP I H  C+ D L RAG +K+AE  I +M M    S+   + ++ + +  ++ G  +  +L+ L  L   AY L+S +
Subjt:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKI

Query:  Y
        Y
Subjt:  Y

Q9SN85 Pentatricopeptide repeat-containing protein At3g475303.0e-2030.16Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR
        LGAL  G++VH+ +++  L    N+   L+ +Y++CG ++ A Q+F   +E                   ++AI+ FN+M+ FGI P+E+T+T +LSAC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQKYG-------------VKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLR
        ++ L   G             +KP +HH GC+VDLL RA  + +A   I +MEM  ++++   +  + +   DV + G  +++ L+ L+
Subjt:  NAELQKYG-------------VKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLR

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307001.1e-1735.85Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACR
        LGAL LG+ VH++V      S   V TALI +YAKCG I  A+++F                      + ++A+++F +M+N GI P   T   VL AC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSV
        +A L K            YG +P++ H  C+VD+L RAGH++ A  FI AM +   +SV
Subjt:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSV

Arabidopsis top hitse value%identityAlignment
AT1G13410.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-2033Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKS-NVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSAC
        LGA   G+ VH+  E         NV+ ALID+Y KCG I  A ++FK  K                     +A++LF++M N GI PD+ T   VL AC
Subjt:  LGALMLGRRVHEIVEKWKLGSKS-NVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSAC

Query:  R------------NAELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLRGL-PQAYELIS
        +            N+    + + P I HCGC+VDLL+RAG + +A  FI  M +  +  +   +  + K  K V D G   L EL+ L    P  + ++S
Subjt:  R------------NAELQKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLRGL-PQAYELIS

Query:  KIY
         IY
Subjt:  KIY

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-1932Show/hide
Query:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACRN
        G L LG R+H I+++  LGS + V  AL+D+YAKCG +  A  +F                        ++AI+LF++M   GI+PD+ T  AVL +C +
Subjt:  GALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACRN

Query:  AEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKIY
        A L            + Y + P + H GC+VDLL R G +KEA   +  M M     +   +  + +  +  D    +L  LV L    P  Y L+S IY
Subjt:  AEL------------QKYGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKIY

AT3G47530.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-2130.16Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR
        LGAL  G++VH+ +++  L    N+   L+ +Y++CG ++ A Q+F   +E                   ++AI+ FN+M+ FGI P+E+T+T +LSAC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKE------------------CRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQKYG-------------VKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLR
        ++ L   G             +KP +HH GC+VDLL RA  + +A   I +MEM  ++++   +  + +   DV + G  +++ L+ L+
Subjt:  NAELQKYG-------------VKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK--KDVSDCGSYILAELVVLR

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-1935.85Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACR
        LGAL LG+ VH++V      S   V TALI +YAKCG I  A+++F                      + ++A+++F +M+N GI P   T   VL AC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFK------------------ADKECRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSV
        +A L K            YG +P++ H  C+VD+L RAGH++ A  FI AM +   +SV
Subjt:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSV

AT4G33170.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-2031.84Show/hide
Query:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSACR
        L AL  GR++H    K    +   V T+L+D+YAKCG I+ A  +FK  +                  E ++ + LF +M + GIKPD+ T   VLSAC 
Subjt:  LGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADK------------------ECRDAIDLFNKMINFGIKPDERTMTAVLSACR

Query:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKI
        ++ L              YG+KP I H  C+ D L RAG +K+AE  I +M M    S+   + ++ + +  ++ G  +  +L+ L  L   AY L+S +
Subjt:  NAELQK------------YGVKPTIHHCGCIVDLLARAGHIKEAEGFIWAMEMLIETSVCLNIFSSLK-KDVSDCGSYILAELVVLRGL-PQAYELISKI

Query:  Y
        Y
Subjt:  Y


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGGGCGTTAATGTTGGGAAGGAGAGTGCATGAGATTGTTGAAAAGTGGAAGCTTGGTTCAAAATCTAATGTGAGAACTGCGCTCATTGATGTGTACGCAAAATG
CGGTTGCATAAACAGCGCAAAGCAGATTTTCAAAGCTGACAAAGAATGCCGAGATGCGATTGATCTATTTAACAAGATGATCAATTTTGGAATTAAGCCTGATGAGAGAA
CAATGACTGCAGTTTTATCAGCATGTAGGAATGCAGAACTACAGAAGTATGGAGTTAAGCCAACCATTCACCATTGTGGATGTATAGTGGACCTTCTTGCCAGGGCAGGG
CATATAAAGGAAGCTGAGGGGTTTATATGGGCAATGGAGATGTTAATCGAGACGAGCGTTTGCTTGAACATCTTTAGCTCCTTAAAGAAGGATGTAAGTGATTGTGGGAG
CTATATACTTGCAGAACTTGTGGTCCTGCGAGGACTGCCACAAGCTTACGAACTGATCTCAAAGATTTACCAAAGAGAGACCATAATGAGGGATAGATTCGCTTCCACCA
TTTCAGAATTGGTCATTGCTCTTGCAAGGACTACTGGTAGCAGAGGGCAATACAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCGGGGCGTTAATGTTGGGAAGGAGAGTGCATGAGATTGTTGAAAAGTGGAAGCTTGGTTCAAAATCTAATGTGAGAACTGCGCTCATTGATGTGTACGCAAAATG
CGGTTGCATAAACAGCGCAAAGCAGATTTTCAAAGCTGACAAAGAATGCCGAGATGCGATTGATCTATTTAACAAGATGATCAATTTTGGAATTAAGCCTGATGAGAGAA
CAATGACTGCAGTTTTATCAGCATGTAGGAATGCAGAACTACAGAAGTATGGAGTTAAGCCAACCATTCACCATTGTGGATGTATAGTGGACCTTCTTGCCAGGGCAGGG
CATATAAAGGAAGCTGAGGGGTTTATATGGGCAATGGAGATGTTAATCGAGACGAGCGTTTGCTTGAACATCTTTAGCTCCTTAAAGAAGGATGTAAGTGATTGTGGGAG
CTATATACTTGCAGAACTTGTGGTCCTGCGAGGACTGCCACAAGCTTACGAACTGATCTCAAAGATTTACCAAAGAGAGACCATAATGAGGGATAGATTCGCTTCCACCA
TTTCAGAATTGGTCATTGCTCTTGCAAGGACTACTGGTAGCAGAGGGCAATACAGATGA
Protein sequenceShow/hide protein sequence
MLGALMLGRRVHEIVEKWKLGSKSNVRTALIDVYAKCGCINSAKQIFKADKECRDAIDLFNKMINFGIKPDERTMTAVLSACRNAELQKYGVKPTIHHCGCIVDLLARAG
HIKEAEGFIWAMEMLIETSVCLNIFSSLKKDVSDCGSYILAELVVLRGLPQAYELISKIYQRETIMRDRFASTISELVIALARTTGSRGQYR