; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022754 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022754
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00000589:1481887..1482420
RNA-Seq ExpressionSgr022754
SyntenySgr022754
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580575.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6886.45Show/hide
Query:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK
        ALQ IRR+DGMNYGAYGRLIQ CTD  F  LGKQLHARLVL SV+PDNFLGSKLIA YSKSGSLRDAYNVF +I+HKNIFSWNALFISYTLHNMH DMLK
Subjt:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK

Query:  LFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        LFSSLVN N+TDVKPDKFT+TCVLKALASLF+NSILAKEVHCF+LRRGLESDI +
Subjt:  LFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

KAG7017327.1 ABC transporter G family member 20, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-6886.45Show/hide
Query:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK
        ALQ IRR+DGMNYGAYGRLIQ CTD  F  LGKQLHARLVL SV+PDNFLGSKLIA YSKSGSLRDAYNVF +I+HKNIFSWNALFISYTLHNMH DMLK
Subjt:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK

Query:  LFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        LFSSLVN N+TDVKPDKFT+TCVLKALASLF+NSILAKEVHCF+LRRGLESDI +
Subjt:  LFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

XP_022145703.1 pentatricopeptide repeat-containing protein At2g37310 [Momordica charantia]3.2e-7687.43Show/hide
Query:  QISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFIS
        QISIPAGA+ PWALQAIRRADGMNY AYGRLIQ C D  F+ LGKQLHARLVL SV+PDNFLGSKLIAFYSKSGSLRDAYNVF NI+HKNIFSWNALFIS
Subjt:  QISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFIS

Query:  YTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        YTLHNMH+DMLKLFSSLVNSNA DVKPDKFTITCVLKALAS F++SILAKEVHCF+LRRGLESDI +
Subjt:  YTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

XP_022983956.1 pentatricopeptide repeat-containing protein At2g37310 [Cucurbita maxima]6.3e-6486.9Show/hide
Query:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA
        MNYGAYGRLIQ CTD  F  LGKQLHARLVL SV+PDNFLGSKLIA YSKSGSLRDAYNVF +I+HKNIFSWNALFISYTLHNMH DMLKLFSSLVN N+
Subjt:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA

Query:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        TDVKPDKFT+TCVLKALASLF+NSILAKEVHCF+LRRGLESDI +
Subjt:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

XP_038905794.1 pentatricopeptide repeat-containing protein At2g37310 [Benincasa hispida]3.9e-7482.18Show/hide
Query:  MRSPQSLQISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFS
        MRSP++ +  +PA     WALQA+RR D MNYGAYGRLIQ CTD LFV LGKQLHARLVL SV+PDNFLGSKLIAFYSKSGSLRDAYNVF NI+HKNIF+
Subjt:  MRSPQSLQISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFS

Query:  WNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        WNALFISYTLHNMH DML+LFSSLVNSN+TDVKPDKFTITCVLKALASLFSNS+LAKEVHCFILRR LE DI +
Subjt:  WNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

TrEMBL top hitse value%identityAlignment
A0A1S4DUQ6 pentatricopeptide repeat-containing protein At2g373106.8e-6486.21Show/hide
Query:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA
        MNYGAYGRLIQ CTDHLF  +GKQLHARLVL SV+PDNFLGSKLI+FYSKSGSLRDAYNVF  I  KNIFSWNALFISYTLHNMHTD+LKLF SLVNSN+
Subjt:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA

Query:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        TDVKPD+FT+TCVLKALASLFSNS+LAKEVHCFILRR LESDI +
Subjt:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

A0A5A7TRM4 Pentatricopeptide repeat-containing protein6.8e-6486.21Show/hide
Query:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA
        MNYGAYGRLIQ CTDHLF  +GKQLHARLVL SV+PDNFLGSKLI+FYSKSGSLRDAYNVF  I  KNIFSWNALFISYTLHNMHTD+LKLF SLVNSN+
Subjt:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA

Query:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        TDVKPD+FT+TCVLKALASLFSNS+LAKEVHCFILRR LESDI +
Subjt:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

A0A6J1CWN9 pentatricopeptide repeat-containing protein At2g373101.5e-7687.43Show/hide
Query:  QISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFIS
        QISIPAGA+ PWALQAIRRADGMNY AYGRLIQ C D  F+ LGKQLHARLVL SV+PDNFLGSKLIAFYSKSGSLRDAYNVF NI+HKNIFSWNALFIS
Subjt:  QISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFIS

Query:  YTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        YTLHNMH+DMLKLFSSLVNSNA DVKPDKFTITCVLKALAS F++SILAKEVHCF+LRRGLESDI +
Subjt:  YTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

A0A6J1F110 pentatricopeptide repeat-containing protein At2g373103.0e-6486.9Show/hide
Query:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA
        MNYGAYGRLIQ CTD  F  LGKQLHARLVL SV+PDNFLGSKLIA YSKSGSLRDAYNVF +I+HKNIFSWNALFISYTLHNMH DMLKLFSSLVN N+
Subjt:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA

Query:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        TDVKPDKFT+TCVLKALASLF+NSILAKEVHCF+LRRGLESDI +
Subjt:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

A0A6J1J0S5 pentatricopeptide repeat-containing protein At2g373103.0e-6486.9Show/hide
Query:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA
        MNYGAYGRLIQ CTD  F  LGKQLHARLVL SV+PDNFLGSKLIA YSKSGSLRDAYNVF +I+HKNIFSWNALFISYTLHNMH DMLKLFSSLVN N+
Subjt:  MNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNA

Query:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        TDVKPDKFT+TCVLKALASLF+NSILAKEVHCF+LRRGLESDI +
Subjt:  TDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

SwissProt top hitse value%identityAlignment
Q0WN60 Pentatricopeptide repeat-containing protein At1g184857.5e-1231.47Show/hide
Query:  AYGRLIQRCTDHLFVPLGKQLHARLVLFS--VSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATD
        A G L+Q       + +G+++H +LV  S  +  D+ L +++I  Y+  GS  D+  VF  +  KN+F WNA+  SY+ + ++ ++L+ F  ++++  TD
Subjt:  AYGRLIQRCTDHLFVPLGKQLHARLVLFS--VSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATD

Query:  VKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        + PD FT  CV+KA A + S+  +   VH  +++ GL  D+ +
Subjt:  VKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

Q9FXH1 Pentatricopeptide repeat-containing protein At1g197206.8e-1331.62Show/hide
Query:  YGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKP
        Y +L++ C D   + LG+ LHAR  LF+  PD F+ +KL++ Y+K G + DA  VF ++  +N+F+W+A+  +Y+  N   ++ KLF  ++      V P
Subjt:  YGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKP

Query:  DKFTITCVLKALASLFSNSILAKEVHCFILRRGLES
        D F    +L+  A+   +    K +H  +++ G+ S
Subjt:  DKFTITCVLKALASLFSNSILAKEVHCFILRRGLES

Q9STE1 Pentatricopeptide repeat-containing protein At4g213002.2e-1129.32Show/hide
Query:  CTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITC
        C   L + LG QLH  +V+  V  +  + + L++ YSK G   DA  +F  ++  +  +WN +   Y    +  + L  F  +++S    V PD  T + 
Subjt:  CTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITC

Query:  VLKALASLFSNSILAKEVHCFILRRGLESDILL
        +L ++ S F N    K++HC+I+R  +  DI L
Subjt:  VLKALASLFSNSILAKEVHCFILRRGLESDILL

Q9SY02 Pentatricopeptide repeat-containing protein At4g027503.4e-1232Show/hide
Query:  LQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKL
        +Q  R    +N  ++   +  C D + + LGKQLH RLV        F+G+ L+  Y K GS+ +A ++F  +  K+I SWN +   Y+ H      L+ 
Subjt:  LQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKL

Query:  FSSLVNSNATDVKPDKFTITCVLKA
        F S+       +KPD  T+  VL A
Subjt:  FSSLVNSNATDVKPDKFTITCVLKA

Q9ZUT5 Pentatricopeptide repeat-containing protein At2g373103.4e-2844.38Show/hide
Query:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK
        ALQ +     ++ GAYG LIQ  T H       QLHAR+V+FS+ PDNFL SKLI+FY++    R A +VF  IT +N FS+NAL I+YT   M+ D   
Subjt:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK

Query:  LFSSLVNS---NATDVKPDKFTITCVLKALASL--FSNSILAKEVHCFILRRGLESDILL
        LF S + S   ++   +PD  +I+CVLKAL+    F    LA++VH F++R G +SD+ +
Subjt:  LFSSLVNS---NATDVKPDKFTITCVLKALASL--FSNSILAKEVHCFILRRGLESDILL

Arabidopsis top hitse value%identityAlignment
AT1G18485.1 Pentatricopeptide repeat (PPR) superfamily protein5.4e-1331.47Show/hide
Query:  AYGRLIQRCTDHLFVPLGKQLHARLVLFS--VSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATD
        A G L+Q       + +G+++H +LV  S  +  D+ L +++I  Y+  GS  D+  VF  +  KN+F WNA+  SY+ + ++ ++L+ F  ++++  TD
Subjt:  AYGRLIQRCTDHLFVPLGKQLHARLVLFS--VSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATD

Query:  VKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL
        + PD FT  CV+KA A + S+  +   VH  +++ GL  D+ +
Subjt:  VKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL

AT1G19720.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.8e-1431.62Show/hide
Query:  YGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKP
        Y +L++ C D   + LG+ LHAR  LF+  PD F+ +KL++ Y+K G + DA  VF ++  +N+F+W+A+  +Y+  N   ++ KLF  ++      V P
Subjt:  YGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKP

Query:  DKFTITCVLKALASLFSNSILAKEVHCFILRRGLES
        D F    +L+  A+   +    K +H  +++ G+ S
Subjt:  DKFTITCVLKALASLFSNSILAKEVHCFILRRGLES

AT2G37310.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-2944.38Show/hide
Query:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK
        ALQ +     ++ GAYG LIQ  T H       QLHAR+V+FS+ PDNFL SKLI+FY++    R A +VF  IT +N FS+NAL I+YT   M+ D   
Subjt:  ALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLK

Query:  LFSSLVNS---NATDVKPDKFTITCVLKALASL--FSNSILAKEVHCFILRRGLESDILL
        LF S + S   ++   +PD  +I+CVLKAL+    F    LA++VH F++R G +SD+ +
Subjt:  LFSSLVNS---NATDVKPDKFTITCVLKALASL--FSNSILAKEVHCFILRRGLESDILL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-1332Show/hide
Query:  LQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKL
        +Q  R    +N  ++   +  C D + + LGKQLH RLV        F+G+ L+  Y K GS+ +A ++F  +  K+I SWN +   Y+ H      L+ 
Subjt:  LQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKL

Query:  FSSLVNSNATDVKPDKFTITCVLKA
        F S+       +KPD  T+  VL A
Subjt:  FSSLVNSNATDVKPDKFTITCVLKA

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-1229.32Show/hide
Query:  CTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITC
        C   L + LG QLH  +V+  V  +  + + L++ YSK G   DA  +F  ++  +  +WN +   Y    +  + L  F  +++S    V PD  T + 
Subjt:  CTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFISYTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITC

Query:  VLKALASLFSNSILAKEVHCFILRRGLESDILL
        +L ++ S F N    K++HC+I+R  +  DI L
Subjt:  VLKALASLFSNSILAKEVHCFILRRGLESDILL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGGAATGCGAAGCCCACAAAGCCTTCAAATCTCAATTCCCGCCGGCGCCATTTTTCCATGGGCTCTGCAAGCGATCCGCCGCGCCGACGGGATGAACTACGGCGC
TTATGGCCGCCTTATCCAGCGGTGCACCGACCACCTCTTCGTCCCCCTCGGTAAGCAGCTTCACGCCCGTCTTGTTCTATTTTCCGTCTCTCCCGACAACTTCCTCGGAT
CGAAGCTCATCGCCTTCTACTCAAAATCCGGCAGCCTTCGGGATGCCTACAATGTGTTCGTTAACATTACTCATAAGAACATTTTCTCATGGAATGCTTTGTTCATCAGC
TACACTCTTCACAATATGCACACTGATATGCTGAAACTGTTTTCGTCTTTGGTTAATTCAAATGCGACGGATGTCAAACCTGATAAGTTTACGATCACTTGTGTTTTGAA
AGCGTTGGCGTCGTTGTTTTCCAATTCGATTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGGACTTGAGTCTGATATTTTGTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGGAATGCGAAGCCCACAAAGCCTTCAAATCTCAATTCCCGCCGGCGCCATTTTTCCATGGGCTCTGCAAGCGATCCGCCGCGCCGACGGGATGAACTACGGCGC
TTATGGCCGCCTTATCCAGCGGTGCACCGACCACCTCTTCGTCCCCCTCGGTAAGCAGCTTCACGCCCGTCTTGTTCTATTTTCCGTCTCTCCCGACAACTTCCTCGGAT
CGAAGCTCATCGCCTTCTACTCAAAATCCGGCAGCCTTCGGGATGCCTACAATGTGTTCGTTAACATTACTCATAAGAACATTTTCTCATGGAATGCTTTGTTCATCAGC
TACACTCTTCACAATATGCACACTGATATGCTGAAACTGTTTTCGTCTTTGGTTAATTCAAATGCGACGGATGTCAAACCTGATAAGTTTACGATCACTTGTGTTTTGAA
AGCGTTGGCGTCGTTGTTTTCCAATTCGATTTTGGCTAAGGAAGTTCATTGTTTCATTCTTCGACGAGGACTTGAGTCTGATATTTTGTTGTGA
Protein sequenceShow/hide protein sequence
MNGMRSPQSLQISIPAGAIFPWALQAIRRADGMNYGAYGRLIQRCTDHLFVPLGKQLHARLVLFSVSPDNFLGSKLIAFYSKSGSLRDAYNVFVNITHKNIFSWNALFIS
YTLHNMHTDMLKLFSSLVNSNATDVKPDKFTITCVLKALASLFSNSILAKEVHCFILRRGLESDILL