; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr007540 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr007540
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionproline-rich protein 1
Genome locationtig00005754:22048..23698
RNA-Seq ExpressionSgr007540
SyntenySgr007540
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575813.1 Proline-rich protein 3, partial [Cucurbita argyrosperma subsp. sororia]3.9e-7071.22Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MALAR+L +ATPV+LLWLLAA +VSS+ADY  F D ++Y      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSC+N ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARITC+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRIL+   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

Query:  QPNTL
        Q N L
Subjt:  QPNTL

XP_022152066.1 proline-rich protein 1 [Momordica charantia]7.1e-8080.88Show/hide
Query:  MALARQLSATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKP--GGGADDNGVPFPTTLPIYEKPPP-YGHEMP-PLTIAVEGVVSCKNGSKYSPLKGV
        MAL RQ SATPVLLL LL   AVSS+AD  G  D EDYAPPK   G G  D   PFPTTLPIYEKPPP YG E P PLTIAVEGVVSCK GSKYSPLKGV
Subjt:  MALARQLSATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKP--GGGADDNGVPFPTTLPIYEKPPP-YGHEMP-PLTIAVEGVVSCKNGSKYSPLKGV

Query:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT
        VARITC+ALNEKGNE APFSFSSFPTDEHGYFLA LSPSKLKGKAKVTQCK FLPP SPCE CKY TDINNGV GAL  SFRILT  KMKLYSVGPFFYT
Subjt:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT

Query:  SQPN
        S+PN
Subjt:  SQPN

XP_022953718.1 proline-rich protein 3-like [Cucurbita moschata]1.6e-6871.5Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MALAR+L +ATPV+LLWLLAA +VSS+ADY  F D +++      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSC+N ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARITC+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRILT   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

XP_022991244.1 proline-rich protein 3-like [Cucurbita maxima]2.8e-6870.73Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MA AR+L +ATPV+LL LLAA +VSS+ADY  F D ++Y      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSCKN ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARI C+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRILT   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

Query:  QPNTL
        Q N L
Subjt:  QPNTL

XP_023549401.1 proline-rich protein 3-like [Cucurbita pepo subsp. pepo]2.3e-7071.71Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MAL R+L +ATPV+LLWLLAA +VSS+ADY  F D ++Y      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSCKN ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARITC+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRILT   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

Query:  QPNTL
        Q N L
Subjt:  QPNTL

TrEMBL top hitse value%identityAlignment
A0A1S3BRP9 proline-rich protein 3-like1.2e-6468.63Show/hide
Query:  MALARQL--SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHE-MPPLTIAVEGVVSCKNGSKYSPLKGV
        MALARQL  SA PVLLLWLLA+  VSS+ADY  F D              D GV      PIY+K  P YG + M PL IAVEGVVSCKNG+KY PLKG+
Subjt:  MALARQL--SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHE-MPPLTIAVEGVVSCKNGSKYSPLKGV

Query:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT
        VAR TC+ALNEKG E APFSFSSFP+D +GYFLA LS S+LKGKAKVTQCKAFLPP SPCE CKYLT++N+GVAGAL RSFRILT  KMKLYS+G FFY+
Subjt:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT

Query:  SQPN
        SQPN
Subjt:  SQPN

A0A5A7VJJ1 Proline-rich protein 3-like1.2e-6468.63Show/hide
Query:  MALARQL--SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHE-MPPLTIAVEGVVSCKNGSKYSPLKGV
        MALARQL  SA PVLLLWLLA+  VSS+ADY  F D              D GV      PIY+K  P YG + M PL IAVEGVVSCKNG+KY PLKG+
Subjt:  MALARQL--SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHE-MPPLTIAVEGVVSCKNGSKYSPLKGV

Query:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT
        VAR TC+ALNEKG E APFSFSSFP+D +GYFLA LS S+LKGKAKVTQCKAFLPP SPCE CKYLT++N+GVAGAL RSFRILT  KMKLYS+G FFY+
Subjt:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT

Query:  SQPN
        SQPN
Subjt:  SQPN

A0A6J1DGI4 proline-rich protein 13.4e-8080.88Show/hide
Query:  MALARQLSATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKP--GGGADDNGVPFPTTLPIYEKPPP-YGHEMP-PLTIAVEGVVSCKNGSKYSPLKGV
        MAL RQ SATPVLLL LL   AVSS+AD  G  D EDYAPPK   G G  D   PFPTTLPIYEKPPP YG E P PLTIAVEGVVSCK GSKYSPLKGV
Subjt:  MALARQLSATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKP--GGGADDNGVPFPTTLPIYEKPPP-YGHEMP-PLTIAVEGVVSCKNGSKYSPLKGV

Query:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT
        VARITC+ALNEKGNE APFSFSSFPTDEHGYFLA LSPSKLKGKAKVTQCK FLPP SPCE CKY TDINNGV GAL  SFRILT  KMKLYSVGPFFYT
Subjt:  VARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYT

Query:  SQPN
        S+PN
Subjt:  SQPN

A0A6J1GP42 proline-rich protein 3-like8.0e-6971.5Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MALAR+L +ATPV+LLWLLAA +VSS+ADY  F D +++      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSC+N ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARITC+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRILT   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

A0A6J1JUA4 proline-rich protein 3-like1.4e-6870.73Show/hide
Query:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV
        MA AR+L +ATPV+LL LLAA +VSS+ADY  F D ++Y      GGA DNG+      PIYEKP P YG E+  PL IAVEGVVSCKN ++Y PLKGVV
Subjt:  MALARQL-SATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKP-PPYGHEM-PPLTIAVEGVVSCKNGSKYSPLKGVV

Query:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
        ARI C+ LNEKGNE APFSFSSFP+DEHGYFLA LS SKLKGKAKVT+CKAFLPPSSPCE CKYLT++NNGV GAL RSFRILT   MKLYSVGPF Y+S
Subjt:  ARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

Query:  QPNTL
        Q N L
Subjt:  QPNTL

SwissProt top hitse value%identityAlignment
O81417 Protein SEED AND ROOT HAIR PROTECTIVE PROTEIN1.3e-2846.38Show/hide
Query:  YGHEMPPLT-----IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKY
        YG+E  P T     IAVEG++ CK+G K  P++G  ARI C+ ++  GNE  P S  S  TD  GYF+A + PS+L+    VT+CK +L   SP   C +
Subjt:  YGHEMPPLT-----IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKY

Query:  LTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTSQP
         TD+N GV G  L ++RIL     KLY  GPFFYTS+P
Subjt:  LTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTSQP

Q9FZ35 Proline-rich protein 14.6e-1330.87Show/hide
Query:  PIYEKPPPYGHE----------MPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKA
        P Y  PPP+  +          +P +  AV G++ CKNG +  P++G  A+I C                S PTD  GYF  +L+  K      ++ C+ 
Subjt:  PIYEKPPPYGHE----------MPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKA

Query:  FLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
         L  +SP E CK  T++N G+ G     F + +   +KL++VGPF++T+
Subjt:  FLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

Q9LZJ7 Proline-rich protein 37.8e-1331.85Show/hide
Query:  PIYEKPPPYGHEMPP-----------------LTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKA
        P+Y+K P Y    PP                 +  AV+G++ CKNG +  P+ G   +I C      G         S PTD  GYF   L+  K     
Subjt:  PIYEKPPPYGHEMPP-----------------LTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKA

Query:  KVTQCKAFLPPSSPCEYCKYLTDINNGVAGA--LLRSFRILTKNKMKLYSVGPFFYT
        +V   K +L   SP E CK  T++N G+ G    L  +R      ++L+SVGPF+YT
Subjt:  KVTQCKAFLPPSSPCEYCKYLTDINNGVAGA--LLRSFRILTKNKMKLYSVGPFFYT

Arabidopsis top hitse value%identityAlignment
AT1G54970.1 proline-rich protein 13.2e-1430.87Show/hide
Query:  PIYEKPPPYGHE----------MPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKA
        P Y  PPP+  +          +P +  AV G++ CKNG +  P++G  A+I C                S PTD  GYF  +L+  K      ++ C+ 
Subjt:  PIYEKPPPYGHE----------MPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKA

Query:  FLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS
         L  +SP E CK  T++N G+ G     F + +   +KL++VGPF++T+
Subjt:  FLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTS

AT2G47530.1 Pollen Ole e 1 allergen and extensin family protein4.1e-1734.59Show/hide
Query:  LWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKPPPYGHEMPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAP
        L LLA   V ++ADY        YA P+P        VP PTT        PY  +  P  IA+EG + CK+G K  P++G   ++ C  ++  G   A 
Subjt:  LWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKPPPYGHEMPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAP

Query:  FSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLR--SFRILTKNKMKLYSVGPFFYTS
         + SS+PTD  GYF  I      K    ++ CK  L  SSP   CK  T++N GV GA L   + + L+ + + LY++ PF+++S
Subjt:  FSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLR--SFRILTKNKMKLYSVGPFFYTS

AT2G47540.1 Pollen Ole e 1 allergen and extensin family protein1.1e-2546.46Show/hide
Query:  IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKV---TQCKAFLPPSSPCEYCKYLTDINNGVAGA
        I V+G++ CK GSK +P++G VAR+TC   +E G EA   +  S  TD  GYFLA LS S++K   KV    +C+AFL   SP + C + T+IN G++GA
Subjt:  IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKV---TQCKAFLPPSSPCEYCKYLTDINNGVAGA

Query:  LLRSFRIL-TKNKMKLYSVGPFFYTSQ
        +L+++R+L  K KMKL++VGPF ++S+
Subjt:  LLRSFRIL-TKNKMKLYSVGPFFYTSQ

AT3G62680.1 proline-rich protein 35.5e-1431.85Show/hide
Query:  PIYEKPPPYGHEMPP-----------------LTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKA
        P+Y+K P Y    PP                 +  AV+G++ CKNG +  P+ G   +I C      G         S PTD  GYF   L+  K     
Subjt:  PIYEKPPPYGHEMPP-----------------LTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKA

Query:  KVTQCKAFLPPSSPCEYCKYLTDINNGVAGA--LLRSFRILTKNKMKLYSVGPFFYT
        +V   K +L   SP E CK  T++N G+ G    L  +R      ++L+SVGPF+YT
Subjt:  KVTQCKAFLPPSSPCEYCKYLTDINNGVAGA--LLRSFRILTKNKMKLYSVGPFFYT

AT4G02270.1 root hair specific 139.4e-3046.38Show/hide
Query:  YGHEMPPLT-----IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKY
        YG+E  P T     IAVEG++ CK+G K  P++G  ARI C+ ++  GNE  P S  S  TD  GYF+A + PS+L+    VT+CK +L   SP   C +
Subjt:  YGHEMPPLT-----IAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGNEAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKY

Query:  LTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTSQP
         TD+N GV G  L ++RIL     KLY  GPFFYTS+P
Subjt:  LTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTSQP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCTCGCAAGGCAACTCTCTGCAACTCCGGTGCTGCTGCTGTGGCTGCTGGCCGCCGGCGCCGTTTCTTCCTCCGCTGATTATAATGGCTTTTCCGACAGGGAAGA
TTACGCGCCACCGAAGCCTGGAGGTGGAGCAGACGACAACGGAGTGCCGTTTCCGACGACCTTGCCAATCTACGAGAAACCGCCCCCGTACGGCCATGAAATGCCGCCGC
TCACCATCGCCGTTGAAGGAGTTGTTTCTTGTAAAAATGGCTCCAAATATTCTCCACTCAAAGGAGTTGTAGCAAGAATTACATGCTTAGCTTTGAACGAGAAAGGCAAC
GAAGCGGCTCCCTTCTCCTTCTCCAGCTTTCCAACAGACGAGCATGGGTACTTCTTGGCGATATTGTCACCTTCCAAGCTCAAGGGCAAGGCCAAGGTGACACAGTGCAA
GGCCTTCCTTCCCCCTTCGTCGCCATGCGAGTACTGCAAATACCTTACCGACATCAACAATGGCGTCGCGGGTGCTCTGCTTCGTTCTTTTCGCATTCTCACGAAGAACA
AGATGAAGTTGTATTCCGTTGGACCTTTCTTCTACACCTCACAACCAAACACCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCTCGCAAGGCAACTCTCTGCAACTCCGGTGCTGCTGCTGTGGCTGCTGGCCGCCGGCGCCGTTTCTTCCTCCGCTGATTATAATGGCTTTTCCGACAGGGAAGA
TTACGCGCCACCGAAGCCTGGAGGTGGAGCAGACGACAACGGAGTGCCGTTTCCGACGACCTTGCCAATCTACGAGAAACCGCCCCCGTACGGCCATGAAATGCCGCCGC
TCACCATCGCCGTTGAAGGAGTTGTTTCTTGTAAAAATGGCTCCAAATATTCTCCACTCAAAGGAGTTGTAGCAAGAATTACATGCTTAGCTTTGAACGAGAAAGGCAAC
GAAGCGGCTCCCTTCTCCTTCTCCAGCTTTCCAACAGACGAGCATGGGTACTTCTTGGCGATATTGTCACCTTCCAAGCTCAAGGGCAAGGCCAAGGTGACACAGTGCAA
GGCCTTCCTTCCCCCTTCGTCGCCATGCGAGTACTGCAAATACCTTACCGACATCAACAATGGCGTCGCGGGTGCTCTGCTTCGTTCTTTTCGCATTCTCACGAAGAACA
AGATGAAGTTGTATTCCGTTGGACCTTTCTTCTACACCTCACAACCAAACACCTTATGA
Protein sequenceShow/hide protein sequence
MALARQLSATPVLLLWLLAAGAVSSSADYNGFSDREDYAPPKPGGGADDNGVPFPTTLPIYEKPPPYGHEMPPLTIAVEGVVSCKNGSKYSPLKGVVARITCLALNEKGN
EAAPFSFSSFPTDEHGYFLAILSPSKLKGKAKVTQCKAFLPPSSPCEYCKYLTDINNGVAGALLRSFRILTKNKMKLYSVGPFFYTSQPNTL