; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012591 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012591
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description14 kDa proline-rich protein DC2.15-like
Genome locationtig00153449:74518..74928
RNA-Seq ExpressionSgr012591
SyntenySgr012591
Gene Ontology termsNA
InterPro domainsIPR016140 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain
IPR027923 - Hydrophobic seed protein domain
IPR036312 - Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650064.1 hypothetical protein Csa_010277 [Cucumis sativus]4.5e-5480.15Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLVHVV+G PPKTPCC+LIQ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGL IDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

XP_004147199.1 14 kDa proline-rich protein DC2.15 [Cucumis sativus]4.5e-5480.15Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLVHVV+G PPKTPCC+LIQ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGL IDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

XP_008448867.1 PREDICTED: 14 kDa proline-rich protein DC2.15-like [Cucumis melo]2.2e-5379.41Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLV VV+G PPKTPCC+LI+ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGLNIDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

XP_022154029.1 14 kDa proline-rich protein DC2.15-like [Momordica charantia]6.1e-5986.03Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK+TS+T+ L+LLNLLFF+LV+ST VPCPPPP KP KGG PK  APQP+CPTD LKLGVCADVLDGLVHVV+G PPKTPCC+LIQGLADLEAA+CLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

XP_038905266.1 14 kDa proline-rich protein DC2.15-like [Benincasa hispida]3.3e-5782.35Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++SAT+LL+L NLLFF++V+STYVPCPPPPPK  KG   KQPAPQP+CP D LKLGVCAD+LDGLVH+V+GKPPKTPCC+LIQ +ADLEAA+CLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
         IKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

TrEMBL top hitse value%identityAlignment
A0A0A0L2A4 Proline-rich protein DC2.152.2e-5480.15Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLVHVV+G PPKTPCC+LIQ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGL IDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

A0A1S3BKP9 14 kDa proline-rich protein DC2.15-like1.1e-5379.41Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLV VV+G PPKTPCC+LI+ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGLNIDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

A0A5A7TDQ2 14 kDa proline-rich protein DC2.15-like1.1e-5379.41Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S T+LL+LLNLLFF++V+STYVPCPPPP K  KG   KQP PQPKCP D LKLGVCAD+LDGLV VV+G PPKTPCC+LI+ L DLEAALCLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KAKALGLNIDLSVSLSLLLNYCGKKVP GF+CPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

A0A6J1DKU8 14 kDa proline-rich protein DC2.15-like2.9e-5986.03Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK+TS+T+ L+LLNLLFF+LV+ST VPCPPPP KP KGG PK  APQP+CPTD LKLGVCADVLDGLVHVV+G PPKTPCC+LIQGLADLEAA+CLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

A0A6J1FXW6 14 kDa proline-rich protein DC2.15-like1.8e-5378.68Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT
        MASK++S TALL+LLNLLFF++VSSTYVPCPPPPP+  K    +QP   PKCP D LKLGVCAD+LDGLVHVV+G PPK+PCC+LIQGL DLEAA+CLCT
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCT

Query:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA
        A+KA+ALGLNIDLSVSLSLLLNYCGKKVP GFQCPA
Subjt:  AIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQCPA

SwissProt top hitse value%identityAlignment
P14009 14 kDa proline-rich protein DC2.158.3e-3557.14Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSST------YVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEA
        M SK +++ AL   LN+LFF+LVSST      Y P P P PKP    YP       KCP DALKLGVCADVL+ + +VV+G PP  PCCSL++GL +LEA
Subjt:  MASKVTSATALLVLLNLLFFSLVSST------YVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEA

Query:  ALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        A+CLCTAIKA  LG N++L ++LSL+LN CGK+VP GF+C
Subjt:  ALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

Q39176 Lipid transfer protein EARLI 13.6e-3044.31Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSST---------YVPCPPPPPKPRKGGYPKQ-----------PAPQPK-------------CPTDALKLGVCADVLD
        MASK +++ AL   LN++FF+L ++T         + P P P PKP     PK            P+P P+             CP DAL+LGVCA+VL 
Subjt:  MASKVTSATALLVLLNLLFFSLVSST---------YVPCPPPPPKPRKGGYPKQ-----------PAPQPK-------------CPTDALKLGVCADVLD

Query:  GLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
         L+++ +G+P   PCCSLIQGL DL+AA+CLCTA++A  LG+N+++ +SLS+LLN C +KVP GFQC
Subjt:  GLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

Q9SU33 pEARLI1-like lipid transfer protein 31.1e-2640.34Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSST-----------------------------------------YVPCPP-PPPKPRKGGYPKQPAPQPKCPTDALK
        MASK +++ AL   LN+LFF+L ++T                                          VP P  P P P     P+ P     CP DAL+
Subjt:  MASKVTSATALLVLLNLLFFSLVSST-----------------------------------------YVPCPP-PPPKPRKGGYPKQPAPQPKCPTDALK

Query:  LGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        LGVCA+VL GL++V +G+P   PCCSLIQGL DL+AA+CLCTA++A  LG+N+++ +SLS+LLN C +++P  FQC
Subjt:  LGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

Q9SU34 pEARLI1-like lipid transfer protein 22.9e-2740.11Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQ------------------------------------------------PKC
        MASK +++ AL   LN+LFF+L + T   C  P PKPR    PK P+P+                                                  C
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQ------------------------------------------------PKC

Query:  PTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        P DAL+LGVCA+VL GL++V +G+P   PCCSLIQGL DL+AA+CLCTA++A  LG+N+++ +SLS+LLN C +++P  FQC
Subjt:  PTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

Q9SU35 pEARLI1-like lipid transfer protein 11.8e-2945.62Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPC-PPPPPKPRKGGYPKQ-----------PAPQPK--------------CPTDALKLGVCADVLDGLVHVVV
        MASK +++ AL   LN+LFF+L  +T   C P P PKP     PK            P+P P+              CP DALKLGVCA+VL  L+++ +
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPC-PPPPPKPRKGGYPKQ-----------PAPQPK--------------CPTDALKLGVCADVLDGLVHVVV

Query:  GKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        G+P    CCSLIQGL D++AA+CLCTA++A  LG+N+++ +SLS+LLN C +K+P GFQC
Subjt:  GKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

Arabidopsis top hitse value%identityAlignment
AT1G12090.1 extensin-like protein5.0e-3556.3Show/hide
Query:  TSATALLVLLNLLFFSLVSS--TYVP----CPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLC
        +S+ AL + LNLLFF+ +S+  +  P    CP P PKP     P   +   KCP D LKLGVCA+VL+GL+ + +GKPP  PCCSLIQGLAD+EAA+CLC
Subjt:  TSATALLVLLNLLFFSLVSS--TYVP----CPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLC

Query:  TAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        TA+KA  LG+N++L +SLSLLLN C K++P GFQC
Subjt:  TAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

AT1G62510.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein2.4e-3755.7Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSS-----------TYVPCPPPPPKPRKGGYP-KQPAPQP---KCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSL
        MAS+ T + AL ++LN LFF+ +S+            + P P P PKP     P   P+P P   KCP DALKLGVCA+VL+GL++V +GKPP  PCC+L
Subjt:  MASKVTSATALLVLLNLLFFSLVSS-----------TYVPCPPPPPKPRKGGYP-KQPAPQP---KCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSL

Query:  IQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        IQGLADLEAA CLCTA+KA  LG+N+++ +SLSLLLN C KKVP GFQC
Subjt:  IQGLADLEAALCLCTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

AT2G45180.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein5.0e-4366.18Show/hide
Query:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRK--GGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCL
        MASK  + TALL+ LNLLFF+ V+ST   CPP  PKP K      K PA +P CPTD LKLGVCAD+L GLV+VVVG PPKTPCC+L+QGLA+LEAA+CL
Subjt:  MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRK--GGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCL

Query:  CTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC
        CTA+KA  LG+N+++ + L+LLLNYCGKKVP+GFQC
Subjt:  CTAIKAKALGLNIDLSVSLSLLLNYCGKKVPYGFQC

AT5G46890.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein4.7e-3357.69Show/hide
Query:  SATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPT--DALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKA
        S  ALL++ N++FF+ VSST VPCPPPPPK     Y K+PA     PT  DALKL VCA+VLD    V V  PP + CC+LI+GL DLEAA+CLCTA+KA
Subjt:  SATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPT--DALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKA

Query:  KALGLNIDLSVSLSLLLNYCGKKVPYGFQC
          LG+N+++ +SL+++LN+CGKKVP GF+C
Subjt:  KALGLNIDLSVSLSLLLNYCGKKVPYGFQC

AT5G46900.1 Bifunctional inhibitor/lipid-transfer protein/seed storage 2S albumin superfamily protein2.5e-3458.59Show/hide
Query:  SATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKA
        S  ALL++ N++FF+LVSST VPCPPPPPK      P  P+P+P C  DALKL VCA+VLD    V V  PP + CC+LI+GL DLEAA+CLCTA+KA  
Subjt:  SATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKA

Query:  LGLNIDLSVSLSLLLNYCGKKVPYGFQC
        LG+N+++ +SL+++LN+CGKKVP GF+C
Subjt:  LGLNIDLSVSLSLLLNYCGKKVPYGFQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAAGTCACATCAGCCACTGCCCTTCTTGTCCTCCTCAACCTCCTGTTCTTCTCTCTCGTCAGCTCCACCTACGTCCCTTGCCCACCGCCGCCGCCAAAGCC
CCGCAAGGGTGGCTACCCGAAGCAGCCTGCGCCGCAGCCCAAATGCCCCACCGACGCCCTCAAGCTGGGCGTCTGTGCCGACGTGCTTGACGGTCTGGTACATGTGGTGG
TCGGCAAGCCGCCAAAAACCCCATGCTGCAGCCTGATCCAGGGGTTGGCTGATCTCGAAGCTGCTCTTTGCCTTTGCACCGCCATTAAAGCAAAAGCTTTGGGCCTGAAC
ATTGACCTCTCTGTTTCCCTCAGCTTGCTTCTGAACTACTGTGGAAAGAAAGTTCCATATGGCTTCCAATGCCCAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAAAGTCACATCAGCCACTGCCCTTCTTGTCCTCCTCAACCTCCTGTTCTTCTCTCTCGTCAGCTCCACCTACGTCCCTTGCCCACCGCCGCCGCCAAAGCC
CCGCAAGGGTGGCTACCCGAAGCAGCCTGCGCCGCAGCCCAAATGCCCCACCGACGCCCTCAAGCTGGGCGTCTGTGCCGACGTGCTTGACGGTCTGGTACATGTGGTGG
TCGGCAAGCCGCCAAAAACCCCATGCTGCAGCCTGATCCAGGGGTTGGCTGATCTCGAAGCTGCTCTTTGCCTTTGCACCGCCATTAAAGCAAAAGCTTTGGGCCTGAAC
ATTGACCTCTCTGTTTCCCTCAGCTTGCTTCTGAACTACTGTGGAAAGAAAGTTCCATATGGCTTCCAATGCCCAGCTTGA
Protein sequenceShow/hide protein sequence
MASKVTSATALLVLLNLLFFSLVSSTYVPCPPPPPKPRKGGYPKQPAPQPKCPTDALKLGVCADVLDGLVHVVVGKPPKTPCCSLIQGLADLEAALCLCTAIKAKALGLN
IDLSVSLSLLLNYCGKKVPYGFQCPA