; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G20300 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G20300
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:17661850..17662476
RNA-Seq ExpressionCSPI07G20300
SyntenyCSPI07G20300
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646431.1 hypothetical protein Csa_016697 [Cucumis sativus]3.0e-74100Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG
        VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG
Subjt:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG

KAG6575615.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]1.2e-5985.07Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHA+GISE VDF +RVFHEMK MGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  CPPNVITYHCFFRSLEK KEILML
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

KAG7014158.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-5985.07Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHA+GISE VDF +RVFHEMK MGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  CPPNVITYHCFFRSLEK KEILML
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

XP_022953157.1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucurbita moschata]1.8e-5884.33Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHA+GISE VDF +RVFHEMK MGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  C PNVITYHCFFRSLEK KEILML
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

XP_031745027.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g80550, mitochondrial [Cucumis sativus]2.8e-72100Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILML
        VILDQMLKYCPPNVITYHCFFRSLEKLKEILML
Subjt:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILML

TrEMBL top hitse value%identityAlignment
A0A0A0K6U8 Uncharacterized protein1.5e-74100Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG
        VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG
Subjt:  VILDQMLKYCPPNVITYHCFFRSLEKLKEILMLSTG

A0A0A0KBR1 Uncharacterized protein1.5e-5883.58Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHAVGISE VDF +RVFHEMK MGCKPNVVTCNT+IKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  C PNVITYHCFFRSLEK KEIL+L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

A0A1S3CFA1 pentatricopeptide repeat-containing protein At1g80550, mitochondrial1.5e-5883.58Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHAVGISE VDF +RVFHEMK MGCKPNVVTCNT+IKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  C PNVITYHCFFRSLEK KEIL+L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

A0A6J1D599 pentatricopeptide repeat-containing protein At1g80550, mitochondrial1.1e-5883.58Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M  KGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHA+GISE VDF +RVFHEMK MGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        V+LDQMLK  CPPNVITYHCFFRS EK  EILML
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

A0A6J1GNU3 pentatricopeptide repeat-containing protein At1g80550, mitochondrial8.6e-5984.33Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M KKGVRKDL SY +YMDIQCK+GK W+AVKLY EMK KGMKLDVVAYNT IHA+GISE VDF +RVFHEMK MGCKPNVVTCNTIIKLFCENGRFKDAH
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
        ++LDQMLK  C PNVITYHCFFRSLEK KEILML
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

SwissProt top hitse value%identityAlignment
Q9FIX3 Pentatricopeptide repeat-containing protein At5g397104.8e-1433.58Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M   G    + +Y   ++  C TGK  +A+ +  +MK KG+  DVV+Y+T +     S DVD   RV  EM   G KP+ +T +++I+ FCE  R K+A 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITY------HCFFRSLEKLKEI
         + ++ML+   PP+  TY      +C    LEK  ++
Subjt:  VILDQMLKY-CPPNVITY------HCFFRSLEKLKEI

Q9FZ19 Putative pentatricopeptide repeat-containing protein At1g024204.3e-1534.35Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M  KG++ D+ +Y   +D+ CK  +  +A KL  +M+ +    DV+ Y T I  +G+    D    V  EMK  GC P+V   N  I+ FC   R  DA 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEI
         ++D+M+K    PN  TY+ FFR L    ++
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEI

Q9M8M3 Pentatricopeptide repeat-containing protein At1g80550, mitochondrial5.1e-4061.19Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M  +GV KDL SY +YMDI CK+GK W+AVKLY EMK++ MKLDVVAYNT I A+G S+ V+F  RVF EM+  GC+PNV T NTIIKL CE+GR +DA+
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
         +LD+M K  C P+ ITY C F  LEK  EIL L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

Q9S7R4 Pentatricopeptide repeat-containing protein At1g74900, mitochondrial6.9e-1329.6Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M ++G+  +L +Y   +    + G+   A + ++EMK +  ++DVV Y T +H  G++ ++     VF EM   G  P+V T N +I++ C+    ++A 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSL
        V+ ++M++    PNV TY+   R L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSL

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic2.8e-1434.88Show/hide
Query:  GVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAHVILD
        G+  D  +Y + M    K G+  EA+KL  EM   G + DV+  N+ I+ +  ++ VD   ++F  MK M  KP VVT NT++    +NG+ ++A  + +
Subjt:  GVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAHVILD

Query:  QML-KYCPPNVITYHCFFRSLEKLKEILM
         M+ K CPPN IT++  F  L K  E+ +
Subjt:  QML-KYCPPNVITYHCFFRSLEKLKEILM

Arabidopsis top hitse value%identityAlignment
AT1G02420.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-1634.35Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M  KG++ D+ +Y   +D+ CK  +  +A KL  +M+ +    DV+ Y T I  +G+    D    V  EMK  GC P+V   N  I+ FC   R  DA 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEI
         ++D+M+K    PN  TY+ FFR L    ++
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEI

AT1G74900.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-1429.6Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M ++G+  +L +Y   +    + G+   A + ++EMK +  ++DVV Y T +H  G++ ++     VF EM   G  P+V T N +I++ C+    ++A 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSL
        V+ ++M++    PNV TY+   R L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSL

AT1G80550.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-4161.19Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M  +GV KDL SY +YMDI CK+GK W+AVKLY EMK++ MKLDVVAYNT I A+G S+ V+F  RVF EM+  GC+PNV T NTIIKL CE+GR +DA+
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML
         +LD+M K  C P+ ITY C F  LEK  EIL L
Subjt:  VILDQMLKY-CPPNVITYHCFFRSLEKLKEILML

AT4G31850.1 proton gradient regulation 32.0e-1534.88Show/hide
Query:  GVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAHVILD
        G+  D  +Y + M    K G+  EA+KL  EM   G + DV+  N+ I+ +  ++ VD   ++F  MK M  KP VVT NT++    +NG+ ++A  + +
Subjt:  GVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAHVILD

Query:  QML-KYCPPNVITYHCFFRSLEKLKEILM
         M+ K CPPN IT++  F  L K  E+ +
Subjt:  QML-KYCPPNVITYHCFFRSLEKLKEILM

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.4e-1533.58Show/hide
Query:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH
        M   G    + +Y   ++  C TGK  +A+ +  +MK KG+  DVV+Y+T +     S DVD   RV  EM   G KP+ +T +++I+ FCE  R K+A 
Subjt:  MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAH

Query:  VILDQMLKY-CPPNVITY------HCFFRSLEKLKEI
         + ++ML+   PP+  TY      +C    LEK  ++
Subjt:  VILDQMLKY-CPPNVITY------HCFFRSLEKLKEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAAGAAGGGTGTTCGTAAGGATTTGCAATCGTACTTGTTATATATGGATATACAATGCAAAACTGGGAAGTCTTGGGAGGCTGTTAAATTGTACATGGAGATGAA
AAATAAGGGAATGAAATTGGATGTTGTGGCCTATAATACAGCGATTCATGCAGTTGGGATTTCGGAAGATGTCGATTTCACCAACAGAGTGTTTCATGAGATGAAGGGAA
TGGGGTGTAAGCCTAACGTTGTGACTTGCAATACTATTATTAAGCTATTTTGTGAGAATGGAAGATTCAAGGATGCTCATGTGATTCTCGACCAAATGCTCAAATACTGT
CCACCGAATGTTATCACCTATCATTGTTTTTTCAGGTCTCTTGAAAAGCTGAAAGAGATTCTCATGTTATCGACAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAAGAAGGGTGTTCGTAAGGATTTGCAATCGTACTTGTTATATATGGATATACAATGCAAAACTGGGAAGTCTTGGGAGGCTGTTAAATTGTACATGGAGATGAA
AAATAAGGGAATGAAATTGGATGTTGTGGCCTATAATACAGCGATTCATGCAGTTGGGATTTCGGAAGATGTCGATTTCACCAACAGAGTGTTTCATGAGATGAAGGGAA
TGGGGTGTAAGCCTAACGTTGTGACTTGCAATACTATTATTAAGCTATTTTGTGAGAATGGAAGATTCAAGGATGCTCATGTGATTCTCGACCAAATGCTCAAATACTGT
CCACCGAATGTTATCACCTATCATTGTTTTTTCAGGTCTCTTGAAAAGCTGAAAGAGATTCTCATGTTATCGACAGGATGATTAAACTTGGGGTTTATCCAAAAATGGAT
GCTGATGTGATGCTCATGAGTAAGTTTGGAAGGGGAGAGACAAATAGAGCACACAAGAACACATTGTTAAACATCTGGCAAACATGATTAACACGATATTACTAGTTAAG
CTTTAATTTTGAATGTTGAAGTTTGTTCTTTTTTTTTAAAAAAAATAAAATTCATTCAGATTAGAGATATCATTGGC
Protein sequenceShow/hide protein sequence
MGKKGVRKDLQSYLLYMDIQCKTGKSWEAVKLYMEMKNKGMKLDVVAYNTAIHAVGISEDVDFTNRVFHEMKGMGCKPNVVTCNTIIKLFCENGRFKDAHVILDQMLKYC
PPNVITYHCFFRSLEKLKEILMLSTG