; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G06020 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G06020
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationClcChr09:4726808..4729002
RNA-Seq ExpressionClc09G06020
SyntenyClc09G06020
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575811.1 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.2e-9087.2Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLS+IRVIG+G  PLIPN+GNSSF RGF YH Q NPNCRFQA KCS SLL SFTSSKIHL L YATPPLKP     AA TIPFALQD S+AA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DFM NV+LADLDPGTAKLAI FLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

XP_004136016.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucumis sativus]5.0e-9187.2Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLSSIRVIGIG  PL PN+GNS+F RGFKYH QRNPNC+FQAIKCS SLL SFTSSK  LSLAY  PPLKPAAAYEAA TIPF LQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DF+ ++TLADLDPGTAKLAISFLGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

XP_022991242.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita maxima]2.7e-8484.83Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLS+IRVIG+G   LIPN+GNSS  RGF YH Q NPNCRFQA KCS SLL SFTSSKI L L  ATP LKP     AA TIPFALQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DF  NV LADLDPGTAKLAI FLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

XP_023548987.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]1.6e-8987.2Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLS+IRVIG+G  PLIPN+GNSSF RGF YH Q NPNCRFQA KCS SLL SFTSSKIHL L YATPPLKP     AA TIPFALQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DFM NV LADLDPGTAKLAI  LGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

XP_038896182.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Benincasa hispida]2.0e-9287.79Show/hide
Query:  IAMAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSM
        +A    AACSSLS IRVIG    PLIPN GNSSFSRGFKYH QRNPNCRFQA KCS S+LDSFT+SK+HLSLAYAT PLKPAAAYEAA TIPFALQD SM
Subjt:  IAMAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSM

Query:  -AADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLG
         A+DFM NV LADLDPG AKLAI FLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLG
Subjt:  -AADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLG

Query:  PQGLLVLLSQQVS
        PQGLLVLLSQQVS
Subjt:  PQGLLVLLSQQVS

TrEMBL top hitse value%identityAlignment
A0A0A0K9A7 Uncharacterized protein2.4e-9187.2Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLSSIRVIGIG  PL PN+GNS+F RGFKYH QRNPNC+FQAIKCS SLL SFTSSK  LSLAY  PPLKPAAAYEAA TIPF LQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DF+ ++TLADLDPGTAKLAISFLGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.7e-8182.46Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAATAACSSLSSIR                   RGFKYH QRNPN +FQAIKCS SLL SFTSSKI  SLAY  PPLKPAAAYEAA TIPFALQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DF+ ++TLADLDPGTAKLAISFLGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

A0A5A7VQJ0 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB34.0e-7889.89Show/hide
Query:  RGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA-DFMKNVTLADLDPGTAKLAISFLGPFLSAFSF
        RGFKYH QRNPN +FQAIKCS SLL SFTSSKI  SLAY  PPLKPAAAYEAA TIPFALQD SMAA DF+ ++TLADLDPGTAKLAISFLGP LS FSF
Subjt:  RGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA-DFMKNVTLADLDPGTAKLAISFLGPFLSAFSF

Query:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
        LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS

A0A6J1GQG0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic1.8e-7880.57Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLS+IR                   RGF YH Q NPNCRFQA KCS SLL SFTSSKIHL L YATPPLKP     AA TIPFALQD S+AA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DFM NV+LADLDPGTAKLAI FLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

A0A6J1JL88 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X11.3e-8484.83Show/hide
Query:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA
        MAAT ACSSLS+IRVIG+G   LIPN+GNSS  RGF YH Q NPNCRFQA KCS SLL SFTSSKI L L  ATP LKP     AA TIPFALQD SMAA
Subjt:  MAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHLSLAYATPPLKPAAAYEAASTIPFALQDLSMAA

Query:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
         DF  NV LADLDPGTAKLAI FLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ
Subjt:  -DFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVVWFGLVSFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic4.3e-4572.8Show/hide
Query:  EAASTIPFALQDLSMAADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVV
        E A+T    ++  +  ++ ++N++LADLDPGTAKLAI  LGP LSAF FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPVV
Subjt:  EAASTIPFALQDLSMAADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVV

Query:  WFGLVSFLNEILLGPQGLLVLLSQQ
        WFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  WFGLVSFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT5G36120.1 cofactor assembly, complex C (B6F)3.1e-4672.8Show/hide
Query:  EAASTIPFALQDLSMAADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVV
        E A+T    ++  +  ++ ++N++LADLDPGTAKLAI  LGP LSAF FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+L+ TRKVIPPL GVDVTPVV
Subjt:  EAASTIPFALQDLSMAADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVDVTPVV

Query:  WFGLVSFLNEILLGPQGLLVLLSQQ
        WFGLVSFL+EIL+GPQGLLVL+SQQ
Subjt:  WFGLVSFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGTTTCATTTTTGTTCGAGTACCCAAAAGAAAATCTCTTTCCCAAACCCAATTTTGCCCATTGATGATAAAGAGATAGCCTTCTCCTTCTTCCTCCACCGGAG
CTCCTCTGCGATAGCCATGGCCGCCACCGCCGCCTGCTCCTCTCTCAGCTCCATTCGAGTAATAGGTATAGGATTGTTCCCTTTGATTCCCAACAATGGAAATTCAAGCT
TCTCTAGAGGCTTCAAGTACCATTCTCAAAGAAATCCAAACTGCAGATTCCAAGCAATCAAATGTAGCTTATCTTTGTTGGATTCTTTTACCTCTTCCAAGATTCATCTG
TCACTGGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGCATATGAAGCTGCAAGTACTATCCCCTTTGCCTTGCAAGATTTATCGATGGCTGCTGATTTCATGAAGAA
TGTCACCTTGGCCGACCTCGACCCAGGAACGGCAAAGCTTGCGATCAGTTTTCTGGGACCATTTCTCTCGGCGTTTTCGTTTTTGTTTATAGCGAGAATCGTAATGTCTT
GGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTTATAGCTTATGCCCCCACTGAACCACTTCTGATTGCAACAAGGAAGGTGATACCTCCTCTCGGTGGAGTTGAC
GTAACGCCAGTCGTCTGGTTTGGATTGGTTAGTTTCCTCAACGAGATATTGCTTGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGGTCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGTTTCATTTTTGTTCGAGTACCCAAAAGAAAATCTCTTTCCCAAACCCAATTTTGCCCATTGATGATAAAGAGATAGCCTTCTCCTTCTTCCTCCACCGGAG
CTCCTCTGCGATAGCCATGGCCGCCACCGCCGCCTGCTCCTCTCTCAGCTCCATTCGAGTAATAGGTATAGGATTGTTCCCTTTGATTCCCAACAATGGAAATTCAAGCT
TCTCTAGAGGCTTCAAGTACCATTCTCAAAGAAATCCAAACTGCAGATTCCAAGCAATCAAATGTAGCTTATCTTTGTTGGATTCTTTTACCTCTTCCAAGATTCATCTG
TCACTGGCCTATGCCACCCCTCCATTAAAGCCAGCTGCTGCATATGAAGCTGCAAGTACTATCCCCTTTGCCTTGCAAGATTTATCGATGGCTGCTGATTTCATGAAGAA
TGTCACCTTGGCCGACCTCGACCCAGGAACGGCAAAGCTTGCGATCAGTTTTCTGGGACCATTTCTCTCGGCGTTTTCGTTTTTGTTTATAGCGAGAATCGTAATGTCTT
GGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTTATAGCTTATGCCCCCACTGAACCACTTCTGATTGCAACAAGGAAGGTGATACCTCCTCTCGGTGGAGTTGAC
GTAACGCCAGTCGTCTGGTTTGGATTGGTTAGTTTCCTCAACGAGATATTGCTTGGTCCCCAAGGGCTGCTTGTCCTCCTTTCTCAACAGGTCAGCTGAATCTTGATGAG
AATGCTTCAGAGAAACATTTTTGTTTTTTTTGTTCTGTATATGTAAATTGATCTTCTTAGCTGAGCCTCTTTGGTTCCAAAAGGCAAAGAATGTGAGAACTTTAAATTCC
TAATGAAAGAAATTTCCCCATCCTAGAGAGTGTAATCGGTTCGACTTCGGGTTGGTTTTGAGCTAAGACCAACGTGAACTATGTCATTTCGAGAAAATTGAACATCGTCT
GAAACTTTGATCTTTCGAGTACCGAGCTAATCAAACTTCCCAGCTACAAAAGGTGGCTGAGATTACATTGCCATTGGCTTGAAGATGATCTCATTCCAAATAATACAAAG
CTTCTTTTGTTAATCTGTATAATTTAGTTAGAAGGATATTTTTGAAGGATATTTTACCTGTACTATCAATTTTTTCTAGTCTATTTTGATCTCATTTCATGATAAAATGT
AAAATACAATAATAAGTTAATATCTTTGTCTGTG
Protein sequenceShow/hide protein sequence
MAEFHFCSSTQKKISFPNPILPIDDKEIAFSFFLHRSSSAIAMAATAACSSLSSIRVIGIGLFPLIPNNGNSSFSRGFKYHSQRNPNCRFQAIKCSLSLLDSFTSSKIHL
SLAYATPPLKPAAAYEAASTIPFALQDLSMAADFMKNVTLADLDPGTAKLAISFLGPFLSAFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLIATRKVIPPLGGVD
VTPVVWFGLVSFLNEILLGPQGLLVLLSQQVS