; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2933 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2933
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic
Genome locationctg1041:5439349..5441028
RNA-Seq ExpressionCucsat.G2933
SyntenyCucsat.G2933
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR003425 - CCB3/YggT


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575811.1 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.74e-12085.71Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLS+IRVIG+GSSPL PNHGNS+FCRGF YHPQ NPNC+FQA KCSSSLLGSFTSSK  L L Y  PPLKPAA     RTIPF LQDAS+AAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DF+N+++LADLDPGTAKLAI FLGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

XP_004136016.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucumis sativus]9.61e-142100Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

XP_004136017.2 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X3 [Cucumis sativus]1.83e-12190.95Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLSSIR                   RGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

XP_023548987.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo]4.97e-12085.71Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLS+IRVIG+GSSPL PNHGNS+FCRGF YHPQ NPNC+FQA KCSSSLLGSFTSSK  L L Y  PPLKPAA     RTIPF LQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DF+N++ LADLDPGTAKLAI  LGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

XP_031745003.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X2 [Cucumis sativus]4.66e-12291.43Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLSSIRVIG                  FKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

TrEMBL top hitse value%identityAlignment
A0A0A0K9A7 Uncharacterized protein4.65e-142100Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

A0A1S3BSI7 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic3.27e-11587.68Show/hide
Query:  MAATA-CSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAA
        MAATA CSSLSSIR                   RGFKYHPQRNPN KFQAIKCSSSLLGSFTSSK L SLAYVIPPLKPAAAYEAARTIPF LQDASMAA
Subjt:  MAATA-CSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAA

Query:  SDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQ
        SDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQ
Subjt:  SDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQ

Query:  GLLVLLSQQVS
        GLLVLLSQQVS
Subjt:  GLLVLLSQQVS

A0A5A7VQJ0 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB36.07e-11396.63Show/hide
Query:  RGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF
        RGFKYHPQRNPN KFQAIKCSSSLLGSFTSSK L SLAYVIPPLKPAAAYEAARTIPF LQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF
Subjt:  RGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF

Query:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQGLLVLLSQQVS
        LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQGLLVLLSQQVS

A0A5D3D3A7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB32.47e-11296.07Show/hide
Query:  RGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF
        RGFKYHPQRNPN KFQAIKCSSSLLGSFTSSK L SLAYVIPPLKPAAAYEAARTIPF L+DASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF
Subjt:  RGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSF

Query:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQGLLVLLSQQVS
        LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQGLLVLLSQQVS
Subjt:  LFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQGLLVLLSQQVS

A0A6J1JL88 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic isoform X18.93e-11684.29Show/hide
Query:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS
        MAATACSSLS+IRVIG+GSS L PNHGNS+ CRGF YHPQ NPNC+FQA KCSSSLLGSFTSSK  L L    P LKPAA     RTIPF LQDASMAAS
Subjt:  MAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDASMAAS

Query:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG
        DF N++ LADLDPGTAKLAI FLGP LS FSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLL+ATRKVIPPLGGVDVTPVVWFGL+SFLNEILLGPQG
Subjt:  DFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQG

Query:  LLVLLSQQVS
        LLVLLSQQVS
Subjt:  LLVLLSQQVS

SwissProt top hitse value%identityAlignment
Q8RWM7 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB3, chloroplastic9.8e-4473.81Show/hide
Query:  EAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPV
        EAA T     Q  S+  S+ + +++LADLDPGTAKLAI  LGP+LS F FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+LV TRKVIPPL GVDVTPV
Subjt:  EAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPV

Query:  VWFGLISFLNEILLGPQGLLVLLSQQ
        VWFGL+SFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLISFLNEILLGPQGLLVLLSQQ

Arabidopsis top hitse value%identityAlignment
AT3G07430.1 YGGT family protein1.2e-0425.64Show/hide
Query:  SSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDA---SMAASDFVNSMTLADLDPGTAKLAISFLGPS----LSVFSFLFIARIVMSWYPK
        SS+L GS  S  +L +LA  +  +       A +T    + D    S++ +  V   +L D  PG     ++ +       L ++S + + R+++SW+P 
Subjt:  SSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIPFTLQDA---SMAASDFVNSMTLADLDPGTAKLAISFLGPS----LSVFSFLFIARIVMSWYPK

Query:  LPVGKFPYVIAYAPTEPLLVATRKVIPPL-GGVDVTPVVWFGLISFLNEILLGPQG
        +P  + P        +P L   R +IPP+   +DV+P++ F ++  L  I+ G  G
Subjt:  LPVGKFPYVIAYAPTEPLLVATRKVIPPL-GGVDVTPVVWFGLISFLNEILLGPQG

AT4G27990.1 YGGT family protein7.8e-0429.33Show/hide
Query:  LSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPL-GGVDVTPVVWFGLISFLNEILLGPQG
        L ++S + + R+++SW+P +P  + P        +P L   R +IPP+   +DV+P++ F ++  L  IL   +G
Subjt:  LSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPL-GGVDVTPVVWFGLISFLNEILLGPQG

AT5G36120.1 cofactor assembly, complex C (B6F)6.9e-4573.81Show/hide
Query:  EAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPV
        EAA T     Q  S+  S+ + +++LADLDPGTAKLAI  LGP+LS F FLFI RIVMSWYPKLPV KFPYV+AYAPTEP+LV TRKVIPPL GVDVTPV
Subjt:  EAARTIPFTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPV

Query:  VWFGLISFLNEILLGPQGLLVLLSQQ
        VWFGL+SFL+EIL+GPQGLLVL+SQQ
Subjt:  VWFGLISFLNEILLGPQGLLVLLSQQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CCTATTGATGATAAAGGGATAGCCTTCTCATTCTTTCTCCACCGGAGCTCCTCTGTGAAACCCATGGCTGCCACCGCCTGCTCCTCTCTCAGCTCCATTCGAGTAATAGG
TATAGGATCATCCCCTTTGACTCCCAACCATGGAAATTCAAATTTCTGTAGAGGCTTCAAGTACCATCCTCAAAGAAATCCAAACTGCAAATTCCAAGCAATCAAATGTA
GCTCATCTTTGTTGGGTTCTTTTACCTCTTCCAAGAGTCTTCTGTCATTGGCCTATGTCATCCCTCCATTAAAGCCAGCTGCTGCTTACGAAGCTGCAAGGACTATCCCC
TTTACCTTGCAAGATGCATCAATGGCTGCCTCTGATTTTGTGAACAGTATGACTTTGGCTGACCTCGACCCAGGAACGGCAAAGCTTGCGATCAGCTTTCTGGGGCCGTC
TCTCTCGGTGTTTTCGTTTTTGTTTATTGCTAGAATAGTAATGTCTTGGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTCATAGCTTATGCCCCCACTGAACCAC
TTCTAGTTGCAACAAGGAAGGTGATCCCTCCTCTCGGCGGAGTTGACGTAACACCAGTTGTCTGGTTCGGATTGATTAGTTTCCTCAATGAGATATTGCTCGGTCCCCAA
GGGCTCCTTGTCCTCCTTTCTCAACAAGTCAGCTAA
mRNA sequenceShow/hide mRNA sequence
CCTATTGATGATAAAGGGATAGCCTTCTCATTCTTTCTCCACCGGAGCTCCTCTGTGAAACCCATGGCTGCCACCGCCTGCTCCTCTCTCAGCTCCATTCGAGTAATAGG
TATAGGATCATCCCCTTTGACTCCCAACCATGGAAATTCAAATTTCTGTAGAGGCTTCAAGTACCATCCTCAAAGAAATCCAAACTGCAAATTCCAAGCAATCAAATGTA
GCTCATCTTTGTTGGGTTCTTTTACCTCTTCCAAGAGTCTTCTGTCATTGGCCTATGTCATCCCTCCATTAAAGCCAGCTGCTGCTTACGAAGCTGCAAGGACTATCCCC
TTTACCTTGCAAGATGCATCAATGGCTGCCTCTGATTTTGTGAACAGTATGACTTTGGCTGACCTCGACCCAGGAACGGCAAAGCTTGCGATCAGCTTTCTGGGGCCGTC
TCTCTCGGTGTTTTCGTTTTTGTTTATTGCTAGAATAGTAATGTCTTGGTATCCCAAGTTGCCTGTGGGGAAGTTTCCATATGTCATAGCTTATGCCCCCACTGAACCAC
TTCTAGTTGCAACAAGGAAGGTGATCCCTCCTCTCGGCGGAGTTGACGTAACACCAGTTGTCTGGTTCGGATTGATTAGTTTCCTCAATGAGATATTGCTCGGTCCCCAA
GGGCTCCTTGTCCTCCTTTCTCAACAAGTCAGCTAA
Protein sequenceShow/hide protein sequence
PIDDKGIAFSFFLHRSSSVKPMAATACSSLSSIRVIGIGSSPLTPNHGNSNFCRGFKYHPQRNPNCKFQAIKCSSSLLGSFTSSKSLLSLAYVIPPLKPAAAYEAARTIP
FTLQDASMAASDFVNSMTLADLDPGTAKLAISFLGPSLSVFSFLFIARIVMSWYPKLPVGKFPYVIAYAPTEPLLVATRKVIPPLGGVDVTPVVWFGLISFLNEILLGPQ
GLLVLLSQQVS