; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G23355 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G23355
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGPI-anchored protein LLG1
Genome locationClcChr09:36724432..36730323
RNA-Seq ExpressionClc09G23355
SyntenyClc09G23355
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR039307 - GPI-anchored protein LORELEI-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3565159.1 hypothetical protein DY000_02017361, partial [Brassica cretica]9.8e-6565.19Show/hide
Query:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC
        CPVNFEF+NYTIITS+CKGP+YPP  CCSA KEFACPYAD LNDL NDCA+TMFSYINLYGKYPPGLF++ C+E  +G                     C
Subjt:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC

Query:  PVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP
        PV+FEF+NYTIITS+CKGP Y PK CCSA  E  CPY D +ND+  DCA+TMFS INLYGKYPPGLF++ C+E   GL CP
Subjt:  PVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP

OVA01383.1 hypothetical protein BVC80_437g4 [Macleaya cordata]4.0e-6646.18Show/hide
Query:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSIT-
        ACPVNFEF NYTIITS+CKGP+YPP LCC A KEFACP+ D LNDL+NDCASTMFSYINLYGKYPPGLF+SEC+EGK GL CPA P  TSA++N GSI+ 
Subjt:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSIT-

Query:  ---------------------------------------------------------------------------------HCPVSFEFLNYTIITSKCK
                                                                                          CPV  E  NYTIITSKC 
Subjt:  ---------------------------------------------------------------------------------HCPVSFEFLNYTIITSKCK

Query:  GPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP-PLPPPS---DSNSALLLKRSSPSIIIASAGVVLLL
        GP Y PK CCS   ELVCP+ +  ND+T +CAST+F  + +YG YP GLFS+ CREG  GL CP P P  S   D+N    + R+  S++I +  + LL+
Subjt:  GPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP-PLPPPS---DSNSALLLKRSSPSIIIASAGVVLLL

Query:  L
        L
Subjt:  L

TXG68121.1 hypothetical protein EZV62_009396 [Acer yangbiense]4.3e-6050.82Show/hide
Query:  KGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD---------------------INWGS
        KGP++P   CC+A K+FACPYAD++NDLT DCASTMFSYINLYGKYPPGLF+++C++GK GLECPA  P+ S +                     +   S
Subjt:  KGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD---------------------INWGS

Query:  ITH----------------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREG
         T                       CPV+FEFLNYTIITSKCKGP +  + CC+AL +  CPY D +ND+TTDCASTMFS INLYGKYPPGLF+S+CREG
Subjt:  ITH----------------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREG

Query:  VKGLACPPLPPPSDSNSALLLKRSSPSII--IASAGVVLLLLSL
          GL CP   P    +++     S PS++  I S  +VLL  SL
Subjt:  VKGLACPPLPPPSDSNSALLLKRSSPSII--IASAGVVLLLLSL

XP_002531773.2 GPI-anchored protein LLG1 [Ricinus communis]1.8e-7965.6Show/hide
Query:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITH
        ACPVNFEF NYTIITS+CKGP+YPP  CC+A KEFACP+AD LNDLTN+CASTMFSYINLYGKYPPGLF+S C+EGK+GL CPA PP  S D        
Subjt:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITH

Query:  CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACPPLPPPS----DSNSALLL
        CPV+FEF NYT+ITS+CKGP Y+P+ CC+A  E  CP+ DV+ND+T DCASTMFS INLYGKYPPGLF+S+CREG +GL C P PPPS    D N  ++ 
Subjt:  CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACPPLPPPS----DSNSALLL

Query:  KRSSPSIIIASAGVVLLL
           +PS+++ S  +VLL+
Subjt:  KRSSPSIIIASAGVVLLL

XP_031105626.1 GPI-anchored protein LLG1-like [Ipomoea triloba]1.7e-7256.96Show/hide
Query:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD------IN
        ACP+NFEF NYT+ITS+CKGP+YPP LCC A  EFACPYA+ LNDLTN CA+TMFSYINLYGKYPPGLF++ECK  K GLECPAL P+  ++      IN
Subjt:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD------IN

Query:  WGSITH--------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLA
        +G                  CP++FEF NYT+ITS+CKGP Y PK CC A T+  CPY + +ND+T DCASTMFS INLYGKYPPGLF+++C+   KGL 
Subjt:  WGSITH--------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLA

Query:  CPPLPPPSDSNS-ALLLKRSSPSIIIASAGVVLLLLS
        CP L P   SN  +       PS ++  A + L+LL+
Subjt:  CPPLPPPSDSNS-ALLLKRSSPSIIIASAGVVLLLLS

TrEMBL top hitse value%identityAlignment
A0A0A0K362 Uncharacterized protein1.9e-5089.22Show/hide
Query:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITH
        ACPVNFEFLNYTIITSKCKGPRYPP  CCSALKEFACPY +DLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGK+GLECPALPP+TSAD++WGS T 
Subjt:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITH

Query:  CP
         P
Subjt:  CP

A0A200PT35 Uncharacterized protein1.9e-6646.18Show/hide
Query:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSIT-
        ACPVNFEF NYTIITS+CKGP+YPP LCC A KEFACP+ D LNDL+NDCASTMFSYINLYGKYPPGLF+SEC+EGK GL CPA P  TSA++N GSI+ 
Subjt:  ACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSIT-

Query:  ---------------------------------------------------------------------------------HCPVSFEFLNYTIITSKCK
                                                                                          CPV  E  NYTIITSKC 
Subjt:  ---------------------------------------------------------------------------------HCPVSFEFLNYTIITSKCK

Query:  GPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP-PLPPPS---DSNSALLLKRSSPSIIIASAGVVLLL
        GP Y PK CCS   ELVCP+ +  ND+T +CAST+F  + +YG YP GLFS+ CREG  GL CP P P  S   D+N    + R+  S++I +  + LL+
Subjt:  GPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACP-PLPPPS---DSNSALLLKRSSPSIIIASAGVVLLL

Query:  L
        L
Subjt:  L

A0A5C7IG61 CRAL-TRIO domain-containing protein2.1e-6050.82Show/hide
Query:  KGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD---------------------INWGS
        KGP++P   CC+A K+FACPYAD++NDLT DCASTMFSYINLYGKYPPGLF+++C++GK GLECPA  P+ S +                     +   S
Subjt:  KGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSAD---------------------INWGS

Query:  ITH----------------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREG
         T                       CPV+FEFLNYTIITSKCKGP +  + CC+AL +  CPY D +ND+TTDCASTMFS INLYGKYPPGLF+S+CREG
Subjt:  ITH----------------------CPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREG

Query:  VKGLACPPLPPPSDSNSALLLKRSSPSII--IASAGVVLLLLSL
          GL CP   P    +++     S PS++  I S  +VLL  SL
Subjt:  VKGLACPPLPPPSDSNSALLLKRSSPSII--IASAGVVLLLLSL

A0A6J1F8E1 GPI-anchored protein LLG15.6e-5081.74Show/hide
Query:  LSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPP
        + R  ++ RK   ACPVNFEFLNYTIITSKCKGPRYPP  CC AL EFACPYA DLNDLTNDCASTMFSYINLYGKYPPGLFSSECK GKQGLECPALPP
Subjt:  LSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPP

Query:  ATSADINWGSITHCP
        +TSAD+NWGSI   P
Subjt:  ATSADINWGSITHCP

A0A6J1IFC4 GPI-anchored protein LLG1-like5.6e-5081.74Show/hide
Query:  LSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPP
        + R  ++ RK   ACPVNFEFLNYTIITSKCKGPRYPP  CC AL EFACPYA DLNDLTNDCASTMFSYINLYGKYPPGLFSSECK GKQGLECPALPP
Subjt:  LSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPP

Query:  ATSADINWGSITHCP
        +TSAD+NWGSI   P
Subjt:  ATSADINWGSITHCP

SwissProt top hitse value%identityAlignment
B3GS44 GPI-anchored protein LORELEI9.6e-3154.46Show/hide
Query:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC
        C VNFE+++Y ++T +CKGP +P   CCSA KEFACPY   +ND+ +DCA TMFSY+N+YG YP GLF++EC+E K GL CP LPP  S ++N  +    
Subjt:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC

Query:  P
        P
Subjt:  P

Q6NLF4 GPI-anchored protein LLG22.3e-3264.13Show/hide
Query:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA
        + C  +F   NYTIITS+CKGP YP N+CCSA K+FACP+A+ LND  NDCASTMFSYINLYG+YPPG+F++ CKEGK+GL+C  +  + SA
Subjt:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA

Q9FKT1 GPI-anchored protein LLG13.8e-4369.64Show/hide
Query:  LNLSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPA-
        L L R  ++ +K    CPVNFEF+NYTIITSKCKGP+YPP  CC A K+FACPY D LNDL++DCA+TMFSYINLYGKYPPGLF+++CKEGK+GLECPA 
Subjt:  LNLSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPA-

Query:  --LPPATSADIN
          LPP TSA++N
Subjt:  --LPPATSADIN

Q9M0I0 GPI-anchored protein LLG35.1e-3263.04Show/hide
Query:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA
        + C  +F   NYTIITSKCKGP YP  +CCSA K+FACP+A+ LND   DCASTMFSYINLYG+YPPG+F++ CKEGK+GL+C  + P +S+
Subjt:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA

Arabidopsis top hitse value%identityAlignment
AT2G20700.1 LORELEI-LIKE-GPI ANCHORED PROTEIN 21.6e-3364.13Show/hide
Query:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA
        + C  +F   NYTIITS+CKGP YP N+CCSA K+FACP+A+ LND  NDCASTMFSYINLYG+YPPG+F++ CKEGK+GL+C  +  + SA
Subjt:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA

AT4G26466.1 lorelei6.8e-3254.46Show/hide
Query:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC
        C VNFE+++Y ++T +CKGP +P   CCSA KEFACPY   +ND+ +DCA TMFSY+N+YG YP GLF++EC+E K GL CP LPP  S ++N  +    
Subjt:  CPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADINWGSITHC

Query:  P
        P
Subjt:  P

AT4G28280.1 LORELEI-LIKE-GPI ANCHORED PROTEIN 34.3e-3461.86Show/hide
Query:  VRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA
        V  + +AC  +F   NYTIITSKCKGP YP  +CCSA K+FACP+A+ LND   DCASTMFSYINLYG+YPPG+F++ CKEGK+GL+C  + P +S+
Subjt:  VRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA

AT4G28280.2 LORELEI-LIKE-GPI ANCHORED PROTEIN 33.6e-3363.04Show/hide
Query:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA
        + C  +F   NYTIITSKCKGP YP  +CCSA K+FACP+A+ LND   DCASTMFSYINLYG+YPPG+F++ CKEGK+GL+C  + P +S+
Subjt:  SACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSA

AT5G56170.1 LORELEI-LIKE-GPI-ANCHORED PROTEIN 12.7e-4469.64Show/hide
Query:  LNLSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPA-
        L L R  ++ +K    CPVNFEF+NYTIITSKCKGP+YPP  CC A K+FACPY D LNDL++DCA+TMFSYINLYGKYPPGLF+++CKEGK+GLECPA 
Subjt:  LNLSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPA-

Query:  --LPPATSADIN
          LPP TSA++N
Subjt:  --LPPATSADIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAACCTCTCCAGATTTCAAATAGAAGTTCGAAAACTTTGTAGCGCTTGTCCAGTGAACTTTGAATTCTTGAACTACACCATCATCACAAGCAAGTGCAAGGGGCC
TCGATATCCTCCCAACCTGTGTTGTTCAGCTTTGAAAGAGTTTGCTTGCCCTTATGCCGACGATCTCAACGATTTAACAAATGATTGCGCTTCAACCATGTTCAGTTACA
TTAACCTGTATGGAAAATACCCTCCTGGTCTCTTCTCTAGCGAGTGCAAGGAGGGGAAGCAAGGTCTAGAATGCCCTGCGCTCCCACCCGCCACTTCAGCTGATATAAAT
TGGGGCTCGATAACACATTGCCCTGTTAGCTTTGAATTTTTGAACTACACAATCATCACAAGCAAATGCAAAGGGCCTCTATATTCTCCAAAGCTATGTTGTTCAGCTCT
AACCGAACTTGTTTGCCCTTATGTTGATGTTATGAATGATATGACAACTGATTGTGCTTCAACCATGTTTAGCAATATTAACCTCTATGGAAAGTATCCACCTGGCCTTT
TCTCGAGCCAGTGTCGCGAAGGAGTCAAAGGTCTTGCCTGCCCTCCATTGCCACCTCCATCTGACTCGAACTCAGCTTTGCTCTTGAAACGTTCGTCTCCTTCAATCATC
ATTGCCTCAGCTGGAGTTGTACTTCTGCTTTTATCGCTTTTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAACCTCTCCAGATTTCAAATAGAAGTTCGAAAACTTTGTAGCGCTTGTCCAGTGAACTTTGAATTCTTGAACTACACCATCATCACAAGCAAGTGCAAGGGGCC
TCGATATCCTCCCAACCTGTGTTGTTCAGCTTTGAAAGAGTTTGCTTGCCCTTATGCCGACGATCTCAACGATTTAACAAATGATTGCGCTTCAACCATGTTCAGTTACA
TTAACCTGTATGGAAAATACCCTCCTGGTCTCTTCTCTAGCGAGTGCAAGGAGGGGAAGCAAGGTCTAGAATGCCCTGCGCTCCCACCCGCCACTTCAGCTGATATAAAT
TGGGGCTCGATAACACATTGCCCTGTTAGCTTTGAATTTTTGAACTACACAATCATCACAAGCAAATGCAAAGGGCCTCTATATTCTCCAAAGCTATGTTGTTCAGCTCT
AACCGAACTTGTTTGCCCTTATGTTGATGTTATGAATGATATGACAACTGATTGTGCTTCAACCATGTTTAGCAATATTAACCTCTATGGAAAGTATCCACCTGGCCTTT
TCTCGAGCCAGTGTCGCGAAGGAGTCAAAGGTCTTGCCTGCCCTCCATTGCCACCTCCATCTGACTCGAACTCAGCTTTGCTCTTGAAACGTTCGTCTCCTTCAATCATC
ATTGCCTCAGCTGGAGTTGTACTTCTGCTTTTATCGCTTTTGTGA
Protein sequenceShow/hide protein sequence
MLNLSRFQIEVRKLCSACPVNFEFLNYTIITSKCKGPRYPPNLCCSALKEFACPYADDLNDLTNDCASTMFSYINLYGKYPPGLFSSECKEGKQGLECPALPPATSADIN
WGSITHCPVSFEFLNYTIITSKCKGPLYSPKLCCSALTELVCPYVDVMNDMTTDCASTMFSNINLYGKYPPGLFSSQCREGVKGLACPPLPPPSDSNSALLLKRSSPSII
IASAGVVLLLLSLL