; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0242771 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0242771
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionCASP-like protein
Genome locationCMiso1.1chr09:5045452..5049420
RNA-Seq ExpressionCmc09g0242771
SyntenyCmc09g0242771
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
GO:0051539 - 4 iron, 4 sulfur cluster binding (molecular function)
InterPro domainsIPR006702 - Casparian strip membrane protein domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458887.1 PREDICTED: CASP-like protein 4D1 [Cucumis melo]2.4e-72100Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
        AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN

XP_022939465.1 CASP-like protein 4D1 [Cucurbita moschata]1.8e-6285.62Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKM TKIASFVLRVLTFVFLL+SIIVLGT SKT+G++++ FHNVNSYR+AMATIIIGGAFNLLQIALALYRL+TK+DG +LFDF+GDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        AAALGS++DLKAN+D LNSFFDQGNAAAALLLLAFLCSAI+SVLSS ALS+KP
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

XP_022992769.1 CASP-like protein 4D1 [Cucurbita maxima]7.9e-6385.62Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKM TKIASFVLRVLTFVFLL+SIIVLGT SKT+G++++ FHNVNSYR+AMATIIIGGAFNLLQIALALYRL+TKTDG +LFDF+GDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        AAALGS++DLKAN+D LNSFFDQGNAAAALLLLAFLCSA++SVLSS ALS+KP
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

XP_023551429.1 CASP-like protein 4D1 [Cucurbita pepo subsp. pepo]4.6e-6386.27Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        ME KM TKIASFVLRVLTFVFLL+SIIVLGT SKT+G+D+V FHNVNSYR+AMATIIIGGAFNLLQIALALYRL+TKTDG +LFDF+GDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        AAALGS++DLKAN+D LNSFFDQGNAAAALLLLAFLCSA++SVLSS ALS+KP
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

XP_031741109.1 CASP-like protein 4D1 isoform X1 [Cucumis sativus]3.2e-6488.31Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKM TKIASFVLRVL F FLL+SIIVLGTNSKT+GN E HFHNVNSYR+AMATII+GGAFNLLQIAL+LYRL+TKTDGSILFDFYGDK+LSY LLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
        AAALGSS+DLKANM   +SFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN

TrEMBL top hitse value%identityAlignment
A0A1S3C911 CASP-like protein1.2e-72100Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
        AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN

A0A6J1BZ60 CASP-like protein1.3e-4772.67Show/hide
Query:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGAAAAL
        M +KIAS VLRV+TFVFL +SIIVL TNS+  GN  +HFHNVNSYR+AMATIIIGGAFNLLQIAL+LY +++K+ G++LF FYGDKVLSY LL+GAAA L
Subjt:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGAAAAL

Query:  GSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN
        G  +DL AN++ + SFF +GNAAAALLL+AFLCSA IS+LSSLALSNKPN
Subjt:  GSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN

A0A6J1C1A4 CASP-like protein9.8e-3550.92Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTI-----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYF
        M A   ++IAS +LR+LTFV + +S++++ TNSKT+        +V F +V SYRY  A  +IG A +LLQIAL LY +VTK+DG+  FD + DK+L+Y 
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTI-----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYF

Query:  LLAGAAAALGSSVDLKANM-----DMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        LL+GA+A LG+ +DL++N      ++ NSFFD+G+A+AA+LLLAF+CSA++S+LSSLAL  KP
Subjt:  LLAGAAAALGSSVDLKANM-----DMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

A0A6J1FMS6 CASP-like protein8.5e-6385.62Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKM TKIASFVLRVLTFVFLL+SIIVLGT SKT+G++++ FHNVNSYR+AMATIIIGGAFNLLQIALALYRL+TK+DG +LFDF+GDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        AAALGS++DLKAN+D LNSFFDQGNAAAALLLLAFLCSAI+SVLSS ALS+KP
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

A0A6J1JYE7 CASP-like protein3.8e-6385.62Show/hide
Query:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA
        MEAKM TKIASFVLRVLTFVFLL+SIIVLGT SKT+G++++ FHNVNSYR+AMATIIIGGAFNLLQIALALYRL+TKTDG +LFDF+GDKVLSYFLLAGA
Subjt:  MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGA

Query:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        AAALGS++DLKAN+D LNSFFDQGNAAAALLLLAFLCSA++SVLSS ALS+KP
Subjt:  AAALGSSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

SwissProt top hitse value%identityAlignment
A1XGB4 CASP-like protein PIMP19.1e-2242.11Show/hide
Query:  SFVLRVLTFVFLLVSIIVLGTNSKTI----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKT----DGSILFDFYGDKVLSYFLLAGAAA
        S ++R+LT + LL+S IV+ TN++T+    G+ ++ F +  +YRY +AT+IIG A+ LLQIA ++  L T      +G +LFDFYGDK +SYFL+ GAAA
Subjt:  SFVLRVLTFVFLLVSIIVLGTNSKTI----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKT----DGSILFDFYGDKVLSYFLLAGAAA

Query:  ALGSSVDLK--ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK
        + G + DLK     D  + F +  NAAA+L L+ F  +   S+ SS  L  +
Subjt:  ALGSSVDLK--ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK

B9NBE5 CASP-like protein 4D11.4e-2546.88Show/hide
Query:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKT--IGNDE--VHFHNVNSYRYAMATIIIGGAFNLLQIALALYRL------VTKTDGSILFDFYGDKVLSY
        M +++ +  LRVLTF FL+VS++++ TN+ T  IG DE  V   +  SYRY +A I  G  + +LQIAL L  +       T  DG+++FDFYGDKV+SY
Subjt:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKT--IGNDE--VHFHNVNSYRYAMATIIIGGAFNLLQIALALYRL------VTKTDGSILFDFYGDKVLSY

Query:  FLLAGAAAALGSSVDLKANMDML--NSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK
         L  GAAAA G++ +LK  +  L  + FF++G A+A+LLLL F+C+AI+SV SS AL  K
Subjt:  FLLAGAAAALGSSVDLKANMDML--NSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK

B9SXY8 CASP-like protein 4D18.5e-2844.59Show/hide
Query:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-----KTDGSILFDFYGDKVLSYF
        +A+++A+ +LR+LTF+FL+ S+++L TN+ T+  D    +VHF +V +YRY +ATI+IG A+ +LQIA  LY + T       DG++ FDF+GDKV+SY 
Subjt:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-----KTDGSILFDFYGDKVLSYF

Query:  LLAGAAAALGSSVDLK---ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLAL
        L+ GAAA   S+ D+K   +     ++F ++G A+A+LLL+ F+C+A++SV SS AL
Subjt:  LLAGAAAALGSSVDLK---ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLAL

C6T1Z6 CASP-like protein 4D16.8e-2545.58Show/hide
Query:  VLRVLTFVFLLVSIIVLGTNSKTIGND--EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-----KTDGSILFDFYGDKVLSYFLLAGAAAALG
        +LRVLTFVFLL+++I++    +T      E+ F+++ +YRY ++TIIIG A+NLLQ+AL+++ +V+       DG  LFDF+GDK++SY L++G+AA  G
Subjt:  VLRVLTFVFLLVSIIVLGTNSKTIGND--EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-----KTDGSILFDFYGDKVLSYFLLAGAAAALG

Query:  SSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK
         +V+L   +   NSF D+ NA+A+LLL+AFL +A+ S  +S AL  K
Subjt:  SSVDLKANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK

D7LD59 CASP-like protein 4D25.0e-2037.42Show/hide
Query:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT--KTDGSILFDFYGDKVLSYFLLA
        ++ K++  +LRVLT VFL++++I+L TNS TI +     + HF +V +YRY ++  +IG  + ++Q+   +    T  K   +   DFYGDK++SY +  
Subjt:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT--KTDGSILFDFYGDKVLSYFLLA

Query:  GAAAALGSSVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK
        G+AA  G S DLK          + D ++ FF +G A+A+LLL +F+C A++SV SSLA++ +
Subjt:  GAAAALGSSVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK

Arabidopsis top hitse value%identityAlignment
AT2G39518.1 Uncharacterised protein family (UPF0497)1.4e-2036.81Show/hide
Query:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT--KTDGSILFDFYGDKVLSYFLLA
        ++ K+   +LRVLT VFL++++I+L TNS TI +     + HF +V +YRY ++  +IG  + ++Q+   +    T  K   +   DFYGDK++SY +  
Subjt:  MATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGND----EVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT--KTDGSILFDFYGDKVLSYFLLA

Query:  GAAAALGSSVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK
        G+AA  G + DLK          + D ++ FF +G A+A+LLL AF+C A++SV SS A++ +
Subjt:  GAAAALGSSVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNK

AT2G39530.1 Uncharacterised protein family (UPF0497)1.8e-2040.38Show/hide
Query:  VLRVLTFVFLLVSIIVLGTNSKTI----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-KTDG-SILFDFYGDKVLSYFLLAGAAAALGS
        +LRVLT  FLL++++++ TN+ T+     + ++ F++V +YRY ++  +IG  + ++Q+ L + +  T KT   +  FDFYGDKV+SY L  G+AA  G 
Subjt:  VLRVLTFVFLLVSIIVLGTNSKTI----GNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVT-KTDG-SILFDFYGDKVLSYFLLAGAAAALGS

Query:  SVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP
        S DLK          + D ++ FF +G A+A+LLL AF+  A++SV SSLALS +P
Subjt:  SVDLK---------ANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKP

AT4G25830.1 Uncharacterised protein family (UPF0497)8.8e-0425.16Show/hide
Query:  VLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMA------TIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGAAAALGS
        +LR+    FLL++  ++G +S+T   +  + H   S+RY +A        ++  A+NL+Q+ L  Y +  KT     F +  D+  +Y + AG +AA   
Subjt:  VLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMA------TIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGAAAALGS

Query:  SVDLKANMDMLNSF----------FDQGNAAAALLLLAFLCSAIISVLSSLALSN
        S+ +      L             F  G+A    ++L ++ +A++ +LSS++  N
Subjt:  SVDLKANMDMLNSF----------FDQGNAAAALLLLAFLCSAIISVLSSLALSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAAAAATGGCTACAAAAATTGCTTCTTTTGTGCTTAGAGTTCTTACTTTTGTTTTTCTTTTAGTTTCCATCATAGTTCTGGGTACCAATTCCAAAACTATAGG
AAATGATGAAGTTCATTTCCATAATGTCAATTCCTATAGGTATGCAATGGCAACCATCATAATCGGAGGCGCATTCAATCTCTTGCAAATTGCCTTAGCCCTTTATCGTC
TTGTAACCAAGACCGATGGCAGTATTTTATTCGACTTCTATGGTGATAAGGTATTGTCGTACTTTTTGTTGGCGGGAGCAGCGGCGGCATTGGGTTCTAGCGTAGATTTG
AAGGCGAATATGGACATGTTGAACTCGTTTTTTGACCAAGGCAATGCTGCTGCAGCGCTTCTTCTTCTTGCTTTTTTATGTAGTGCTATCATTTCTGTACTTTCTTCTTT
AGCTCTTTCTAACAAACCCAATTAG
mRNA sequenceShow/hide mRNA sequence
TTCTCTATTATTTTCTTTGTCGGAATTAAAACCAAAACAAAAGTTAGGCATAAATTCTAATCTTTACCAAAAGAAAAAAGAAAAAAAAAGTCTACTTTCTTTTGGGGCTA
TAAAAGCTAAACAAAAAATTGAGATCGAGAAGTAAAAGGTATATTAGATTCATTATCAAATATTAGAAAGAAAGAAAGAAAAAAATGGAAGCAAAAATGGCTACAAAAAT
TGCTTCTTTTGTGCTTAGAGTTCTTACTTTTGTTTTTCTTTTAGTTTCCATCATAGTTCTGGGTACCAATTCCAAAACTATAGGAAATGATGAAGTTCATTTCCATAATG
TCAATTCCTATAGGTATGCAATGGCAACCATCATAATCGGAGGCGCATTCAATCTCTTGCAAATTGCCTTAGCCCTTTATCGTCTTGTAACCAAGACCGATGGCAGTATT
TTATTCGACTTCTATGGTGATAAGGTATTGTCGTACTTTTTGTTGGCGGGAGCAGCGGCGGCATTGGGTTCTAGCGTAGATTTGAAGGCGAATATGGACATGTTGAACTC
GTTTTTTGACCAAGGCAATGCTGCTGCAGCGCTTCTTCTTCTTGCTTTTTTATGTAGTGCTATCATTTCTGTACTTTCTTCTTTAGCTCTTTCTAACAAACCCAATTAGT
TTCTTTTCTTAGGAAATTTATACAACAATCCTTTCTCTCTTTTTATGAAATTATTTGGTTGCGTCATTGTAATGTAGAATAATTAGGAGTTTGGAGTTTTGTATCAGTTT
TTATGCTCTTGGTTAGCAGTAGACTCTTTTTTTTGTCTGGTTAATTGATGTTTGAACATTATATACATAATAATTTTATTGTG
Protein sequenceShow/hide protein sequence
MEAKMATKIASFVLRVLTFVFLLVSIIVLGTNSKTIGNDEVHFHNVNSYRYAMATIIIGGAFNLLQIALALYRLVTKTDGSILFDFYGDKVLSYFLLAGAAAALGSSVDL
KANMDMLNSFFDQGNAAAALLLLAFLCSAIISVLSSLALSNKPN