; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G02400 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G02400
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionlate embryogenesis abundant protein At1g64065
Genome locationClcChr01:2102660..2103274
RNA-Seq ExpressionClc01G02400
SyntenyClc01G02400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647845.1 hypothetical protein Csa_000600 [Cucumis sativus]1.2e-7980.39Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAAD+I G R A  RSRRK+ +KCLNTFCICLF AAA ACIAALT GLVVLRVK PTVKLTSVAVK+L YGFSPTPFMEATL GE+TMENPN+G F+YE 
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V NVTLIYYGV VGIGEVKR+ VNAKSIEK +F VKVKPN  FV+VDYFSDDL RLKTMNMS  AEF+G+I LLKLFKEKKISVLKCS SLNLTSH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LACL
        LACL
Subjt:  LACL

XP_008437351.1 PREDICTED: late embryogenesis abundant protein At1g64065 [Cucumis melo]9.7e-8583.82Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAAD+I G RP   RSRRK+S+KCLN FCICLFAAAA ACIAALT GLVVLRVK PTVKLTSVAVKNLHYGFSPTPFMEATL GEITMENPN+G F+YE 
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        VRNVTLIYYGV VGIGEVKR+ VNAKSIEK KFIVKVKPN  FV+VDYFSDDLARLKTMNMS  AEF+G+I LLKLFKEKKISV+KCS SLNLTSH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LACL
        LACL
Subjt:  LACL

XP_022929131.1 late embryogenesis abundant protein At1g64065 [Cucurbita moschata]1.3e-8483.74Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAA E    RPATLRSRRKASEKC+NTFCI LFA AAVACIAAL LGLVV+RVK PTVKLTSV VKNLHYGFSPTPFM+ATL+ EITMENPNFGEFKYEE
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V N TLIYYGVAVGIGEVK V VNAKS +K  F VKVKPNSSFVDVDYFS DLA LKTMNMSCIAEFKGR+RLLKLFKEKK+S+LKC+MSLNL SH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LAC
        LAC
Subjt:  LAC

XP_022970100.1 late embryogenesis abundant protein At1g64065 [Cucurbita maxima]2.2e-8483.25Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAA E    RPATLRSRRKASEKC+NTFCI LFA AAVACIAAL LGLVV+RVK PTVKLTSV VKNLHYGFSPTPFM+ATL+ EITMENPNFGEFKYEE
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V N TLIYYGVAVGIGEVK V VNAKS +   F VKVKPNSSFVDVDYFS DLA LKTMNMSCIAEFKGR+RLLKLFKEKK+S+LKC+MSLNL+SH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LAC
        LAC
Subjt:  LAC

XP_023550526.1 late embryogenesis abundant protein At1g64065 [Cucurbita pepo subsp. pepo]1.3e-8483.25Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAA +    RPATLRSRRKASEKC+NTFCI LFA AAVACIAAL LGLVV+RVK PTVKLTSV VKNLHYGFSPTPFM+ATL+ EITMENPNFGEFKYEE
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V N TLIYYGVAVGIGEVK V VNAKS +K  F VKVKPNSSFVDVDYFS DLA LKTMNMSCIAEFKGR+RLLKLFKEKK+S+LKC+MSLNL+SH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LAC
        LAC
Subjt:  LAC

TrEMBL top hitse value%identityAlignment
A0A0A0KK54 Uncharacterized protein5.9e-8080.39Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAAD+I G R A  RSRRK+ +KCLNTFCICLF AAA ACIAALT GLVVLRVK PTVKLTSVAVK+L YGFSPTPFMEATL GE+TMENPN+G F+YE 
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V NVTLIYYGV VGIGEVKR+ VNAKSIEK +F VKVKPN  FV+VDYFSDDL RLKTMNMS  AEF+G+I LLKLFKEKKISVLKCS SLNLTSH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LACL
        LACL
Subjt:  LACL

A0A1S3AUE7 late embryogenesis abundant protein At1g640654.7e-8583.82Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAAD+I G RP   RSRRK+S+KCLN FCICLFAAAA ACIAALT GLVVLRVK PTVKLTSVAVKNLHYGFSPTPFMEATL GEITMENPN+G F+YE 
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        VRNVTLIYYGV VGIGEVKR+ VNAKSIEK KFIVKVKPN  FV+VDYFSDDLARLKTMNMS  AEF+G+I LLKLFKEKKISV+KCS SLNLTSH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LACL
        LACL
Subjt:  LACL

A0A5A7TMT1 Late embryogenesis abundant protein4.7e-8583.82Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAAD+I G RP   RSRRK+S+KCLN FCICLFAAAA ACIAALT GLVVLRVK PTVKLTSVAVKNLHYGFSPTPFMEATL GEITMENPN+G F+YE 
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        VRNVTLIYYGV VGIGEVKR+ VNAKSIEK KFIVKVKPN  FV+VDYFSDDLARLKTMNMS  AEF+G+I LLKLFKEKKISV+KCS SLNLTSH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LACL
        LACL
Subjt:  LACL

A0A6J1EM85 late embryogenesis abundant protein At1g640656.1e-8583.74Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAA E    RPATLRSRRKASEKC+NTFCI LFA AAVACIAAL LGLVV+RVK PTVKLTSV VKNLHYGFSPTPFM+ATL+ EITMENPNFGEFKYEE
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V N TLIYYGVAVGIGEVK V VNAKS +K  F VKVKPNSSFVDVDYFS DLA LKTMNMSCIAEFKGR+RLLKLFKEKK+S+LKC+MSLNL SH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LAC
        LAC
Subjt:  LAC

A0A6J1I1W2 late embryogenesis abundant protein At1g640651.0e-8483.25Show/hide
Query:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE
        MAA E    RPATLRSRRKASEKC+NTFCI LFA AAVACIAAL LGLVV+RVK PTVKLTSV VKNLHYGFSPTPFM+ATL+ EITMENPNFGEFKYEE
Subjt:  MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEE

Query:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN
        V N TLIYYGVAVGIGEVK V VNAKS +   F VKVKPNSSFVDVDYFS DLA LKTMNMSCIAEFKGR+RLLKLFKEKK+S+LKC+MSLNL+SH +QN
Subjt:  VRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQN

Query:  LAC
        LAC
Subjt:  LAC

SwissProt top hitse value%identityAlignment
Q6DST1 Late embryogenesis abundant protein At1g640656.7e-1232.31Show/hide
Query:  RRKASE---KCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPT-PFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVA
        RRK  E   KCL  + + +       C   L L  + LR+  P ++  S++ ++L  G + T P+  ATL+ +I++ N NFG F++E+   + ++Y    
Subjt:  RRKASE---KCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPT-PFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVA

Query:  VGIGEVK----RVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC
        V +GE K    RV+ + K++  T  +V++  +   +D      DL RL  + +  +AE +GRI++L   K  K+SV+ C+M LNLT   +QNL C
Subjt:  VGIGEVK----RVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC

Arabidopsis top hitse value%identityAlignment
AT1G64065.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.7e-1332.31Show/hide
Query:  RRKASE---KCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPT-PFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVA
        RRK  E   KCL  + + +       C   L L  + LR+  P ++  S++ ++L  G + T P+  ATL+ +I++ N NFG F++E+   + ++Y    
Subjt:  RRKASE---KCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPT-PFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVA

Query:  VGIGEVK----RVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC
        V +GE K    RV+ + K++  T  +V++  +   +D      DL RL  + +  +AE +GRI++L   K  K+SV+ C+M LNLT   +QNL C
Subjt:  VGIGEVK----RVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC

AT2G44000.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.9e-0729.8Show/hide
Query:  TVKLTSVAVKNLHY----GFSPTPFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDD
        T  LTSV V+NL Y      S + +  ATL  EI +ENPN G F++   R   ++Y G  VG   +    V +    +T+   +V    +     +  +D
Subjt:  TVKLTSVAVKNLHY----GFSPTPFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDD

Query:  LARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC
        + R + + +   A+ +G + L  L   K+   LKC M LNL+   +  L C
Subjt:  LARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLAC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.3e-0625.36Show/hide
Query:  ADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMG-------EITMENPNFGE
        +DE +     T RSR +   KC     IC+ A + +     LTL   V RVK P +K+  V V  L    S T   +  L+G       +++++NPN   
Subjt:  ADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMG-------EITMENPNFGE

Query:  FKYEEVRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVD-YFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLT
        FKY    N T   Y     +GE   +   A+    ++  V V      +  D     +++R   +N+       G+++++ + K+     + C+M++N+T
Subjt:  FKYEEVRNVTLIYYGVAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVD-YFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLT

Query:  SHALQNLAC
          A+Q++ C
Subjt:  SHALQNLAC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCGACGAAATTTCCGGCGTACGGCCGGCGACCCTCCGATCAAGGCGTAAAGCATCAGAAAAATGCCTAAACACCTTCTGCATTTGCCTCTTCGCCGCTGCCGC
CGTCGCCTGCATCGCGGCTCTAACCCTCGGCCTGGTCGTCCTCCGGGTAAAAATCCCGACCGTAAAATTAACCTCCGTCGCGGTAAAAAATCTCCACTACGGCTTCTCGC
CGACCCCTTTCATGGAGGCCACATTAATGGGGGAAATAACGATGGAAAATCCGAATTTCGGGGAGTTCAAGTACGAGGAAGTAAGGAACGTGACATTGATTTACTACGGC
GTGGCAGTGGGAATCGGCGAGGTGAAAAGAGTGGATGTAAATGCGAAGAGTATTGAAAAAACGAAGTTTATAGTGAAAGTGAAACCCAATTCGAGCTTTGTTGATGTGGA
TTATTTCAGCGATGATTTGGCGAGGTTGAAGACGATGAATATGAGTTGCATCGCCGAGTTTAAGGGAAGGATTCGTTTGTTGAAACTGTTTAAAGAGAAGAAAATTTCTG
TGTTGAAATGTAGTATGAGCTTGAATTTGACCTCCCATGCCCTCCAAAATCTTGCTTGCCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCCGACGAAATTTCCGGCGTACGGCCGGCGACCCTCCGATCAAGGCGTAAAGCATCAGAAAAATGCCTAAACACCTTCTGCATTTGCCTCTTCGCCGCTGCCGC
CGTCGCCTGCATCGCGGCTCTAACCCTCGGCCTGGTCGTCCTCCGGGTAAAAATCCCGACCGTAAAATTAACCTCCGTCGCGGTAAAAAATCTCCACTACGGCTTCTCGC
CGACCCCTTTCATGGAGGCCACATTAATGGGGGAAATAACGATGGAAAATCCGAATTTCGGGGAGTTCAAGTACGAGGAAGTAAGGAACGTGACATTGATTTACTACGGC
GTGGCAGTGGGAATCGGCGAGGTGAAAAGAGTGGATGTAAATGCGAAGAGTATTGAAAAAACGAAGTTTATAGTGAAAGTGAAACCCAATTCGAGCTTTGTTGATGTGGA
TTATTTCAGCGATGATTTGGCGAGGTTGAAGACGATGAATATGAGTTGCATCGCCGAGTTTAAGGGAAGGATTCGTTTGTTGAAACTGTTTAAAGAGAAGAAAATTTCTG
TGTTGAAATGTAGTATGAGCTTGAATTTGACCTCCCATGCCCTCCAAAATCTTGCTTGCCTATAG
Protein sequenceShow/hide protein sequence
MAADEISGVRPATLRSRRKASEKCLNTFCICLFAAAAVACIAALTLGLVVLRVKIPTVKLTSVAVKNLHYGFSPTPFMEATLMGEITMENPNFGEFKYEEVRNVTLIYYG
VAVGIGEVKRVDVNAKSIEKTKFIVKVKPNSSFVDVDYFSDDLARLKTMNMSCIAEFKGRIRLLKLFKEKKISVLKCSMSLNLTSHALQNLACL