; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G044560 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G044560
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionLate embryogenesis abundant protein
Genome locationCiama_Chr02:32422086..32422658
RNA-Seq ExpressionCaUC02G044560
SyntenyCaUC02G044560
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033488.1 putative Harpin-induced 1 [Cucumis melo var. makuwa]6.9e-6975.26Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP++KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSLLDLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +G +KMNLTLT+M +RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_008465308.1 PREDICTED: uncharacterized protein LOC103502964 [Cucumis melo]6.9e-6975.26Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP++KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSLLDLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +G +KMNLTLT+M +RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_011658424.1 uncharacterized protein LOC105435999 [Cucumis sativus]3.2e-6675.13Show/hide
Query:  AAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI
        AAP++KLL NIC+ + L LIL ++  LILAFTVFKPK+PII VDSVSLLDLNVS+  GV L+LSL VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPI
Subjt:  AAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI

Query:  PAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        P GRL  +G EKMNLTLT+M DRML KSEVFSDVVSGQLPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GDQ CQYRT L
Subjt:  PAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_023007254.1 uncharacterized protein LOC111499794 [Cucurbita maxima]1.2e-6575.65Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG
        MAA + K  RNICIA+LL LI+ VILILILAFTVFKPK+P I VDSVSLLDLN+SL +   GVDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV 
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG

Query:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        EAPIP+GRLS +G EKMNLTLTMMADR+LAKSE+FSDV++G+LPISTFARL+GK+ VIGVFKI VVA SSCDLTIDI N S+ DQ+C+YRTKL
Subjt:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_023534551.1 uncharacterized protein LOC111796093 [Cucurbita pepo subsp. pepo]6.1e-6575.13Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG
        MAA + K  RNICIA+LL LIL VI ILILAFTVFKPK+P I VDS+SLLDLN+SL +   GVDLNL+L+V L++ENPNKVAF++S  TAVVSYRGEEV 
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG

Query:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        EAPIP+GRLSA+G EKMNLTLTMMADRMLAKSE+FSDV++G+LPISTFARL+GKV VIGVFKI VVA SSCDLTI+I N ++ DQ+C+YRTKL
Subjt:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

TrEMBL top hitse value%identityAlignment
A0A0A0KBX8 LEA_2 domain-containing protein1.6e-6675.13Show/hide
Query:  AAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI
        AAP++KLL NIC+ + L LIL ++  LILAFTVFKPK+PII VDSVSLLDLNVS+  GV L+LSL VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPI
Subjt:  AAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI

Query:  PAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        P GRL  +G EKMNLTLT+M DRML KSEVFSDVVSGQLPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GDQ CQYRT L
Subjt:  PAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A1S3CNL0 uncharacterized protein LOC1035029643.4e-6975.26Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP++KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSLLDLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +G +KMNLTLT+M +RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A5A7SSE6 Putative Harpin-induced 13.4e-6975.26Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP++KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSLLDLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +G +KMNLTLT+M +RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A6J1G8C1 uncharacterized protein LOC1114518006.1e-6374.09Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSG---VDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG
        MAA + K  RNICIA+LL LIL VI ILILAFTVFKPK+P I VDS+SLLDLN+SL +    VDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV 
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSG---VDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG

Query:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        EAPIP+GRLSA+G EKMNLTLTMMADR+LAKSE+ SDV++G+LPISTFARL GKV VIGVFKI VVA SSCDLTIDI   ++ DQ+C+YRTKL
Subjt:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A6J1L4F9 uncharacterized protein LOC1114997945.9e-6675.65Show/hide
Query:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG
        MAA + K  RNICIA+LL LI+ VILILILAFTVFKPK+P I VDSVSLLDLN+SL +   GVDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV 
Subjt:  MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVG

Query:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        EAPIP+GRLS +G EKMNLTLTMMADR+LAKSE+FSDV++G+LPISTFARL+GK+ VIGVFKI VVA SSCDLTIDI N S+ DQ+C+YRTKL
Subjt:  EAPIPAGRLSAEGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family1.1e-0633.33Show/hide
Query:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGAE
        C    + L++ ++++L++ FTVFKPK P I+V++V L       VS    N S    ++V NPN+  F +  S+  + Y G +VG   IPAG++ +   +
Subjt:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGAE

Query:  KMNLTLTM
         M  T T+
Subjt:  KMNLTLTM

AT1G64450.1 Glycine-rich protein family1.9e-0337.74Show/hide
Query:  FSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC
        + + V   + I +   L+G+VKV+ VF  HVVA S C +T+ I +GS+    C
Subjt:  FSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-2032.97Show/hide
Query:  NICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLN----VSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        +IC+     ++ T++L L+  FTVF+ K PII ++ V +  L+     + V  +  N+S++VD+SV+NPN  +F+YS +T  + Y+G  VGEA    G+ 
Subjt:  NICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLN----VSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGAEKMNLTLTMMADRMLAKSEVFSDVV-SGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
              +MN+T+ +M DR+L+   +  ++  SG + + ++ R+ GKVK++G+ K HV    +C + ++IT  +I D  C+ +  L
Subjt:  SAEGAEKMNLTLTMMADRMLAKSEVFSDVV-SGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family5.9e-1830.11Show/hide
Query:  RNICIAI--LLCLILTVILILILAFTVFKPKRPIIAV--DSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGR
        R IC  +  ++ ++  + +  ++   VFKPK PI+     +V  +  N+SL   V LN +L +++ ++NPN   FEY     +V YR   VG   +P+  
Subjt:  RNICIAI--LLCLILTVILILILAFTVFKPKRPIIAV--DSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGR

Query:  LSAEGAEKMNLTLTMMADRMLAK-SEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        L A+G+  +   L +  D+ +A   ++  DV+ G++ + T A++ GK+ ++G+FKI + + S C+L +   +  + DQ C  +TKL
Subjt:  LSAEGAEKMNLTLTMMADRMLAK-SEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.5e-4248.35Show/hide
Query:  ICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVS---LVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSA
        IC  ILL L++ ++ I+ILAFT+FKPKRP   +DSV++  L  S   L+  V LNL+L VDLS++NPN++ F Y  S+A+++YRG+ +GEAP+PA R++A
Subjt:  ICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVS---LVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSA

Query:  EGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
             +N+TLT+MADR+L+++++ SDV++G +P++TF +++GKV V+ +FKI V +SSSCDL+I +++ ++  Q C+Y TKL
Subjt:  EGAEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family8.6e-0925.56Show/hide
Query:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGAE
        C    L ++  +I  L +  TVF+P+ P I+V SV +   +V+  S   ++ +     +V NPN+ AF +  +   + Y G  +G   +PAG + +   +
Subjt:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGAE

Query:  KMNLTLTMMADRMLAKSE--------VFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC
        +M  T ++ +  + A S           SD     + I +   ++G+V+V+G+F   + A  +C + I  ++GSI   +C
Subjt:  KMNLTLTMMADRMLAKSE--------VFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCCCAGTACAAAATTACTGCGAAACATATGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCC
CAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTACTCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTCGAGA
ATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGG
GCTGAGAAAATGAACCTAACGCTGACGATGATGGCGGACCGGATGCTGGCGAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTCAACTTCCGATCAGTACTTTCGCTCG
GTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAAT
GCCAATACCGGACGAAGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCCCAGTACAAAATTACTGCGAAACATATGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCC
CAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTACTCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTCGAGA
ATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGG
GCTGAGAAAATGAACCTAACGCTGACGATGATGGCGGACCGGATGCTGGCGAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTCAACTTCCGATCAGTACTTTCGCTCG
GTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAAT
GCCAATACCGGACGAAGCTCTGA
Protein sequenceShow/hide protein sequence
MAAPSTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLLDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEG
AEKMNLTLTMMADRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL