; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G036600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G036600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionLate embryogenesis abundant protein
Genome locationCicolChr02:32380484..32381056
RNA-Seq ExpressionCcUC02G036600
SyntenyCcUC02G036600
Gene Ontology termsGO:0009269 - response to desiccation (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004864 - Late embryogenesis abundant protein, LEA_2 subgroup
IPR013990 - Water stress and hypersensitive response domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033488.1 putative Harpin-induced 1 [Cucumis melo var. makuwa]1.2e-6875.26Show/hide
Query:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP +KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSL+DLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +GT+KMNLTLT+M  RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_008465308.1 PREDICTED: uncharacterized protein LOC103502964 [Cucumis melo]1.2e-6875.26Show/hide
Query:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP +KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSL+DLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +GT+KMNLTLT+M  RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_011658424.1 uncharacterized protein LOC105435999 [Cucumis sativus]1.6e-6574.6Show/hide
Query:  AAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI
        AAP +KLL NIC+ + L LIL ++  LILAFTVFKPK+PII VDSVSL+DLNVS+  GV L+LSL VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPI
Subjt:  AAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI

Query:  PAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        P GRL  +GTEKMNLTLT+M +RML KSEVFSDVVSGQLPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GDQ CQYRT L
Subjt:  PAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_023007254.1 uncharacterized protein LOC111499794 [Cucurbita maxima]2.7e-6576.63Show/hide
Query:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        RNICIA+LL LI+ VILILILAFTVFKPK+P I VDSVSL+DLN+SL +   GVDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV EAPIP+GRL
Subjt:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        S +GTEKMNLTLTMMA+R+LAKSE+FSDV++G+LPISTFARL+GK+ VIGVFKI VVA SSCDLTIDI N S+ DQ+C+YRTKL
Subjt:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

XP_023534551.1 uncharacterized protein LOC111796093 [Cucurbita pepo subsp. pepo]1.4e-6476.09Show/hide
Query:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        RNICIA+LL LIL VI ILILAFTVFKPK+P I VDS+SL+DLN+SL +   GVDLNL+L+V L++ENPNKVAF++S  TAVVSYRGEEV EAPIP+GRL
Subjt:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        SA+GTEKMNLTLTMMA+RMLAKSE+FSDV++G+LPISTFARL+GKV VIGVFKI VVA SSCDLTI+I N ++ DQ+C+YRTKL
Subjt:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

TrEMBL top hitse value%identityAlignment
A0A0A0KBX8 LEA_2 domain-containing protein7.7e-6674.6Show/hide
Query:  AAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI
        AAP +KLL NIC+ + L LIL ++  LILAFTVFKPK+PII VDSVSL+DLNVS+  GV L+LSL VDL+V+NPNKV FEYS+STAVV YRGE+VGEAPI
Subjt:  AAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPI

Query:  PAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        P GRL  +GTEKMNLTLT+M +RML KSEVFSDVVSGQLPISTFARL GKVKV+ V KIHVVAS+SCDL ID+ N S GDQ CQYRT L
Subjt:  PAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A1S3CNL0 uncharacterized protein LOC1035029645.7e-6975.26Show/hide
Query:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP +KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSL+DLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +GT+KMNLTLT+M  RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A5A7SSE6 Putative Harpin-induced 15.7e-6975.26Show/hide
Query:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        MAAP +KLLRN CI ++L LIL V+L+L+LAFTVFKP+RPII VDSVSL+DLNV+L  GVDLNLS+ VDL+VENPNKVAFEYS+STAVV YRGE+VGEAP
Subjt:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        IP GRL  +GT+KMNLTLT+M  RML +SEVFSDVVSGQL IST ARL+GKVKV+GV KIHVVAS+SCDL ID+ NGS GDQ CQ+RT++
Subjt:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A6J1G8C1 uncharacterized protein LOC1114518001.4e-6275Show/hide
Query:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSG---VDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        RNICIA+LL LIL VI ILILAFTVFKPK+P I VDS+SL+DLN+SL +    VDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV EAPIP+GRL
Subjt:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSG---VDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        SA+GTEKMNLTLTMMA+R+LAKSE+ SDV++G+LPISTFARL GKV VIGVFKI VVA SSCDLTIDI   ++ DQ+C+YRTKL
Subjt:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

A0A6J1L4F9 uncharacterized protein LOC1114997941.3e-6576.63Show/hide
Query:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        RNICIA+LL LI+ VILILILAFTVFKPK+P I VDSVSL+DLN+SL +   GVDLNL+L+V L+VENPNKVAF++S  TAVVSYRGEEV EAPIP+GRL
Subjt:  RNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVS---GVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        S +GTEKMNLTLTMMA+R+LAKSE+FSDV++G+LPISTFARL+GK+ VIGVFKI VVA SSCDLTIDI N S+ DQ+C+YRTKL
Subjt:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64450.1 Glycine-rich protein family1.4e-0633.33Show/hide
Query:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTE
        C    + L++ ++++L++ FTVFKPK P I+V++V L       VS    N S    ++V NPN+  F +  S+  + Y G +VG   IPAG++ +   +
Subjt:  CIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEGTE

Query:  KMNLTLTM
         M  T T+
Subjt:  KMNLTLTM

AT1G64450.1 Glycine-rich protein family1.9e-0337.74Show/hide
Query:  FSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC
        + + V   + I +   L+G+VKV+ VF  HVVA S C +T+ I +GS+    C
Subjt:  FSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC

AT2G46150.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.7e-2032.97Show/hide
Query:  NICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLN----VSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL
        +IC+     ++ T++L L+  FTVF+ K PII ++ V +  L+     + V  +  N+S++VD+SV+NPN  +F+YS +T  + Y+G  VGEA    G+ 
Subjt:  NICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLN----VSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRL

Query:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVV-SGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
            T +MN+T+ +M +R+L+   +  ++  SG + + ++ R+ GKVK++G+ K HV    +C + ++IT  +I D  C+ +  L
Subjt:  SAEGTEKMNLTLTMMANRMLAKSEVFSDVV-SGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT3G05975.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family1.3e-1730.11Show/hide
Query:  RNICIAI--LLCLILTVILILILAFTVFKPKRPIIAV--DSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGR
        R IC  +  ++ ++  + +  ++   VFKPK PI+     +V  I  N+SL   V LN +L +++ ++NPN   FEY     +V YR   VG   +P+  
Subjt:  RNICIAI--LLCLILTVILILILAFTVFKPKRPIIAV--DSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGR

Query:  LSAEGTEKMNLTLTMMANRMLAK-SEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
        L A+G+  +   L +  ++ +A   ++  DV+ G++ + T A++ GK+ ++G+FKI + + S C+L +   +  + DQ C  +TKL
Subjt:  LSAEGTEKMNLTLTMMANRMLAK-SEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT3G54200.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family4.5e-4248.35Show/hide
Query:  ICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVS---LVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSA
        IC  ILL L++ ++ I+ILAFT+FKPKRP   +DSV++  L  S   L+  V LNL+L VDLS++NPN++ F Y  S+A+++YRG+ +GEAP+PA R++A
Subjt:  ICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVS---LVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSA

Query:  EGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL
          T  +N+TLT+MA+R+L+++++ SDV++G +P++TF +++GKV V+ +FKI V +SSSCDL+I +++ ++  Q C+Y TKL
Subjt:  EGTEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL

AT4G23930.1 Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family3.0e-0926.04Show/hide
Query:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP
        M+  C+ L    C    L ++  +I  L +  TVF+P+ P I+V SV +   +V+  S   ++ +     +V NPN+ AF +  +   + Y G  +G   
Subjt:  MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAP

Query:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSE--------VFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC
        +PAG + +  T++M  T ++ +  + A S           SD     + I +   ++G+V+V+G+F   + A  +C + I  ++GSI   +C
Subjt:  IPAGRLSAEGTEKMNLTLTMMANRMLAKSE--------VFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCCCTGTACAAAATTACTGCGAAACATTTGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCC
CAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTAATCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTCGAGA
ATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGG
ACTGAGAAAATGAACCTAACGCTGACGATGATGGCGAACCGGATGCTGGCAAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTCAACTTCCGATCAGTACTTTTGCTCG
GTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAAT
GCCAATACCGGACGAAGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCCCTGTACAAAATTACTGCGAAACATTTGCATAGCCATATTGCTTTGTCTAATTCTTACCGTAATTTTGATCCTCATTTTAGCCTTTACTGTTTTCAAGCC
CAAGCGGCCTATCATCGCCGTCGATTCAGTTTCTCTAATCGATCTGAACGTTTCTCTGGTTAGCGGCGTCGATCTGAACCTATCTCTCATGGTGGATCTATCCGTCGAGA
ATCCGAATAAGGTCGCCTTTGAATACTCTCAAAGCACCGCCGTTGTGAGTTACAGAGGCGAAGAAGTCGGAGAAGCGCCGATTCCAGCTGGCCGATTATCAGCCGAAGGG
ACTGAGAAAATGAACCTAACGCTGACGATGATGGCGAACCGGATGCTGGCAAAGTCGGAGGTGTTTTCCGACGTGGTTTCCGGTCAACTTCCGATCAGTACTTTTGCTCG
GTTGTCCGGGAAAGTGAAGGTGATCGGTGTTTTCAAGATTCATGTTGTGGCGTCGTCGTCTTGTGATCTCACCATTGATATTACAAATGGAAGCATTGGAGATCAGCAAT
GCCAATACCGGACGAAGCTCTGA
Protein sequenceShow/hide protein sequence
MAAPCTKLLRNICIAILLCLILTVILILILAFTVFKPKRPIIAVDSVSLIDLNVSLVSGVDLNLSLMVDLSVENPNKVAFEYSQSTAVVSYRGEEVGEAPIPAGRLSAEG
TEKMNLTLTMMANRMLAKSEVFSDVVSGQLPISTFARLSGKVKVIGVFKIHVVASSSCDLTIDITNGSIGDQQCQYRTKL