; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001550 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001550
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionExtracellular ligand-binding receptor
Genome locationscaffold119:175196..176745
RNA-Seq ExpressionMS001550
SyntenyMS001550
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576012.1 hypothetical protein SDJN03_26651, partial [Cucurbita argyrosperma subsp. sororia]1.9e-5752.74Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+  CVL+S    ++ PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FG L++K K+S  E S+++QVREII  +S KI+D  LETASS AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GV SSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_011659765.2 uncharacterized protein LOC105436268 [Cucumis sativus]4.6e-5649.5Show/hide
Query:  MSCCSSSP-LSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGI
        M C SSS  +S   STFLL LLL+I SLKI       + NGFNSL T    LF K+ PC+++SF   IKLPA+A LSAFQ L +A++++ + TIEMG GI
Subjt:  MSCCSSSP-LSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGI

Query:  ITSFVLMVLEFLKNAVFGSILE-SGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKM
        I+SFV+ VLE + N VFGS +E S S FGGL+E TK S+   ++ +QVR IIES  + ++    E A+SFAG MF+F    +  + NEP S IG LVE +
Subjt:  ITSFVLMVLEFLKNAVFGSILE-SGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKM

Query:  KEALAG--SSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        K +L     S M+GV+ IV+  + KM+   S V +SS  GLFE VK   +L V+SG +VGGL+EK +  LE+L ME LRGII +I+KI ++++ +YLFG
Subjt:  KEALAG--SSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022156757.1 uncharacterized protein LOC111023598 [Momordica charantia]8.0e-14199.32Show/hide
Query:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
        MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
Subjt:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF

Query:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
        VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSM DQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
Subjt:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA

Query:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        GSSAMDGVREIVQNFLSKMIGAGSGV SSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
Subjt:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022954027.1 uncharacterized protein LOC111456411 [Cucurbita moschata]5.5e-5751.71Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S    ++ PA+A L A Q LA A++ V + ++EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS+FGGL++K ++S  E S+++ VREII  +S KI+D  LETA+S AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GV SSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++  SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

XP_022991981.1 uncharacterized protein LOC111488468 [Cucurbita maxima]1.3e-5852.74Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S    ++ PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FGGL++K K+S  E S+++QVREII  +S KI+D  LETA+S AG +F F   +I   LN+P SA+G+LV  +K +L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GV SSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

TrEMBL top hitse value%identityAlignment
A0A0A0K6V2 Uncharacterized protein3.2e-5549.82Show/hide
Query:  STFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN
        STFLL LLL+I SLKI       + NGFNSL T    LF K+ PC+++SF   IKLPA+A LSAFQ L +A++++ + TIEMG GII+SFV+ VLE + N
Subjt:  STFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN

Query:  AVFGSILE-SGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG--SSAMDG
         VFGS +E S S FGGL+E TK S+   ++ +QVR IIES  + ++    E A+SFAG MF+F    +  + NEP S IG LVE +K +L     S M+G
Subjt:  AVFGSILE-SGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG--SSAMDG

Query:  VREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        V+ IV+  + KM+   S V +SS  GLFE VK   +L V+SG +VGGL+EK +  LE+L ME LRGII +I+KI ++++ +YLFG
Subjt:  VREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A5D3BCI5 Uncharacterized protein3.2e-3450Show/hide
Query:  VLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEAL--A
        VLE + N VFGS +ES  S FGGL+E T   F   ++ +QVR IIES  K ++    E A+SFAG MF+F    I  + NEPSSA+G LV  +K++L   
Subjt:  VLEFLKNAVFGSILESG-SIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEAL--A

Query:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
          S+M+GVR IV++F+ KM+ A S VVSSSA GLFE VK  ++L V+SG +VGGL+EK + +LE+L ME LR II +I+ I +++I +YL G
Subjt:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1DST0 uncharacterized protein LOC1110235983.9e-14199.32Show/hide
Query:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
        MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF
Subjt:  MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSF

Query:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
        VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSM DQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA
Subjt:  VLMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALA

Query:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        GSSAMDGVREIVQNFLSKMIGAGSGV SSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
Subjt:  GSSAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1GPW1 uncharacterized protein LOC1114564112.7e-5751.71Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S    ++ PA+A L A Q LA A++ V + ++EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS+FGGL++K ++S  E S+++ VREII  +S KI+D  LETA+S AG +F F   +I  LLN+P SA+G+LV  +K  L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GV SSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++  SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

A0A6J1JSB1 uncharacterized protein LOC1114884686.3e-5952.74Show/hide
Query:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV
        SSS +   AST LL LLL+I SLK       LL NG NSLPT +F LFFK+ PCVL+S    ++ PA+A L A Q LA A++ V + T+EMGLGII SF+
Subjt:  SSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISF---IKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFV

Query:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG
        + VL  LKN VFGS  E GS FGGL++K K+S  E S+++QVREII  +S KI+D  LETA+S AG +F F   +I   LN+P SA+G+LV  +K +L G
Subjt:  LMVLEFLKNAVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAG

Query:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG
        + S+M GVR IV++ + K+   G GV SSSASG FE VK A+ L +ESG TVGGL+EK K +LEVL ME LR +I+S++++ +++I SY  G
Subjt:  S-SAMDGVREIVQNFLSKMIGAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTGTTGTTCTTCTTCTCCCCTTTCTTCCAAGGCTTCAACATTTCTCCTCTACCTCCTCCTTGTAATCACATCTCTAAAAATCCTTAATCTCACTTTAAATCTCCT
TTCCAATGGCTTCAACAGCTTGCCCACTTTTATTTTCTCCCTATTCTTCAAATCCACACCTTGTGTTCTTATTTCCTTTATAAAGCTCCCTGCCCAAGCCCTCCTCAGCG
CATTTCAGCAGCTTGCACAAGCCATGAGGGCTGTTCTCATGTACACAATAGAGATGGGTTTGGGAATCATAACTTCTTTTGTGCTTATGGTGCTCGAGTTTCTCAAGAAT
GCCGTTTTCGGTTCGATTTTGGAGTCTGGTTCGATCTTTGGTGGCCTGGTGGAGAAGACAAAGTCGTCGTTTATGGAGAGTTCGATGATGGATCAAGTGCGGGAGATTAT
CGAGAGCATTTCGAAGAAGATCATCGACATGGCTCTGGAAACAGCAAGCTCTTTTGCAGGTAGCATGTTCGACTTTGTGAAGGACACCATCCTTGAGTTGTTGAATGAGC
CTAGTTCAGCCATCGGAGAGCTGGTAGAGAAGATGAAGGAGGCATTGGCAGGCAGCTCGGCCATGGACGGGGTGCGGGAGATTGTTCAAAACTTCTTAAGCAAGATGATC
GGTGCGGGATCGGGAGTGGTGAGTTCTTCGGCCAGTGGTTTGTTTGAATTTGTGAAGAATGCTTTGAGTTTGGCTGTTGAATCTGGTCTCACTGTTGGAGGCTTATTGGA
GAAGATGAAGGAATCTTTGGAGGTTCTAAGTATGGAAGGACTACGAGGGATAATTGAGAGTATTTCCAAGATAATTTTGGATGTCATTTTTAGCTATTTATTTGGC
mRNA sequenceShow/hide mRNA sequence
ATGTCTTGTTGTTCTTCTTCTCCCCTTTCTTCCAAGGCTTCAACATTTCTCCTCTACCTCCTCCTTGTAATCACATCTCTAAAAATCCTTAATCTCACTTTAAATCTCCT
TTCCAATGGCTTCAACAGCTTGCCCACTTTTATTTTCTCCCTATTCTTCAAATCCACACCTTGTGTTCTTATTTCCTTTATAAAGCTCCCTGCCCAAGCCCTCCTCAGCG
CATTTCAGCAGCTTGCACAAGCCATGAGGGCTGTTCTCATGTACACAATAGAGATGGGTTTGGGAATCATAACTTCTTTTGTGCTTATGGTGCTCGAGTTTCTCAAGAAT
GCCGTTTTCGGTTCGATTTTGGAGTCTGGTTCGATCTTTGGTGGCCTGGTGGAGAAGACAAAGTCGTCGTTTATGGAGAGTTCGATGATGGATCAAGTGCGGGAGATTAT
CGAGAGCATTTCGAAGAAGATCATCGACATGGCTCTGGAAACAGCAAGCTCTTTTGCAGGTAGCATGTTCGACTTTGTGAAGGACACCATCCTTGAGTTGTTGAATGAGC
CTAGTTCAGCCATCGGAGAGCTGGTAGAGAAGATGAAGGAGGCATTGGCAGGCAGCTCGGCCATGGACGGGGTGCGGGAGATTGTTCAAAACTTCTTAAGCAAGATGATC
GGTGCGGGATCGGGAGTGGTGAGTTCTTCGGCCAGTGGTTTGTTTGAATTTGTGAAGAATGCTTTGAGTTTGGCTGTTGAATCTGGTCTCACTGTTGGAGGCTTATTGGA
GAAGATGAAGGAATCTTTGGAGGTTCTAAGTATGGAAGGACTACGAGGGATAATTGAGAGTATTTCCAAGATAATTTTGGATGTCATTTTTAGCTATTTATTTGGC
Protein sequenceShow/hide protein sequence
MSCCSSSPLSSKASTFLLYLLLVITSLKILNLTLNLLSNGFNSLPTFIFSLFFKSTPCVLISFIKLPAQALLSAFQQLAQAMRAVLMYTIEMGLGIITSFVLMVLEFLKN
AVFGSILESGSIFGGLVEKTKSSFMESSMMDQVREIIESISKKIIDMALETASSFAGSMFDFVKDTILELLNEPSSAIGELVEKMKEALAGSSAMDGVREIVQNFLSKMI
GAGSGVVSSSASGLFEFVKNALSLAVESGLTVGGLLEKMKESLEVLSMEGLRGIIESISKIILDVIFSYLFG