; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0055031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0055031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionStress up-regulated Nod 19
Genome locationCMiso1.1chr02:21956582..21957304
RNA-Seq ExpressionCmc02g0055031
SyntenyCmc02g0055031
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR011692 - Stress up-regulated Nod 19


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645835.1 hypothetical protein Csa_017353 [Cucumis sativus]2.7e-7857.98Show/hide
Query:  MLFHH----WFLQFTLIVTLFLNLEAIQIN----HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLV
        ML HH    WFL    I  +   LE + IN    HQ +KT+++++P FTL PG V+E+F+YN NFP+GHIAIKSFD E+VDE+ NP+ LF+TY HHW ++
Subjt:  MLFHH----WFLQFTLIVTLFLNLEAIQIN----HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLV

Query:  RYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLE
        RYYQHK T   +  TN  F      + IIAGN+GVCQ H+LP F+G G +SRKTS+ LP  Y IEVGNEKEVPLGYEEKWVLN+HAIDTRGVEDRIGC+E
Subjt:  RYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLE

Query:  CKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYT
        CK +LY+V KD     L+DDY GG +CCYD+ QCKVK+G++ E     + NLY +YT
Subjt:  CKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYT

XP_004137324.1 uncharacterized protein LOC101215981 [Cucumis sativus]8.9e-12289.03Show/hide
Query:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKT
        MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPG VIEKFYYNLNFPKGHIAIKSFDAE+VDEQGNPVSLFDTY HHWTLVRYYQH KT
Subjt:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKT

Query:  TTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK
        TTTNH TN    NSIIIAGNNGVCQP++L YFYGMGTE+RKTSN LPD Y IEVGNEKEVPLG+EEKWVLNVHAIDTRGV DR+GCLECKC+LYDV+KD+
Subjt:  TTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK

Query:  LDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV
        LDDDY GGFKCCYDKAQCKV+EG+NGEE NLYMKYTV
Subjt:  LDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV

XP_008453444.1 PREDICTED: uncharacterized protein LOC103494150 [Cucumis melo]1.2e-12389.92Show/hide
Query:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQ-HKK
        MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPG VIEKFYYNLNFPKGHIAIKSFDAE+VDEQGNPVSLFDTY HHWT+VRYYQ H K
Subjt:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQ-HKK

Query:  TTTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD
        T TTNH  N D+ANSIIIAGNNGVCQ H+L YFYGMGTE+RKTSN LPD Y IEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKC+LYDV+KD
Subjt:  TTTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD

Query:  KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV
        +LDDDYKGGFKCCYDKAQCKV+EG+NGEE NLYMKYTV
Subjt:  KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV

XP_038887541.1 uncharacterized protein LOC120077659 [Benincasa hispida]2.6e-8161.25Show/hide
Query:  WFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKTTTTNH
        WFL     ++L + +  ++  +Q +KT+++++PLFTL PG V+E+FYYN NFPKGHIA+KSFD E+VDE GNP+ LF+TY HHW ++RYYQHK T   N 
Subjt:  WFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKTTTTNH

Query:  MTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD--
         TN  F      + IIA NNGVCQ ++LP F+G G +SRKTS+ LP+ Y IEVGNEKEVPLGYEEKWVLN+HAIDTRGVEDRIGC+ECK +LY+V KD  
Subjt:  MTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD--

Query:  --KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV
           L+DDY GG +CCYD+ QCKVKEG+ GEE NLY++YTV
Subjt:  --KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV

XP_038905678.1 uncharacterized protein LOC120091645 [Benincasa hispida]3.2e-11180.17Show/hide
Query:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKT
        MLFHHW L FTLI TLFLNLEAI++NHQT+KTQ+FLSPLFTLTPG V+EKF+YNLNFPKGHIAIKSFDAE++DE+ NPVSLFD Y HHWTLVRYYQH K 
Subjt:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKT

Query:  TTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK
        T TNH   +D A+SIIIAGN+G CQPH+L YFYGMGTESRKTSN LP+ Y IEVGNEKEVPLGYEEKW+LNVH IDTRGVEDRIGCLEC+C+LY+VNKD+
Subjt:  TTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK

Query:  LDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV
        L++DYKGGFKCCYD AQCKV+EG+ GEE NLY+KYTV
Subjt:  LDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV

TrEMBL top hitse value%identityAlignment
A0A0A0K2U1 Uncharacterized protein1.0e-7857.75Show/hide
Query:  MLFHH----WFLQFTLIVTLFLNLEAIQIN----HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLV
        ML HH    WFL    I  +   LE + IN    HQ +KT+++++P FTL PG V+E+F+YN NFP+GHIAIKSFD E+VDE+ NP+ LF+TY HHW + 
Subjt:  MLFHH----WFLQFTLIVTLFLNLEAIQIN----HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLV

Query:  RYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLE
        RYYQHK +   N   N  F      + IIAGN+GVCQ H+LP+F+G G ESRKTS+ LP  Y IEVGNEKEVPLGYEEKWVLN+HAIDTRGVEDRIGC+E
Subjt:  RYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLE

Query:  CKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGF-----NGEEMNLYMKYTV
        CK +LY+V KD     L+DDY GG +CCYD+ QCKVK+G+     + ++ NLY++YTV
Subjt:  CKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGF-----NGEEMNLYMKYTV

A0A1S3BVP7 uncharacterized protein LOC1034941506.0e-12489.92Show/hide
Query:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQ-HKK
        MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPG VIEKFYYNLNFPKGHIAIKSFDAE+VDEQGNPVSLFDTY HHWT+VRYYQ H K
Subjt:  MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQ-HKK

Query:  TTTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD
        T TTNH  N D+ANSIIIAGNNGVCQ H+L YFYGMGTE+RKTSN LPD Y IEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKC+LYDV+KD
Subjt:  TTTTNHMTNDDFANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD

Query:  KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV
        +LDDDYKGGFKCCYDKAQCKV+EG+NGEE NLYMKYTV
Subjt:  KLDDDYKGGFKCCYDKAQCKVKEGFNGEEMNLYMKYTV

A0A1S3C023 uncharacterized protein LOC1034953428.5e-7856.92Show/hide
Query:  MLFHH----WFLQFTLIVTLFLNLEAIQIN-----HQT-LKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWT
        ML HH    W L F+ +  +   LE +  N     HQ  +KT+++ +PLFTL PG V+E+F+YN NFPKGHIA+KSFD E+VDE+ NP+ LF+TY HHW 
Subjt:  MLFHH----WFLQFTLIVTLFLNLEAIQIN-----HQT-LKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWT

Query:  LVRYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGC
        + RYYQHK T   N   N  F      + I+AGNNGVCQ H+LP F+G G +SRKTS+ LP+ Y IEVGNEKEVPLGYEEKWVLN+HAIDTRGVEDRIGC
Subjt:  LVRYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGC

Query:  LECKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYTV
        +ECK +LY+V KD     L+DDY GG +CCYD+ QCK+K+G+  E     + NLY++YTV
Subjt:  LECKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYTV

A0A5D3C682 SURNod19 domain-containing protein8.5e-7856.92Show/hide
Query:  MLFHH----WFLQFTLIVTLFLNLEAIQIN-----HQT-LKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWT
        ML HH    W L F+ +  +   LE +  N     HQ  +KT+++ +PLFTL PG V+E+F+YN NFPKGHIA+KSFD E+VDE+ NP+ LF+TY HHW 
Subjt:  MLFHH----WFLQFTLIVTLFLNLEAIQIN-----HQT-LKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWT

Query:  LVRYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGC
        + RYYQHK T   N   N  F      + I+AGNNGVCQ H+LP F+G G +SRKTS+ LP+ Y IEVGNEKEVPLGYEEKWVLN+HAIDTRGVEDRIGC
Subjt:  LVRYYQHKKTTTTNHMTNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGC

Query:  LECKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYTV
        +ECK +LY+V KD     L+DDY GG +CCYD+ QCK+K+G+  E     + NLY++YTV
Subjt:  LECKCNLYDVNKD----KLDDDYKGGFKCCYDKAQCKVKEGFNGE-----EMNLYMKYTV

A0A6J1E2R7 uncharacterized protein LOC1114300471.4e-7558.33Show/hide
Query:  LQFTLIVTLFLNLEAIQIN-HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKTTTTNHM
        L   LI+ +   L    IN +Q +KT+SFL+P FT+TPG V+E+FYY+ NFPK HIA+K FD E+VD+ GNPV LF+TY HHW ++RYYQHK     N  
Subjt:  LQFTLIVTLFLNLEAIQIN-HQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKTTTTNHM

Query:  TNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD---
        TN  F      + +IAGNNGVCQ H LP+FYG G +SR+TS+ LP+ Y IEVGNE EVPLGYEEKWVL +HAIDTRGVEDR+GC+EC+ +LY+V KD   
Subjt:  TNDDFAN----SIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKD---

Query:  -KLDDDYKGGFKCCYDKAQCKVKEGFNG-EEMNLYMKYTV
          L+ DYKGG +CCYDK +CK++E + G EE +LY++YTV
Subjt:  -KLDDDYKGGFKCCYDKAQCKVKEGFNG-EEMNLYMKYTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G61820.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: vacuole; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Stress up-regulated Nod 19 (InterPro:IPR011692); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.0e-5847.16Show/hide
Query:  QTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHK------KTTTTNH-------MTNDDFANS
        + +K+  F SP   + PG V   + ++++FP+GHI +K+FDAE+VDE G PV L +TY HHW +  YY  K      +    NH        +N D  + 
Subjt:  QTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHK------KTTTTNH-------MTNDDFANS

Query:  IIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK----LDDDYKGGFK
        II+  N G+C+  +L +F+G+G+E+R+TS  +PD Y IE+GN +E P GYE KW+LN+HAIDTRGVED+ GC+EC C+LY+V  D+    +   YKGG  
Subjt:  IIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDK----LDDDYKGGFK

Query:  CCYDKAQCKVKEGF-NGEE-MNLYMKYTV
        CCYDK QC+VK GF NGE+   LY+KYTV
Subjt:  CCYDKAQCKVKEGF-NGEE-MNLYMKYTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTCCATCATTGGTTTCTCCAATTTACACTCATAGTGACACTATTCCTAAACTTGGAAGCCATTCAAATAAACCACCAAACCCTAAAAACCCAATCTTTTCTCTC
CCCATTATTCACTTTAACTCCTGGCTTAGTAATTGAAAAATTCTACTACAATCTCAACTTTCCCAAAGGCCATATTGCAATAAAGAGCTTCGATGCCGAAATCGTCGACG
AACAAGGTAATCCCGTCTCGCTTTTTGACACATATCCCCATCATTGGACACTCGTGAGATATTACCAACACAAGAAAACTACAACAACGAATCACATGACGAATGACGAT
TTTGCAAACTCTATCATTATTGCGGGTAATAACGGAGTTTGCCAACCACATTCATTGCCGTATTTTTATGGTATGGGAACCGAATCGAGAAAAACATCAAATCTTCTTCC
AGACCTGTATGTAATTGAAGTTGGAAATGAAAAGGAAGTTCCTTTAGGGTATGAAGAGAAATGGGTTCTTAATGTTCATGCCATTGATACTAGAGGTGTGGAGGATAGAA
TTGGATGTCTTGAGTGTAAATGTAATTTGTATGATGTTAACAAAGATAAATTGGATGATGATTACAAAGGAGGATTTAAATGTTGCTATGATAAAGCTCAATGTAAAGTG
AAGGAAGGTTTTAACGGAGAGGAAATGAATTTGTATATGAAATATACGGTGCCAATGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTATTCCATCATTGGTTTCTCCAATTTACACTCATAGTGACACTATTCCTAAACTTGGAAGCCATTCAAATAAACCACCAAACCCTAAAAACCCAATCTTTTCTCTC
CCCATTATTCACTTTAACTCCTGGCTTAGTAATTGAAAAATTCTACTACAATCTCAACTTTCCCAAAGGCCATATTGCAATAAAGAGCTTCGATGCCGAAATCGTCGACG
AACAAGGTAATCCCGTCTCGCTTTTTGACACATATCCCCATCATTGGACACTCGTGAGATATTACCAACACAAGAAAACTACAACAACGAATCACATGACGAATGACGAT
TTTGCAAACTCTATCATTATTGCGGGTAATAACGGAGTTTGCCAACCACATTCATTGCCGTATTTTTATGGTATGGGAACCGAATCGAGAAAAACATCAAATCTTCTTCC
AGACCTGTATGTAATTGAAGTTGGAAATGAAAAGGAAGTTCCTTTAGGGTATGAAGAGAAATGGGTTCTTAATGTTCATGCCATTGATACTAGAGGTGTGGAGGATAGAA
TTGGATGTCTTGAGTGTAAATGTAATTTGTATGATGTTAACAAAGATAAATTGGATGATGATTACAAAGGAGGATTTAAATGTTGCTATGATAAAGCTCAATGTAAAGTG
AAGGAAGGTTTTAACGGAGAGGAAATGAATTTGTATATGAAATATACGGTGCCAATGGGTTGA
Protein sequenceShow/hide protein sequence
MLFHHWFLQFTLIVTLFLNLEAIQINHQTLKTQSFLSPLFTLTPGLVIEKFYYNLNFPKGHIAIKSFDAEIVDEQGNPVSLFDTYPHHWTLVRYYQHKKTTTTNHMTNDD
FANSIIIAGNNGVCQPHSLPYFYGMGTESRKTSNLLPDLYVIEVGNEKEVPLGYEEKWVLNVHAIDTRGVEDRIGCLECKCNLYDVNKDKLDDDYKGGFKCCYDKAQCKV
KEGFNGEEMNLYMKYTVPMG