; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g37880 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g37880
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF1997)
Genome locationchr8:28222298..28224800
RNA-Seq ExpressionMoc08g37880
SyntenyMoc08g37880
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008443384.1 PREDICTED: uncharacterized protein LOC103486982 [Cucumis melo]1.2e-6553.7Show/hide
Query:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK------QTFLCFAL------------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLR
        M H L+ VSLQ P  I +P+ +  T     HH+ K        F+CFAL            N +QNPP+FSL+FS  HPL ES +ASFD+YIEDE R+LR
Subjt:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK------QTFLCFAL------------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLR

Query:  ATFAGKSEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLR
        ATFAGKSE+++ +D WR+ MP+FQ+LF KV+PV DVR  C+S  KD PIHIP ++SKF++LQLM WEL GL  DFK    +I+VKGA+YAER +SKS L 
Subjt:  ATFAGKSEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLR

Query:  YYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
          L+LNL++ A P P+ F  QD    LA+KGLKGMMEE M +F+E LLLDY K+K++
Subjt:  YYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

XP_022144448.1 uncharacterized protein LOC111014131 [Momordica charantia]1.2e-134100Show/hide
Query:  MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH
        MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH
Subjt:  MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH

Query:  MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI
        MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI
Subjt:  MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI

Query:  PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
        PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
Subjt:  PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE

XP_022983867.1 uncharacterized protein LOC111482352 [Cucurbita maxima]1.2e-6557.55Show/hide
Query:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED
        M H L  VS  FP  I           SR  H+ + +F  FA+    N  QNPP+FSL FS  HPLFES  ASFDEYI DE R+LRATF+GKSE+L ++ 
Subjt:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED

Query:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAER--RESKSKLRYYLVLNLHSFAA
        EWR+ MPSFQLLF K++P+VDVR  CRS AKDYPIHIP H+SKFL+LQ+MRWE+ G+G DFKSQ F+ISVKGA YA R   ESKS LR +L+L+LHSF  
Subjt:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAER--RESKSKLRYYLVLNLHSFAA

Query:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
            + IP D     A+KGLKGMM+E+M DF++ L+LDYTK+K++
Subjt:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

XP_023528168.1 uncharacterized protein LOC111791159 [Cucurbita pepo subsp. pepo]1.2e-6557.14Show/hide
Query:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED
        M H L  VS  FP  I           +R  H+ +  F  FA+    N  QNPP+FSL FS  HPLFES  ASFDEYI DE R+LRATF+GKSE+L+ + 
Subjt:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED

Query:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSK--LRYYLVLNLHSFAA
        EWR+ MPSFQLLF K++PVVDVR  CRSSAKDYPIHIP H++KFL+LQ+MRWE+ G+G DFK Q F+ISVKGA YA R ES+SK  LR +L+L+LHSF +
Subjt:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSK--LRYYLVLNLHSFAA

Query:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
        P     IP D     A+KGL+GMM+E+M DF++ L+LDYTK+K++
Subjt:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

XP_038905853.1 uncharacterized protein LOC120091799 isoform X1 [Benincasa hispida]4.4e-6857.03Show/hide
Query:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK--QTFLCFALNGD------QNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQL
        M H L+ VS Q P  I + + R++ +    HH+ K    F+CFA+  +      QNPP+FSL+FS  HPL ES +ASFD+YIEDE R+LR TF+GKSE++
Subjt:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK--QTFLCFALNGD------QNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQL

Query:  ADEDEWRIHMPSFQLLFFKVNPVVDVRFICRS--SAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLH
         ++DEWRI MPSFQL F +V+ V DVR  CRS  + +DYPIHIP H+SKF++LQLMRWEL GLG +FK Q F I+V+GALYAER ESKS L    VLNLH
Subjt:  ADEDEWRIHMPSFQLLFFKVNPVVDVRFICRS--SAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLH

Query:  SFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
        +FAAPTP  F  QD     A+KGLKGMMEE M++F+E LLLDY+K+K++
Subjt:  SFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

TrEMBL top hitse value%identityAlignment
A0A0A0LC26 Uncharacterized protein3.7e-6551.97Show/hide
Query:  MGHTLI-VSLQFPNFIFSPHLRTRT-----HTSRFHHENKQTFLCFAL-------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGK
        M H+++ VSLQ P  + +P+ +  +     H  + ++     F+CFAL       N  QNPP+FSL+FS   PL ES +ASFD+YIEDE R+LRATF+GK
Subjt:  MGHTLI-VSLQFPNFIFSPHLRTRT-----HTSRFHHENKQTFLCFAL-------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGK

Query:  SEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLN
        SE++ ++D+WR+ MPSFQ+LF KV+PV DVR  C+SS KD PIHIP ++SKF++LQLM WEL GL  DFK+   KI+VKGA+YAER +SKS L   L+LN
Subjt:  SEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLN

Query:  LHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
        L++ A   P+ F  QD    L +KGLKGMMEE M +F+E LLLDY K+K++ ++
Subjt:  LHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE

A0A1S3B8N8 uncharacterized protein LOC1034869825.8e-6653.7Show/hide
Query:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK------QTFLCFAL------------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLR
        M H L+ VSLQ P  I +P+ +  T     HH+ K        F+CFAL            N +QNPP+FSL+FS  HPL ES +ASFD+YIEDE R+LR
Subjt:  MGHTLI-VSLQFPNFIFSPHLRTRTHTSRFHHENK------QTFLCFAL------------NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLR

Query:  ATFAGKSEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLR
        ATFAGKSE+++ +D WR+ MP+FQ+LF KV+PV DVR  C+S  KD PIHIP ++SKF++LQLM WEL GL  DFK    +I+VKGA+YAER +SKS L 
Subjt:  ATFAGKSEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLR

Query:  YYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
          L+LNL++ A P P+ F  QD    LA+KGLKGMMEE M +F+E LLLDY K+K++
Subjt:  YYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

A0A6J1CT99 uncharacterized protein LOC1110141315.8e-135100Show/hide
Query:  MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH
        MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH
Subjt:  MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIH

Query:  MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI
        MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI
Subjt:  MPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFI

Query:  PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
        PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
Subjt:  PQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE

A0A6J1F2K4 uncharacterized protein LOC111441814 isoform X22.9e-6556.33Show/hide
Query:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED
        M H L  VS  FP  I           +R  H+ +  F  FA+    N  QNPP+FSL FS  HPLFES  ASFDEYI DE R+LRATF+GKSE+L ++ 
Subjt:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED

Query:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSK--LRYYLVLNLHSFAA
        EWR+ MPSFQLLF K++PVVDVR  C+SS KDYPIHIP H+SKFL+LQ+MRWE+ G+G DFK Q F+ISVKG +YA R ES+SK  LR +L+L+LHSF +
Subjt:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSK--LRYYLVLNLHSFAA

Query:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
        P     IP D     A+KGL+GMM+E+M DF++ L+LDYTK+K++
Subjt:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

A0A6J1J0I3 uncharacterized protein LOC1114823525.8e-6657.55Show/hide
Query:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED
        M H L  VS  FP  I           SR  H+ + +F  FA+    N  QNPP+FSL FS  HPLFES  ASFDEYI DE R+LRATF+GKSE+L ++ 
Subjt:  MGHTL-IVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFAL----NGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADED

Query:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAER--RESKSKLRYYLVLNLHSFAA
        EWR+ MPSFQLLF K++P+VDVR  CRS AKDYPIHIP H+SKFL+LQ+MRWE+ G+G DFKSQ F+ISVKGA YA R   ESKS LR +L+L+LHSF  
Subjt:  EWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAER--RESKSKLRYYLVLNLHSFAA

Query:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK
            + IP D     A+KGLKGMM+E+M DF++ L+LDYTK+K++
Subjt:  PTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)2.0e-3139.06Show/hide
Query:  FSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQL-ADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWEL
        +S + S    L ES +A FDEY+ED+ RV  A F  K +    +E+EWRI M   +  F    PVV +R  C+S+ +DYP  +P HI+K LEL + +WEL
Subjt:  FSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQL-ADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWEL

Query:  NGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE
         GL    +   F + VKGALY +RR   ++L+  L   + SF  P+ LA +P+DV   +A   L G+++       E L+ DY+KFK + K+
Subjt:  NGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE

AT5G39530.1 Protein of unknown function (DUF1997)8.9e-3542.27Show/hide
Query:  PPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGK-SEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMR
        P  +S   S   PL ES +A FDEY+ED+ RV  A F  K      +E+EWRI M     LF  V PVVD+R  C+S+ +DYP  +P  I+K LEL +MR
Subjt:  PPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGK-SEQLADEDEWRIHMPSFQLLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMR

Query:  WELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLK
        W+L GL    +   F + VKGALY +RR   ++LR  L +N+ SF  P  L  +P+DV   LA   L G++E      +  LL DY++FK + K
Subjt:  WELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFIPQDVFLALAQKGLKGMMEEAMDDFSEKLLLDYTKFKEKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCACACTTTGATTGTTTCTCTCCAGTTTCCAAACTTTATTTTCTCTCCCCATCTCAGAACACGTACACACACATCCAGATTCCATCACGAAAACAAGCAA
ACTTTCCTCTGCTTTGCCCTCAATGGGGATCAAAATCCTCCACTCTTCTCTCTCGAATTCTCCCGTCGTCATCCACTCTTCGAGTCTTCCAAGGCTTCGTTTGAT
GAATACATTGAAGATGAAGTTAGAGTACTCAGAGCAACGTTTGCGGGAAAAAGTGAACAACTAGCTGATGAGGATGAATGGAGAATTCACATGCCATCTTTCCAA
TTGCTATTCTTCAAGGTCAACCCTGTTGTTGACGTAAGATTCATTTGCAGAAGCTCCGCCAAAGATTACCCTATTCATATTCCTCCCCATATCTCCAAATTTCTT
GAGCTTCAACTGATGAGATGGGAGCTGAATGGATTGGGTGGGGATTTTAAATCCCAAAGCTTCAAAATTAGTGTAAAAGGAGCTTTGTATGCTGAGAGAAGAGAA
TCAAAAAGTAAGCTCAGATATTATTTAGTGCTCAATCTTCACAGCTTTGCTGCCCCCACACCCCTTGCCTTCATTCCACAAGATGTTTTTCTTGCCCTTGCACAA
AAGGGGTTGAAGGGAATGATGGAGGAAGCCATGGATGACTTTTCAGAAAAATTACTTTTGGATTACACCAAATTCAAGGAGAAGCTAAAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCACACTTTGATTGTTTCTCTCCAGTTTCCAAACTTTATTTTCTCTCCCCATCTCAGAACACGTACACACACATCCAGATTCCATCACGAAAACAAGCAA
ACTTTCCTCTGCTTTGCCCTCAATGGGGATCAAAATCCTCCACTCTTCTCTCTCGAATTCTCCCGTCGTCATCCACTCTTCGAGTCTTCCAAGGCTTCGTTTGAT
GAATACATTGAAGATGAAGTTAGAGTACTCAGAGCAACGTTTGCGGGAAAAAGTGAACAACTAGCTGATGAGGATGAATGGAGAATTCACATGCCATCTTTCCAA
TTGCTATTCTTCAAGGTCAACCCTGTTGTTGACGTAAGATTCATTTGCAGAAGCTCCGCCAAAGATTACCCTATTCATATTCCTCCCCATATCTCCAAATTTCTT
GAGCTTCAACTGATGAGATGGGAGCTGAATGGATTGGGTGGGGATTTTAAATCCCAAAGCTTCAAAATTAGTGTAAAAGGAGCTTTGTATGCTGAGAGAAGAGAA
TCAAAAAGTAAGCTCAGATATTATTTAGTGCTCAATCTTCACAGCTTTGCTGCCCCCACACCCCTTGCCTTCATTCCACAAGATGTTTTTCTTGCCCTTGCACAA
AAGGGGTTGAAGGGAATGATGGAGGAAGCCATGGATGACTTTTCAGAAAAATTACTTTTGGATTACACCAAATTCAAGGAGAAGCTAAAAGAATGA
Protein sequenceShow/hide protein sequence
MGHTLIVSLQFPNFIFSPHLRTRTHTSRFHHENKQTFLCFALNGDQNPPLFSLEFSRRHPLFESSKASFDEYIEDEVRVLRATFAGKSEQLADEDEWRIHMPSFQ
LLFFKVNPVVDVRFICRSSAKDYPIHIPPHISKFLELQLMRWELNGLGGDFKSQSFKISVKGALYAERRESKSKLRYYLVLNLHSFAAPTPLAFIPQDVFLALAQ
KGLKGMMEEAMDDFSEKLLLDYTKFKEKLKE