; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0675 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0675
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHAD-superfamily hydrolase isoform 1
Genome locationMC08:5450300..5454906
RNA-Seq ExpressionMC08g0675
SyntenyMC08g0675
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152696.1 uncharacterized protein LOC101216764 isoform X1 [Cucumis sativus]2.72e-11981.74Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRLISV    + AVSP +  S NND+F+E Q   H+LPI  +RRG TMLSF+SA+PSLFLPAPA+AFDIGISGPKDWLKEQKKKASKFLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL+AVYLLL++DSDYSSKD+E+VQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQL+VKNA+SLLGN+DPIKLEAES LKDLVSSFTSLNSL YE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDALNDT+ SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

XP_008444717.1 PREDICTED: uncharacterized protein LOC103487971 [Cucumis melo]5.48e-11980.87Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRL SVP+A    VSP +  S N+D+F+ESQ   H+LPI  +RRG TMLSF+SAMPSLFLPAPA+AFDIGISGPKDWLKEQKKK+SKFLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL+ VYLLL++DSDYS+KD+E+VQRLLKSAARDCVLKDR+SFVQFQASTGVEVCTFQL+VKNA+SLLGNKDPIKLEAE+ LKDLVSSFTSLNSLAYE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDALNDT+ SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

XP_022144262.1 uncharacterized protein LOC111013988 [Momordica charantia]1.42e-15299.12Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR
        MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR

Query:  GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD
        GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD
Subjt:  GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD

Query:  IQVDSNRQKVLDALNDTIASLDKFEKDV
        IQVDSNRQKVLDALNDTIASLDKFEK +
Subjt:  IQVDSNRQKVLDALNDTIASLDKFEKDV

XP_023545902.1 uncharacterized protein LOC111805195 isoform X1 [Cucurbita pepo subsp. pepo]6.70e-12081.9Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQ--NNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPI
        MTTIGFRLISVPSA    VSPL+ +S   NND+   SQ   H LPI  +RRG TMLSFLSAMPSLFLPAPATAFDIGISGPKDWL+EQK+KAS+FLLAPI
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQ--NNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPI

Query:  DASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLA
        +ASRGSL AVYL+L++DSDYS+KD+E+VQR+LKSAARDCVLKDRNSFVQFQASTGVEVCTFQL+VKNA+SLLGN+DPIKLEAESRLKDLVSSFTSLNSLA
Subjt:  DASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLA

Query:  YETDIQVDSNRQKVLDALNDTIASLDKFEKDV
        YETDIQV+SNRQKVLDA+NDTI SLDKFEK +
Subjt:  YETDIQVDSNRQKVLDALNDTIASLDKFEKDV

XP_038884855.1 uncharacterized protein LOC120075490 isoform X1 [Benincasa hispida]2.47e-11880.6Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQN--NDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPI
        MTTIGFRL  V +A    VSP +P+SQN  ND F+ESQ   H+LPI  +RRG TMLSF+SA+PSLFLPAPATAF+IGISGPKDWLKEQKKKASKFLLAPI
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQN--NDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPI

Query:  DASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLA
        +ASR SL+ V+LLL++DSDYS+KD+E+V RLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQL+VKNASSLLGNKDPIKLEAES LKDLVSSFT LNSLA
Subjt:  DASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLA

Query:  YETDIQVDSNRQKVLDALNDTIASLDKFEKDV
        YETDIQVDSNRQ+VLDALNDT++SLDKFEK +
Subjt:  YETDIQVDSNRQKVLDALNDTIASLDKFEKDV

TrEMBL top hitse value%identityAlignment
A0A0A0LPA8 Uncharacterized protein1.32e-11981.74Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRLISV    + AVSP +  S NND+F+E Q   H+LPI  +RRG TMLSF+SA+PSLFLPAPA+AFDIGISGPKDWLKEQKKKASKFLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL+AVYLLL++DSDYSSKD+E+VQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQL+VKNA+SLLGN+DPIKLEAES LKDLVSSFTSLNSL YE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDALNDT+ SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

A0A1S3BAY8 uncharacterized protein LOC1034879712.65e-11980.87Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRL SVP+A    VSP +  S N+D+F+ESQ   H+LPI  +RRG TMLSF+SAMPSLFLPAPA+AFDIGISGPKDWLKEQKKK+SKFLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL+ VYLLL++DSDYS+KD+E+VQRLLKSAARDCVLKDR+SFVQFQASTGVEVCTFQL+VKNA+SLLGNKDPIKLEAE+ LKDLVSSFTSLNSLAYE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDALNDT+ SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

A0A5A7VAP0 Chloroplast thylakoid membrane2.65e-11980.87Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRL SVP+A    VSP +  S N+D+F+ESQ   H+LPI  +RRG TMLSF+SAMPSLFLPAPA+AFDIGISGPKDWLKEQKKK+SKFLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL+ VYLLL++DSDYS+KD+E+VQRLLKSAARDCVLKDR+SFVQFQASTGVEVCTFQL+VKNA+SLLGNKDPIKLEAE+ LKDLVSSFTSLNSLAYE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDALNDT+ SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

A0A6J1CT65 uncharacterized protein LOC1110139886.87e-15399.12Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR
        MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASR

Query:  GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD
        GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD
Subjt:  GSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETD

Query:  IQVDSNRQKVLDALNDTIASLDKFEKDV
        IQVDSNRQKVLDALNDTIASLDKFEK +
Subjt:  IQVDSNRQKVLDALNDTIASLDKFEKDV

A0A6J1HBI9 uncharacterized protein LOC111462505 isoform X12.51e-11780.43Show/hide
Query:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA
        MTTIGFRLISVPSA    VSPL+ +S    +  +SQ   H LP+  +RRG TMLSFLSAMPSLFLPAPATAFDIGISGPKDWL+EQKKKAS+FLLAPI+A
Subjt:  MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPI--NRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDA

Query:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE
        SR SL AVYL+L++ SDYS+KD+E+VQR+LKSAARDCVLKDRNSFVQFQASTGVEVCTFQL+VKNA+SLLGN+DPIKLEAESRLKDLVSSFTSLNSLAYE
Subjt:  SRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYE

Query:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV
        TDIQV+SNRQKVLDA+NDTI SLDKFEK +
Subjt:  TDIQVDSNRQKVLDALNDTIASLDKFEKDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G42765.1 INVOLVED IN: biological_process unknown; LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Twin-arginine translocation pathway, signal sequence (InterPro:IPR006311); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).1.8e-5359.79Show/hide
Query:  INRRGFTMLSFLSAMPSLFLPA----PATAFDIGISGPKDWLKEQKKKASKFLLAPIDASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKD
        I+RR  T+L  LS+  SL L      PA AF +GISGPK+WLK+QKKK+S+FLLAPIDA+R +L++ YL LTS+SDY+ KDLE +Q L KS+ARDCV K+
Subjt:  INRRGFTMLSFLSAMPSLFLPA----PATAFDIGISGPKDWLKEQKKKASKFLLAPIDASRGSLRAVYLLLTSDSDYSSKDLEEVQRLLKSAARDCVLKD

Query:  RNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVDSNRQKVLDALNDTIASLDKFEKDV
        R+S V FQ+ +GVEVCTF+LVVKNA+SLL +KDP+KLEAE+ L DLV SF SL  L    D+ + S+R+KV D + +TI+ LDKFEK V
Subjt:  RNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVDSNRQKVLDALNDTIASLDKFEKDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAACGATCGGATTCCGCCTTATCTCTGTTCCATCCGCCGCCGCAACCGCCGTCTCACCGCTCAATCCGGTTTCTCAGAACAATGATGAATTCCGAGAAAGCCAACT
GCAGTGGCATCGACTTCCGATTAATCGTAGAGGCTTCACTATGCTCTCGTTTCTCTCCGCGATGCCGTCTCTGTTTCTACCTGCTCCCGCCACTGCTTTCGATATCGGCA
TTTCAGGACCGAAGGATTGGTTAAAGGAGCAAAAGAAGAAGGCCTCCAAGTTTCTCTTGGCACCAATTGATGCCTCCAGAGGCAGCCTTCGCGCTGTTTACCTTTTGCTT
ACCAGTGATTCCGATTACTCAAGCAAGGATCTGGAGGAAGTTCAAAGACTGCTTAAATCTGCTGCAAGGGATTGCGTTCTGAAGGATAGGAACTCGTTTGTCCAATTTCA
AGCGAGCACAGGAGTCGAGGTCTGTACATTTCAACTGGTTGTGAAAAATGCATCCTCCTTGCTTGGAAATAAAGATCCTATAAAACTAGAAGCTGAAAGTCGGCTAAAAG
ATCTAGTGAGCTCTTTCACTTCACTCAATAGTCTCGCATATGAAACTGATATTCAAGTCGATTCCAACAGACAAAAAGTACTGGATGCACTAAATGATACCATAGCTTCC
CTTGACAAATTTGAGAAGGATGTGGTGAACCAGAAGCCTGTACCAAATACAAGCGGCAGCAAGGTTTTCCATCATCAGGGTGAGTTTCTGAGGCAGGCCTCTCCTGCAAG
TAACCCTGCTAGTGGATCATGGATCACAGACTCCAAAGCGCTCTTCCTGCTGCCATTTGACATGCCGAGCGTTGGAGGTAATCCCTTCGGCCTCAATTCTTGCATGGGCT
TCACGTACGCTATTGGCCAACAACATATCTGGAAGCTT
mRNA sequenceShow/hide mRNA sequence
AAAAAATTGGCATACCTCACTGTAAAAATTGATTTTTTTCCCCATTATTTTCCATGAAAAAATTCCGGGAACATAACTCAGAAGCATCTCCGTTCACTAAACATGACAAC
GATCGGATTCCGCCTTATCTCTGTTCCATCCGCCGCCGCAACCGCCGTCTCACCGCTCAATCCGGTTTCTCAGAACAATGATGAATTCCGAGAAAGCCAACTGCAGTGGC
ATCGACTTCCGATTAATCGTAGAGGCTTCACTATGCTCTCGTTTCTCTCCGCGATGCCGTCTCTGTTTCTACCTGCTCCCGCCACTGCTTTCGATATCGGCATTTCAGGA
CCGAAGGATTGGTTAAAGGAGCAAAAGAAGAAGGCCTCCAAGTTTCTCTTGGCACCAATTGATGCCTCCAGAGGCAGCCTTCGCGCTGTTTACCTTTTGCTTACCAGTGA
TTCCGATTACTCAAGCAAGGATCTGGAGGAAGTTCAAAGACTGCTTAAATCTGCTGCAAGGGATTGCGTTCTGAAGGATAGGAACTCGTTTGTCCAATTTCAAGCGAGCA
CAGGAGTCGAGGTCTGTACATTTCAACTGGTTGTGAAAAATGCATCCTCCTTGCTTGGAAATAAAGATCCTATAAAACTAGAAGCTGAAAGTCGGCTAAAAGATCTAGTG
AGCTCTTTCACTTCACTCAATAGTCTCGCATATGAAACTGATATTCAAGTCGATTCCAACAGACAAAAAGTACTGGATGCACTAAATGATACCATAGCTTCCCTTGACAA
ATTTGAGAAGGATGTGGTGAACCAGAAGCCTGTACCAAATACAAGCGGCAGCAAGGTTTTCCATCATCAGGGTGAGTTTCTGAGGCAGGCCTCTCCTGCAAGTAACCCTG
CTAGTGGATCATGGATCACAGACTCCAAAGCGCTCTTCCTGCTGCCATTTGACATGCCGAGCGTTGGAGGTAATCCCTTCGGCCTCAATTCTTGCATGGGCTTCACGTAC
GCTATTGGCCAACAACATATCTGGAAGCTT
Protein sequenceShow/hide protein sequence
MTTIGFRLISVPSAAATAVSPLNPVSQNNDEFRESQLQWHRLPINRRGFTMLSFLSAMPSLFLPAPATAFDIGISGPKDWLKEQKKKASKFLLAPIDASRGSLRAVYLLL
TSDSDYSSKDLEEVQRLLKSAARDCVLKDRNSFVQFQASTGVEVCTFQLVVKNASSLLGNKDPIKLEAESRLKDLVSSFTSLNSLAYETDIQVDSNRQKVLDALNDTIAS
LDKFEKDVVNQKPVPNTSGSKVFHHQGEFLRQASPASNPASGSWITDSKALFLLPFDMPSVGGNPFGLNSCMGFTYAIGQQHIWKL