; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G005500 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G005500
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGlycoside hydrolase, family 43
Genome locationCmo_Chr02:3115294..3122221
RNA-Seq ExpressionCmoCh02G005500
SyntenyCmoCh02G005500
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605231.1 hypothetical protein SDJN03_02548, partial [Cucurbita argyrosperma subsp. sororia]1.0e-10999.55Show/hide
Query:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD
        MGDQ NGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD
Subjt:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD

Query:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK
        DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK
Subjt:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK

Query:  KVRGEKWLPSIAREMKLQAQR
        KVRGEKWLPSIAREMKLQAQR
Subjt:  KVRGEKWLPSIAREMKLQAQR

KAG7035197.1 hypothetical protein SDJN02_01992 [Cucurbita argyrosperma subsp. argyrosperma]2.5e-10898.64Show/hide
Query:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD
        MGD  NGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSD DSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD
Subjt:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD

Query:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK
        DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK
Subjt:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK

Query:  KVRGEKWLPSIAREMKLQAQR
        KVRGEKWLPSIAREMKLQAQR
Subjt:  KVRGEKWLPSIAREMKLQAQR

XP_022946955.1 uncharacterized protein LOC111450982 [Cucurbita moschata]6.3e-120100Show/hide
Query:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
        MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
Subjt:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE

Query:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
        TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
Subjt:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA

Query:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
        KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
Subjt:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR

XP_023007376.1 uncharacterized protein LOC111499889 [Cucurbita maxima]1.2e-11898.73Show/hide
Query:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
        MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATS TTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
Subjt:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE

Query:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
        TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEG+DIIAAGNR REKSIARLEAKEAAA
Subjt:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA

Query:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
        KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
Subjt:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR

XP_023532576.1 uncharacterized protein LOC111794698 [Cucurbita pepo subsp. pepo]3.5e-11898.31Show/hide
Query:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
        MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATS TTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
Subjt:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE

Query:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
        TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQS+A+EGKDIIAAGNRAREKSIARLEAKEAAA
Subjt:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA

Query:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
        KAAAK EEERVAELKKVRGEKWLPSIAREMKLQAQR
Subjt:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR

TrEMBL top hitse value%identityAlignment
A0A0A0LMM8 Uncharacterized protein4.9e-8679.28Show/hide
Query:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPC-
        MGD+R+  +GGDSSSGEEDGDAQWR+AIDSV  +SVF+SSLTNG+PATS  T S+SD+  E NL  QPPKQYQIKAQK+L+NILETTLE+VEHS ++PC 
Subjt:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPC-

Query:  DDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAEL
        DDSK+SEGGIRLFKNAPVGVVFDH+DEL RPTK+PKILPGKEINEKSKKFKQ+++SVAVEG+DII A  R  EKSIARLEAKEAA KAAAKREE+RVA+L
Subjt:  DDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKVRGEKWLPSIAREMKLQAQR
        KKVRGEKWLPSIAREMKLQ+Q+
Subjt:  KKVRGEKWLPSIAREMKLQAQR

A0A1S3C6L4 uncharacterized protein LOC1034974291.2e-8781.45Show/hide
Query:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPC-
        MGD+R+  +GGDSSSGEEDGDA+WRAAIDSVT +SVF+SSLTNG+PATS  T S  DDD ELNL  QPPK YQIKAQK+L+NILETTLE+VEHS ++PC 
Subjt:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPC-

Query:  DDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAEL
        DDSK+SEGGIRLFKNAPVGVVFDH+DEL RPTKKPKILPGKEINEKSKKFKQ+++SVAVEG+DII A  R  EKSIARLEAKEAA KAAAKREEERVA+L
Subjt:  DDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKVRGEKWLPSIAREMKLQAQ
        KKVRGEKWLPSIAREMKLQ+Q
Subjt:  KKVRGEKWLPSIAREMKLQAQ

A0A6J1D727 uncharacterized protein LOC1110178724.7e-8980.91Show/hide
Query:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD
        M D+RNGGYGGDSSSGEEDGDAQWRAAIDSV T+SVF+SSLTNG+P TS T AS S+DDSELNL   PPKQYQIKA+K+LENILETTLEVVEH  ++P D
Subjt:  MGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCD

Query:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK
        DSK   GGIRLFKNAP+GVVFDH+DEL+RPTK+PKI+PGKEINEKSKKFKQR+QSVAV+G+DIIA+  RA EKS+ RLEA+EAAAKAAAKREEERVAELK
Subjt:  DSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELK

Query:  KVRGEKWLPSIAREMKLQAQ
        KVRGEKWLPSIAREMKL ++
Subjt:  KVRGEKWLPSIAREMKLQAQ

A0A6J1G5G9 uncharacterized protein LOC1114509823.0e-120100Show/hide
Query:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
        MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
Subjt:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE

Query:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
        TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
Subjt:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA

Query:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
        KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
Subjt:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR

A0A6J1L0C9 uncharacterized protein LOC1114998895.7e-11998.73Show/hide
Query:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
        MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATS TTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE
Subjt:  MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILE

Query:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA
        TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEG+DIIAAGNR REKSIARLEAKEAAA
Subjt:  TTLEVVEHSTAIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAA

Query:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
        KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR
Subjt:  KAAAKREEERVAELKKVRGEKWLPSIAREMKLQAQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49890.1 unknown protein6.6e-4349.06Show/hide
Query:  GGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCDDSKTSEGGI
        GGDSSS  ED D +WRAAI+S+ TT+V+ +S T   PA     A+ S +  +  L P+     QIK + +L  ++E TL+ VE    IP +D   ++ G+
Subjt:  GGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHSTAIPCDDSKTSEGGI

Query:  RLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELKKVRGEKWLP
        RLFK    G+VFDH+DE++ P KKP + P K +   SK+FK+R++S+AV+G DI+ A   A +K+ ARL+AKE AAK  AK+EEER+AELKKVRGEKWLP
Subjt:  RLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELKKVRGEKWLP

Query:  SIAREMKLQAQR
        SI R MK + +R
Subjt:  SIAREMKLQAQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAGCCGGATATCTGTTACATGTCCAGGCTTGAGAAACGGAATGGGTGATCAGAGAAACGGCGGCTATGGTGGCGACAGCAGCAGCGGTGAGGAGGACGGCGACGC
CCAATGGAGAGCCGCAATCGACTCTGTCACTACCACGTCTGTGTTTCTCTCGTCGTTAACTAATGGTCTTCCTGCTACTTCTGCAACCACTGCTTCAAACTCGGACGATG
ACTCCGAGCTTAATCTCGGTCCTCAACCGCCCAAGCAATATCAAATCAAGGCACAGAAGATGTTGGAGAATATTTTGGAAACTACTCTGGAGGTGGTAGAACATTCCACA
GCTATTCCTTGTGATGATTCCAAAACCAGTGAAGGTGGAATTCGTTTGTTTAAAAATGCTCCCGTTGGAGTTGTGTTTGATCACATCGATGAGCTTCAACGCCCCACAAA
GAAACCAAAAATTCTTCCAGGGAAGGAAATCAACGAGAAATCAAAGAAGTTCAAACAGCGTATCCAATCTGTGGCCGTTGAAGGAAAAGATATAATCGCTGCTGGAAACC
GTGCGCGCGAGAAGTCAATTGCTAGGCTTGAAGCGAAAGAAGCCGCAGCCAAAGCAGCTGCTAAAAGAGAGGAAGAAAGGGTAGCAGAACTGAAAAAGGTAAGAGGAGAG
AAATGGTTGCCATCCATTGCTAGGGAAATGAAGTTACAAGCTCAACGTTGA
mRNA sequenceShow/hide mRNA sequence
TGGGCTTTTGAATGCAAAGCCGGATATCTGTTACATGTCCAGGCTTGAGAAACGGAATGGGTGATCAGAGAAACGGCGGCTATGGTGGCGACAGCAGCAGCGGTGAGGAG
GACGGCGACGCCCAATGGAGAGCCGCAATCGACTCTGTCACTACCACGTCTGTGTTTCTCTCGTCGTTAACTAATGGTCTTCCTGCTACTTCTGCAACCACTGCTTCAAA
CTCGGACGATGACTCCGAGCTTAATCTCGGTCCTCAACCGCCCAAGCAATATCAAATCAAGGCACAGAAGATGTTGGAGAATATTTTGGAAACTACTCTGGAGGTGGTAG
AACATTCCACAGCTATTCCTTGTGATGATTCCAAAACCAGTGAAGGTGGAATTCGTTTGTTTAAAAATGCTCCCGTTGGAGTTGTGTTTGATCACATCGATGAGCTTCAA
CGCCCCACAAAGAAACCAAAAATTCTTCCAGGGAAGGAAATCAACGAGAAATCAAAGAAGTTCAAACAGCGTATCCAATCTGTGGCCGTTGAAGGAAAAGATATAATCGC
TGCTGGAAACCGTGCGCGCGAGAAGTCAATTGCTAGGCTTGAAGCGAAAGAAGCCGCAGCCAAAGCAGCTGCTAAAAGAGAGGAAGAAAGGGTAGCAGAACTGAAAAAGG
TAAGAGGAGAGAAATGGTTGCCATCCATTGCTAGGGAAATGAAGTTACAAGCTCAACGTTGATGTATGGGGGTTTAGATGGCAATTACAACCCTATTCTCAGTGATTATT
ATGGAGATGATGTTTTATATGAATGCATCAGCTCATATTCAGTTCAGAGAATTAAGGTTTGTTTTTTTTGCCCATAAAATTACCTTAAAAATTTTGAACCTCTTCCTCTT
GCATATGATTTTCTAATCATTTTTCGTCTATTGCATGCTCCATTTTGGCTTAAAATCATTTTCCATCGGTACCAATGTTTTGCAATTGATCTGTAATGAAAGGTTTATTT
GATCTGAAAGGCTTACAAATTTTGTAAGGTTAAGTTACCACTTTCGAATTTTGTTTCCTTACTGATCTGCCATGAAACTAGAATTCTTTCATTGACTTTTTACTTTATTA
TTCCATTTGTTATATGCACTCATTGATCAATGAAAGTCCAAAACAATTGGGTAATGTATTCTATATAACATAATCATCTGTATTTCTTCATTTGTTTGTTAGTTTAGTTT
GTAATTAACGATATTTATATTATTTGAAGGACTTGAACTACTACATTCTTTAGGGTTATATGAGTTTTGTTTATATGTTCTGTTAGTTGCAC
Protein sequenceShow/hide protein sequence
MQSRISVTCPGLRNGMGDQRNGGYGGDSSSGEEDGDAQWRAAIDSVTTTSVFLSSLTNGLPATSATTASNSDDDSELNLGPQPPKQYQIKAQKMLENILETTLEVVEHST
AIPCDDSKTSEGGIRLFKNAPVGVVFDHIDELQRPTKKPKILPGKEINEKSKKFKQRIQSVAVEGKDIIAAGNRAREKSIARLEAKEAAAKAAAKREEERVAELKKVRGE
KWLPSIAREMKLQAQR