; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1371 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1371
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionMucin-4 beta chain like
Genome locationMC01:18364068..18364796
RNA-Seq ExpressionMC01g1371
SyntenyMC01g1371
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594629.1 hypothetical protein SDJN03_11182, partial [Cucurbita argyrosperma subsp. sororia]3.11e-8667.35Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPLFSTLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R I D N QIE E  +   A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW
            D VSCACFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD +LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT

KAG7026597.1 hypothetical protein SDJN02_10599, partial [Cucurbita argyrosperma subsp. argyrosperma]3.55e-8667.35Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPLFSTLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R I D N QIE E  +   A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW
            D VSCACFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD +LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT

XP_022142309.1 uncharacterized protein LOC111012458 [Momordica charantia]5.87e-170100Show/hide
Query:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG
        SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG
Subjt:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG

Query:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW
        GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW
Subjt:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT
        DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT

XP_022926379.1 uncharacterized protein LOC111433544 [Cucurbita moschata]4.93e-8666.94Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPLFSTLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R   D N QIE E  + + A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW
            D VSCACFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD +LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT

XP_023518306.1 uncharacterized protein LOC111781826 [Cucurbita pepo subsp. pepo]1.51e-8566.94Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPLFSTLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R   D N QIE E  +   A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW
            D VSCACFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD +LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT

TrEMBL top hitse value%identityAlignment
A0A0A0KLD5 Uncharacterized protein1.22e-5450.59Show/hide
Query:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG
        S   ++S FS PLFS LL+FF LILL FP   W  L SP  +L G L LSLLRLGATQR  +    ++Q  +    +      P E   G          
Subjt:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG

Query:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRP---------VPVRI---ERYPSLSLYYPETDSDSSSDDEFPIPG
             C  +S +CFEESFVQWNVRAPLEVIYE+YE ED+++   E     E E            + +R+   ERYPSLSLYYPETDSDS S       G
Subjt:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRP---------VPVRI---ERYPSLSLYYPETDSDSSSDDEFPIPG

Query:  AWDSLDSLCFMWDDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT
        A D ++++CFMWDD EDRDELIEIALDK     + SSE FQLEEENLIEIDISPT
Subjt:  AWDSLDSLCFMWDDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT

A0A6J1CME6 uncharacterized protein LOC1110124582.84e-170100Show/hide
Query:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG
        SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG
Subjt:  SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLG

Query:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW
        GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW
Subjt:  GGGGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT
        DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISPT

A0A6J1EEY1 uncharacterized protein LOC1114335442.38e-8666.94Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPLFSTLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R   D N QIE E  + + A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW
            D VSCACFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD +LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLD-SLCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISPT
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISPT

A0A6J1KY66 uncharacterized protein LOC1114974331.81e-8365.98Show/hide
Query:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG
        +IS  SDPL STLL+FFAL LL FPR+ W +LFSP+ LLTG LLLSLLRLGATQR F R   D N QIE E  +   A V  E        AV   + GG
Subjt:  TISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQR-FPRHIKDKN-QIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGG

Query:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDS-LCFMW
            D VSC CFEESFVQWNVRAPLEVIYEEYEGED DK+ SNESDP    EPR V + +ERYPSLSLYYPETDSDSSS+D FP  G WDSLD  LC MW
Subjt:  GGICDIVSCACFEESFVQWNVRAPLEVIYEEYEGED-DKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDS-LCFMW

Query:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISP
        D+ EDRDELIEIALDKT  K   +SE F LEE+NLIEI  DISP
Subjt:  DDGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEI--DISP

A0A7J0E0A7 Uncharacterized protein1.23e-5249.79Show/hide
Query:  RTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGG
        +T    SDPLFS ++T + LI L FPR+F  ++FSPV + TG+LLL+LLRLGA QR          IE E  S       P  E +  D   VS     G
Subjt:  RTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGG

Query:  GICDI--VSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMWD
           ++       + ESFV+WNVRAPLEVIYEEYEGE+ +      D F    P      IERYPSLSLYYPE+D +SSSD +FP+ G WDS +S+CF WD
Subjt:  GICDI--VSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMWD

Query:  DGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISP
        + EDR+ LIEI LD      K   ELF  EEENLIEIDISP
Subjt:  DGEDRDELIEIALDKTTNKMKASSELFQLEEENLIEIDISP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G05018.1 unknown protein2.6e-1031.11Show/hide
Query:  FSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGGGICDI
        FS PLFS  ++ + + LL FP+    +L S VPLL G  L+S L LG+T++                    ++  P       +D++  +L  G G+   
Subjt:  FSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGGGICDI

Query:  VSCACFEESFVQWNVR--APLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPI
                 F++WN +  APLEVI++E   E+++ ++   DP          V I+R+PSLS++ PE  SDS  D +FP+
Subjt:  VSCACFEESFVQWNVR--APLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPI

AT4G21500.1 unknown protein1.1e-2741.39Show/hide
Query:  FSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGGGICDI
        FS PLFS  +T ++LIL+ FP     +  SPV L++G LLLSLLRLG+T R P    DK+  ESE       V    E D          L GG      
Subjt:  FSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGGGICDI

Query:  VSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSD----SSSDDEFPIPGAWDSLDSLCFMWDDGE
                 FV+WN+RAPLEVI+E YE E++++   E DP   T  R    ++ER+PSLSL YPE+DS+    SSS+  FP  G W+S +++ F W++ E
Subjt:  VSCACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSD----SSSDDEFPIPGAWDSLDSLCFMWDDGE

Query:  DR---DELIEIALDKTTN--KMKASSEL-FQLEEENLIEIDISP
             + LIEI LD  ++  KM + +E+ F  EE+ LIEID+ P
Subjt:  DR---DELIEIALDKTTN--KMKASSEL-FQLEEENLIEIDISP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTTGTAAGCGTACGATTTCCTGTTTCTCCGATCCTCTCTTTTCTACTCTCCTCACTTTCTTCGCTCTCATTCTTCTCTCATTCCCTCGCATTTTCTGGCCACTCCTCTT
CTCCCCGGTCCCTCTCCTCACCGGACTCCTCCTCCTCTCTCTGCTCCGCCTCGGCGCAACTCAGAGATTTCCACGCCACATTAAGGACAAGAATCAAATAGAATCCGAAA
TCGGAAGCAGAGCCGCCGCTGTAGTACCGCCGGAGGAAGAAGATCGCGGCCGGGACGACGCCGTAGTAAGTAAATTAGGAGGCGGAGGAGGAATTTGTGATATTGTTTCG
TGCGCGTGCTTCGAGGAATCGTTCGTGCAGTGGAATGTGAGGGCGCCGTTGGAGGTTATCTACGAGGAGTACGAAGGGGAAGACGATAAGGACGTTTCGAACGAGAGCGA
TCCGTTTACGGAAACGGAGCCCCGACCGGTACCGGTACGCATCGAGCGGTATCCATCGCTGTCGCTGTACTATCCCGAGACCGACTCCGATAGCTCTTCGGACGACGAGT
TTCCGATTCCCGGAGCGTGGGATTCACTTGACAGTCTGTGCTTCATGTGGGACGACGGTGAAGACAGGGATGAACTGATCGAGATTGCGTTGGATAAGACGACGAACAAG
ATGAAGGCATCATCGGAATTATTTCAGTTGGAGGAGGAGAATTTGATAGAGATCGATATTTCCCCAACG
mRNA sequenceShow/hide mRNA sequence
TCTTGTAAGCGTACGATTTCCTGTTTCTCCGATCCTCTCTTTTCTACTCTCCTCACTTTCTTCGCTCTCATTCTTCTCTCATTCCCTCGCATTTTCTGGCCACTCCTCTT
CTCCCCGGTCCCTCTCCTCACCGGACTCCTCCTCCTCTCTCTGCTCCGCCTCGGCGCAACTCAGAGATTTCCACGCCACATTAAGGACAAGAATCAAATAGAATCCGAAA
TCGGAAGCAGAGCCGCCGCTGTAGTACCGCCGGAGGAAGAAGATCGCGGCCGGGACGACGCCGTAGTAAGTAAATTAGGAGGCGGAGGAGGAATTTGTGATATTGTTTCG
TGCGCGTGCTTCGAGGAATCGTTCGTGCAGTGGAATGTGAGGGCGCCGTTGGAGGTTATCTACGAGGAGTACGAAGGGGAAGACGATAAGGACGTTTCGAACGAGAGCGA
TCCGTTTACGGAAACGGAGCCCCGACCGGTACCGGTACGCATCGAGCGGTATCCATCGCTGTCGCTGTACTATCCCGAGACCGACTCCGATAGCTCTTCGGACGACGAGT
TTCCGATTCCCGGAGCGTGGGATTCACTTGACAGTCTGTGCTTCATGTGGGACGACGGTGAAGACAGGGATGAACTGATCGAGATTGCGTTGGATAAGACGACGAACAAG
ATGAAGGCATCATCGGAATTATTTCAGTTGGAGGAGGAGAATTTGATAGAGATCGATATTTCCCCAACG
Protein sequenceShow/hide protein sequence
SCKRTISCFSDPLFSTLLTFFALILLSFPRIFWPLLFSPVPLLTGLLLLSLLRLGATQRFPRHIKDKNQIESEIGSRAAAVVPPEEEDRGRDDAVVSKLGGGGGICDIVS
CACFEESFVQWNVRAPLEVIYEEYEGEDDKDVSNESDPFTETEPRPVPVRIERYPSLSLYYPETDSDSSSDDEFPIPGAWDSLDSLCFMWDDGEDRDELIEIALDKTTNK
MKASSELFQLEEENLIEIDISPT