; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0942 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0942
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPeptidase_S15 domain-containing protein
Genome locationMC06:8058891..8062943
RNA-Seq ExpressionMC06g0942
SyntenyMC06g0942
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000383 - Xaa-Pro dipeptidyl-peptidase-like domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7022069.1 hypothetical protein SDJN02_15798 [Cucurbita argyrosperma subsp. argyrosperma]8.16e-14491.93Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        M+NC V+SCKVETSDGVKLH R+FKP DEE RENLVVVLVHPYSVLGGCQGLLRGIA GLAERGFRAVTFDMRGAGKSSG+ SLTGFAEIKDVTAVC WV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR ETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLIL FISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

XP_022154093.1 uncharacterized protein LOC111021430 [Momordica charantia]5.19e-156100Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGPSYDAQMVNLILHFISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

XP_022933651.1 uncharacterized protein LOC111441007 [Cucurbita moschata]5.74e-14492.38Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        M+NC V+SCKVETSDGVKLH RVFKP DEE RENLVVVLVHPYSVLGGCQGLLRGIA GLAERGFRAVTFDMRGAGKSSG+ SLTGFAEIKDVTAVC WV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR ETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLIL FISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

XP_023530620.1 uncharacterized protein LOC111793117 [Cucurbita pepo subsp. pepo]4.04e-14492.38Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        M+NC VESCKVETSDGVKLH RVFKP DEE RENLVVVLVHPYSVLGGCQGLLRGIA GLA+RGFRAVTFDMRGAGKSSG+ SLTGFAEIKDVTAVC WV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        C NLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GRVETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLIL FISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

XP_038878932.1 uncharacterized protein LOC120071019 [Benincasa hispida]2.44e-14592.83Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        MSN  VESCKVETSDGVKLH RVFKPKDEEARENL VVLVHPYS+LGGCQGLLRGIAAGLAERG+RAVTFDMRGAGKSSG+ASLTGFAEIKDV AVCKWV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CE LSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR+ETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLI HFISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

TrEMBL top hitse value%identityAlignment
A0A0A0M2M4 Peptidase_S15 domain-containing protein1.22e-14391.56Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARE--NLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCK
        MSNC VESCKVETSDGVKLH RVFKPKDEEA+E  NL VVLVHPYS+LGGCQGLLRGIAAGLAERG++AVTFDMRGAGKSSG+ASLTGFAEIKDV AVCK
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARE--NLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCK

Query:  WVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGV
        WVCENLSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQL+NKL SA GRVE+HLIEGV
Subjt:  WVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGV

Query:  SHFEMEGPSYDAQMVNLILHFISSL
        SHFEMEGP+YDAQMVNLILHFISSL
Subjt:  SHFEMEGPSYDAQMVNLILHFISSL

A0A5A7TJY6 Abhydrolase_5 domain-containing protein9.99e-14391.11Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARE--NLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCK
        MSNC VESCKVETSDGVKLH RVFKPKDE A+E  NL VVLVHPYS+LGGCQGLLRGIAAGLAE+G++AVTFDMRGAGKSSG+ASLTGFAEIKDV AVCK
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARE--NLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCK

Query:  WVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGV
        WVCENLSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQL+NKL SA GRVETHLIEGV
Subjt:  WVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGV

Query:  SHFEMEGPSYDAQMVNLILHFISSL
        SHFEMEGP+YDAQMVNLILHFISSL
Subjt:  SHFEMEGPSYDAQMVNLILHFISSL

A0A6J1DJB7 uncharacterized protein LOC1110214302.51e-156100Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGPSYDAQMVNLILHFISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

A0A6J1EZN6 uncharacterized protein LOC1114410072.78e-14492.38Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        M+NC V+SCKVETSDGVKLH RVFKP DEE RENLVVVLVHPYSVLGGCQGLLRGIA GLAERGFRAVTFDMRGAGKSSG+ SLTGFAEIKDVTAVC WV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR ETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLIL FISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

A0A6J1HXF4 uncharacterized protein LOC1114675269.30e-14391.48Show/hide
Query:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV
        M NC V+SCKVETS+GVKLH RVFKP DEE RENLVVVLVHPYSVLGGCQGLLRGIA GLAERGFRAVTFDMRGAGKSSG+ SLTGFAEIKDVTAVC WV
Subjt:  MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWV

Query:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGY+SLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR ETHLIEGVSH
Subjt:  CENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSH

Query:  FEMEGPSYDAQMVNLILHFISSL
        FEMEGP+YDAQMVNLIL FISSL
Subjt:  FEMEGPSYDAQMVNLILHFISSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G19630.1 alpha/beta-Hydrolases superfamily protein2.3e-9372.57Show/hide
Query:  SNCAVESCKVETSDGVKLHARVFKPKDEEAR----ENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVC
        SN  VESC V++ +GVKLH R+FKP++E       ENLV+VLVHP+S+LGGCQ LL+GIA+ LA +GF++VTFD RGAGKS+G+A+LTGFAE+KDV AVC
Subjt:  SNCAVESCKVETSDGVKLHARVFKPKDEEAR----ENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVC

Query:  KWVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEG
        +W+C+N+   RILLVGSSAGAPIAGS+V+ +EQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGT+DGFTSV QLK KL SAVGR ETHLIEG
Subjt:  KWVCENLSVDRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEG

Query:  VSHFEMEGPSYDAQMVNLILHFISSL
        VSHF+MEGP YD+Q+ ++I  FISSL
Subjt:  VSHFEMEGPSYDAQMVNLILHFISSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACTGCGCCGTGGAGTCGTGTAAAGTCGAGACCAGTGATGGGGTGAAGCTCCACGCGAGGGTTTTCAAGCCGAAAGATGAGGAGGCGAGGGAAAATCTGGTGGT
TGTTCTGGTCCATCCCTATTCCGTTTTGGGTGGTTGTCAAGGGCTCTTGAGAGGAATAGCCGCTGGATTGGCGGAGAGGGGTTTTAGGGCTGTGACTTTTGACATGAGGG
GTGCTGGGAAATCGTCTGGAAAGGCTTCTCTCACTGGATTTGCGGAAATTAAGGATGTCACTGCTGTCTGCAAGTGGGTCTGTGAGAATTTGTCTGTTGATCGAATCTTG
TTGGTGGGTTCATCTGCAGGCGCCCCCATTGCTGGCTCATCTGTGGATTTGATAGAACAAGTGGTAGGCTATGTTAGCCTTGGCTACCCGTTCGGTCTAACAGCCTCGAT
TCTTTTCGGTAGACATCACAAAGCCATTTTACAGTCTCCAAAACCAAAGCTTTTCGTAATGGGGACACGGGACGGATTCACAAGTGTGAAGCAATTGAAGAACAAGTTAA
GTTCTGCAGTGGGACGTGTCGAAACACATCTAATAGAAGGTGTGAGCCACTTTGAAATGGAAGGCCCTTCATATGATGCTCAAATGGTGAATCTTATCCTTCATTTTATA
TCTTCTTTGTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAATCTGACCTTAAACCATTTTAAAAACACTCAAATGGGAATTTGGAAAAAAAAAAAAAAGTTTTCCCAACTTCTAGTGTTTTGTTCTCCATTCGTTGGAATG
AAAAACAAATAAGCTCCATTTTGATTCTGCTCTCAAATCTGTCAATTCCGCTCGAAGAAACCGAATTTTGGTACCAAATTAATCGTCGGAACCCTCTTCTCCGACCACGT
CTCCGTCAAAATTGGTTATCTCCCAACCAGAATCATTAATTCCTCCGTTTCCCCACAAGGATTAAATTCCAGTTGCAGCCTCCGATCCAGTTCATAGCCCTCAGTTCCTC
TTCGTCCGCCTGTTTTTGAACTGTCGGCGATGTCGAACTGCGCCGTGGAGTCGTGTAAAGTCGAGACCAGTGATGGGGTGAAGCTCCACGCGAGGGTTTTCAAGCCGAAA
GATGAGGAGGCGAGGGAAAATCTGGTGGTTGTTCTGGTCCATCCCTATTCCGTTTTGGGTGGTTGTCAAGGGCTCTTGAGAGGAATAGCCGCTGGATTGGCGGAGAGGGG
TTTTAGGGCTGTGACTTTTGACATGAGGGGTGCTGGGAAATCGTCTGGAAAGGCTTCTCTCACTGGATTTGCGGAAATTAAGGATGTCACTGCTGTCTGCAAGTGGGTCT
GTGAGAATTTGTCTGTTGATCGAATCTTGTTGGTGGGTTCATCTGCAGGCGCCCCCATTGCTGGCTCATCTGTGGATTTGATAGAACAAGTGGTAGGCTATGTTAGCCTT
GGCTACCCGTTCGGTCTAACAGCCTCGATTCTTTTCGGTAGACATCACAAAGCCATTTTACAGTCTCCAAAACCAAAGCTTTTCGTAATGGGGACACGGGACGGATTCAC
AAGTGTGAAGCAATTGAAGAACAAGTTAAGTTCTGCAGTGGGACGTGTCGAAACACATCTAATAGAAGGTGTGAGCCACTTTGAAATGGAAGGCCCTTCATATGATGCTC
AAATGGTGAATCTTATCCTTCATTTTATATCTTCTTTGTAGGATTCATACTTTTGTTCAGAATATTTGTGGGAACAAACCCACCCTTTGTTGCCATTGTGGATCAGGCTC
TGTAATATTAACTTCTTGTGTGTACTGCAGGAAAATTAGAGGTTTTGGCTCAACAAATTTTGCAATTATTACAGTGAAAAAACAAAACTAAGCATCTAAATGAGATTCTA
TTGACTCTGGAATAGAGTTTTACAGGAAGAACATTTCTACTGAGTTAAAGCTAGATACCAAAATAAAACAAGAATTCTTAGTACAATCAATAAACAGAAGCCATGACTTC
ATTAACTGCACAGTTGATCACAAGAGTTCAAAAATGCTCGAAAACGCCATAACGGAGCTGAGCACACTGCAGAAATTTTTCCATAGCAAGTCTGCAAGTTGTAGAATTTT
TGCCATTTTGGAGCTTCAGCCACTTTACATCATCATCATACACGAACCATTGATTGCCCTTGAGAATATGGTTCTCTCCCATCTGCTTGAATCCCAAAAAAGAAAAAGAG
GAAGAAATATGATCTTCTCACTACCAAGTAACCCAATTTAACCGCTCTTAATTCTTTATATTAAAATATATATATATATAATCCATTGAAATTATGACGATGAAATTATA
CACCCAAATTTAATAATCAGATGGGTTTTTAGCCTTTCACCTTGCTAATGTTTTGTGAGAAGAAGCTATCACAAATATATTAATAATAAATACCAGGCAAACTTTTGGAT
CCATAAATCAGTACTGAAGGAGAAGTCTAATAATTGAAGTGAACAATTTTACAAAATTGGGAGACCGAGAACTGAAGTTGATAAAAAGAAAGCACAGTAAAAGAAAATGA
AGGTTTGGTATGTTTTCAAGCACAATGTACATATACAGAACCCAATGCATATCATTTAAATTAAGTACTGTACACCAGTAAGCATGTTTAATTAGTTATTACCACTTTCA
CCATCGGATGCATCTTCAAGCCTTGCGATCGCGTCGACGAGGGATTGTTCGTGCTCCTGCATTCACACAGATCCGAAATAAGTTAGATAAGTAAATGCACATAGAAAAGT
TAAGTCGGAGCTTAGGATGAAGAAAGATTCATACTTTCAGCACTTTCTTAGCCTTTTCGATCTCCATCGGGTCGGGATGATTGGCACCAAATACTTTTTCCACCTGTTCG
AGCAACTCAAATAACAAGGGAAAAACAAAGGTCATCACTTGCATTAACCAAAACAATAAAGAACAGAAAATGAATGATAAAAAGAAATCATATCATATCTTGCACACCTC
CTTAATCAGCGTGTCGGTGTGAAGTATTTCAATATCGCCCGTAGCCTGCTTCCTAGAACCATTTTGAGAGACAGGTAAGTCTTTCCTGGATTGGCCCTTTGTGGTCCCCC
GACCCCTTCCACCAGCTGCAACTGCACCACCCCGTGTCATGGACTTCTTATTGCCCCTCCCGTGTCCAGGTCGAGCGCCACTGCGAGGTAAATCGCCACGTTCCCACCTA
ATATCTTCAGGAGGTATCTATAACAGGAGAGGTGTATAGATAACTCATAAGTGACAATATTCCACTTTAGATCAGGATTTCAAATAGAAAAATGACTACTGAATTTATGG
CTGGTCATCTCATCATCCAATTAATAGTCTGATCTTTAAGTGAAAAAAGAAGCAACATTTTCAGACACACTACAGTCATGTGGTTTACAATCAACAGCAATCACTGTTAG
TGGTTTTCTATTTGAGCAGAAGACATTACAATTCTTGAACAACTCCCCTGGTATATTCTTTCATTTTTACTTGATTAAAGCTTGGTTTCTTATTAGGGAAAAAAAATCTT
GAAC
Protein sequenceShow/hide protein sequence
MSNCAVESCKVETSDGVKLHARVFKPKDEEARENLVVVLVHPYSVLGGCQGLLRGIAAGLAERGFRAVTFDMRGAGKSSGKASLTGFAEIKDVTAVCKWVCENLSVDRIL
LVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLKNKLSSAVGRVETHLIEGVSHFEMEGPSYDAQMVNLILHFI
SSL