; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G004750 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G004750
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPeptidase_S15 domain-containing protein
Genome locationchr06:5478495..5480939
RNA-Seq ExpressionLsi06G004750
SyntenyLsi06G004750
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000383 - Xaa-Pro dipeptidyl-peptidase-like domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149026.1 uncharacterized protein LOC101217852 [Cucumis sativus]5.8e-11595.09Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
        MSNCTVESCKVETSDGVKLHTRVFKP DEEA+  ENL VVLVHP+SILGGCQGLLRGIAAGLAERGY+AVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK

Query:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV
        WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR+E+HLIEGV
Subjt:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISS
        SHFEMEGPAYDAQMVNLILHFISS
Subjt:  SHFEMEGPAYDAQMVNLILHFISS

XP_008451125.1 PREDICTED: uncharacterized protein LOC103492503 [Cucumis melo]2.9e-11494.64Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
        MSNCTVESCKVETSDGVKLHTRVFKP DE A+  ENL VVLVHP+SILGGCQGLLRGIAAGLAE+GY+AVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK

Query:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV
        WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR+ETHLIEGV
Subjt:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISS
        SHFEMEGPAYDAQMVNLILHFISS
Subjt:  SHFEMEGPAYDAQMVNLILHFISS

XP_022154093.1 uncharacterized protein LOC111021430 [Momordica charantia]3.2e-11393.24Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
        MSNC VESCKVETSDGVKLH RVFKP DEEARENL VVLVHP+S+LGGCQGLLRGIAAGLAERG+RAVTFDMRGAGKSSG+ASLTGFAEIKDV AVCKWV
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV

Query:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
        CENLSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR+ETHLIEGVSH
Subjt:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISS
        FEMEGP+YDAQMVNLILHFISS
Subjt:  FEMEGPAYDAQMVNLILHFISS

XP_022933651.1 uncharacterized protein LOC111441007 [Cucurbita moschata]4.6e-11293.24Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
        M+NCTV+SCKVETSDGVKLHTRVFKP DEE RENL VVLVHP+S+LGGCQGLLRGIA GLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDV AVC WV
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV

Query:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR ETHLIEGVSH
Subjt:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISS
        FEMEGPAYDAQMVNLIL FISS
Subjt:  FEMEGPAYDAQMVNLILHFISS

XP_038878932.1 uncharacterized protein LOC120071019 [Benincasa hispida]5.2e-11697.3Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
        MSN TVESCKVETSDGVKLHTRVFKP DEEARENLAVVLVHP+SILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV

Query:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
        CE LSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
Subjt:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISS
        FEMEGPAYDAQMVNLI HFISS
Subjt:  FEMEGPAYDAQMVNLILHFISS

TrEMBL top hitse value%identityAlignment
A0A0A0M2M4 Peptidase_S15 domain-containing protein2.8e-11595.09Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
        MSNCTVESCKVETSDGVKLHTRVFKP DEEA+  ENL VVLVHP+SILGGCQGLLRGIAAGLAERGY+AVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK

Query:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV
        WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR+E+HLIEGV
Subjt:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISS
        SHFEMEGPAYDAQMVNLILHFISS
Subjt:  SHFEMEGPAYDAQMVNLILHFISS

A0A1S3BQV5 uncharacterized protein LOC1034925031.4e-11494.64Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
        MSNCTVESCKVETSDGVKLHTRVFKP DE A+  ENL VVLVHP+SILGGCQGLLRGIAAGLAE+GY+AVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK

Query:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV
        WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR+ETHLIEGV
Subjt:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISS
        SHFEMEGPAYDAQMVNLILHFISS
Subjt:  SHFEMEGPAYDAQMVNLILHFISS

A0A5A7TJY6 Abhydrolase_5 domain-containing protein1.4e-11494.64Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
        MSNCTVESCKVETSDGVKLHTRVFKP DE A+  ENL VVLVHP+SILGGCQGLLRGIAAGLAE+GY+AVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEAR--ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCK

Query:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV
        WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR+ETHLIEGV
Subjt:  WVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGV

Query:  SHFEMEGPAYDAQMVNLILHFISS
        SHFEMEGPAYDAQMVNLILHFISS
Subjt:  SHFEMEGPAYDAQMVNLILHFISS

A0A6J1DJB7 uncharacterized protein LOC1110214301.5e-11393.24Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
        MSNC VESCKVETSDGVKLH RVFKP DEEARENL VVLVHP+S+LGGCQGLLRGIAAGLAERG+RAVTFDMRGAGKSSG+ASLTGFAEIKDV AVCKWV
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV

Query:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
        CENLSV RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQL+NKL SA GR+ETHLIEGVSH
Subjt:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISS
        FEMEGP+YDAQMVNLILHFISS
Subjt:  FEMEGPAYDAQMVNLILHFISS

A0A6J1EZN6 uncharacterized protein LOC1114410072.2e-11293.24Show/hide
Query:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV
        M+NCTV+SCKVETSDGVKLHTRVFKP DEE RENL VVLVHP+S+LGGCQGLLRGIA GLAERG+RAVTFDMRGAGKSSGR SLTGFAEIKDV AVC WV
Subjt:  MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWV

Query:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH
        CENLSV+RILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGR ETHLIEGVSH
Subjt:  CENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSH

Query:  FEMEGPAYDAQMVNLILHFISS
        FEMEGPAYDAQMVNLIL FISS
Subjt:  FEMEGPAYDAQMVNLILHFISS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G26740.1 soluble epoxide hydrolase7.5e-0432.82Show/hide
Query:  DGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVIAVCKWVCENLSVHRILL
        +G+ +H  +  P+D        V+L+H F  L       R    GLA RGYRAV  D+RG G S   A   S T F  + D+IAV   +  +    ++ +
Subjt:  DGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVIAVCKWVCENLSVHRILL

Query:  VGSSAGAPIAG-SSVDLIEQVVGYVSLGYPF
        VG   GA IA    +   ++V   V+L  PF
Subjt:  VGSSAGAPIAG-SSVDLIEQVVGYVSLGYPF

AT2G26750.1 alpha/beta-Hydrolases superfamily protein3.4e-0433.64Show/hide
Query:  DGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVIAVCKWVCENLSVHRILL
        +G+ +H  +  P+D        V+L+H F  L       R   +GLA RGYRAV  D+RG G S   A   S T F  + D++AV   + +     ++ +
Subjt:  DGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRA---SLTGFAEIKDVIAVCKWVCENLSVHRILL

Query:  VGSSAGAPIA
        VG   GA IA
Subjt:  VGSSAGAPIA

AT5G19630.1 alpha/beta-Hydrolases superfamily protein1.6e-9472.89Show/hide
Query:  SNCTVESCKVETSDGVKLHTRVFKPNDEEAR----ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVC
        SN  VESC V++ +GVKLHTR+FKP +E       ENL +VLVHPFS+LGGCQ LL+GIA+ LA +G+++VTFD RGAGKS+GRA+LTGFAE+KDV+AVC
Subjt:  SNCTVESCKVETSDGVKLHTRVFKPNDEEAR----ENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVC

Query:  KWVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEG
        +W+C+N+  HRILLVGSSAGAPIAGS+V+ +EQVVGYVSLGYPFGL ASILFGRHHKAIL SPKPKLFVMGT+DGFTSV QL+ KLKSA GR ETHLIEG
Subjt:  KWVCENLSVHRILLVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEG

Query:  VSHFEMEGPAYDAQMVNLILHFISS
        VSHF+MEGP YD+Q+ ++I  FISS
Subjt:  VSHFEMEGPAYDAQMVNLILHFISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAACTGTACCGTGGAGTCTTGTAAAGTCGAAACTAGTGATGGGGTAAAGCTCCACACAAGGGTTTTCAAGCCAAATGATGAGGAAGCGAGGGAAAATCTGGCTGT
TGTTTTGGTCCATCCTTTTTCCATTTTGGGTGGTTGTCAAGGGCTTTTGAGAGGAATAGCCGCTGGGTTAGCGGAAAGAGGTTATAGAGCTGTGACTTTTGATATGAGGG
GTGCTGGGAAATCGTCTGGAAGGGCTTCTTTGACTGGATTTGCTGAAATTAAAGATGTGATTGCTGTTTGCAAATGGGTTTGTGAGAATTTGTCTGTTCATCGAATCTTG
TTGGTGGGTTCATCCGCAGGGGCTCCCATTGCAGGCTCGTCTGTGGATTTGATAGAACAAGTGGTAGGCTATGTTAGCCTTGGCTATCCTTTTGGCCTAACTGCCTCAAT
TCTTTTTGGAAGACACCATAAAGCCATTTTACAGTCTCCGAAACCAAAACTTTTCGTGATGGGTACACGGGACGGGTTCACAAGTGTAAAACAATTGCAGAACAAGCTAA
AATCTGCAGCAGGACGTATCGAAACACATCTAATAGAAGGTGTGAGTCACTTTGAAATGGAAGGCCCTGCATATGATGCTCAAATGGTGAATCTTATCCTTCATTTTATT
TCTTCTTTTTAG
mRNA sequenceShow/hide mRNA sequence
CCCAACTTTTTTTTATCATTATGTTCTCCACAAATGCTTCAATTCTCAATTTGCTCTCAAATCTGTCATTTCCGCTTTTGAAAACTGAATTCCCCATCAAAATTAATCGT
CCTCTTAACTTTCTTCTTCCCAATCCCTACCAGTCTCCATCAGATTGTTGTAACAATCTGCTCCCCAACCAAAATCATTGAAAAATCCTCTTCTCTACAACCAATAAATT
CCACTTCGCAGCCTCCGATCCACTCCATAACCCTTCACTTCCTCTTCATCGCCGTTAACTGCCGGCGATGTCGAACTGTACCGTGGAGTCTTGTAAAGTCGAAACTAGTG
ATGGGGTAAAGCTCCACACAAGGGTTTTCAAGCCAAATGATGAGGAAGCGAGGGAAAATCTGGCTGTTGTTTTGGTCCATCCTTTTTCCATTTTGGGTGGTTGTCAAGGG
CTTTTGAGAGGAATAGCCGCTGGGTTAGCGGAAAGAGGTTATAGAGCTGTGACTTTTGATATGAGGGGTGCTGGGAAATCGTCTGGAAGGGCTTCTTTGACTGGATTTGC
TGAAATTAAAGATGTGATTGCTGTTTGCAAATGGGTTTGTGAGAATTTGTCTGTTCATCGAATCTTGTTGGTGGGTTCATCCGCAGGGGCTCCCATTGCAGGCTCGTCTG
TGGATTTGATAGAACAAGTGGTAGGCTATGTTAGCCTTGGCTATCCTTTTGGCCTAACTGCCTCAATTCTTTTTGGAAGACACCATAAAGCCATTTTACAGTCTCCGAAA
CCAAAACTTTTCGTGATGGGTACACGGGACGGGTTCACAAGTGTAAAACAATTGCAGAACAAGCTAAAATCTGCAGCAGGACGTATCGAAACACATCTAATAGAAGGTGT
GAGTCACTTTGAAATGGAAGGCCCTGCATATGATGCTCAAATGGTGAATCTTATCCTTCATTTTATTTCTTCTTTTTAGGATAATTAACTTTGCTCAGAATATTTGTGTG
GAAATAACTCTTATTACCATTGTGCATTAGGGTTTGTAATATTAACTTCTTGTGTGTTGTGTAGGAAATTTAGAGGTGTTGGCTCAACAAATTTTGTATTATCATAGTGA
AAAGCAAAACCAAACATCTGCCATAGTGTTGGATATGTATTAACACAGCATATATGAGAGGCAGCTGAGTGACTAATCTACATGAGATTCTATTGACTATAGAATAAAGT
TTTACAGAAAGATCATTTCTACTGAAGTTAAAGCTAAAGATTAAAATAAAACAACTCTTAACCATCAATAAACAAAAGCCAAGGCAAGGCACCAGTAACTGCACAGTTGG
TCATAAAAATTTGAAAACGCCACAACAGAG
Protein sequenceShow/hide protein sequence
MSNCTVESCKVETSDGVKLHTRVFKPNDEEARENLAVVLVHPFSILGGCQGLLRGIAAGLAERGYRAVTFDMRGAGKSSGRASLTGFAEIKDVIAVCKWVCENLSVHRIL
LVGSSAGAPIAGSSVDLIEQVVGYVSLGYPFGLTASILFGRHHKAILQSPKPKLFVMGTRDGFTSVKQLQNKLKSAAGRIETHLIEGVSHFEMEGPAYDAQMVNLILHFI
SSF