; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027543 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027543
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCystatin domain-containing protein
Genome locationtig00153054:2376953..2401593
RNA-Seq ExpressionSgr027543
SyntenySgr027543
Gene Ontology termsGO:0010466 - negative regulation of peptidase activity (biological process)
GO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR000010 - Cystatin domain
IPR006525 - Cystatin-related, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137488.1 uncharacterized protein LOC111008920 isoform X1 [Momordica charantia]6.1e-3458.65Show/hide
Query:  STDSEHNSPYHSDGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVRDI-LLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQAS
        S  + +N  Y SDGYY+DGVR MNEEE + +Y A QESEGFDVP+FP +YAF +I P+  I L++ E++ C+ +AIKHYNNENG +FE VK++KAN QA+
Subjt:  STDSEHNSPYHSDGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVRDI-LLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQAS

Query:  CGILYFITFEAKQTGTPSEYPTTAFQARVLFGI
         G L+F+TF+ KQTG P + PTT  QARVL GI
Subjt:  CGILYFITFEAKQTGTPSEYPTTAFQARVLFGI

XP_022137526.1 uncharacterized protein LOC111008952 [Momordica charantia]6.5e-3660.31Show/hide
Query:  PSASAYHFDEYFADGLREMTDEEEMEYYLAFEKTQGFDMPTFPESYSMDRIEPVSERRLFSPELQECA--EEAIQHYNKENGTNFEFVKMVKANNGAACG
        P     H D YF +  REMT EEE+EYY+A +KT+GFDMP+FP+SY+  RIE ++  RL S ELQECA  ++A+ ++N++NGT+FEFVKMVKA N A  G
Subjt:  PSASAYHFDEYFADGLREMTDEEEMEYYLAFEKTQGFDMPTFPESYSMDRIEPVSERRLFSPELQECA--EEAIQHYNKENGTNFEFVKMVKANNGAACG

Query:  ILYYITFEVKQIGSPPNSPTTTFQAHVLFGI
        I+YY+TFEVKQ+GSPPNSPT T QA VL G+
Subjt:  ILYYITFEVKQIGSPPNSPTTTFQAHVLFGI

XP_022942206.1 uncharacterized protein LOC111447330 [Cucurbita moschata]3.2e-2752.03Show/hide
Query:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR--DIL-LTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE
        D +++DG  ++ ++EE E+YCA +ES+GFDVP F   YAF LI P++  ++  +  E+Q+ + EAIKHYNNENGTNFEVV IVKAN    CG +Y+ITF 
Subjt:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR--DIL-LTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE

Query:  AKQTGTPSEYPTTAFQARVLFGI
         K  GTP E+P+  FQA+V + I
Subjt:  AKQTGTPSEYPTTAFQARVLFGI

XP_023521681.1 UPF0725 protein At4g29550-like [Cucurbita pepo subsp. pepo]3.2e-2745.45Show/hide
Query:  MASSPKFCLSDSTDSECDSPCDSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVC---DIPLTPGLRVCTEEAIKHYNNENGTNF
        MASS    LS+  + E DS    +E+Y++G   + K+EE +Y+RA+E+SEGFDVP F +  ++ LI P+    +  +   +R+   +AIKHYN ENGTNF
Subjt:  MASSPKFCLSDSTDSECDSPCDSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVC---DIPLTPGLRVCTEEAIKHYNNENGTNF

Query:  EIVEIVKANQQGACGILYFITFEAKQIGTPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKN
        E+VEIVKAN  G CG +Y+ITF  K IGT  E+P  TFQA+V + I   I+ I    L  P   N
Subjt:  EIVEIVKANQQGACGILYFITFEAKQIGTPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKN

XP_023525925.1 uncharacterized protein LOC111789396 [Cucurbita pepo subsp. pepo]5.7e-3260.98Show/hide
Query:  DSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVCDIPLT-PGLRVCTEEAIKHYNNENGTNFEIVEIVKANQQGACGILYFITFE
        DS+E YE+G+R MN+EE  +YY+ L  ++GFDVPTFP + A GLIVP+C+  L+ P LR C E+AI HYN  NGTNFE V+IVKANQQ   G  Y+ITF+
Subjt:  DSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVCDIPLT-PGLRVCTEEAIKHYNNENGTNFEIVEIVKANQQGACGILYFITFE

Query:  AKQIGTPPEYPTTTFQARVLFGI
         KQIGT  E+PTTTF+A+VL GI
Subjt:  AKQIGTPPEYPTTTFQARVLFGI

TrEMBL top hitse value%identityAlignment
A0A6J1C8H7 uncharacterized protein LOC1110089523.1e-3660.31Show/hide
Query:  PSASAYHFDEYFADGLREMTDEEEMEYYLAFEKTQGFDMPTFPESYSMDRIEPVSERRLFSPELQECA--EEAIQHYNKENGTNFEFVKMVKANNGAACG
        P     H D YF +  REMT EEE+EYY+A +KT+GFDMP+FP+SY+  RIE ++  RL S ELQECA  ++A+ ++N++NGT+FEFVKMVKA N A  G
Subjt:  PSASAYHFDEYFADGLREMTDEEEMEYYLAFEKTQGFDMPTFPESYSMDRIEPVSERRLFSPELQECA--EEAIQHYNKENGTNFEFVKMVKANNGAACG

Query:  ILYYITFEVKQIGSPPNSPTTTFQAHVLFGI
        I+YY+TFEVKQ+GSPPNSPT T QA VL G+
Subjt:  ILYYITFEVKQIGSPPNSPTTTFQAHVLFGI

A0A6J1CAH1 uncharacterized protein LOC111008920 isoform X12.9e-3458.65Show/hide
Query:  STDSEHNSPYHSDGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVRDI-LLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQAS
        S  + +N  Y SDGYY+DGVR MNEEE + +Y A QESEGFDVP+FP +YAF +I P+  I L++ E++ C+ +AIKHYNNENG +FE VK++KAN QA+
Subjt:  STDSEHNSPYHSDGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVRDI-LLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQAS

Query:  CGILYFITFEAKQTGTPSEYPTTAFQARVLFGI
         G L+F+TF+ KQTG P + PTT  QARVL GI
Subjt:  CGILYFITFEAKQTGTPSEYPTTAFQARVLFGI

A0A6J1FN74 uncharacterized protein LOC1114473301.6e-2752.03Show/hide
Query:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR--DIL-LTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE
        D +++DG  ++ ++EE E+YCA +ES+GFDVP F   YAF LI P++  ++  +  E+Q+ + EAIKHYNNENGTNFEVV IVKAN    CG +Y+ITF 
Subjt:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR--DIL-LTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE

Query:  AKQTGTPSEYPTTAFQARVLFGI
         K  GTP E+P+  FQA+V + I
Subjt:  AKQTGTPSEYPTTAFQARVLFGI

A0A6J1FPD6 UPF0725 protein At4g29550-like2.3e-2644.59Show/hide
Query:  LSDSTDSECDSPCDSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVC---DIPLTPGLRVCTEEAIKHYNNENGTNFEIVEIVKA
        LS+  + E DS    +E+Y++G   + K+EE +Y+RA+E+SEGFDVP F +  ++ LI P+    D  +   +R+   +AIK+YN ENGTNFE+VEIVKA
Subjt:  LSDSTDSECDSPCDSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVC---DIPLTPGLRVCTEEAIKHYNNENGTNFEIVEIVKA

Query:  NQQGACGILYFITFEAKQIGTPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKN
        N  G CG +Y+ITF  K IGT  E+   TFQA+V + I   I+ I    L  P   N
Subjt:  NQQGACGILYFITFEAKQIGTPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKN

A0A6J1IJT3 uncharacterized protein LOC1114753201.7e-2651.22Show/hide
Query:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR---DILLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE
        D +++DG  ++ ++EE E++CA  ES+GFDVP F   YAFGLI P++      L  E+Q+ + EAIKHYN+ENGTNFEVV IVKAN +  CG +Y+ITF 
Subjt:  DGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVR---DILLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFE

Query:  AKQTGTPSEYPTTAFQARVLFGI
         K  GT +E+P+  FQA+V + I
Subjt:  AKQTGTPSEYPTTAFQARVLFGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50690.1 Cystatin/monellin superfamily protein1.7e-0528Show/hide
Query:  YEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPES---YAFGLIVPVRDILLTPE-----LQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFIT
        Y +   S + E+E      +Q S  +D  T  +    + + +I    D+   PE     +   S+ A++ YN++   N E+V+ VKAN++   G +++IT
Subjt:  YEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPES---YAFGLIVPVRDILLTPE-----LQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFIT

Query:  FEAKQTGTPSEYPTTAFQARVLFGI
        FEAK   + ++  T     R L GI
Subjt:  FEAKQTGTPSEYPTTAFQARVLFGI

AT1G63190.1 Cystatin/monellin superfamily protein9.7e-0643.33Show/hide
Query:  MDRIEPVSERRLFSPELQECAEEAIQHYNKENGTNFEFVKMVKANNGAACGILYYITFEV
        +D  E V E       L+  + +A+  YN+E+ T FEFVK+VKAN    C I++ ITFEV
Subjt:  MDRIEPVSERRLFSPELQECAEEAIQHYNKENGTNFEFVKMVKANNGAACGILYYITFEV

AT1G63200.1 Cystatin/monellin superfamily protein1.5e-0641.54Show/hide
Query:  LQECAEEAIQHYNKENGTNFEFVKMVKANNGAACGILYYITFEVKQIGSPPNSPTTTFQAHVLFG
        L+  A+EA+  +N  +GT +EFVK+VKAN   AC +++ ITF+VK    P +     FQ  V  G
Subjt:  LQECAEEAIQHYNKENGTNFEFVKMVKANNGAACGILYYITFEVKQIGSPPNSPTTTFQAHVLFG

AT1G63205.1 Cystatin/monellin superfamily protein2.1e-0831.21Show/hide
Query:  EEEEQYYRALEKSEGFDVPTFPESCAFGL-IVPVCDIPL-------TPG--LRVCTEEAIKHYNNENGTN-FEIVEIVKANQQGACGILYFITFEAKQIG
        EE       + KSEGFD+      C F   +V   D          T G  ++  ++E++K YN+E GTN +E  E+VKAN  G+CG ++ ITF   Q+ 
Subjt:  EEEEQYYRALEKSEGFDVPTFPESCAFGL-IVPVCDIPL-------TPG--LRVCTEEAIKHYNNENGTN-FEIVEIVKANQQGACGILYFITFEAKQIG

Query:  TPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKNYTTS
         P +    TFQAR+ +   +  E +      +P + ++ T+
Subjt:  TPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKNYTTS

AT2G37435.1 Cystatin/monellin superfamily protein1.4e-0729.6Show/hide
Query:  EKEYY---RALEETQGFDVPTFPKSYAFGLIMPAPVELFSQKL-----------QACAEEAIKHYNKENDTNFEFVKIVKANHRAARGILYFITFEVKQI
        E+EYY   + +E+++GFD+        F      PV+L   +L                ++++H+N+ + T +EFV+ +KANH  + G++YFITFE K +
Subjt:  EKEYY---RALEETQGFDVPTFPKSYAFGLIMPAPVELFSQKL-----------QACAEEAIKHYNKENDTNFEFVKIVKANHRAARGILYFITFEVKQI

Query:  GTPLEFPTTTFQARV--LSGIPDTI
            +  +  FQA++    G P+ I
Subjt:  GTPLEFPTTTFQARV--LSGIPDTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGGTCTTTGGCTTTGGTCTCTATGCGGTTGGTTCTTCTATACGCTGTCTGGTTACTTTAGTCTGCTTGTTAATTTGCTTTCTCATTTTCTGCTTATTGGTTGGTT
GACTCTCGACTGTGTGACACATTTTGAAGCAGGGGACTTTCTTAGGGTGGCGTTGATGGCTTCTTCACCCGAATTCAGCTTCTCTAACATATCCTCACCTGGCTCTGCAC
ACAATTCTCCAAATTATTTGGATGGATATGATGGAGACGGTATCCATGAGATGACTGAGGAAGAGGAGAAGGAATACTATCGTGCCTTAGAAGAAACCCAGGGTTTTGAT
GTACCGACTTTTCCTAAATCCTATGCTTTTGGTCTTATTATGCCTGCACCTGTGGAGCTATTTTCACAAAAGCTTCAAGCATGCGCAGAGGAAGCCATTAAACATTACAA
CAAGGAAAATGATACAAATTTTGAGTTTGTGAAAATTGTGAAGGCGAATCATCGAGCTGCTCGTGGTATCTTGTATTTCATCACTTTCGAGGTTAAGCAAATTGGAACAC
CTCTAGAGTTCCCAACCACAACCTTCCAAGCTCGGGTGCTATCTGGTATTCCTGATACTATAGAGCTTAGCTTCCCCCGGCCCTCTTCCTCCAACTCTACTGCTTTCATT
TCCTTGCGGGAGTCAGTTTGTTGGTGTTCCTTCGGCCTTGGTCTCTATGAGTATGATGTTGGTTCTTCTATTCTATTATATGCTGCTCTGCTATGGTTCCTTGCTTGTTA
TTGGCTGATTGTGTCACACATCACAAATACACCTTACACCCTAACAAATACTTTTGCATTTACTTTGAATCAATATTGTTTCTGCCTTCTTTTTTCCTTTTTCTTTTTTT
TTTTGTTTTTTTTTCCTGAGATCATCAAACAAAGGGTGGGGGATTCGGGAGGTTACCATACTGATGCCTTAACCGGGGACTTTCTTGCGTTGATGGCGTTTTCACCCAAA
CTCAGCCCATCTGCCTCTGCATATCATTTTGATGAATATTTTGCAGACGGTCTACGTGAGATGACTGATGAAGAGGAGATGGAATACTATCTTGCCTTTGAAAAAACCCA
GGGTTTTGATATGCCCACTTTTCCTGAATCCTATTCTATGGATCGTATTGAGCCTGTATCTGAGCGTCGATTATTTTCACCAGAGCTTCAAGAATGCGCAGAGGAAGCTA
TTCAACATTACAACAAGGAAAATGGTACAAATTTTGAATTTGTGAAGATGGTGAAGGCAAATAATGGAGCTGCTTGTGGTATATTATATTATATCACCTTCGAAGTCAAG
CAAATTGGATCACCTCCAAACTCCCCCACCACAACATTCCAAGCTCATGTGCTGTTTGGTATTGATGGTATAGAGGGTTTCTCCAATTTCTCAAGAGATAGAACTATCAA
CAAGATAAGAGACCTCGGACATCAGAAGGAGTCTGCTATTTATGCGAAGCAACTTCCGTTAGGGTTTTGTCCAATTTCTGAAAAGAGCTTCATTAATGTAGCTATCTTCT
TGCATGGATTATTTCTACTGACAGATTTTGAAGCTGGGGACTTTCTAGGGGCGGCTTTCATGGCTTCTTCACCCAAATTCTGTTTCTCTGACTCAACTGACTCTGAACAC
AATTCTCCGTATCACTCTGATGGATATTACGAAGACGGTGTCCGTTCGATGAATGAAGAAGAAGAAGACGAATTTTATTGTGCCTTCCAAGAAAGCGAGGGTTTTGATGT
ACCGACTTTTCCTGAATCCTATGCTTTTGGTCTTATTGTGCCTGTACGTGACATTCTACTTACCCCAGAGCTTCAAGTATGCTCAGAAGAAGCCATTAAACATTACAACA
ATGAAAATGGTACAAATTTTGAAGTTGTGAAGATTGTGAAGGCGAATCAACAAGCTTCTTGTGGTATTTTGTATTTCATTACCTTTGAGGCCAAACAAACTGGAACACCC
TCAGAGTACCCAACCACAGCCTTCCAAGCTCGAGTTCTTTTTGGAATTTTTCATAAAATAGAGGGGGTGGCTTTCATGGCTTCTTCACCCAAATTCTGCCTCTCTGACTC
AACTGACTCTGAATGCGATTCCCCATGTGATTCTGAAGAATATTATGAAAATGGTATCCGTGGGATGAATAAGGAAGAGGAAGAGCAATATTATCGTGCATTAGAAAAAA
GCGAGGGTTTTGATGTACCAACTTTTCCAGAATCCTGTGCTTTTGGTCTTATTGTTCCTGTCTGCGACATTCCACTTACCCCAGGACTTCGAGTGTGCACAGAGGAAGCT
ATTAAACATTACAACAACGAAAATGGTACAAATTTTGAAATTGTGGAGATTGTGAAGGCAAATCAACAAGGTGCGTGTGGTATTTTGTATTTCATCACTTTCGAGGCCAA
ACAAATTGGAACACCTCCAGAGTATCCAACCACAACCTTCCAAGCTCGAGTGCTGTTTGGAATTCTTCATAAAATAGAGCCTATAACGTCTTCTTCTCTCTCTGACCCAC
CACTGAAAAACTACACCACCTCCATTTTTTCTCTCAAGGTTTCTCCTTCTCTCACTCTTCTGTCTCTCTATCACTCTGTTTCTATGGCTACCTCCTCCAAGACATCAAGA
AATTCAAGATTTCAAATTCACCGGCAGCTTCTCCCGAAAGCTCCGATTCATATCTCCAGCTCTACAGCCAGCGGCTTAACAAGTAGAGGAATAGTGACTAGTCGGAGAGT
GAACGCTGATTCCAAAGCGACTTCGAGGTTGGTGGTCTGCTCGGTAGCCTCCGAGTCTATCAAGACAATATTGCGACGGCACATCAATGACAAACTGAATGAGGAGGCGG
CAACCAAGGAAATGGGAAAGTTGTTGCTGCGGCCATCAAATTACCACAGCGATCAGAACTTGAGCAGATTTTCTACTTACAGTTTACAGTATTTTGAAGAGCAGGGGACT
TTGTTGGTGGTGGTTTTGATGGCTTCTTATCCACCCGAATTCTACTTCTCCGACCACTCATCTGCCTCTGCAGTTGCACACAATTCTAATCCATGTGATTATGATTCTGA
TGGATGGATATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGGTCTTTGGCTTTGGTCTCTATGCGGTTGGTTCTTCTATACGCTGTCTGGTTACTTTAGTCTGCTTGTTAATTTGCTTTCTCATTTTCTGCTTATTGGTTGGTT
GACTCTCGACTGTGTGACACATTTTGAAGCAGGGGACTTTCTTAGGGTGGCGTTGATGGCTTCTTCACCCGAATTCAGCTTCTCTAACATATCCTCACCTGGCTCTGCAC
ACAATTCTCCAAATTATTTGGATGGATATGATGGAGACGGTATCCATGAGATGACTGAGGAAGAGGAGAAGGAATACTATCGTGCCTTAGAAGAAACCCAGGGTTTTGAT
GTACCGACTTTTCCTAAATCCTATGCTTTTGGTCTTATTATGCCTGCACCTGTGGAGCTATTTTCACAAAAGCTTCAAGCATGCGCAGAGGAAGCCATTAAACATTACAA
CAAGGAAAATGATACAAATTTTGAGTTTGTGAAAATTGTGAAGGCGAATCATCGAGCTGCTCGTGGTATCTTGTATTTCATCACTTTCGAGGTTAAGCAAATTGGAACAC
CTCTAGAGTTCCCAACCACAACCTTCCAAGCTCGGGTGCTATCTGGTATTCCTGATACTATAGAGCTTAGCTTCCCCCGGCCCTCTTCCTCCAACTCTACTGCTTTCATT
TCCTTGCGGGAGTCAGTTTGTTGGTGTTCCTTCGGCCTTGGTCTCTATGAGTATGATGTTGGTTCTTCTATTCTATTATATGCTGCTCTGCTATGGTTCCTTGCTTGTTA
TTGGCTGATTGTGTCACACATCACAAATACACCTTACACCCTAACAAATACTTTTGCATTTACTTTGAATCAATATTGTTTCTGCCTTCTTTTTTCCTTTTTCTTTTTTT
TTTTGTTTTTTTTTCCTGAGATCATCAAACAAAGGGTGGGGGATTCGGGAGGTTACCATACTGATGCCTTAACCGGGGACTTTCTTGCGTTGATGGCGTTTTCACCCAAA
CTCAGCCCATCTGCCTCTGCATATCATTTTGATGAATATTTTGCAGACGGTCTACGTGAGATGACTGATGAAGAGGAGATGGAATACTATCTTGCCTTTGAAAAAACCCA
GGGTTTTGATATGCCCACTTTTCCTGAATCCTATTCTATGGATCGTATTGAGCCTGTATCTGAGCGTCGATTATTTTCACCAGAGCTTCAAGAATGCGCAGAGGAAGCTA
TTCAACATTACAACAAGGAAAATGGTACAAATTTTGAATTTGTGAAGATGGTGAAGGCAAATAATGGAGCTGCTTGTGGTATATTATATTATATCACCTTCGAAGTCAAG
CAAATTGGATCACCTCCAAACTCCCCCACCACAACATTCCAAGCTCATGTGCTGTTTGGTATTGATGGTATAGAGGGTTTCTCCAATTTCTCAAGAGATAGAACTATCAA
CAAGATAAGAGACCTCGGACATCAGAAGGAGTCTGCTATTTATGCGAAGCAACTTCCGTTAGGGTTTTGTCCAATTTCTGAAAAGAGCTTCATTAATGTAGCTATCTTCT
TGCATGGATTATTTCTACTGACAGATTTTGAAGCTGGGGACTTTCTAGGGGCGGCTTTCATGGCTTCTTCACCCAAATTCTGTTTCTCTGACTCAACTGACTCTGAACAC
AATTCTCCGTATCACTCTGATGGATATTACGAAGACGGTGTCCGTTCGATGAATGAAGAAGAAGAAGACGAATTTTATTGTGCCTTCCAAGAAAGCGAGGGTTTTGATGT
ACCGACTTTTCCTGAATCCTATGCTTTTGGTCTTATTGTGCCTGTACGTGACATTCTACTTACCCCAGAGCTTCAAGTATGCTCAGAAGAAGCCATTAAACATTACAACA
ATGAAAATGGTACAAATTTTGAAGTTGTGAAGATTGTGAAGGCGAATCAACAAGCTTCTTGTGGTATTTTGTATTTCATTACCTTTGAGGCCAAACAAACTGGAACACCC
TCAGAGTACCCAACCACAGCCTTCCAAGCTCGAGTTCTTTTTGGAATTTTTCATAAAATAGAGGGGGTGGCTTTCATGGCTTCTTCACCCAAATTCTGCCTCTCTGACTC
AACTGACTCTGAATGCGATTCCCCATGTGATTCTGAAGAATATTATGAAAATGGTATCCGTGGGATGAATAAGGAAGAGGAAGAGCAATATTATCGTGCATTAGAAAAAA
GCGAGGGTTTTGATGTACCAACTTTTCCAGAATCCTGTGCTTTTGGTCTTATTGTTCCTGTCTGCGACATTCCACTTACCCCAGGACTTCGAGTGTGCACAGAGGAAGCT
ATTAAACATTACAACAACGAAAATGGTACAAATTTTGAAATTGTGGAGATTGTGAAGGCAAATCAACAAGGTGCGTGTGGTATTTTGTATTTCATCACTTTCGAGGCCAA
ACAAATTGGAACACCTCCAGAGTATCCAACCACAACCTTCCAAGCTCGAGTGCTGTTTGGAATTCTTCATAAAATAGAGCCTATAACGTCTTCTTCTCTCTCTGACCCAC
CACTGAAAAACTACACCACCTCCATTTTTTCTCTCAAGGTTTCTCCTTCTCTCACTCTTCTGTCTCTCTATCACTCTGTTTCTATGGCTACCTCCTCCAAGACATCAAGA
AATTCAAGATTTCAAATTCACCGGCAGCTTCTCCCGAAAGCTCCGATTCATATCTCCAGCTCTACAGCCAGCGGCTTAACAAGTAGAGGAATAGTGACTAGTCGGAGAGT
GAACGCTGATTCCAAAGCGACTTCGAGGTTGGTGGTCTGCTCGGTAGCCTCCGAGTCTATCAAGACAATATTGCGACGGCACATCAATGACAAACTGAATGAGGAGGCGG
CAACCAAGGAAATGGGAAAGTTGTTGCTGCGGCCATCAAATTACCACAGCGATCAGAACTTGAGCAGATTTTCTACTTACAGTTTACAGTATTTTGAAGAGCAGGGGACT
TTGTTGGTGGTGGTTTTGATGGCTTCTTATCCACCCGAATTCTACTTCTCCGACCACTCATCTGCCTCTGCAGTTGCACACAATTCTAATCCATGTGATTATGATTCTGA
TGGATGGATATTATGA
Protein sequenceShow/hide protein sequence
MVGLWLWSLCGWFFYTLSGYFSLLVNLLSHFLLIGWLTLDCVTHFEAGDFLRVALMASSPEFSFSNISSPGSAHNSPNYLDGYDGDGIHEMTEEEEKEYYRALEETQGFD
VPTFPKSYAFGLIMPAPVELFSQKLQACAEEAIKHYNKENDTNFEFVKIVKANHRAARGILYFITFEVKQIGTPLEFPTTTFQARVLSGIPDTIELSFPRPSSSNSTAFI
SLRESVCWCSFGLGLYEYDVGSSILLYAALLWFLACYWLIVSHITNTPYTLTNTFAFTLNQYCFCLLFSFFFFFLFFFPEIIKQRVGDSGGYHTDALTGDFLALMAFSPK
LSPSASAYHFDEYFADGLREMTDEEEMEYYLAFEKTQGFDMPTFPESYSMDRIEPVSERRLFSPELQECAEEAIQHYNKENGTNFEFVKMVKANNGAACGILYYITFEVK
QIGSPPNSPTTTFQAHVLFGIDGIEGFSNFSRDRTINKIRDLGHQKESAIYAKQLPLGFCPISEKSFINVAIFLHGLFLLTDFEAGDFLGAAFMASSPKFCFSDSTDSEH
NSPYHSDGYYEDGVRSMNEEEEDEFYCAFQESEGFDVPTFPESYAFGLIVPVRDILLTPELQVCSEEAIKHYNNENGTNFEVVKIVKANQQASCGILYFITFEAKQTGTP
SEYPTTAFQARVLFGIFHKIEGVAFMASSPKFCLSDSTDSECDSPCDSEEYYENGIRGMNKEEEEQYYRALEKSEGFDVPTFPESCAFGLIVPVCDIPLTPGLRVCTEEA
IKHYNNENGTNFEIVEIVKANQQGACGILYFITFEAKQIGTPPEYPTTTFQARVLFGILHKIEPITSSSLSDPPLKNYTTSIFSLKVSPSLTLLSLYHSVSMATSSKTSR
NSRFQIHRQLLPKAPIHISSSTASGLTSRGIVTSRRVNADSKATSRLVVCSVASESIKTILRRHINDKLNEEAATKEMGKLLLRPSNYHSDQNLSRFSTYSLQYFEEQGT
LLVVVLMASYPPEFYFSDHSSASAVAHNSNPCDYDSDGWIL