; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr005801 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr005801
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationtig00004052:51206..53857
RNA-Seq ExpressionSgr005801
SyntenySgr005801
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055303.1 putative ubiquitin-like-specific protease 2A [Cucumis melo var. makuwa]3.0e-7352.9Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ + +I+++ P TGH+   +LEE+ENVK FQ V P +       RR++  +K GCN A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  + ++D  L
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

XP_004152511.1 probable ubiquitin-like-specific protease 2A isoform X1 [Cucumis sativus]3.0e-7352.94Show/hide
Query:  QRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDC
        +RK QQ + +I+++ P TGH+   +LEE ENVKN Q V P +  +    RR++  +K G N A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFTYLDC
Subjt:  QRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDC

Query:  LWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENLNLIC
        LWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NLN+IC
Subjt:  LWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENLNLIC

Query:  KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  H ++D CL
Subjt:  KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

XP_008439347.1 PREDICTED: probable ubiquitin-like-specific protease 2A [Cucumis melo]3.0e-7352.9Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ + +I+++ P TGH+   +LEE+ENVK FQ V P +       RR++  +K GCN A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  + ++D  L
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

XP_022137751.1 probable ubiquitin-like-specific protease 2A [Momordica charantia]1.0e-7353.4Show/hide
Query:  MGKRQRKLQQSITYIEIDPPST----------------GHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKR--SIRKFGCNSAVPSRKKKLDSRAFEY
        MGK  RKLQ  I  I++D  +T                   +RN L+++ENV+ FQ VP Y++     PRRKR  S RK GCNSAVPSRKKKLDSRAFEY
Subjt:  MGKRQRKLQQSITYIEIDPPST----------------GHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKR--SIRKFGCNSAVPSRKKKLDSRAFEY

Query:  CFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHKRK--------------------ICVGH---------------------------------------
        CFQNLWRS PEEKKIQFTYLDCLWF+LYLKAAH+RK                    +C GH                                       
Subjt:  CFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHKRK--------------------ICVGH---------------------------------------

Query:  --------FKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
                FKE GR +NLN +CKIPLMVPKVPQQKNG+ECGKFVLYFIHLFMEAAP+NFSIKDYPYFM+E WFTPEG  QFY+ LDH EDDI L
Subjt:  --------FKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

XP_038895554.1 probable ubiquitin-like-specific protease 2A [Benincasa hispida]1.8e-7353.99Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ +  I+ID  +TGH  RN+LEE+ENVKNFQ V P M       RRKR+ RK GCNSA+P +K+KLDS AFEYCFQNLWRS PEEKKI FT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF+LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP  F IKDYPYFM+ENWFT +GVCQFYK     ++ +CL
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

TrEMBL top hitse value%identityAlignment
A0A0A0LUA4 ULP_PROTEASE domain-containing protein1.4e-7352.94Show/hide
Query:  QRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDC
        +RK QQ + +I+++ P TGH+   +LEE ENVKN Q V P +  +    RR++  +K G N A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFTYLDC
Subjt:  QRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDC

Query:  LWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENLNLIC
        LWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NLN+IC
Subjt:  LWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENLNLIC

Query:  KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  H ++D CL
Subjt:  KIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

A0A5A7UH72 Putative ubiquitin-like-specific protease 2A1.4e-7352.9Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ + +I+++ P TGH+   +LEE+ENVK FQ V P +       RR++  +K GCN A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  + ++D  L
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

A0A5D3BJD5 Putative ubiquitin-like-specific protease 2A1.4e-7352.9Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ + +I+++ P TGH+   +LEE+ENVK FQ V P +       RR++  +K GCN A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  + ++D  L
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

A0A6J1C7K2 probable ubiquitin-like-specific protease 2A5.0e-7453.4Show/hide
Query:  MGKRQRKLQQSITYIEIDPPST----------------GHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKR--SIRKFGCNSAVPSRKKKLDSRAFEY
        MGK  RKLQ  I  I++D  +T                   +RN L+++ENV+ FQ VP Y++     PRRKR  S RK GCNSAVPSRKKKLDSRAFEY
Subjt:  MGKRQRKLQQSITYIEIDPPST----------------GHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKR--SIRKFGCNSAVPSRKKKLDSRAFEY

Query:  CFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHKRK--------------------ICVGH---------------------------------------
        CFQNLWRS PEEKKIQFTYLDCLWF+LYLKAAH+RK                    +C GH                                       
Subjt:  CFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHKRK--------------------ICVGH---------------------------------------

Query:  --------FKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
                FKE GR +NLN +CKIPLMVPKVPQQKNG+ECGKFVLYFIHLFMEAAP+NFSIKDYPYFM+E WFTPEG  QFY+ LDH EDDI L
Subjt:  --------FKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

E5GBW7 Sentrin/sumo-specific protease1.4e-7352.9Show/hide
Query:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT
        MGKR+R  QQ + +I+++ P TGH+   +LEE+ENVK FQ V P +       RR++  +K GCN A+P RK+KLDSRAFEYCFQNLWRS PEEKKIQFT
Subjt:  MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFT

Query:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL
        YLDCLWF LYLKA+H+RK                    +C  H                                               FKEDG+ +NL
Subjt:  YLDCLWFTLYLKAAHKRK--------------------ICVGH-----------------------------------------------FKEDGRRENL

Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL
        N+ICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAP NF IKDYPYFM+ENWFT EGVCQFYKT  + ++D  L
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL

SwissProt top hitse value%identityAlignment
Q2PS26 Ubiquitin-like-specific protease 1D1.9e-0646.55Show/hide
Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE
        NL  +I   V +VPQQKN  +CG FVL+FI  F+E AP+    KD   F ++ WF P+
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE

Q8L7S0 Probable ubiquitin-like-specific protease 2B3.1e-0442.86Show/hide
Query:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFS---IKDYPYFMRENWFTP
        ++PQQ+N  +CG F+L+++ LF+  AP NFS   I +   F+  NWF P
Subjt:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFS---IKDYPYFMRENWFTP

Q8RWN0 Ubiquitin-like-specific protease 1C1.6e-0544.68Show/hide
Query:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE
        +VPQQKN  +CG F+L+FI  F+E AP+  +++D    + + WF PE
Subjt:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE

Arabidopsis top hitse value%identityAlignment
AT1G09730.1 Cysteine proteinases superfamily protein2.2e-0542.86Show/hide
Query:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFS---IKDYPYFMRENWFTP
        ++PQQ+N  +CG F+L+++ LF+  AP NFS   I +   F+  NWF P
Subjt:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFS---IKDYPYFMRENWFTP

AT1G10570.1 Cysteine proteinases superfamily protein1.2e-0644.68Show/hide
Query:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE
        +VPQQKN  +CG F+L+FI  F+E AP+  +++D    + + WF PE
Subjt:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE

AT1G10570.2 Cysteine proteinases superfamily protein1.2e-0644.68Show/hide
Query:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE
        +VPQQKN  +CG F+L+FI  F+E AP+  +++D    + + WF PE
Subjt:  KVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE

AT1G60220.1 UB-like protease 1D1.4e-0746.55Show/hide
Query:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE
        NL  +I   V +VPQQKN  +CG FVL+FI  F+E AP+    KD   F ++ WF P+
Subjt:  NLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPE

AT3G48480.1 Cysteine proteinases superfamily protein8.2e-2930.2Show/hide
Query:  EEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHK--------------
        +  + ++ F+   P   D  T  RR RS R+  C       +KKL+S+AF    +++WR F +EKK  F YLDCLWF++Y    H               
Subjt:  EEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDCLWFTLYLKAAHK--------------

Query:  ------------------------------------------------------RKICVGHFKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYF
                                                              RK  +  ++ +GR E+ +L+ +IP  VP VPQQ N  ECG FVLY+
Subjt:  ------------------------------------------------------RKICVGHFKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYF

Query:  IHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDI
        IH F+E APENF+++D PYF++E+WF+       +K L+ F D++
Subjt:  IHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAAGCGTCAGCGAAAGCTTCAACAATCGATAACCTACATCGAAATTGATCCACCGAGCACAGGGCATGCTTTCAGGAATGATTTAGAAGAAGCTGAGAATGTTAA
AAATTTTCAGCATGTACCCCCCTATATGTTAGATGTTTATACCTTTCCTCGTCGCAAGCGATCAATACGGAAATTTGGTTGCAATAGTGCAGTTCCTAGTCGGAAAAAGA
AGCTTGACTCTAGAGCTTTTGAATATTGTTTTCAGAATTTATGGAGGAGCTTTCCAGAAGAGAAGAAGATTCAGTTTACATACCTTGACTGCTTATGGTTTACCTTGTAC
TTGAAAGCAGCCCACAAAAGAAAGATTTGTGTTGGACATTTCAAAGAAGATGGCAGACGCGAAAACTTGAATTTAATTTGCAAAATTCCTCTCATGGTGCCGAAGGTGCC
ACAACAGAAAAATGGTGATGAATGTGGCAAATTTGTTTTGTACTTCATTCATTTGTTCATGGAGGCTGCTCCAGAGAATTTTAGCATCAAGGACTACCCTTACTTTATGA
GAGAGAATTGGTTTACGCCAGAAGGTGTTTGCCAATTCTACAAGACACTCGACCATTTCGAAGACGATATCTGTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAAGCGTCAGCGAAAGCTTCAACAATCGATAACCTACATCGAAATTGATCCACCGAGCACAGGGCATGCTTTCAGGAATGATTTAGAAGAAGCTGAGAATGTTAA
AAATTTTCAGCATGTACCCCCCTATATGTTAGATGTTTATACCTTTCCTCGTCGCAAGCGATCAATACGGAAATTTGGTTGCAATAGTGCAGTTCCTAGTCGGAAAAAGA
AGCTTGACTCTAGAGCTTTTGAATATTGTTTTCAGAATTTATGGAGGAGCTTTCCAGAAGAGAAGAAGATTCAGTTTACATACCTTGACTGCTTATGGTTTACCTTGTAC
TTGAAAGCAGCCCACAAAAGAAAGATTTGTGTTGGACATTTCAAAGAAGATGGCAGACGCGAAAACTTGAATTTAATTTGCAAAATTCCTCTCATGGTGCCGAAGGTGCC
ACAACAGAAAAATGGTGATGAATGTGGCAAATTTGTTTTGTACTTCATTCATTTGTTCATGGAGGCTGCTCCAGAGAATTTTAGCATCAAGGACTACCCTTACTTTATGA
GAGAGAATTGGTTTACGCCAGAAGGTGTTTGCCAATTCTACAAGACACTCGACCATTTCGAAGACGATATCTGTCTTTAA
Protein sequenceShow/hide protein sequence
MGKRQRKLQQSITYIEIDPPSTGHAFRNDLEEAENVKNFQHVPPYMLDVYTFPRRKRSIRKFGCNSAVPSRKKKLDSRAFEYCFQNLWRSFPEEKKIQFTYLDCLWFTLY
LKAAHKRKICVGHFKEDGRRENLNLICKIPLMVPKVPQQKNGDECGKFVLYFIHLFMEAAPENFSIKDYPYFMRENWFTPEGVCQFYKTLDHFEDDICL