; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025398 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025398
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUlp1-like peptidase
Genome locationscaffold13:42184735..42186780
RNA-Seq ExpressionSpg025398
SyntenySpg025398
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022148308.1 uncharacterized protein LOC111016993 [Momordica charantia]2.2e-4042.25Show/hide
Query:  GSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDT-----FDW-SRFKTITNY
        G+ RKTVY+ ++K WF+ LL+P +W + EV+D LF+ +RKK++ RPDLC R F T D+ +  + RR+D     ++SD    +     +DW  R ++I  Y
Subjt:  GSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDT-----FDW-SRFKTITNY

Query:  VMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHP
          G HTDY + W  VDA+Y+PFN+   HWV++C D E GE V+ DSL ++ +D  +   +  + TI P +L +CDVMK +P+LP  P
Subjt:  VMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHP

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]6.8e-4239.52Show/hide
Query:  LFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRS-----DHGNDTFDW-SRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCAD
        +F+  K+  RP+LC R F T DV +  FLR  D    +++S           +DW  R  ++ +Y+ G H+D    W  VDAVY+P+N+G  HW+++C D
Subjt:  LFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRS-----DHGNDTFDW-SRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCAD

Query:  FETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRR
        F+ GE ++ DS   +     + +++  + TI P L+ R  V   KP++P  PWR RR +  PQQ   GDCG+F + F EYDVT     +L+Q ++ F RR
Subjt:  FETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

XP_022158807.1 uncharacterized protein LOC111025273 [Momordica charantia]1.0e-4539.04Show/hide
Query:  LEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSD--HGNDTFDWSRFKTI
        ++DP +D + R T    + K WF  LL P   + DE IDSL +   +K++    L    F   DV +   LRR D     ++        T+DW + +TI
Subjt:  LEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSD--HGNDTFDWSRFKTI

Query:  TNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVP
          YV+G  +DY   WS  D VY   N+G NHWV++  D   G+  + DSL A+    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        QQ    DC +F V+F EYDV  S + +L Q  I   RRQ+AVQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

XP_031738492.1 uncharacterized protein LOC116402715 [Cucumis sativus]5.7e-4133.33Show/hide
Query:  YDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELL
        YDPM  IP+ +  + + W+ D  +    R+T +  +SK++F+ L     W++DE +D+LFL IR  + T      +NF T D +F+R  + +   Y+E +
Subjt:  YDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELL

Query:  RSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDK
        +    N  F W     + +YV+G   D   PW +VD +Y PFN+  NHW+LLC D    +  + DSL +L S  D+   +  +  + P LL        +
Subjt:  RSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDK

Query:  PSLPTH--PWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
            TH  PW       +P Q+++ DCGVFT+K+ EY+ +  D+ +L QE + + R+Q A QLW N P +
Subjt:  PSLPTH--PWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]6.8e-4251.74Show/hide
Query:  EELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDV
        E  LR D    T DWS  K +  YV G+HTDY VPWS+VDAVYMPFNL   HWVL+CADF+  E ++ DSL AL+ +AD+  +M  VC  FP LL+   V
Subjt:  EELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDV

Query:  MKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        M +  +L    W  RR     QQ +SGDCG+FT KF EYDVT S +G+L+Q++ ++ RRQ+A+Q+WANR  F
Subjt:  MKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

TrEMBL top hitse value%identityAlignment
A0A5A7TD74 Ulp1-like peptidase1.1e-4034.07Show/hide
Query:  YDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELL
        YDPM  I + +  + + W+ D  +D   R+T +  +SK +F+ L     W++DE +D+LFLFIR K+        +NF TAD +F    + +  +Y+E +
Subjt:  YDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELL

Query:  RSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--K
        +    N  FDW     + +YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  + P+LL        +
Subjt:  RSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--K

Query:  DKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
         + S    PW       +P Q+++ DCGVFT+K+ EY      L +L QE + + R+Q A QLW N P +
Subjt:  DKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

A0A5A7TSP7 Ulp1-like peptidase3.5e-3628.27Show/hide
Query:  EDKDKWREKEEEVEKEDESREKKKKMMMKQTFCLTFVNFQGKFVDLKKYFGS-KDGPDDVGG-PSKGPDDVGGPSKGPDDNEKDGKEKDVDEAYDIEHIM
        + K+   ++  EV  +++  +K KK   K        N Q +   ++    + K   D++ G  S     +G   KG + + K      V E  + EHI 
Subjt:  EDKDKWREKEEEVEKEDESREKKKKMMMKQTFCLTFVNFQGKFVDLKKYFGS-KDGPDDVGG-PSKGPDDVGGPSKGPDDNEKDGKEKDVDEAYDIEHIM

Query:  ELESQPTTDLESHSITDVESQPTTDPVELIAPKAEPVELGDE----EVEKVIGQHELVKRRGKRTRQISWKLRSPWA-------DTRPGGKRRKVKQYDP
                +++     DV  +     +E      EP+++ D+    E+E  + Q   V  R  R ++ S  L +P+         +       +   YDP
Subjt:  ELESQPTTDLESHSITDVESQPTTDPVELIAPKAEPVELGDE----EVEKVIGQHELVKRRGKRTRQISWKLRSPWA-------DTRPGGKRRKVKQYDP

Query:  MRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELLRSD
        M  I + +  + + W+ D  +D   R+T +  +SK +F+ L     W+SDE +D+LFLFIR K+        +NF TAD +F+R  + +  +Y+E ++  
Subjt:  MRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTAD-VFVREFLRREDVYEELLRSD

Query:  HGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--KDKP
          N  FDW     + +YV+G   D+  PW+SVD VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  + P+LL        + + 
Subjt:  HGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--KDKP

Query:  SLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        S    PW       +P Q+++ DCGVFT+K+ EY  T   L +L QE + + R+Q A QLW N P +
Subjt:  SLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

A0A5A7UQZ5 Ulp1-like peptidase3.4e-3930.63Show/hide
Query:  EPVELGDE----EVEKVIGQHELVKRRGKRTRQISWKLRSPW-------ADTRPGGKRRKVKQYDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRS
        EP+++ D+    E+E  + Q   V  R  R ++ S  L +P+         +     + +   YDPM  I + +  + + W+ D  +D   R+  +  +S
Subjt:  EPVELGDE----EVEKVIGQHELVKRRGKRTRQISWKLRSPW-------ADTRPGGKRRKVKQYDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRS

Query:  KQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTA-DVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDA
        K +F+ L     W++DE +D+LFLFIR K+        +NF TA  +F+R  + +  +Y+E ++    N  FDW     + +YV+G   D+  PW+SVD 
Subjt:  KQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTA-DVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDA

Query:  VYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEY
        VY PFN+  NHWVLLC D  + +  + DSL +L +  ++   +  +  + P+LL        + + S    PW       +P Q+++ DCG+FT+K+ EY
Subjt:  VYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVM--KDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEY

Query:  DVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
              L +L QE + + R+Q A QLW N P +
Subjt:  DVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

A0A6J1DLV0 uncharacterized protein LOC1110216463.3e-4239.52Show/hide
Query:  LFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRS-----DHGNDTFDW-SRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCAD
        +F+  K+  RP+LC R F T DV +  FLR  D    +++S           +DW  R  ++ +Y+ G H+D    W  VDAVY+P+N+G  HW+++C D
Subjt:  LFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRS-----DHGNDTFDW-SRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCAD

Query:  FETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRR
        F+ GE ++ DS   +     + +++  + TI P L+ R  V   KP++P  PWR RR +  PQQ   GDCG+F + F EYDVT     +L+Q ++ F RR
Subjt:  FETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRR

Query:  QFAVQLWANR
        QFAVQLWAN+
Subjt:  QFAVQLWANR

A0A6J1DY60 uncharacterized protein LOC1110252734.9e-4639.04Show/hide
Query:  LEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSD--HGNDTFDWSRFKTI
        ++DP +D + R T    + K WF  LL P   + DE IDSL +   +K++    L    F   DV +   LRR D     ++        T+DW + +TI
Subjt:  LEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSD--HGNDTFDWSRFKTI

Query:  TNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVP
          YV+G  +DY   WS  D VY   N+G NHWV++  D   G+  + DSL A+    D+ K +  +CTI P +L    ++  +P+LP  PWR RR T VP
Subjt:  TNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVP

Query:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF
        QQ    DC +F V+F EYDV  S + +L Q  I   RRQ+AVQ+WA RPFF
Subjt:  QQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF

SwissProt top hitse value%identityAlignment
Q94F30 Ubiquitin-like-specific protease ESD43.1e-0522.48Show/hide
Query:  LKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLG
        L PS W++DEVI +++L + K+ +TR     + ++    F   F ++       L SD G +      FK +  +       Y +     D +++P + G
Subjt:  LKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLG

Query:  RNHWVLLCADFETGEFVLTDSLTALNSDA--DIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLG
          HW L   +    + +  DSL  ++      +AK M                 K    +  + W       +PQQ++  DCG+F +K++++  +R    
Subjt:  RNHWVLLCADFETGEFVLTDSLTALNSDA--DIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLG

Query:  SLSQEKIEFCRRQFAVQL
          SQE + + R + A ++
Subjt:  SLSQEKIEFCRRQFAVQL

Arabidopsis top hitse value%identityAlignment
AT2G07240.1 cysteine-type peptidases;cysteine-type peptidases4.0e-0830.3Show/hide
Query:  WSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV
        ++  D VYMPFN  + HWV LC D +  +  + DS   L  DA +  ++  +  + P L  +        SL   P+   R   +PQ     D GV +V
Subjt:  WSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTV

AT4G15880.1 Cysteine proteinases superfamily protein2.2e-0622.48Show/hide
Query:  LKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLG
        L PS W++DEVI +++L + K+ +TR     + ++    F   F ++       L SD G +      FK +  +       Y +     D +++P + G
Subjt:  LKPSHWMSDEVIDSLFLFIRKKMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLG

Query:  RNHWVLLCADFETGEFVLTDSLTALNSDA--DIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLG
          HW L   +    + +  DSL  ++      +AK M                 K    +  + W       +PQQ++  DCG+F +K++++  +R    
Subjt:  RNHWVLLCADFETGEFVLTDSLTALNSDA--DIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLG

Query:  SLSQEKIEFCRRQFAVQL
          SQE + + R + A ++
Subjt:  SLSQEKIEFCRRQFAVQL

AT5G45570.1 Ulp1 protease family protein7.3e-1027.91Show/hide
Query:  VDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKD-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL
        VD +Y    +  NHWV L  D       + DS+ +L +D ++A Q   V T+ P +L      K  + S     W  +R T++P+  D GDC ++++K++
Subjt:  VDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALNSDADIAKQMNTVCTIFPRLLLRCDVMKD-KPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFL

Query:  EYDVTRSDLGSLSQEKIEFCRRQFAVQLW
        E          L  E ++  R + AV+++
Subjt:  EYDVTRSDLGSLSQEKIEFCRRQFAVQLW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGAGTGATGGAGCCTCCACGCGTGCCAGAGCCAGTGCATGAGCCAGTACTCGAGTCAGGGCAAGAGCCAGATGAAGAACGAGAGAGTCTACCTACTGTA
TCCGAGGCTAGACTACCTGATGTAGAGGAGGCTACTGTAGGCGACACTGAGATGGTAGATGTCATTGATGCAAATAAGAGAGGAATGGAAAAAGAGGACAAAGAC
AAATGGAGAGAGAAAGAGGAGGAAGTAGAAAAAGAGGATGAGAGCAGGGAGAAGAAGAAGAAGATGATGATGAAGCAGACCTTTTGTCTAACATTTGTAAACTTT
CAGGGAAAATTCGTGGACCTTAAGAAGTACTTTGGATCGAAAGATGGTCCGGATGATGTGGGTGGTCCATCGAAAGGACCGGATGACGTGGGTGGTCCATCGAAA
GGACCGGATGACAATGAGAAGGACGGAAAGGAGAAGGACGTTGATGAGGCGTACGACATAGAGCATATTATGGAGTTGGAGTCTCAACCAACCACTGACTTAGAG
TCTCACTCAATTACTGACGTGGAGTCTCAACCAACTACAGACCCAGTCGAACTAATTGCACCTAAGGCTGAGCCTGTTGAGTTAGGTGATGAGGAGGTTGAAAAA
GTAATCGGGCAGCATGAATTGGTAAAAAGACGAGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGCTGACACCAGGCCGGGCGGCAAAAGG
CGAAAAGTTAAGCAATACGATCCCATGCGTACCATTCCTGAGGAATACGAGACCAAGTTTCAGAAATGGTTGGAAGACCCATTGTCTGACGGATCGGAGCGTAAG
ACGGTATATGCCTACAGAAGCAAGCAGTGGTTTCAGACATTACTCAAACCGTCTCATTGGATGAGTGATGAGGTGATTGACTCTCTCTTCCTCTTTATTCGGAAG
AAGATGGATACCCGTCCTGACTTATGTCATCGAAACTTTGTCACGGCGGATGTATTTGTAAGAGAATTTTTGAGGCGCGAGGATGTGTACGAAGAACTCCTTCGT
AGTGACCATGGGAACGACACGTTCGATTGGAGCAGATTCAAGACGATCACTAACTACGTAATGGGAGAACACACAGATTACGGCGTTCCTTGGAGTTCCGTTGAT
GCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAATTTGTGTTGACAGACTCCCTAACGGCACTGAAT
TCAGATGCAGACATAGCCAAGCAGATGAATACGGTATGCACCATTTTTCCTAGGCTGCTATTAAGGTGCGACGTTATGAAGGACAAGCCGTCTCTTCCAACACAT
CCATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTTTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGAT
TTAGGTAGTCTTAGTCAGGAGAAAATTGAGTTTTGCAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCGAGTGATGGAGCCTCCACGCGTGCCAGAGCCAGTGCATGAGCCAGTACTCGAGTCAGGGCAAGAGCCAGATGAAGAACGAGAGAGTCTACCTACTGTA
TCCGAGGCTAGACTACCTGATGTAGAGGAGGCTACTGTAGGCGACACTGAGATGGTAGATGTCATTGATGCAAATAAGAGAGGAATGGAAAAAGAGGACAAAGAC
AAATGGAGAGAGAAAGAGGAGGAAGTAGAAAAAGAGGATGAGAGCAGGGAGAAGAAGAAGAAGATGATGATGAAGCAGACCTTTTGTCTAACATTTGTAAACTTT
CAGGGAAAATTCGTGGACCTTAAGAAGTACTTTGGATCGAAAGATGGTCCGGATGATGTGGGTGGTCCATCGAAAGGACCGGATGACGTGGGTGGTCCATCGAAA
GGACCGGATGACAATGAGAAGGACGGAAAGGAGAAGGACGTTGATGAGGCGTACGACATAGAGCATATTATGGAGTTGGAGTCTCAACCAACCACTGACTTAGAG
TCTCACTCAATTACTGACGTGGAGTCTCAACCAACTACAGACCCAGTCGAACTAATTGCACCTAAGGCTGAGCCTGTTGAGTTAGGTGATGAGGAGGTTGAAAAA
GTAATCGGGCAGCATGAATTGGTAAAAAGACGAGGAAAGCGGACCCGACAAATTTCTTGGAAGCTTCGGTCTCCATGGGCTGACACCAGGCCGGGCGGCAAAAGG
CGAAAAGTTAAGCAATACGATCCCATGCGTACCATTCCTGAGGAATACGAGACCAAGTTTCAGAAATGGTTGGAAGACCCATTGTCTGACGGATCGGAGCGTAAG
ACGGTATATGCCTACAGAAGCAAGCAGTGGTTTCAGACATTACTCAAACCGTCTCATTGGATGAGTGATGAGGTGATTGACTCTCTCTTCCTCTTTATTCGGAAG
AAGATGGATACCCGTCCTGACTTATGTCATCGAAACTTTGTCACGGCGGATGTATTTGTAAGAGAATTTTTGAGGCGCGAGGATGTGTACGAAGAACTCCTTCGT
AGTGACCATGGGAACGACACGTTCGATTGGAGCAGATTCAAGACGATCACTAACTACGTAATGGGAGAACACACAGATTACGGCGTTCCTTGGAGTTCCGTTGAT
GCTGTCTACATGCCCTTCAACTTAGGTAGAAACCATTGGGTTCTACTGTGCGCTGACTTTGAAACGGGCGAATTTGTGTTGACAGACTCCCTAACGGCACTGAAT
TCAGATGCAGACATAGCCAAGCAGATGAATACGGTATGCACCATTTTTCCTAGGCTGCTATTAAGGTGCGACGTTATGAAGGACAAGCCGTCTCTTCCAACACAT
CCATGGCGATTCAGAAGGAAGACCCAAGTGCCACAACAACAAGATAGTGGGGATTGTGGGGTTTTCACTGTAAAGTTTTTGGAATATGATGTAACTAGATCAGAT
TTAGGTAGTCTTAGTCAGGAGAAAATTGAGTTTTGCAGGCGTCAATTTGCTGTACAACTTTGGGCCAATAGGCCGTTCTTTTAG
Protein sequenceShow/hide protein sequence
MNRVMEPPRVPEPVHEPVLESGQEPDEERESLPTVSEARLPDVEEATVGDTEMVDVIDANKRGMEKEDKDKWREKEEEVEKEDESREKKKKMMMKQTFCLTFVNF
QGKFVDLKKYFGSKDGPDDVGGPSKGPDDVGGPSKGPDDNEKDGKEKDVDEAYDIEHIMELESQPTTDLESHSITDVESQPTTDPVELIAPKAEPVELGDEEVEK
VIGQHELVKRRGKRTRQISWKLRSPWADTRPGGKRRKVKQYDPMRTIPEEYETKFQKWLEDPLSDGSERKTVYAYRSKQWFQTLLKPSHWMSDEVIDSLFLFIRK
KMDTRPDLCHRNFVTADVFVREFLRREDVYEELLRSDHGNDTFDWSRFKTITNYVMGEHTDYGVPWSSVDAVYMPFNLGRNHWVLLCADFETGEFVLTDSLTALN
SDADIAKQMNTVCTIFPRLLLRCDVMKDKPSLPTHPWRFRRKTQVPQQQDSGDCGVFTVKFLEYDVTRSDLGSLSQEKIEFCRRQFAVQLWANRPFF