; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g28030 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g28030
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr11:20466013..20470558
RNA-Seq ExpressionMoc11g28030
SyntenyMoc11g28030
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0019784 - NEDD8-specific protease activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily
IPR044613 - NEDD8-specific protease 1/2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031708.1 hypothetical protein E6C27_scaffold139G004940 [Cucumis melo var. makuwa]1.1e-6340Show/hide
Query:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------
        VC  ++ L K      K GT CRL +G+ DNVV AGTIF+   +  NVKVS+D+V D +  +P+PT  G  +LSQE+GS +LWP+ LVI  +EK      
Subjt:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------

Query:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT
                                 ++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+ 
Subjt:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT

Query:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM
         +SKE+R Q+L  RL   +  Q+LMF Y+SG+HW LI I  ++   +++D +RNR+ +D   V+ MA     KK    + + CPKQ   VECGYYVM  M
Subjt:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM

Query:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
        RDI+ + N TI E M+G+   Y+Q+ LDV+R E AEF+  +I+ S
Subjt:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

KAA0035941.1 uncharacterized protein E6C27_scaffold56G001300 [Cucumis melo var. makuwa]7.2e-6339.71Show/hide
Query:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------
        VC  ++ L K      K GT CRL +G+ DNVV AGTI +   +  NVKVS+D+V D +  +PIPT  G  +LSQE+GS +LWP+ LVI  +EK      
Subjt:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------

Query:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT
                                 ++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+ 
Subjt:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT

Query:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM
         +SKE+R Q+L  RL   +  Q+L+F Y+SG+HW LI I+ ++   +++D +RNR+ +D   V+ MA     KK    + + CPKQ   VECGYYVM  M
Subjt:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM

Query:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
        RDI+ + N TI E M+G+   Y+Q+ LDV+R E AEF+  +I+ S
Subjt:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

TYJ96009.1 uncharacterized protein E5676_scaffold2612G00150 [Cucumis melo var. makuwa]2.9e-6442.6Show/hide
Query:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------
        TLE  K GT C+L   + D+VVA GTI +S     NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP+ LVI++N K++Y              
Subjt:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------

Query:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL
                         +G AIQ+    DVFG   K  IM+E ++    M P  T C+DAY M+L+  + +    + YKFLDAG+ S  + SKE RVQ+L
Subjt:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL

Query:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI
        T RL   + +QLL+F Y+SG+HWTL+VI+  K   F++D ++NR+  D+  V+  +   + KK  A + V CPKQ   VECGYYVM  MRDI+ + +T+I
Subjt:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI

Query:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI
         + MK + + YTQ+ +D IR E AEF+  H+
Subjt:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]2.9e-6442.6Show/hide
Query:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------
        TLE  K GT C+L   + D+VVA GTI +S     NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP+ LVI++N K++Y              
Subjt:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------

Query:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL
                         +G AIQ+    DVFG   K  IM+E ++    M P  T C+DAY M+L+  + +    + YKFLDAG+ S  + SKE RVQ+L
Subjt:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL

Query:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI
        T RL   + +QLL+F Y+SG+HWTL+VI+  K   F++D ++NR+  D+  V+  +   + KK  A + V CPKQ   VECGYYVM  MRDI+ + +T+I
Subjt:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI

Query:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI
         + MK + + YTQ+ +D IR E AEF+  H+
Subjt:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI

XP_022156878.1 uncharacterized protein LOC111023711 [Momordica charantia]2.3e-13873.64Show/hide
Query:  SYRYLTDVTVCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISD
        SYRYLTDVTVCVHLKILDKEETLEFK+GT+CRL LGSIDNVVAA TIFES R DGNVKVSIDVVVDDDSRLPIPTHGGN+ILSQEIGSHILWPQ+LVISD
Subjt:  SYRYLTDVTVCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISD

Query:  NEK-------------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKF
        NEK                               INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYT FLHRSLGNE ESSPYKF
Subjt:  NEK-------------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKF

Query:  LDAGATSITNLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVP---------
        LDAGATSITNLSKENRVQVLTKRLSELELNQLLMF YHSG                                   AVRNIQKKPFALKRVP         
Subjt:  LDAGATSITNLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVP---------

Query:  -----CPKQPNAVECGYYVMWLMRDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
             CPKQPNAVECGYYV+  MRDIVFARNTTIPECMKGA Q YTQEHLDVIRRELAEF LSHIYFS
Subjt:  -----CPKQPNAVECGYYVMWLMRDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

TrEMBL top hitse value%identityAlignment
A0A5A7SM56 ULP_PROTEASE domain-containing protein5.4e-6440Show/hide
Query:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------
        VC  ++ L K      K GT CRL +G+ DNVV AGTIF+   +  NVKVS+D+V D +  +P+PT  G  +LSQE+GS +LWP+ LVI  +EK      
Subjt:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------

Query:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT
                                 ++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+ 
Subjt:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT

Query:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM
         +SKE+R Q+L  RL   +  Q+LMF Y+SG+HW LI I  ++   +++D +RNR+ +D   V+ MA     KK    + + CPKQ   VECGYYVM  M
Subjt:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM

Query:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
        RDI+ + N TI E M+G+   Y+Q+ LDV+R E AEF+  +I+ S
Subjt:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

A0A5A7T2U8 ULP_PROTEASE domain-containing protein3.5e-6339.71Show/hide
Query:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------
        VC  ++ L K      K GT CRL +G+ DNVV AGTI +   +  NVKVS+D+V D +  +PIPT  G  +LSQE+GS +LWP+ LVI  +EK      
Subjt:  VCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEK------

Query:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT
                                 ++YIG  IQ+ +   VFG+E K  I +E +Q+   M+P +T CIDA+   L++ +        YKF DAG+ S+ 
Subjt:  -------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSIT

Query:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM
         +SKE+R Q+L  RL   +  Q+L+F Y+SG+HW LI I+ ++   +++D +RNR+ +D   V+ MA     KK    + + CPKQ   VECGYYVM  M
Subjt:  NLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLM

Query:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
        RDI+ + N TI E M+G+   Y+Q+ LDV+R E AEF+  +I+ S
Subjt:  RDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

A0A5D3CDJ5 ULP_PROTEASE domain-containing protein1.4e-6442.6Show/hide
Query:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------
        TLE  K GT C+L   + D+VVA GTI +S     NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP+ LVI++N K++Y              
Subjt:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------

Query:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL
                         +G AIQ+    DVFG   K  IM+E ++    M P  T C+DAY M+L+  + +    + YKFLDAG+ S  + SKE RVQ+L
Subjt:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL

Query:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI
        T RL   + +QLL+F Y+SG+HWTL+VI+  K   F++D ++NR+  D+  V+  +   + KK  A + V CPKQ   VECGYYVM  MRDI+ + +T+I
Subjt:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI

Query:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI
         + MK + + YTQ+ +D IR E AEF+  H+
Subjt:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein1.4e-6442.6Show/hide
Query:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------
        TLE  K GT C+L   + D+VVA GTI +S     NVKV+IDVVVD D  +PIP+  G   +SQE+GSHILWP+ LVI++N K++Y              
Subjt:  TLE-FKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINY--------------

Query:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL
                         +G AIQ+    DVFG   K  IM+E ++    M P  T C+DAY M+L+  + +    + YKFLDAG+ S  + SKE RVQ+L
Subjt:  -----------------IGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVL

Query:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI
        T RL   + +QLL+F Y+SG+HWTL+VI+  K   F++D ++NR+  D+  V+  +   + KK  A + V CPKQ   VECGYYVM  MRDI+ + +T+I
Subjt:  TKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTI

Query:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI
         + MK + + YTQ+ +D IR E AEF+  H+
Subjt:  PECMKGAQQYYTQEHLDVIRRELAEFILSHI

A0A6J1DRT3 uncharacterized protein LOC1110237111.1e-13873.64Show/hide
Query:  SYRYLTDVTVCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISD
        SYRYLTDVTVCVHLKILDKEETLEFK+GT+CRL LGSIDNVVAA TIFES R DGNVKVSIDVVVDDDSRLPIPTHGGN+ILSQEIGSHILWPQ+LVISD
Subjt:  SYRYLTDVTVCVHLKILDKEETLEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISD

Query:  NEK-------------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKF
        NEK                               INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYT FLHRSLGNE ESSPYKF
Subjt:  NEK-------------------------------INYIGRAIQMIISKDVFGHEHKLFIMVEDVQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKF

Query:  LDAGATSITNLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVP---------
        LDAGATSITNLSKENRVQVLTKRLSELELNQLLMF YHSG                                   AVRNIQKKPFALKRVP         
Subjt:  LDAGATSITNLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRNIQKKPFALKRVP---------

Query:  -----CPKQPNAVECGYYVMWLMRDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS
             CPKQPNAVECGYYV+  MRDIVFARNTTIPECMKGA Q YTQEHLDVIRRELAEF LSHIYFS
Subjt:  -----CPKQPNAVECGYYVMWLMRDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS

SwissProt top hitse value%identityAlignment
O13612 NEDD8-specific protease 23.2e-0535.71Show/hide
Query:  SGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRN---IQKKPFALKRVPCPKQPNAVECGYYV
        SG HW+L+V+S  K + ++ DSM N  T+D      +A++N   + KK F ++ +  P+Q N  +CG +V
Subjt:  SGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVITMAVRN---IQKKPFALKRVPCPKQPNAVECGYYV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAATAGTTTTCCCAAATACTCCATGAAACAAGTAGAAGATGGTATTTGTTATAGTGATCCAAACACTACTGCTGATGGACTGCCCCAACAAGATCTGAATGCTGC
CGCCACCTTCGTTCTCCCAAATCCGAAAGCTGCTACCGCCACTCTACTCTCAGATCCAAAGCTGATCGTGCCGCCCTTAGTTCATTCTCGAGGTCTGTTACCCACACCCT
TCACGAAATCGCGTGGGGATCTCATTTTATCACAAGAGAAAACAAGTTACCGATACTTAACTGATGTGACTGTATGTGTTCACTTGAAGATATTGGACAAGGAAGAAACT
TTGGAGTTCAAGAAGGGAACTTATTGTCGTCTGGTACTTGGGTCCATCGATAATGTTGTCGCTGCAGGCACTATATTTGAATCTAGGAGGAATGATGGAAACGTGAAAGT
GTCCATAGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTCCGACACATGGAGGAAATGATATTCTCTCGCAAGAAATAGGTTCACATATATTATGGCCTCAAAGTC
TAGTCATATCCGATAATGAGAAGATAAACTACATTGGAAGGGCAATTCAAATGATTATATCGAAGGATGTGTTCGGTCATGAACATAAGTTGTTTATCATGGTGGAGGAT
GTACAGAAGTTGTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTACACGATGTTCTTACATAGATCGTTGGGCAATGAAAATGAATCAAGCCCGTACAAGTT
TCTAGATGCTGGGGCCACTTCCATAACTAATCTATCTAAAGAAAACCGCGTGCAAGTATTGACTAAAAGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCGAT
ATCATTCCGGAGATCATTGGACGTTGATAGTGATATCTCCCGCAAAGAATATGACATTTTTTCTTGACTCGATGAGAAATCGCATGACAGATGATATTCTTAGTGTCATC
ACCATGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACCGTGCCCAAAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGTGGTTAATGCG
TGACATAGTCTTCGCTCGTAACACAACAATCCCAGAATGCATGAAAGGGGCACAGCAATATTACACACAGGAGCACTTGGATGTGATTAGGAGGGAGTTGGCAGAGTTCA
TACTCTCGCACATATACTTTTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTTAATAGTTTTCCCAAATACTCCATGAAACAAGTAGAAGATGGTATTTGTTATAGTGATCCAAACACTACTGCTGATGGACTGCCCCAACAAGATCTGAATGCTGC
CGCCACCTTCGTTCTCCCAAATCCGAAAGCTGCTACCGCCACTCTACTCTCAGATCCAAAGCTGATCGTGCCGCCCTTAGTTCATTCTCGAGGTCTGTTACCCACACCCT
TCACGAAATCGCGTGGGGATCTCATTTTATCACAAGAGAAAACAAGTTACCGATACTTAACTGATGTGACTGTATGTGTTCACTTGAAGATATTGGACAAGGAAGAAACT
TTGGAGTTCAAGAAGGGAACTTATTGTCGTCTGGTACTTGGGTCCATCGATAATGTTGTCGCTGCAGGCACTATATTTGAATCTAGGAGGAATGATGGAAACGTGAAAGT
GTCCATAGACGTGGTGGTTGATGACGACTCTCGACTTCCAATTCCGACACATGGAGGAAATGATATTCTCTCGCAAGAAATAGGTTCACATATATTATGGCCTCAAAGTC
TAGTCATATCCGATAATGAGAAGATAAACTACATTGGAAGGGCAATTCAAATGATTATATCGAAGGATGTGTTCGGTCATGAACATAAGTTGTTTATCATGGTGGAGGAT
GTACAGAAGTTGTTTCATATGGAACCGACAACTACTCCGTGCATTGATGCCTACACGATGTTCTTACATAGATCGTTGGGCAATGAAAATGAATCAAGCCCGTACAAGTT
TCTAGATGCTGGGGCCACTTCCATAACTAATCTATCTAAAGAAAACCGCGTGCAAGTATTGACTAAAAGACTCTCAGAATTGGAGTTGAACCAACTGCTGATGTTTCGAT
ATCATTCCGGAGATCATTGGACGTTGATAGTGATATCTCCCGCAAAGAATATGACATTTTTTCTTGACTCGATGAGAAATCGCATGACAGATGATATTCTTAGTGTCATC
ACCATGGCTGTAAGAAATATACAGAAAAAACCATTTGCTCTGAAGCGTGTACCGTGCCCAAAACAACCGAATGCAGTAGAATGTGGATACTATGTCATGTGGTTAATGCG
TGACATAGTCTTCGCTCGTAACACAACAATCCCAGAATGCATGAAAGGGGCACAGCAATATTACACACAGGAGCACTTGGATGTGATTAGGAGGGAGTTGGCAGAGTTCA
TACTCTCGCACATATACTTTTCGTAG
Protein sequenceShow/hide protein sequence
MLNSFPKYSMKQVEDGICYSDPNTTADGLPQQDLNAAATFVLPNPKAATATLLSDPKLIVPPLVHSRGLLPTPFTKSRGDLILSQEKTSYRYLTDVTVCVHLKILDKEET
LEFKKGTYCRLVLGSIDNVVAAGTIFESRRNDGNVKVSIDVVVDDDSRLPIPTHGGNDILSQEIGSHILWPQSLVISDNEKINYIGRAIQMIISKDVFGHEHKLFIMVED
VQKLFHMEPTTTPCIDAYTMFLHRSLGNENESSPYKFLDAGATSITNLSKENRVQVLTKRLSELELNQLLMFRYHSGDHWTLIVISPAKNMTFFLDSMRNRMTDDILSVI
TMAVRNIQKKPFALKRVPCPKQPNAVECGYYVMWLMRDIVFARNTTIPECMKGAQQYYTQEHLDVIRRELAEFILSHIYFS