; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010824 (gene) of Snake gourd v1 genome

Gene IDTan0010824
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG09:53862839..53866819
RNA-Seq ExpressionTan0010824
SyntenyTan0010824
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040673.1 putative serine/threonine-protein kinase nek2 [Cucumis melo var. makuwa]1.1e-2828.54Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCI----------------------------------
        M  +  I++  ++ V+EYN +G  IG N  K+ S+IG+CV HHIPIT+  W  VP ELKEKI S +                                  
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCI----------------------------------

Query:  -----EDQLAKDQENAHDDILAEAL-----------------GTPEHS-GRVRGGKSCDLAV--GNINNIVAS-----GMVYERLSSHE-----------
             E QL  D  + +  I+ + +                    EH  GR    K        G IN  V         + ++  + E           
Subjt:  -----EDQLAKDQENAHDDILAEAL-----------------GTPEHS-GRVRGGKSCDLAV--GNINNIVAS-----GMVYERLSSHE-----------

Query:  -VVYGVPLTSNDVRVLNTLVSDFNAPLPILI------AEK---QYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTS
           YG  + S+        V+   + LPI +      AE+   +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ PCN G HW LV+I+    
Subjt:  -VVYGVPLTSNDVRVLNTLVSDFNAPLPILI------AEK---QYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTS

Query:  TIWSIDSIGHGVRDYVK------------------NIV----NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
         ++ +DS+    RD +K                  NI+     T ECGYY M+++R+I+++ + +ITD +  ++ Y+Q ELDE+RVE  EF++RY+
Subjt:  TIWSIDSIGHGVRDYVK------------------NIV----NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

KAA0064126.1 uncharacterized protein E6C27_scaffold548G00390 [Cucumis melo var. makuwa]1.9e-3329.92Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----
        M  +  I+   +  ++EYN +G  IG N  K+ S+IG+CV HHIPIT+ S   V  ELKEK+ + +ED++     N  + +D L++ALGTPE+ GR    
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----

Query:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------
          V+  K    +V  I+      +V E         +S +EV    P+T  +V V N      N P+ +                               
Subjt:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------

Query:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------
            L    +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ P N G HW LVVI+     ++ +DS+    RD +K + N             
Subjt:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------

Query:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
                            ECGYY M+++ +IV++ + +ITD +  ++ Y+Q +LDEIRVE  +F+  Y+
Subjt:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

TYJ97927.1 uncharacterized protein E5676_scaffold234G00170 [Cucumis melo var. makuwa]1.4e-2830.99Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQENAHDDILAEALGTPEHSGRVRGGKS
        M ++   R+E  + VI+YN  GQ IG+N TK+ S+IGT VR H+PI Y  WP VP+E+K+KI+  IE     D  +    I    + TP     VR  + 
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQENAHDDILAEALGTPEHSGRVRGGKS

Query:  CDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPILIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCN
        C + + ++ +  +   +        ++Y +       R LN                  Y FL    IS     +E+R + L  RL  T  ++LL+ P N
Subjt:  CDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPILIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCN

Query:  SGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIV-----------------------NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDK-YTQDELDE
        SG+HW LVVI+L     + ID + + +   V  +V                          ECGYY M+F+RDI+   S  I  ++    + YTQDE+D 
Subjt:  SGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIV-----------------------NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDK-YTQDELDE

Query:  IRVELCEFVNRYL
        IR E  EFV +++
Subjt:  IRVELCEFVNRYL

TYK13949.1 transposase [Cucumis melo var. makuwa]2.4e-2826.76Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIE---------------------------------
        M  +  +R+E +R+++EYN  G  IG N  K+ S+I +CV +HIPI Y +W +V  ELKEKIY+ +E                                 
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIE---------------------------------

Query:  ---------------------------------------------------------------DQLAK---DQENAHDDILAEALGTPEHSGRVRG----
                                                                       D+++K   D+E + +D+  +AL T E SGRVRG    
Subjt:  ---------------------------------------------------------------DQLAK---DQENAHDDILAEALGTPEHSGRVRG----

Query:  --GKSCDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSN------DVRVLNTLVSDFNAPLPILIAE-----------KQYAFLQPSLISYAYGPEEQR
            +          I       E+L   EV   V + ++      +VR    +V + N  +P+ +               Y F+ PSLIS  +  +E  
Subjt:  --GKSCDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSN------DVRVLNTLVSDFNAPLPILIAE-----------KQYAFLQPSLISYAYGPEEQR

Query:  CRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVNTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIR
         R L NRL  +K N+L++ P N G HW L+ I+    TI+ +DS+    R  +K  V   ECGYY M++IRDI+T  S V+TD++  +  Y+Q EL E+R
Subjt:  CRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVNTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIR

Query:  VELCEFVNRYL
        +EL + +  ++
Subjt:  VELCEFVNRYL

TYK28614.1 uncharacterized protein E5676_scaffold2030G00320 [Cucumis melo var. makuwa]1.9e-3329.92Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----
        M  +  I+   +  ++EYN +G  IG N  K+ S+IG+CV HHIPIT+ S   V  ELKEK+ + +ED++     N  + +D L++ALGTPE+ GR    
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----

Query:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------
          V+  K    +V  I+      +V E         +S +EV    P+T  +V V N      N P+ +                               
Subjt:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------

Query:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------
            L    +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ P N G HW LVVI+     ++ +DS+    RD +K + N             
Subjt:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------

Query:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
                            ECGYY M+++ +IV++ + +ITD +  ++ Y+Q +LDEIRVE  +F+  Y+
Subjt:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

TrEMBL top hitse value%identityAlignment
A0A5A7TC55 Putative serine/threonine-protein kinase nek25.2e-2928.54Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCI----------------------------------
        M  +  I++  ++ V+EYN +G  IG N  K+ S+IG+CV HHIPIT+  W  VP ELKEKI S +                                  
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCI----------------------------------

Query:  -----EDQLAKDQENAHDDILAEAL-----------------GTPEHS-GRVRGGKSCDLAV--GNINNIVAS-----GMVYERLSSHE-----------
             E QL  D  + +  I+ + +                    EH  GR    K        G IN  V         + ++  + E           
Subjt:  -----EDQLAKDQENAHDDILAEAL-----------------GTPEHS-GRVRGGKSCDLAV--GNINNIVAS-----GMVYERLSSHE-----------

Query:  -VVYGVPLTSNDVRVLNTLVSDFNAPLPILI------AEK---QYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTS
           YG  + S+        V+   + LPI +      AE+   +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ PCN G HW LV+I+    
Subjt:  -VVYGVPLTSNDVRVLNTLVSDFNAPLPILI------AEK---QYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTS

Query:  TIWSIDSIGHGVRDYVK------------------NIV----NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
         ++ +DS+    RD +K                  NI+     T ECGYY M+++R+I+++ + +ITD +  ++ Y+Q ELDE+RVE  EF++RY+
Subjt:  TIWSIDSIGHGVRDYVK------------------NIV----NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

A0A5A7VCM5 ULP_PROTEASE domain-containing protein9.1e-3429.92Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----
        M  +  I+   +  ++EYN +G  IG N  K+ S+IG+CV HHIPIT+ S   V  ELKEK+ + +ED++     N  + +D L++ALGTPE+ GR    
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----

Query:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------
          V+  K    +V  I+      +V E         +S +EV    P+T  +V V N      N P+ +                               
Subjt:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------

Query:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------
            L    +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ P N G HW LVVI+     ++ +DS+    RD +K + N             
Subjt:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------

Query:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
                            ECGYY M+++ +IV++ + +ITD +  ++ Y+Q +LDEIRVE  +F+  Y+
Subjt:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

A0A5D3BFM5 ULP_PROTEASE domain-containing protein6.7e-2930.99Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQENAHDDILAEALGTPEHSGRVRGGKS
        M ++   R+E  + VI+YN  GQ IG+N TK+ S+IGT VR H+PI Y  WP VP+E+K+KI+  IE     D  +    I    + TP     VR  + 
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQENAHDDILAEALGTPEHSGRVRGGKS

Query:  CDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPILIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCN
        C + + ++ +  +   +        ++Y +       R LN                  Y FL    IS     +E+R + L  RL  T  ++LL+ P N
Subjt:  CDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPILIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCN

Query:  SGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIV-----------------------NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDK-YTQDELDE
        SG+HW LVVI+L     + ID + + +   V  +V                          ECGYY M+F+RDI+   S  I  ++    + YTQDE+D 
Subjt:  SGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIV-----------------------NTTECGYYTMKFIRDIVTQRSRVITDVLTRKDK-YTQDELDE

Query:  IRVELCEFVNRYL
        IR E  EFV +++
Subjt:  IRVELCEFVNRYL

A0A5D3CUP0 Transposase1.1e-2826.76Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIE---------------------------------
        M  +  +R+E +R+++EYN  G  IG N  K+ S+I +CV +HIPI Y +W +V  ELKEKIY+ +E                                 
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIE---------------------------------

Query:  ---------------------------------------------------------------DQLAK---DQENAHDDILAEALGTPEHSGRVRG----
                                                                       D+++K   D+E + +D+  +AL T E SGRVRG    
Subjt:  ---------------------------------------------------------------DQLAK---DQENAHDDILAEALGTPEHSGRVRG----

Query:  --GKSCDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSN------DVRVLNTLVSDFNAPLPILIAE-----------KQYAFLQPSLISYAYGPEEQR
            +          I       E+L   EV   V + ++      +VR    +V + N  +P+ +               Y F+ PSLIS  +  +E  
Subjt:  --GKSCDLAVGNINNIVASGMVYERLSSHEVVYGVPLTSN------DVRVLNTLVSDFNAPLPILIAE-----------KQYAFLQPSLISYAYGPEEQR

Query:  CRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVNTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIR
         R L NRL  +K N+L++ P N G HW L+ I+    TI+ +DS+    R  +K  V   ECGYY M++IRDI+T  S V+TD++  +  Y+Q EL E+R
Subjt:  CRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVNTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIR

Query:  VELCEFVNRYL
        +EL + +  ++
Subjt:  VELCEFVNRYL

A0A5D3DYN7 ULP_PROTEASE domain-containing protein9.1e-3429.92Show/hide
Query:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----
        M  +  I+   +  ++EYN +G  IG N  K+ S+IG+CV HHIPIT+ S   V  ELKEK+ + +ED++     N  + +D L++ALGTPE+ GR    
Subjt:  MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQEN--AHDDILAEALGTPEHSGR----

Query:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------
          V+  K    +V  I+      +V E         +S +EV    P+T  +V V N      N P+ +                               
Subjt:  --VRGGKSCDLAVGNINNIVASGMVYER--------LSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPI-------------------------------

Query:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------
            L    +Y F+ PSLIS  +   E R R LC+RL  +K ++L++ P N G HW LVVI+     ++ +DS+    RD +K + N             
Subjt:  ----LIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETK-NKLLICPCNSGHHWLLVVISLRTSTIWSIDSIGHGVRDYVKNIVN-------------

Query:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL
                            ECGYY M+++ +IV++ + +ITD +  ++ Y+Q +LDEIRVE  +F+  Y+
Subjt:  ------------------TTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCAGCTTGCAGGAATAAGAAATGAAGCTGATCGAAGAGTGATCGAATACAATGCTCATGGACAGTGGATTGGTAGGAATGTAACGAAAATGATAAGCTATATTGG
AACGTGTGTGCGACACCACATTCCAATTACATATGACTCGTGGCCTGAAGTACCAAGAGAATTGAAAGAAAAGATTTATTCTTGCATTGAGGACCAACTAGCAAAAGATC
AAGAGAATGCGCATGATGATATTCTGGCAGAGGCATTAGGCACTCCTGAACACTCAGGACGTGTAAGAGGTGGAAAATCTTGCGATTTAGCTGTTGGAAATATAAACAAT
ATTGTGGCATCAGGAATGGTTTATGAGAGGTTAAGTTCGCATGAAGTTGTTTACGGTGTGCCTCTCACATCGAACGATGTGAGAGTACTTAACACTCTTGTATCTGACTT
CAATGCTCCATTGCCTATATTGATAGCTGAGAAACAATATGCATTCCTTCAACCATCCTTGATTTCATATGCCTATGGACCTGAAGAGCAACGTTGTCGATTCTTATGTA
ATAGACTACGAGAGACCAAGAATAAACTATTGATTTGTCCTTGTAATTCAGGACACCATTGGTTGTTGGTGGTTATCTCATTGAGAACATCTACAATTTGGTCGATTGAC
TCCATAGGACATGGCGTTCGCGATTACGTGAAAAACATAGTTAATACTACAGAGTGTGGGTACTATACTATGAAGTTTATTCGAGATATAGTAACACAGAGAAGTCGAGT
GATAACGGACGTGTTGACGAGAAAAGATAAGTACACCCAAGATGAGTTAGACGAGATACGAGTTGAGTTATGTGAATTTGTAAATCGATACTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCAGCTTGCAGGAATAAGAAATGAAGCTGATCGAAGAGTGATCGAATACAATGCTCATGGACAGTGGATTGGTAGGAATGTAACGAAAATGATAAGCTATATTGG
AACGTGTGTGCGACACCACATTCCAATTACATATGACTCGTGGCCTGAAGTACCAAGAGAATTGAAAGAAAAGATTTATTCTTGCATTGAGGACCAACTAGCAAAAGATC
AAGAGAATGCGCATGATGATATTCTGGCAGAGGCATTAGGCACTCCTGAACACTCAGGACGTGTAAGAGGTGGAAAATCTTGCGATTTAGCTGTTGGAAATATAAACAAT
ATTGTGGCATCAGGAATGGTTTATGAGAGGTTAAGTTCGCATGAAGTTGTTTACGGTGTGCCTCTCACATCGAACGATGTGAGAGTACTTAACACTCTTGTATCTGACTT
CAATGCTCCATTGCCTATATTGATAGCTGAGAAACAATATGCATTCCTTCAACCATCCTTGATTTCATATGCCTATGGACCTGAAGAGCAACGTTGTCGATTCTTATGTA
ATAGACTACGAGAGACCAAGAATAAACTATTGATTTGTCCTTGTAATTCAGGACACCATTGGTTGTTGGTGGTTATCTCATTGAGAACATCTACAATTTGGTCGATTGAC
TCCATAGGACATGGCGTTCGCGATTACGTGAAAAACATAGTTAATACTACAGAGTGTGGGTACTATACTATGAAGTTTATTCGAGATATAGTAACACAGAGAAGTCGAGT
GATAACGGACGTGTTGACGAGAAAAGATAAGTACACCCAAGATGAGTTAGACGAGATACGAGTTGAGTTATGTGAATTTGTAAATCGATACTTATGA
Protein sequenceShow/hide protein sequence
MLQLAGIRNEADRRVIEYNAHGQWIGRNVTKMISYIGTCVRHHIPITYDSWPEVPRELKEKIYSCIEDQLAKDQENAHDDILAEALGTPEHSGRVRGGKSCDLAVGNINN
IVASGMVYERLSSHEVVYGVPLTSNDVRVLNTLVSDFNAPLPILIAEKQYAFLQPSLISYAYGPEEQRCRFLCNRLRETKNKLLICPCNSGHHWLLVVISLRTSTIWSID
SIGHGVRDYVKNIVNTTECGYYTMKFIRDIVTQRSRVITDVLTRKDKYTQDELDEIRVELCEFVNRYL