; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020207 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020207
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionUbiquitin-like-specific protease ESD4 isoform X2
Genome locationchr11:23736301..23738799
RNA-Seq ExpressionPI0020207
SyntenyPI0020207
Gene Ontology termsGO:0043229 - intracellular organelle (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031317.1 uncharacterized protein E6C27_scaffold139G00700 [Cucumis melo var. makuwa]1.8e-3737.92Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN
          +   RPKRQ +P+KKVLENV   KN
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN

TYK08201.1 40S ribosomal protein S15-4 [Cucumis melo var. makuwa]1.8e-3737.21Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN--DKKRCLPTKLQDIPQ
          +   RPKRQ +P+KKVLENV   KN   KK   P   +  P+
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN--DKKRCLPTKLQDIPQ

TYK22453.1 uncharacterized protein E5676_scaffold3009G00010 [Cucumis melo var. makuwa]1.8e-3737.92Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN
          +   RPKRQ +P+KKVLENV   KN
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN

XP_038880505.1 glutamic acid-rich protein-like [Benincasa hispida]2.9e-4052.21Show/hide
Query:  FSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS--MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKEEGGEK
        F+V  F+PTE+ELQS+YYAYF E   +EM++EKE+EG +++  +GR+IASL+E IE LK+GQE+IKK INE HKEILD+LG+IKE+I+D+LP KE+  ++
Subjt:  FSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS--MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKEEGGEK

Query:  QFQEKNKAPSSSLELLNANASLECMHAE---GKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGD
           EKNKAPSSSLELL  +ASLE + AE    K+ E    +D E+++ E   DD E+EEE+ EE K+++   KK V+ K +K      R K  I  +  +
Subjt:  QFQEKNKAPSSSLELLNANASLECMHAE---GKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGD

Query:  KRPKRQTRPTKKVLENVNNPKNDKKR
        KRPKRQ +PTKK+LEN    K DKK+
Subjt:  KRPKRQTRPTKKVLENVNNPKNDKKR

XP_038891747.1 pescadillo homolog [Benincasa hispida]5.3e-7453.66Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS
        MLDDEE+F+SYPWGRVSFELT+E+FKK V+NKPSSIFLQGFPL LVYWAFE+IP+LSNPT+GFA+RI+SD GPR+ QWESQEP DWQHINN+IFKA+G  
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS

Query:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRSMGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKEEGGEKQFQ-
                                                          IE LK+GQE+IKK INE HKEILD+LG+IK+AI+D+LP KEEGGEKQ Q 
Subjt:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRSMGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKEEGGEKQFQ-

Query:  --EKNKAPSSSLELLNANASLECMHAE---GKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGDK
          EKNKAPSSSLELL  +ASLE + AE    K+ E    +D E+++ E   DD E+EEE+ EE K+++   KK V+ K +K      R K  I+ +  +K
Subjt:  --EKNKAPSSSLELLNANASLECMHAE---GKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGDK

Query:  RPKRQTRPTKKVLENVNNPKNDKKRCLP
        RPKRQ +PTKK+LEN    K DKK+  P
Subjt:  RPKRQTRPTKKVLENVNNPKNDKKRCLP

TrEMBL top hitse value%identityAlignment
A0A5A7VIA9 Uncharacterized protein8.6e-3837.92Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN
          +   RPKRQ +P+KKVLENV   KN
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN

A0A5A7VKZ2 Uncharacterized protein4.0e-2731.58Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS
        M+DDEE   ++PWGR+S  LT+EY +K   +   +  LQGFP  LV WA E+IPKLS    G A RI+    PRI+ W+ ++   W ++  + FK+S F+
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS

Query:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS---------MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKE
        V+PF PT  EL S  + +F+  +  E+E E+ +E E  +               SL + I+ +K+  E +K  +     E+L++L  I   IN+R+P   
Subjt:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS---------MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKE

Query:  EGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLE--------EGKNEQGKGKKKVVVKIIKSRAKD
           EKQ Q    +N AP+ SL+ L    ++  +            E+ ED K   NE+++EN+E N E        E KN +   ++K + ++ K     
Subjt:  EGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLE--------EGKNEQGKGKKKVVVKIIKSRAKD

Query:  SIDELVGDKRPKRQTRPTKKVLE
        +I E   +++P  +    K+++E
Subjt:  SIDELVGDKRPKRQTRPTKKVLE

A0A5D3C8N2 40S ribosomal protein S15-48.6e-3837.21Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN--DKKRCLPTKLQDIPQ
          +   RPKRQ +P+KKVLENV   KN   KK   P   +  P+
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN--DKKRCLPTKLQDIPQ

A0A5D3CVR6 Uncharacterized protein3.1e-2730.88Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS
        M+DDEE   ++PWGR+SF LT+EY +K   +   +  LQGFP  LV WA E+IPKLS    G A RI      RI+ W+ ++   W ++  + FK+S F+
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFS

Query:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS---------MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKE
        V+PF PT  EL S  + +F+  +  E+E E+ +E E  +               SL + I+ +K+  E +K  +     E+L++L  I   IN+R+P   
Subjt:  VVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS---------MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKE

Query:  EGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDE-----KNED------------NEDDVENEEENLE--------EGKNEQG
           EKQ Q    +N AP+ SL+ L    ++  +       E +  E N +E     KN D            NE+++EN+E N E        E KN + 
Subjt:  EGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDE-----KNED------------NEDDVENEEENLE--------EGKNEQG

Query:  KGKKKVVVKIIKSRAKDSIDELVGDKRPKRQTRPTKKVLE
          ++K + ++ K     +I E   +++P  +    K+++E
Subjt:  KGKKKVVVKIIKSRAKDSIDELVGDKRPKRQTRPTKKVLE

A0A5D3DFX6 Uncharacterized protein8.6e-3837.92Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+ L YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP
        ++IF     SVVP + +E+E++S Y+ YF+E  L+  ++++E    K S M  +I+SL E I++LK+  +E  + + +  +EI+ +L  + E +N RL  
Subjt:  NSIFKASGFSVVPFIPTEKELQSSYYAYFVEQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPP

Query:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-
        K E  EKQ Q   EKN  P  SLE+++A+          KE E    EDNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+ 
Subjt:  KEEGGEKQFQ---EKNKAPSSSLELLNANASLECMHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL-

Query:  --VGDKRPKRQTRPTKKVLENVNNPKN
          +   RPKRQ +P+KKVLENV   KN
Subjt:  --VGDKRPKRQTRPTKKVLENVNNPKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G32960.1 Domain of unknown function (DUF1985)1.1e-0529.17Show/hide
Query:  DEEKFKSYPWGRVSFELTLEYFKKGV---LNKPSSIFLQGFPLALVYWAFEVIPKL----------SNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        D EK  +YPWG  +F + +   KK V   + KP    + GFPLAL  W  E IP L            PT    ++  S   P++ Q ++ E        
Subjt:  DEEKFKSYPWGRVSFELTLEYFKKGV---LNKPSSIFLQGFPLALVYWAFEVIPKL----------SNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGFSVVPFIPTEKE
         + +    F ++P IP + E
Subjt:  NSIFKASGFSVVPFIPTEKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGATGATGAAGAAAAATTTAAATCTTATCCATGGGGTAGAGTGTCTTTTGAACTAACTTTAGAGTATTTTAAGAAGGGTGTCCTCAACAAACCATCATCGATTTT
TCTACAAGGATTTCCTCTTGCCCTGGTTTATTGGGCTTTTGAGGTAATACCAAAACTATCAAATCCTACTGTTGGATTTGCAAAAAGAATACAAAGTGATCATGGGCCAA
GAATTGTTCAATGGGAGTCGCAAGAACCGGAAGATTGGCAACACATCAATAACAGCATCTTCAAAGCCTCTGGTTTTTCTGTTGTTCCGTTTATTCCAACGGAAAAGGAA
TTACAATCAAGCTATTATGCCTATTTTGTGGAGCAATCATTGAAGGAGATGGAGATGGAAAAGGAGCAAGAAGGAGAAAAAAGATCAATGGGAAGAAAAATAGCTTCATT
AGAGGAAGGCATTGAAAATTTGAAACAAGGACAAGAAGAGATCAAGAAGCACATCAATGAAAATCATAAAGAGATACTTGACATGTTGGGCTACATTAAAGAAGCCATCA
ACGATAGACTTCCGCCAAAAGAAGAAGGTGGCGAGAAACAATTTCAAGAAAAGAACAAAGCCCCATCATCCAGCTTGGAGCTTTTAAATGCAAATGCAAGTTTAGAATGT
ATGCATGCTGAAGGCAAAGAGACTGAATGTGTAATTCCCGAGGACAACGAGGATGAAAAAAATGAGGACAATGAAGATGATGTGGAAAATGAAGAGGAAAATCTTGAAGA
AGGAAAAAATGAACAAGGAAAGGGCAAAAAGAAGGTTGTTGTAAAGATAATTAAAAGTAGAGCAAAAGATAGCATAGATGAGCTTGTTGGTGACAAGAGGCCAAAGAGAC
AAACAAGACCAACCAAAAAAGTGTTGGAGAATGTAAACAATCCAAAGAATGACAAAAAAAGGTGTCTCCCAACAAAGCTACAAGACATTCCCCAAGGTGGAATGATATTG
CTCCTCCAAGCTTCGATCTCAAAATATCACAAATTGATGGGAGTCATTACTGATAGCATAATATCAGTGATGGATATCATAACTGATAGGATTATATCAGTGATGGATAT
CATCACTGATAGCATTATATCAATGATGAAAGTCATCATTGAAGTGATGTTAGTCATCACTGATAACATGATATCAGTGACAGAACAAGTTACAAGGCAATATCTCAATA
TCATTGACAACAAACACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGATGATGAAGAAAAATTTAAATCTTATCCATGGGGTAGAGTGTCTTTTGAACTAACTTTAGAGTATTTTAAGAAGGGTGTCCTCAACAAACCATCATCGATTTT
TCTACAAGGATTTCCTCTTGCCCTGGTTTATTGGGCTTTTGAGGTAATACCAAAACTATCAAATCCTACTGTTGGATTTGCAAAAAGAATACAAAGTGATCATGGGCCAA
GAATTGTTCAATGGGAGTCGCAAGAACCGGAAGATTGGCAACACATCAATAACAGCATCTTCAAAGCCTCTGGTTTTTCTGTTGTTCCGTTTATTCCAACGGAAAAGGAA
TTACAATCAAGCTATTATGCCTATTTTGTGGAGCAATCATTGAAGGAGATGGAGATGGAAAAGGAGCAAGAAGGAGAAAAAAGATCAATGGGAAGAAAAATAGCTTCATT
AGAGGAAGGCATTGAAAATTTGAAACAAGGACAAGAAGAGATCAAGAAGCACATCAATGAAAATCATAAAGAGATACTTGACATGTTGGGCTACATTAAAGAAGCCATCA
ACGATAGACTTCCGCCAAAAGAAGAAGGTGGCGAGAAACAATTTCAAGAAAAGAACAAAGCCCCATCATCCAGCTTGGAGCTTTTAAATGCAAATGCAAGTTTAGAATGT
ATGCATGCTGAAGGCAAAGAGACTGAATGTGTAATTCCCGAGGACAACGAGGATGAAAAAAATGAGGACAATGAAGATGATGTGGAAAATGAAGAGGAAAATCTTGAAGA
AGGAAAAAATGAACAAGGAAAGGGCAAAAAGAAGGTTGTTGTAAAGATAATTAAAAGTAGAGCAAAAGATAGCATAGATGAGCTTGTTGGTGACAAGAGGCCAAAGAGAC
AAACAAGACCAACCAAAAAAGTGTTGGAGAATGTAAACAATCCAAAGAATGACAAAAAAAGGTGTCTCCCAACAAAGCTACAAGACATTCCCCAAGGTGGAATGATATTG
CTCCTCCAAGCTTCGATCTCAAAATATCACAAATTGATGGGAGTCATTACTGATAGCATAATATCAGTGATGGATATCATAACTGATAGGATTATATCAGTGATGGATAT
CATCACTGATAGCATTATATCAATGATGAAAGTCATCATTGAAGTGATGTTAGTCATCACTGATAACATGATATCAGTGACAGAACAAGTTACAAGGCAATATCTCAATA
TCATTGACAACAAACACTAG
Protein sequenceShow/hide protein sequence
MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLALVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGFSVVPFIPTEKE
LQSSYYAYFVEQSLKEMEMEKEQEGEKRSMGRKIASLEEGIENLKQGQEEIKKHINENHKEILDMLGYIKEAINDRLPPKEEGGEKQFQEKNKAPSSSLELLNANASLEC
MHAEGKETECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDELVGDKRPKRQTRPTKKVLENVNNPKNDKKRCLPTKLQDIPQGGMIL
LLQASISKYHKLMGVITDSIISVMDIITDRIISVMDIITDSIISMMKVIIEVMLVITDNMISVTEQVTRQYLNIIDNKH