; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027159 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027159
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionUbiquitin-like-specific protease ESD4 isoform X2
Genome locationchr04:25414537..25416118
RNA-Seq ExpressionPI0027159
SyntenyPI0027159
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0043229 - intracellular organelle (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031317.1 uncharacterized protein E6C27_scaffold139G00700 [Cucumis melo var. makuwa]8.9e-2834.23Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM
        KN   KK  SP K TR+SPR  D+A PSFDL+ISQ+
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM

KAA0059898.1 uncharacterized protein E6C27_scaffold108G001930 [Cucumis melo var. makuwa]5.4e-2535.03Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGQSL-----KEMEMEKE----------QEGEKRSMGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIP
        ++IF     S+      E EM+ E           + ++ +   K +++ E I  L +D +  + T+ +  +       +L++ +  D      T+    
Subjt:  NSIFKASGQSL-----KEMEMEKE----------QEGEKRSMGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIP

Query:  EDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWN
         DNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   KN   KK  SP K TR+SPR  
Subjt:  EDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWN

Query:  DIAPPSFDLKISQM
        D+A PSFDL+ISQ+
Subjt:  DIAPPSFDLKISQM

TYK08201.1 40S ribosomal protein S15-4 [Cucumis melo var. makuwa]1.9e-2533.44Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAP
        KN   KK  SP K TRHSPR  D+AP
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAP

TYK22453.1 uncharacterized protein E5676_scaffold3009G00010 [Cucumis melo var. makuwa]8.9e-2834.23Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM
        KN   KK  SP K TR+SPR  D+A PSFDL+ISQ+
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM

XP_038891747.1 pescadillo homolog [Benincasa hispida]1.2e-6150.48Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASG--
        MLDDEE+F+SYPWGRVSFELT+E+FKK V+NKPSSIFLQGFPL LVYWAFE+IP+LSNPT+GFA+RI+SD GPR+ QWESQEP DWQHINN+IFKA+G  
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASG--

Query:  -------QSLKEMEMEKEQE------GEKRSMGRKIASLEEGIEILKQ---DKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIPEDNEDEK
               + +K+   EK +E        K+++  K+   EEG E   Q   +K ++ S+S++++   L    +L+  + ++  + K+ +    +D E+++
Subjt:  -------QSLKEMEMEKEQE------GEKRSMGRKIASLEEGIEILKQ---DKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIPEDNEDEK

Query:  NEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGDKRPKRQTRPTKKVLENVNNPKNDKKKVSPN--------KATRHSPRW
         E   DD E+EEE+ EE K+++   KK V+ K +K      R K  I+ +  +KRPKRQ +PTKK+LEN    K DKKKVSPN        KATR SPRW
Subjt:  NEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKS-----RAKDSIDELVGDKRPKRQTRPTKKVLENVNNPKNDKKKVSPN--------KATRHSPRW

Query:  NDIAPPSFDLKISQM
        ND A PSFDLKISQ+
Subjt:  NDIAPPSFDLKISQM

TrEMBL top hitse value%identityAlignment
A0A5A7UXF1 Uncharacterized protein2.6e-2535.03Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFKASGQSL-----KEMEMEKE----------QEGEKRSMGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIP
        ++IF     S+      E EM+ E           + ++ +   K +++ E I  L +D +  + T+ +  +       +L++ +  D      T+    
Subjt:  NSIFKASGQSL-----KEMEMEKE----------QEGEKRSMGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIP

Query:  EDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWN
         DNE+ + +DN    EN+ + + +  N+         VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   KN   KK  SP K TR+SPR  
Subjt:  EDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWN

Query:  DIAPPSFDLKISQM
        D+A PSFDL+ISQ+
Subjt:  DIAPPSFDLKISQM

A0A5A7VHY6 Ubiquitin-like-specific protease ESD4 isoform X21.0e-2135.55Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNK----------PSSIFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K           +  FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES        +N
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNK----------PSSIFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIFK-ASGQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIPEDNEDEKNEDNED
        NSI K     S  +  +++E    K S M  +I+SL E I+ LK+  K   +   K  +  +    +L     ++ R   K E         EKN     
Subjt:  NSIFK-ASGQSLKEMEMEKEQEGEKRS-MGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIPEDNEDEKNEDNED

Query:  DVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQ
         +E  + ++++ K            ++++ +AK  I E+   +   RPKRQ +P+KKVLENV   KN   KK  SP K TRHSPR  D+A PSF L+ISQ
Subjt:  DVENEEENLEEGKNEQGKGKKKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNPKN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQ

Query:  M
        +
Subjt:  M

A0A5A7VIA9 Uncharacterized protein4.3e-2834.23Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM
        KN   KK  SP K TR+SPR  D+A PSFDL+ISQ+
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM

A0A5D3C8N2 40S ribosomal protein S15-49.0e-2633.44Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAP
        KN   KK  SP K TRHSPR  D+AP
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAP

A0A5D3DFX6 Uncharacterized protein4.3e-2834.23Show/hide
Query:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN
        MLDD+EKF++YPWGR+ F LT ++ +  V +K  S           FLQGFP+VL YWA+E++P+L++   G+  RI     PRI+ WES E  DWQ + 
Subjt:  MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSS----------IFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHIN

Query:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF
        ++IF           AS + +K        E+E +K  E     ++ +M  +I+SL E I+ LK+  K      +    +I++  L+   ++ +  +   
Subjt:  NSIF----------KASGQSLK--------EMEMEKEQEG----EKRSMGRKIASLEEGIEILKQDKKRS----RSTSMKIIKRYLTCWAILKKPSTIDF

Query:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP
          +K+++  + ++          + + D E ++E+ EE  E+  + + K K        VK+++ +AK  I E+   +   RPKRQ +P+KKVLENV   
Subjt:  RQKKKTECVIPEDNED-------EKNEDNEDDVENEEENLEEGKNEQGKGK----KKVVVKIIKSRAKDSIDEL---VGDKRPKRQTRPTKKVLENVNNP

Query:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM
        KN   KK  SP K TR+SPR  D+A PSFDL+ISQ+
Subjt:  KN--DKKKVSPNKATRHSPRWNDIAPPSFDLKISQM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGATGATGAAGAAAAATTTAAATCTTATCCATGGGGTAGAGTGTCTTTTGAACTAACTTTAGAGTATTTTAAGAAGGGTGTCCTCAACAAACCATCATCGATTTT
TCTACAAGGATTTCCTCTTGTCCTGGTTTATTGGGCTTTTGAGGTAATACCAAAACTATCAAATCCTACTGTTGGATTTGCAAAAAGAATACAAAGTGATCATGGGCCAA
GAATTGTTCAATGGGAGTCGCAAGAACCGGAAGATTGGCAACACATCAATAACAGCATCTTCAAAGCCTCTGGTCAATCATTGAAGGAGATGGAGATGGAAAAGGAGCAA
GAAGGAGAAAAAAGATCAATGGGAAGAAAAATAGCTTCATTAGAGGAAGGCATTGAAATTTTGAAACAAGACAAGAAGAGATCAAGAAGCACATCAATGAAAATCATAAA
GAGATACTTGACATGTTGGGCTATATTAAAGAAGCCATCAACGATAGACTTCCGCCAAAAGAAGAAGACTGAATGTGTAATTCCCGAGGACAACGAGGATGAAAAAAATG
AGGACAATGAAGATGATGTGGAAAATGAAGAGGAAAATCTTGAAGAAGGAAAAAATGAACAAGGAAAGGGCAAAAAGAAGGTTGTTGTAAAGATAATTAAAAGTAGAGCA
AAAGATAGCATAGATGAGCTTGTTGGTGACAAGAGGCCAAAGAGACAAACAAGACCAACCAAAAAAGTGTTGGAGAATGTAAACAATCCAAAGAATGACAAAAAAAAGGT
GTCTCCCAACAAAGCTACAAGACATTCCCCAAGGTGGAATGATATTGCTCCTCCAAGCTTCGATCTCAAAATATCACAAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGATGATGAAGAAAAATTTAAATCTTATCCATGGGGTAGAGTGTCTTTTGAACTAACTTTAGAGTATTTTAAGAAGGGTGTCCTCAACAAACCATCATCGATTTT
TCTACAAGGATTTCCTCTTGTCCTGGTTTATTGGGCTTTTGAGGTAATACCAAAACTATCAAATCCTACTGTTGGATTTGCAAAAAGAATACAAAGTGATCATGGGCCAA
GAATTGTTCAATGGGAGTCGCAAGAACCGGAAGATTGGCAACACATCAATAACAGCATCTTCAAAGCCTCTGGTCAATCATTGAAGGAGATGGAGATGGAAAAGGAGCAA
GAAGGAGAAAAAAGATCAATGGGAAGAAAAATAGCTTCATTAGAGGAAGGCATTGAAATTTTGAAACAAGACAAGAAGAGATCAAGAAGCACATCAATGAAAATCATAAA
GAGATACTTGACATGTTGGGCTATATTAAAGAAGCCATCAACGATAGACTTCCGCCAAAAGAAGAAGACTGAATGTGTAATTCCCGAGGACAACGAGGATGAAAAAAATG
AGGACAATGAAGATGATGTGGAAAATGAAGAGGAAAATCTTGAAGAAGGAAAAAATGAACAAGGAAAGGGCAAAAAGAAGGTTGTTGTAAAGATAATTAAAAGTAGAGCA
AAAGATAGCATAGATGAGCTTGTTGGTGACAAGAGGCCAAAGAGACAAACAAGACCAACCAAAAAAGTGTTGGAGAATGTAAACAATCCAAAGAATGACAAAAAAAAGGT
GTCTCCCAACAAAGCTACAAGACATTCCCCAAGGTGGAATGATATTGCTCCTCCAAGCTTCGATCTCAAAATATCACAAATGTAA
Protein sequenceShow/hide protein sequence
MLDDEEKFKSYPWGRVSFELTLEYFKKGVLNKPSSIFLQGFPLVLVYWAFEVIPKLSNPTVGFAKRIQSDHGPRIVQWESQEPEDWQHINNSIFKASGQSLKEMEMEKEQ
EGEKRSMGRKIASLEEGIEILKQDKKRSRSTSMKIIKRYLTCWAILKKPSTIDFRQKKKTECVIPEDNEDEKNEDNEDDVENEEENLEEGKNEQGKGKKKVVVKIIKSRA
KDSIDELVGDKRPKRQTRPTKKVLENVNNPKNDKKKVSPNKATRHSPRWNDIAPPSFDLKISQM