; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012653 (gene) of Chayote v1 genome

Gene IDSed0012653
OrganismSechium edule (Chayote v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG03:17188932..17199713
RNA-Seq ExpressionSed0012653
SyntenySed0012653
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]1.8e-3034.77Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V IG M+ES      +H   LG  N+RV+VD+I    EDV LPIP+  E++ L Q I +FV WPR L+     T++   P  A +  
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
           S  ++    T     +K+++ YA   +   D++ I +  ++ G +  +  Y+  ++I+QYC M EIGY+ IL YI  LW+ CD E    ++LVD + 
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         ++ + S  +E+R   L +RL+ + +L Q V  P+N+G  HWI +V+   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]2.1e-3133.59Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V +G M+ES      +H   LG  N+RV+VD+     EDV LPIP+  +++ L Q I +FV WPR L+     TK+   P    S  
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
           S  ++    T     +K+++ YA   +  +D++ I ++ ++ G +  +  Y+ +++I+QYC M EIGY+ IL YI  LW+ C+ E    ++LVD + 
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         ++ + S  +E+R   L NRL+ + +L Q V  P+N+G  HWI +++   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]1.8e-3034.77Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V IG M+ES      +H   LG  N+RV+VD+I    EDV LPIP+  E++ L Q I +FV WPR L+     T++   P  A +  
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
           S  ++    T     +K+++ YA   +   D++ I +  ++ G +  +  Y+  ++I+QYC M EIGY+ IL YI  LW+ CD E    ++LVD + 
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         ++ + S  +E+R   L +RL+ + +L Q V  P+N+G  HWI +V+   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]2.5e-3235.16Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V +GTM+ES      ++   LG  NVR  VD++    EDV LPIP  ++++ L Q I +FV WPR L+    TTK+   P   T+SK
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
         +   S    ++      +K+++ YA   +  DD++ I ++  +LG +  +  Y+ +++I+QYC M EIGY+ IL YI  LW+ CD E    +++VD  +
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         T +     +E R   L NRL+ +  L Q V  P+N+G+ HWI +++   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]2.9e-3335.29Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V +GTM+ES      ++   LG  NVR  VD++    EDV LPIP  ++++ L Q I +FV WPR L+    TTK+   P   T+SK
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
         +   S    ++      +K+++ YA   +  DD++ I ++  +LG +  +  Y+ +++I+QYC M EIGY+ IL YI  LW+ CD E    +++VD  +
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV
         T +     +E R   L NRL+ +  L Q V  P+N+G HWI +++   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X11.0e-3133.59Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V +G M+ES      +H   LG  N+RV+VD+     EDV LPIP+  +++ L Q I +FV WPR L+     TK+   P    S  
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
           S  ++    T     +K+++ YA   +  +D++ I ++ ++ G +  +  Y+ +++I+QYC M EIGY+ IL YI  LW+ C+ E    ++LVD + 
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         ++ + S  +E+R   L NRL+ + +L Q V  P+N+G  HWI +++   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein1.0e-3133.59Show/hide
Query:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK
        +PC LA+      V +G M+ES      +H   LG  N+RV+VD+     EDV LPIP+  +++ L Q I +FV WPR L+     TK+   P    S  
Subjt:  MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSK

Query:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE
           S  ++    T     +K+++ YA   +  +D++ I ++ ++ G +  +  Y+ +++I+QYC M EIGY+ IL YI  LW+ C+ E    ++LVD + 
Subjt:  KVVSDSHSHQLNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSE

Query:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV
         ++ + S  +E+R   L NRL+ + +L Q V  P+N+G  HWI +++   EN +YV
Subjt:  FTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGN-HWIFLVLIPAENTLYV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X11.6e-2932.69Show/hide
Query:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK
        PC LAVE     V +GT+++++     VH   LG  NVRV VD++ +  E   +PIP+  E++ L Q I  FV WPR L+                S +K
Subjt:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK

Query:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL
         +S S + Q  T           +K+++ Y    +  +D + I +++++ G +  +  Y+ + +I+QYC M EIGY+ IL YI YLW   + E    +L+
Subjt:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL

Query:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV
        VD +  +  + S  +E R   LANRL+ + +L+Q V  P+ SG HW+ +++   EN +YV
Subjt:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.6e-2932.69Show/hide
Query:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK
        PC LAVE     V +GT+++++     VH   LG  NVRV VD++ +  E   +PIP+  E++ L Q I  FV WPR L+                S +K
Subjt:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK

Query:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL
         +S S + Q  T           +K+++ Y    +  +D + I +++++ G +  +  Y+ + +I+QYC M EIGY+ IL YI YLW   + E    +L+
Subjt:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL

Query:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV
        VD +  +  + S  +E R   LANRL+ + +L+Q V  P+ SG HW+ +++   EN +YV
Subjt:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.6e-2932.69Show/hide
Query:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK
        PC LAVE     V +GT+++++     VH   LG  NVRV VD++ +  E   +PIP+  E++ L Q I  FV WPR L+                S +K
Subjt:  PCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKK

Query:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL
         +S S + Q  T           +K+++ Y    +  +D + I +++++ G +  +  Y+ + +I+QYC M EIGY+ IL YI YLW   + E    +L+
Subjt:  VVSDSHSHQLNTY------GPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLL

Query:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV
        VD +  +  + S  +E R   LANRL+ + +L+Q V  P+ SG HW+ +++   EN +YV
Subjt:  VDGSEFTTALDSTNEEDRVSCLANRLDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATGTCAACTTGCCGTTGAGTATGAAGGTCAAACTGTTGTCATTGGTACCATGTATGAATCGAGTGGCTCATCTGTTAGGGTACATAGACACCTTTTAGGGAAACA
TAATGTGCGGGTCTCGGTCGATTTGATCTTCGAAGGCAATGAAGACGTAGATCTACCTATTCCCATAAACGAGGAACTTCAGTACCTTGGACAAGTTATAAAATCTTTTG
TGCCATGGCCAAGATCATTGATCAGAACACCTACAACAACCAAGAAGCACTATGGACCAATAAAAGCTACAAGCAGTAAGAAGGTGGTATCAGATTCACACTCTCATCAA
TTGAACACGTATGGTCCAACCATGGTCAAGGTTGTACATAATTATGCAAGAGATAAACTGGGTCCTGATGACTTACTAGGTATACCAATGGCAAGAAACATGTTAGGTGT
TGACGCGCCCGAATATTTTTATATTGCTCAAGAAGAGATTCTACAATATTGTAATATGGAAGAAATAGGCTACACTCCAATACTCTATTACATCTGTTACTTATGGTCTA
CTTGTGATCAAGAGACACTGGGCAATTACTTATTAGTGGACGGTAGCGAGTTCACAACGGCTTTGGATAGCACAAATGAAGAAGATCGAGTTTCATGTCTAGCTAATAGG
TTAGACAAGCTGACACATCTTAAACAACGAGTATTCTTTCCCTTCAATTCTGGGAACCACTGGATTTTTCTTGTCCTAATTCCAGCTGAAAACACATTGTATGTTTTCAC
TCACTCTGTCAAAATCCGTTGA
mRNA sequenceShow/hide mRNA sequence
CTCTCTCTCAATAGTCTTTCCCCTTTTCTCTCTACTTTTCTTTCTCCAACGATAAAACCCTCTTTGCTGCTCTATTTTACCACTAATTTCTCACCACCCTCTTCTATTTC
ACTTGACCTAAATACTTTAGAAACCTCATTGTCGAGAAAATAAAGAAAATCGTAGTTCGAATTGTCAACAATCGTTGTCATTATACTCGACGAAGTCCTTGGATTGTCAA
TTTCGTACCTCTTCTTCTTCTACGTCTATTCTCTCTTATGATTCTTGTTATTATTTCTTCTTTTTAATGCCTAAAATTTAATTGGAACACAATTTTGCCTAATTTACTCT
TTTGAGTTTCATATTTGAGAACGTCATGGACCTCAGATTTTGTTTCCCTTTTCATTTTTGTTGGACATAAGAAAAACATATTATTGTTGTATGTATTTTTTTACTTCTGT
GTTTCAGTAGTAGAATTACATGAATAAGTGGCGGGCTAGAATCGAGAATTTTAAAAAACGCGGAGGGAATTATTGATGGCGGGCCATGACCGCCATTGAAAAATGCGTGT
TTTGAGTTTTGGTGGCGGTCCGTGCCCGCTGTCCATTCATTTAGGTGGCGGGCCGTGCCCGCCTTTGTATATAGCGTGTTATTAATTTCGGTGGCGGGTCGTGCCCGCCA
TCAATAATTTGTGCATTTTCGTCTTCACAACTGGACAGTGTCCTCATTTCAACATTTCCTCATTTCCTCCGCCAAGTGACCAAACCCTTTCGACCCAAAACCCATCTCCA
TCGCCGGCGCCGCTAGTTCTTCGACAAAACTTCTCCATTGTCATTAGTGGGTCGAAAAACCATCTTCGCCGCCGCCATTAACCTCCGGTTTTCATTTCTTCCTCTCCGCC
GCCGCTAGGTTTTGGCATTTCTCCACCTTTTGGGCATTATTTCTTCACAAACTTGAACTTGTTGATGCAAAACTGTGTCCTACCTGTGGAACATCGAGATGGAAGTTCAG
AAGAATATGAAAGAAGGCGGGTAGCGATGATGATGATGATGTTCCCATGAGCGGATGTGCTCGAAGCAGAAATACTAGAGGCGTGATAGTCTTGCGTGAGCTTGCTCAAG
AAAGGATGGCTGGAGAACGACGCACTCTTAGTATAATTCCATGGGTCAAGATGTGGGTGAAGCTTAAAAAAAGTTTCGTAGTTTTATTGGTGTATGTGTTAGAACACATA
TACCTATTATGTACGATTCTTGGAAAACTGTCCCTGAGCAACTAAAAGAGAAGATTTGGGATTCTATAGAGATGTCGTTTGAGATGGATCCAATGTCGAAACATAATGTT
ATGTTATCTATGTCCACCGCATTTAGGACGTTTAGATACAATCTGAACAGAAAATACATCCAGCCATTTCTGGACCAACCTGAGATATTAAGATCTCCACCCGTTAAATA
CTCTTTCATCACACAAGAGCAATGAGATTATTTTGTGAATGTCCGATTTTCAGAAGAATTTATGAGAATTAGTGCAGAGAAAAAAGAGTTGCAAGCTAAACAAAAGTGTC
ATCATCATATGGCTAGAAAGGGATATGGTCAACTGGCTCATAAACTAGCCGATTAGAACTGTGGAAAGAAGGTCGAAAGAAGAAGTCTAACGATCCAAAGAGGAAGAGTA
AAAAGTTCGTCGACGAAGACACAATTCAAACAGCCAACCAAATTATAAGTTGTTGGTTCATTTCTTTTGCCAAAAGATAACTTGAATCTTAAAATCTTTGGTTGTGAATG
TAAAAATCTTTTACCACAAGATGAATTGCAACGTATAAAAGAAGGTGAAGATATTTTGGTCGATGCATTGGGAACGCCAGAACATTGTGGGCGTGTTAAGGGAGTAGGTC
GATTTGTATCTCCATCAATGTTCTTTAGGATGGCTCATCCGAAGTCTAAGATGGGCCAATAACCAACTGAATGAGAAAATCAAGTCCATTACTCTCAACAATTAGACTCC
CCATGTGAGGGGAGTAATAGAGATTCCATAAGGGAAGAAAGTTTTATATTATGATTTATTTTTTTAAAGGTTATGTCATATCAACTAATTATGATCTGTTTCTTTTAAGG
GTATGCCATGTCAACTTGCCGTTGAGTATGAAGGTCAAACTGTTGTCATTGGTACCATGTATGAATCGAGTGGCTCATCTGTTAGGGTACATAGACACCTTTTAGGGAAA
CATAATGTGCGGGTCTCGGTCGATTTGATCTTCGAAGGCAATGAAGACGTAGATCTACCTATTCCCATAAACGAGGAACTTCAGTACCTTGGACAAGTTATAAAATCTTT
TGTGCCATGGCCAAGATCATTGATCAGAACACCTACAACAACCAAGAAGCACTATGGACCAATAAAAGCTACAAGCAGTAAGAAGGTGGTATCAGATTCACACTCTCATC
AATTGAACACGTATGGTCCAACCATGGTCAAGGTTGTACATAATTATGCAAGAGATAAACTGGGTCCTGATGACTTACTAGGTATACCAATGGCAAGAAACATGTTAGGT
GTTGACGCGCCCGAATATTTTTATATTGCTCAAGAAGAGATTCTACAATATTGTAATATGGAAGAAATAGGCTACACTCCAATACTCTATTACATCTGTTACTTATGGTC
TACTTGTGATCAAGAGACACTGGGCAATTACTTATTAGTGGACGGTAGCGAGTTCACAACGGCTTTGGATAGCACAAATGAAGAAGATCGAGTTTCATGTCTAGCTAATA
GGTTAGACAAGCTGACACATCTTAAACAACGAGTATTCTTTCCCTTCAATTCTGGGAACCACTGGATTTTTCTTGTCCTAATTCCAGCTGAAAACACATTGTATGTTTTC
ACTCACTCTGTCAAAATCCGTTGAACCAGAGTTTAGCTCGTGTCATAAATACTGCCTACCGTGTATGACAATTAAGACACACGCCAAAGGCTTTAAAATTAAATTGCAAA
TGGGTGAAGTGCCCTCAACAAACTGGATCAATAGTATGTGGATTTTACGTCCAACTATTCATACGGAAGATGATGCACAACACATCCACGCACCAATTGAAACTTTTTAG
CGTTGGCCCGACCTCGTTCACTCAAGACCAGATTGATGAGAGGACTGAAGTCTTTAATTGATAGAGAAGTCTGTGGCTTTATGATATCCTACAAAAACTCCACCAATATC
GACACCCTTATTGACGGGCTCATAAAGCATTCAAAGTCGAAAAATTGAGCTACGTTGTCTACAATTTCAGCCATTCCAACCTTTGTCGAGCAATGTTTGTTTGACTTTTG
GGGGCACTTCTTAGAGAGATCCATCATGCCTGCTTTTCTAATGGTTGAGGTTTTGCAAGGTTTTGCATAAACTCAACCAAATGATAGATCTTCAGAGATGAGCATTAATA
TATTATTCTTTTTGTATAAGTTAGTG
Protein sequenceShow/hide protein sequence
MPCQLAVEYEGQTVVIGTMYESSGSSVRVHRHLLGKHNVRVSVDLIFEGNEDVDLPIPINEELQYLGQVIKSFVPWPRSLIRTPTTTKKHYGPIKATSSKKVVSDSHSHQ
LNTYGPTMVKVVHNYARDKLGPDDLLGIPMARNMLGVDAPEYFYIAQEEILQYCNMEEIGYTPILYYICYLWSTCDQETLGNYLLVDGSEFTTALDSTNEEDRVSCLANR
LDKLTHLKQRVFFPFNSGNHWIFLVLIPAENTLYVFTHSVKIR