; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12850 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12850
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUlp1-like peptidase
Genome locationClcChr01:24580521..24586852
RNA-Seq ExpressionClc01G12850
SyntenyClc01G12850
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141784.1 uncharacterized protein LOC111012067 [Momordica charantia]7.9e-1841.18Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+IC +F   E++V+DS+  +  ++ LE +L ++ T  PSLL    VI  +  + +  WRIRR  + PQQ    DC IF  K+FEYD+TG++++T+ Q+ 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHALF
        M YFR+Q+A Q+W+N A++
Subjt:  MSYFRKQYAIQIWANHALF

XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]5.9e-2146.61Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGV-IESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        ++IC +F   ELIV+DS + +     LE EL+ + T  P+L+   GV +    I L  WRIRR ++ PQQ   GDCGIF   FFEYD+T  + DT+TQ R
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGV-IESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHAL
        MS+FR+Q+A+Q+WAN ++
Subjt:  MSYFRKQYAIQIWANHAL

XP_022156568.1 uncharacterized protein LOC111023442 [Momordica charantia]8.5e-2042.86Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+IC +F   E++V+DS+  + S + LE +L+++ T  PSLL    VI     + +  WRIRR  + P+Q   GDCGIF  K+FEYD+T ++++T+ Q+ 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHALF
        MSYFR+Q+A Q+W+N A++
Subjt:  MSYFRKQYAIQIWANHALF

XP_038882332.1 uncharacterized protein LOC120073583 [Benincasa hispida]2.2e-3661.86Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRM
        VL+CA+F+ +ELI+FDS++ LH N+DLE E+R++C NFP LL V  V+ES+ + +D+W +RRDA   QQ E GDCG+F  KFFEYD+TGS M T+TQDR 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRM

Query:  SYFRKQYAIQIWANHALF
         YFR+QYAIQIWAN ALF
Subjt:  SYFRKQYAIQIWANHALF

XP_038885861.1 sentrin-specific protease [Benincasa hispida]1.8e-3863.56Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRM
        VL+C +F+ +ELIVFDS++VLH N+DLEHE+R +C NF  LL    V+ES+ + +D+W +RRDA VPQQD+ GDCG+F CKFFEYD+TGS MDT+TQDRM
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRM

Query:  SYFRKQYAIQIWANHALF
         Y+R+QYAIQI AN  LF
Subjt:  SYFRKQYAIQIWANHALF

TrEMBL top hitse value%identityAlignment
A0A6J1CJT2 uncharacterized protein LOC1110120673.8e-1841.18Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+IC +F   E++V+DS+  +  ++ LE +L ++ T  PSLL    VI  +  + +  WRIRR  + PQQ    DC IF  K+FEYD+TG++++T+ Q+ 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHALF
        M YFR+Q+A Q+W+N A++
Subjt:  MSYFRKQYAIQIWANHALF

A0A6J1DID7 uncharacterized protein LOC1110207825.6e-1743.7Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+I  +    +L V+DS+  +    DLE  L+ +CT  P +L   G++    I+    WR+RR  TVPQQ    DCGIF  +FFEYD+TGS MDT+ Q  
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHALF
        +S FR+QYA+Q+WA    F
Subjt:  MSYFRKQYAIQIWANHALF

A0A6J1DLV0 uncharacterized protein LOC1110216462.8e-2146.61Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGV-IESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        ++IC +F   ELIV+DS + +     LE EL+ + T  P+L+   GV +    I L  WRIRR ++ PQQ   GDCGIF   FFEYD+T  + DT+TQ R
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGV-IESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHAL
        MS+FR+Q+A+Q+WAN ++
Subjt:  MSYFRKQYAIQIWANHAL

A0A6J1DPE8 uncharacterized protein LOC1110222961.0e-1539.29Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+IC +F   E++V+DS++ +  ++ LE +L ++    P LL     I     + +  WRI R  + PQQ   GDCGIF  K+FEYD+TG++++T+ Q+ 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQI
        MSYFR+Q+A ++
Subjt:  MSYFRKQYAIQI

A0A6J1DQZ3 uncharacterized protein LOC1110234424.1e-2042.86Show/hide
Query:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR
        V+IC +F   E++V+DS+  + S + LE +L+++ T  PSLL    VI     + +  WRIRR  + P+Q   GDCGIF  K+FEYD+T ++++T+ Q+ 
Subjt:  VLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESN-IIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDR

Query:  MSYFRKQYAIQIWANHALF
        MSYFR+Q+A Q+W+N A++
Subjt:  MSYFRKQYAIQIWANHALF

SwissProt top hitse value%identityAlignment
P59110 Sentrin-specific protease 13.0e-0428.68Show/hide
Query:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF
        FS+  LLV +       L   +F+ K +  +DSM        + +E   I   +    +VD   +      + W++  ++   +PQQ  G DCG+FACK+
Subjt:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF

Query:  FEYDITGSNMDTITQDRMSYFRKQYAIQI
         +  IT       TQ  M YFRK+   +I
Subjt:  FEYDITGSNMDTITQDRMSYFRKQYAIQI

Q09353 Sentrin-specific protease8.9e-0435.19Show/hide
Query:  WRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRMSYFRKQYAIQI
        W I++   +P+Q  G DCG+F+C+F E+  +       TQ  M Y+RK+   +I
Subjt:  WRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRMSYFRKQYAIQI

Q5RBB1 Sentrin-specific protease 14.0e-0427.91Show/hide
Query:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF
        FS+  LLV +       L   +F+ K +  +DSM        + +E   I   +    ++D   +      + W++  ++   +PQQ  G DCG+FACK+
Subjt:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF

Query:  FEYDITGSNMDTITQDRMSYFRKQYAIQI
         +  IT       TQ  M YFRK+   +I
Subjt:  FEYDITGSNMDTITQDRMSYFRKQYAIQI

Q9P0U3 Sentrin-specific protease 14.0e-0427.91Show/hide
Query:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF
        FS+  LLV +       L   +F+ K +  +DSM        + +E   I   +    ++D   +      + W++  ++   +PQQ  G DCG+FACK+
Subjt:  FSITDLLVTV-------LICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRI--RRDATVPQQDEGGDCGIFACKF

Query:  FEYDITGSNMDTITQDRMSYFRKQYAIQI
         +  IT       TQ  M YFRK+   +I
Subjt:  FEYDITGSNMDTITQDRMSYFRKQYAIQI

Arabidopsis top hitse value%identityAlignment
AT3G06910.1 UB-like protease 1A9.1e-0434.29Show/hide
Query:  VDGVIESNIIALD--QWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRMSYFRKQYAIQI
        VD V + + + LD  +WR      +P Q  G DCG+F  K+ ++   G ++   TQ++M YFR + A +I
Subjt:  VDGVIESNIIALD--QWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRMSYFRKQYAIQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATATCCGAACTTGTGGCGCCCTTCAAGCACGAGTTGGAGCGAGACTCAGAGCGAGACTTGCGAGATTAGCAGCGAGACTCGGAGCAAGACTGCGATTCGTAGCGA
CATTCAGGAGCGATACTTGGATCGAGACTCAGTCTTGCAGAATCGAATGCGGCTTCACCATTCCTCACTCCGAAAGAGGGCGGCGGCAACTTCTGGGTTGAGTGATCAGA
GGTCGTGGTCGACGGCGGCTTGGATGGTTGACTCGGAGGTAGCGGCGACGGCGGGTTCTGTCAGCAACAGTGGCAGCAGCAGATCTTGCCTAGGGGTATTTTCGTCGACG
ACGGCTAGGGTTTCTAGGGTAAAGACGGACGGTGGCGATCACTTCGACGGGCGACACAGCAGTGAGAAGGTAAGGCCAAGTGAAGCCCTTGGCGATGGTCTTAGACGACC
CTCGATGGCGGATTTAAAAGATAGGCAGCAAATCGAAGGAGAGACACCATTCATCGGCCATATTCTCGCTTGTCCCCGTCACTCAGTCGCAGCGCTCCAGCCCTCACGGT
GTGGCCTTAGTCGCAGTGCTCAACGTTGTGGTATCGTCATAGTTGCCACGCCGCAGCGTCTTGGTCGTCGCACCGCAGCGTTCTGGTCATCATTAGCCTTCGATGCATCA
TCAGTCCGCGTCGCCGCATCATTAGTCTCGACGCATCGTCAGTCCGCGTTGCCGCATCATTATCCGCGTTGCCGCATCATTAGCCCTCGAGCATCATCAGTCCGTGTCGC
CCCGTCACCCGTCCTCTCCGCGACGCATTGTCAGCCTGCATCGTCGCGTCGCTTGTCCTCCCCGCGACGCACTGTCAAAGCCTGCGTAGCGCGTTGCCGCATTGTAAGGA
AGTCGCTACCTTGCGGACACGCTCAGTATCGCTCGATAACGTTGGTAACGCTCAGTATCGCTCGTAACGCTAAGTATCGCTCGTATCGTTCAGTGTCGCTCAGTGTCGAT
CCTTTTCGCTCTCGCAGGTTGCCATGTTGTTGTGCCCCGTACCCTCACTGTACTATGCTACGAAGGGTGTCCCTCGGGATCACCACCTATGTTCACGTTATAATGTTAGT
TAATGCGTACCCTCGGGACGCTTCCTATGCTTATGCCGGTACACCCACGCAAATCTCCAAAAGCATCCATTCTTCTTTTCTTACCGTCTTAGCTTCCCATAGATTCCTTG
CATTTCTTCAACAAAAGGAGGCCCAGTCGGGCTTCATCCTAGGTATTGACCGGGCTTCTGTTGAGGAAGACAAGAGATCACCCGATCTCAACCCCTGCCGGAAAACATGT
GTATCATTGTGTTATACTGTTGCTGGTTCAAGTGGTCATGATGGTTCAAGTGGTCAAAATGACCACGAACATGAAGGTCTAGTAGAGCAAAGAGAACATTTGGATGGTGA
CAATCAGGTCAACATAGTTATGCGCGACTCGATGGTTGCTTTAGACGAAACATCGGGTACACATGATGTCCGGGACGAGGGTAAATCTGTGGATGCGCAAACTCATTTGC
CTAAGAATGATGATGCCCATGATGGTCCAAGTGGTCAAAATGACCACGAACATAAAGGTTTAATAAAGCAAAGAGTACATTTGGATGGTGACAATCAAGTCAACATAGTT
TTGCGCGACTCGATGGTTGCTTTAGACGAAACATTGGGTACACATGATGTCCGAGACGAGAAAGACACAAGAAACGATGGTAAGAGACAAAAGATTAAACACTACAATCC
CATCATCAACATACCAGAAGAAATTGAAGTGAAATTCAATAAGTGGATGATCAACACCGACATGACTACTGCAGTACGAAGGAATAATTATGCTTATTTGGATATAACAT
GGTTTCATAGTCTCCAGACGGCGTATAAATGGTTGAACGACGAGGTGAAAGACACTATATTTATGTTTATTAAGAAAAAGTTGGAAATCTGCCCAAACCTGTGCCGTCGA
GAATTTTCCATAACTGACCTTCTAGTGACGGTGCTTATATGTGCAAACTTCAAGACCAAAGAGTTGATCGTTTTTGACTCGATGGTTGTCCTTCACTCTAACTCTGATTT
GGAACATGAGCTAAGAATGATATGTACAAATTTTCCAAGTCTACTAGCTGTCGATGGAGTTATAGAGTCTAATATCATAGCTCTAGATCAGTGGAGGATCCGACGAGACG
CTACCGTGCCTCAACAAGATGAAGGTGGTGATTGTGGTATATTTGCATGCAAATTTTTTGAATATGATATAACTGGTTCCAACATGGATACCATAACACAGGATAGGATG
AGTTATTTTCGTAAGCAGTATGCTATTCAAATTTGGGCCAATCATGCACTATTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTATATCCGAACTTGTGGCGCCCTTCAAGCACGAGTTGGAGCGAGACTCAGAGCGAGACTTGCGAGATTAGCAGCGAGACTCGGAGCAAGACTGCGATTCGTAGCGA
CATTCAGGAGCGATACTTGGATCGAGACTCAGTCTTGCAGAATCGAATGCGGCTTCACCATTCCTCACTCCGAAAGAGGGCGGCGGCAACTTCTGGGTTGAGTGATCAGA
GGTCGTGGTCGACGGCGGCTTGGATGGTTGACTCGGAGGTAGCGGCGACGGCGGGTTCTGTCAGCAACAGTGGCAGCAGCAGATCTTGCCTAGGGGTATTTTCGTCGACG
ACGGCTAGGGTTTCTAGGGTAAAGACGGACGGTGGCGATCACTTCGACGGGCGACACAGCAGTGAGAAGGTAAGGCCAAGTGAAGCCCTTGGCGATGGTCTTAGACGACC
CTCGATGGCGGATTTAAAAGATAGGCAGCAAATCGAAGGAGAGACACCATTCATCGGCCATATTCTCGCTTGTCCCCGTCACTCAGTCGCAGCGCTCCAGCCCTCACGGT
GTGGCCTTAGTCGCAGTGCTCAACGTTGTGGTATCGTCATAGTTGCCACGCCGCAGCGTCTTGGTCGTCGCACCGCAGCGTTCTGGTCATCATTAGCCTTCGATGCATCA
TCAGTCCGCGTCGCCGCATCATTAGTCTCGACGCATCGTCAGTCCGCGTTGCCGCATCATTATCCGCGTTGCCGCATCATTAGCCCTCGAGCATCATCAGTCCGTGTCGC
CCCGTCACCCGTCCTCTCCGCGACGCATTGTCAGCCTGCATCGTCGCGTCGCTTGTCCTCCCCGCGACGCACTGTCAAAGCCTGCGTAGCGCGTTGCCGCATTGTAAGGA
AGTCGCTACCTTGCGGACACGCTCAGTATCGCTCGATAACGTTGGTAACGCTCAGTATCGCTCGTAACGCTAAGTATCGCTCGTATCGTTCAGTGTCGCTCAGTGTCGAT
CCTTTTCGCTCTCGCAGGTTGCCATGTTGTTGTGCCCCGTACCCTCACTGTACTATGCTACGAAGGGTGTCCCTCGGGATCACCACCTATGTTCACGTTATAATGTTAGT
TAATGCGTACCCTCGGGACGCTTCCTATGCTTATGCCGGTACACCCACGCAAATCTCCAAAAGCATCCATTCTTCTTTTCTTACCGTCTTAGCTTCCCATAGATTCCTTG
CATTTCTTCAACAAAAGGAGGCCCAGTCGGGCTTCATCCTAGGTATTGACCGGGCTTCTGTTGAGGAAGACAAGAGATCACCCGATCTCAACCCCTGCCGGAAAACATGT
GTATCATTGTGTTATACTGTTGCTGGTTCAAGTGGTCATGATGGTTCAAGTGGTCAAAATGACCACGAACATGAAGGTCTAGTAGAGCAAAGAGAACATTTGGATGGTGA
CAATCAGGTCAACATAGTTATGCGCGACTCGATGGTTGCTTTAGACGAAACATCGGGTACACATGATGTCCGGGACGAGGGTAAATCTGTGGATGCGCAAACTCATTTGC
CTAAGAATGATGATGCCCATGATGGTCCAAGTGGTCAAAATGACCACGAACATAAAGGTTTAATAAAGCAAAGAGTACATTTGGATGGTGACAATCAAGTCAACATAGTT
TTGCGCGACTCGATGGTTGCTTTAGACGAAACATTGGGTACACATGATGTCCGAGACGAGAAAGACACAAGAAACGATGGTAAGAGACAAAAGATTAAACACTACAATCC
CATCATCAACATACCAGAAGAAATTGAAGTGAAATTCAATAAGTGGATGATCAACACCGACATGACTACTGCAGTACGAAGGAATAATTATGCTTATTTGGATATAACAT
GGTTTCATAGTCTCCAGACGGCGTATAAATGGTTGAACGACGAGGTGAAAGACACTATATTTATGTTTATTAAGAAAAAGTTGGAAATCTGCCCAAACCTGTGCCGTCGA
GAATTTTCCATAACTGACCTTCTAGTGACGGTGCTTATATGTGCAAACTTCAAGACCAAAGAGTTGATCGTTTTTGACTCGATGGTTGTCCTTCACTCTAACTCTGATTT
GGAACATGAGCTAAGAATGATATGTACAAATTTTCCAAGTCTACTAGCTGTCGATGGAGTTATAGAGTCTAATATCATAGCTCTAGATCAGTGGAGGATCCGACGAGACG
CTACCGTGCCTCAACAAGATGAAGGTGGTGATTGTGGTATATTTGCATGCAAATTTTTTGAATATGATATAACTGGTTCCAACATGGATACCATAACACAGGATAGGATG
AGTTATTTTCGTAAGCAGTATGCTATTCAAATTTGGGCCAATCATGCACTATTTTAG
Protein sequenceShow/hide protein sequence
MLYPNLWRPSSTSWSETQSETCEISSETRSKTAIRSDIQERYLDRDSVLQNRMRLHHSSLRKRAAATSGLSDQRSWSTAAWMVDSEVAATAGSVSNSGSSRSCLGVFSST
TARVSRVKTDGGDHFDGRHSSEKVRPSEALGDGLRRPSMADLKDRQQIEGETPFIGHILACPRHSVAALQPSRCGLSRSAQRCGIVIVATPQRLGRRTAAFWSSLAFDAS
SVRVAASLVSTHRQSALPHHYPRCRIISPRASSVRVAPSPVLSATHCQPASSRRLSSPRRTVKACVARCRIVRKSLPCGHAQYRSITLVTLSIARNAKYRSYRSVSLSVD
PFRSRRLPCCCAPYPHCTMLRRVSLGITTYVHVIMLVNAYPRDASYAYAGTPTQISKSIHSSFLTVLASHRFLAFLQQKEAQSGFILGIDRASVEEDKRSPDLNPCRKTC
VSLCYTVAGSSGHDGSSGQNDHEHEGLVEQREHLDGDNQVNIVMRDSMVALDETSGTHDVRDEGKSVDAQTHLPKNDDAHDGPSGQNDHEHKGLIKQRVHLDGDNQVNIV
LRDSMVALDETLGTHDVRDEKDTRNDGKRQKIKHYNPIINIPEEIEVKFNKWMINTDMTTAVRRNNYAYLDITWFHSLQTAYKWLNDEVKDTIFMFIKKKLEICPNLCRR
EFSITDLLVTVLICANFKTKELIVFDSMVVLHSNSDLEHELRMICTNFPSLLAVDGVIESNIIALDQWRIRRDATVPQQDEGGDCGIFACKFFEYDITGSNMDTITQDRM
SYFRKQYAIQIWANHALF