; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G011200 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G011200
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionUlp1-like peptidase
Genome locationCiama_Chr01:18014142..18016427
RNA-Seq ExpressionCaUC01G011200
SyntenyCaUC01G011200
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038874902.1 uncharacterized protein LOC120067405 [Benincasa hispida]3.5e-5145.83Show/hide
Query:  MMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYE
        MM WI D N D ++R NS+  L+K+FF++L  P+S VEC                                               NK  L AA+ WE E
Subjt:  MMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYE

Query:  DTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPTS
        DTYKDYV GKFD   + WADVDFVY+++N  +HW+V+A+D+ RG +FVFDSL S T   KL   LE +T+T+ SLL+YCD+ R K DL   RW++ RP +
Subjt:  DTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPTS

Query:  KNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRM
         NIQHGSLDCGIF +K L+HL+T  D S+ITQEK+ +YRM
Subjt:  KNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRM

XP_038875042.1 uncharacterized protein LOC120067568 [Benincasa hispida]1.1e-4959.74Show/hide
Query:  EDTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPT
        ++   DYVMGKFD   + W+DVDFVY++ N  +HW+++A ++NR ++FVFDSLPS+TSKKKL+  LEP+T TLPSLL+YCD+KR KPD+   RW++ +PT
Subjt:  EDTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPT

Query:  SKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
        S N Q+G LDCGIF VK LEHL+TG   S ITQ+K+ DYRM+LACQLW N P++
Subjt:  SKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

XP_038899753.1 uncharacterized protein LOC120086987 [Benincasa hispida]1.4e-6052.71Show/hide
Query:  MMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYE
        M+ WI D NTD ++R NS+ PL+K+FF++L   SSWV+CD + + F F+ +K + +PDMCM+KFT+L   +  HLN   G++++IKNK  L AA+ WE E
Subjt:  MMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYE

Query:  DTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPTS
        DTYK+YV+GKFD   + WADVDF+Y+++N  EHW+V+A+D+NRG +FVFDSLPS T   KL   LEP+T+T+PSLL+YCD+ R K DL+  RW+  RP +
Subjt:  DTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPTS

Query:  KNI
         +I
Subjt:  KNI

XP_038902498.1 uncharacterized protein LOC120089158 [Benincasa hispida]2.8e-7244.17Show/hide
Query:  PTPPLPPPRSPNPLMLSPPPLLLSSPPRSPDPLAPDTTTSSGPLPLPT-LETTTTPPALPLPPSDTTTTSIDLELQTLEPLPLASQDSLVTCKSARPKHK
        P PP PPPRSP P            PP+SP          + PLP P  L  TT+   L L                         DS V  +S +   K
Subjt:  PTPPLPPPRSPNPLMLSPPPLLLSSPPRSPDPLAPDTTTSSGPLPLPT-LETTTTPPALPLPPSDTTTTSIDLELQTLEPLPLASQDSLVTCKSARPKHK

Query:  RKGIES-TPNKEPKKRAKEEKNSASSTQPKVEQPVEPKTKQTLKYPLPGVSTPIFRARVTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKF
        RK ++  TP  EP K+  +EK      + +V+ P++   +  +KY LPGV    F+A + Y+L H++P     EM++WI D+NT+ +++ N+  PLSK F
Subjt:  RKGIES-TPNKEPKKRAKEEKNSASSTQPKVEQPVEPKTKQTLKYPLPGVSTPIFRARVTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKF

Query:  FQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYS
        FQ+LT+PSSWVECDT+N++F F+ +KF+ +PDMCM +FT+LP  I  HLN R G+YK +KNK TL AA+ W  ++T  DYVMGKFD   + W DVDF+Y+
Subjt:  FQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYS

Query:  SVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRV
        + N  +HW++VA D+NRGR+FVFDSLPS+TSKKKL+  LE +T TLPSLL+YCD+K  KPD+   +W++
Subjt:  SVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRV

XP_038906451.1 uncharacterized protein LOC120092435 [Benincasa hispida]2.1e-4348.19Show/hide
Query:  LKYPLPGVSTPIFRARVTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLP
        +K+PL GV+T + RA V YN+AH++P NIF++MM WI  +NTD  +R NS+ PL+K+FF++L  PSSWVECD+                           
Subjt:  LKYPLPGVSTPIFRARVTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLP

Query:  TAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKL
             HLN R  ++K+IKNK  L AA+ WE ED YKD+V GK D   + WA VDFVY  +N  EHW+V+A+DMN G +FVFDSLPS T  KKL
Subjt:  TAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKL

TrEMBL top hitse value%identityAlignment
A0A5A7TN23 Ulp1-like peptidase1.2e-3133.09Show/hide
Query:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ
        + Y+  H I       + +WI DK TD ++R   +   SK FF+ L     W+  + ++ +F FIC K           FT + T  M     RSG Y  
Subjt:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ

Query:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--
         KN    N    W  E    DYV+G  +     WA VD+VYS  N  G HWV++ LD+   +V V+DSLPS+++ +++  IL P+   +P LL+      
Subjt:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--

Query:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
        +R +       W V    S  +Q  + DCGIF +K+ E++  G     + QE M  +R QLA QLW N P +
Subjt:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

A0A5A7TVI1 Ulp1-like peptidase2.6e-3131.87Show/hide
Query:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ
        + Y+  H I       + +WI DK TD ++R   +   SK FF+ L     W+  + ++ +F FIC K           FT   T  M  L  +  +YK+
Subjt:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ

Query:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLL---NYCD
           +   N    W+ E    DYV+G  +     WA VD+VYS  N  G HWV++ LD+   +V V+DSLPS+T+ +++  IL P+   +P LL    + D
Subjt:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLL---NYCD

Query:  MKRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
         +R +       W V    S   Q  + DCG+F +K+ E+++ G     + QE M  +R Q A QLW N P +
Subjt:  MKRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

A0A5D3DIC1 Ulp1-like peptidase4.4e-3131.99Show/hide
Query:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ
        + Y+  H I       + +WI DK TD ++R   +   SK FF+ L     W+  + ++ +F FI  K           FT   T  M  L  +  +YK+
Subjt:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ

Query:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--
           +   N    W+ E    DYV+G        WA VD+VYS  N  G HWV++ LD+   +V V+DSLPS+T+ +++  IL P+   +P LL+      
Subjt:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--

Query:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
        +R +       W V    S  +Q  + DCG+FA+K+ E++  G     + QE M  +R QLA QLW N P +
Subjt:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

A0A5D3DU78 Ulp1-like peptidase1.2e-3131.99Show/hide
Query:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ
        + Y+  H I       + +WI DK TD ++R   +   SK FF+ L     W+  + ++ +F FI  K           FT   T  M  L  +  +YK+
Subjt:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ

Query:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--
           +   N    W+ E    DYV+G  +     WA VD+VYS  N  G HWV++ LD+   +V V+DSLPS+T+ +++  IL P+   +P LL+      
Subjt:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--

Query:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
        +R +       W V    S  +Q  + DCG+FA+K+ E++  G     + QE M  +R QLA QLW N P +
Subjt:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

A0A5D3DYH3 Ulp1-like peptidase4.4e-3131.99Show/hide
Query:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ
        + Y+  H I       + +WI DK TD ++R   +   SK FF+ L     W+  + ++ +F FI  K           FT   T  M  L  +  +YK+
Subjt:  VTYNLAHDIPSNIFSEMMSWIVDKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQ

Query:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--
           +   N    W+ E    DYV+G        WA VD+VYS  N  G HWV++ LD+   +V V+DSLPS+T+ +++  IL P+   +P LL+      
Subjt:  IKNKPTLNAAMVWEYEDTYKDYVMGKFDPHNMGWADVDFVYSSVNT-GEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDM--

Query:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF
        +R +       W V    S  +Q  + DCG+FA+K+ E++  G     + QE M  +R QLA QLW N P +
Subjt:  KRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVKFLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATACTGAGAAGGAATTTCGAGACACTCCAGTTGATAGACGACCCGTTATTAGTCTACACGAGGATGGGGATAGTTCGCAAAGCGCAGATAGTACCCAT
GGTTCAAATAGTTCGGATGGTGGGGAAGATGGCGATGAGGGCGATGAGCATAACAAGACTAGACGAAATGGGGATGATGGTGATGACCATAACGGGATTGGAGGC
AACGAGTCTGAGGATGATCATTGTCATGATCGATTTCATGAGACGATGATGGGCCACACTGAAGAGGAAAATGAGGTGAATCCATCTATGCCGTTGCCCACCGGG
AGGGAGGAGGAGCGCCCTGGGCCCTCAAACTTTGCCCAACGGGAGGGAGCCAATGCTGCCCCAAGTATTTATTCTTACTTGCAAAGAATTGACAGCTCTATGTCG
AGGATGGATGGTCGAGTTTTAAGGCTTGAGGAAGATATGGGTTATGTCAAAGCCCAACTGTCGACCATTATGTCATTGTTACAGACGTTGTGCAAGGATCTCACC
ATGGTAGCTGAGGTGAGGTCACCCACCCCACCCCTTGAGCCGGATTCTAGGAGGTCACCCACCCAACCCCGGGATTCGGATTCCAGGAGGTCACCCACACCACAT
CGGGATCCAGATTCCAGGAGGTCACCCACACCATACCATGATCAAGATTCTAGGAGGTCACCCACCCCATCCCAGGATCCAAATTCTAGGAGGTCACCCACTCCA
CCTCTACCTCCACCTCGATCTCCTAATCCCCTGATGTTGTCACCTCCCCCACTTCTACTTTCATCTCCACCTCGATCTCCCGATCCCCTAGCTCCTGATACCACT
ACCTCATCGGGTCCACTTCCACTCCCGACTCTTGAAACCACTACCACGCCTCCCGCACTTCCACTCCCACCTTCGGATACCACTACCACATCTATTGATCTGGAG
TTACAGACATTGGAGCCACTTCCCCTCGCATCTCAAGATTCATTGGTAACTTGCAAGTCTGCACGACCAAAACATAAGAGAAAGGGTATTGAGAGTACTCCCAAT
AAAGAACCGAAGAAGAGGGCCAAAGAGGAAAAAAACTCGGCCTCATCAACACAACCGAAGGTAGAGCAACCTGTAGAGCCAAAGACGAAGCAAACGCTGAAGTAT
CCTCTTCCTGGAGTTTCTACGCCCATCTTCCGAGCGAGGGTCACATACAATTTAGCACATGACATTCCATCCAACATCTTTTCAGAGATGATGTCCTGGATTGTT
GATAAGAACACAGATGTGGACATTCGACATAATTCATACCTGCCGTTGTCGAAGAAATTTTTTCAGCAACTAACAAAACCGAGTTCCTGGGTTGAGTGTGATACT
GTGAACATCATGTTCCGATTTATTTGCGACAAGTTTCACAACCGACCAGACATGTGCATGAACAAGTTCACTGTCCTCCCAACAGCAATAATGGGTCACCTCAAT
CTTCGATCTGGGATATACAAACAGATCAAGAATAAGCCAACATTGAACGCAGCGATGGTATGGGAGTACGAAGACACATACAAGGACTACGTTATGGGGAAATTC
GACCCTCACAACATGGGGTGGGCAGATGTAGACTTCGTGTACAGCTCAGTTAACACCGGTGAACATTGGGTGGTGGTTGCATTAGATATGAACAGAGGTCGAGTG
TTCGTGTTTGACTCACTCCCATCCATAACATCCAAGAAGAAGTTGGATTACATACTAGAGCCCATGACGTTGACCCTACCATCTCTTTTGAATTACTGTGACATG
AAACGCCTCAAGCCGGATTTAAACTTAGGTCGATGGCGCGTCTTTCGACCGACAAGCAAAAACATACAGCATGGGTCACTAGACTGTGGTATCTTTGCAGTGAAG
TTTTTAGAGCATTTAATTACTGGTGCTGATAGTAGTATAATAACTCAAGAAAAGATGATTGACTATAGGATGCAGTTAGCATGTCAGTTGTGGGCAAACATGCCT
TTTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATACTGAGAAGGAATTTCGAGACACTCCAGTTGATAGACGACCCGTTATTAGTCTACACGAGGATGGGGATAGTTCGCAAAGCGCAGATAGTACCCAT
GGTTCAAATAGTTCGGATGGTGGGGAAGATGGCGATGAGGGCGATGAGCATAACAAGACTAGACGAAATGGGGATGATGGTGATGACCATAACGGGATTGGAGGC
AACGAGTCTGAGGATGATCATTGTCATGATCGATTTCATGAGACGATGATGGGCCACACTGAAGAGGAAAATGAGGTGAATCCATCTATGCCGTTGCCCACCGGG
AGGGAGGAGGAGCGCCCTGGGCCCTCAAACTTTGCCCAACGGGAGGGAGCCAATGCTGCCCCAAGTATTTATTCTTACTTGCAAAGAATTGACAGCTCTATGTCG
AGGATGGATGGTCGAGTTTTAAGGCTTGAGGAAGATATGGGTTATGTCAAAGCCCAACTGTCGACCATTATGTCATTGTTACAGACGTTGTGCAAGGATCTCACC
ATGGTAGCTGAGGTGAGGTCACCCACCCCACCCCTTGAGCCGGATTCTAGGAGGTCACCCACCCAACCCCGGGATTCGGATTCCAGGAGGTCACCCACACCACAT
CGGGATCCAGATTCCAGGAGGTCACCCACACCATACCATGATCAAGATTCTAGGAGGTCACCCACCCCATCCCAGGATCCAAATTCTAGGAGGTCACCCACTCCA
CCTCTACCTCCACCTCGATCTCCTAATCCCCTGATGTTGTCACCTCCCCCACTTCTACTTTCATCTCCACCTCGATCTCCCGATCCCCTAGCTCCTGATACCACT
ACCTCATCGGGTCCACTTCCACTCCCGACTCTTGAAACCACTACCACGCCTCCCGCACTTCCACTCCCACCTTCGGATACCACTACCACATCTATTGATCTGGAG
TTACAGACATTGGAGCCACTTCCCCTCGCATCTCAAGATTCATTGGTAACTTGCAAGTCTGCACGACCAAAACATAAGAGAAAGGGTATTGAGAGTACTCCCAAT
AAAGAACCGAAGAAGAGGGCCAAAGAGGAAAAAAACTCGGCCTCATCAACACAACCGAAGGTAGAGCAACCTGTAGAGCCAAAGACGAAGCAAACGCTGAAGTAT
CCTCTTCCTGGAGTTTCTACGCCCATCTTCCGAGCGAGGGTCACATACAATTTAGCACATGACATTCCATCCAACATCTTTTCAGAGATGATGTCCTGGATTGTT
GATAAGAACACAGATGTGGACATTCGACATAATTCATACCTGCCGTTGTCGAAGAAATTTTTTCAGCAACTAACAAAACCGAGTTCCTGGGTTGAGTGTGATACT
GTGAACATCATGTTCCGATTTATTTGCGACAAGTTTCACAACCGACCAGACATGTGCATGAACAAGTTCACTGTCCTCCCAACAGCAATAATGGGTCACCTCAAT
CTTCGATCTGGGATATACAAACAGATCAAGAATAAGCCAACATTGAACGCAGCGATGGTATGGGAGTACGAAGACACATACAAGGACTACGTTATGGGGAAATTC
GACCCTCACAACATGGGGTGGGCAGATGTAGACTTCGTGTACAGCTCAGTTAACACCGGTGAACATTGGGTGGTGGTTGCATTAGATATGAACAGAGGTCGAGTG
TTCGTGTTTGACTCACTCCCATCCATAACATCCAAGAAGAAGTTGGATTACATACTAGAGCCCATGACGTTGACCCTACCATCTCTTTTGAATTACTGTGACATG
AAACGCCTCAAGCCGGATTTAAACTTAGGTCGATGGCGCGTCTTTCGACCGACAAGCAAAAACATACAGCATGGGTCACTAGACTGTGGTATCTTTGCAGTGAAG
TTTTTAGAGCATTTAATTACTGGTGCTGATAGTAGTATAATAACTCAAGAAAAGATGATTGACTATAGGATGCAGTTAGCATGTCAGTTGTGGGCAAACATGCCT
TTTTTCTAA
Protein sequenceShow/hide protein sequence
MSDTEKEFRDTPVDRRPVISLHEDGDSSQSADSTHGSNSSDGGEDGDEGDEHNKTRRNGDDGDDHNGIGGNESEDDHCHDRFHETMMGHTEEENEVNPSMPLPTG
REEERPGPSNFAQREGANAAPSIYSYLQRIDSSMSRMDGRVLRLEEDMGYVKAQLSTIMSLLQTLCKDLTMVAEVRSPTPPLEPDSRRSPTQPRDSDSRRSPTPH
RDPDSRRSPTPYHDQDSRRSPTPSQDPNSRRSPTPPLPPPRSPNPLMLSPPPLLLSSPPRSPDPLAPDTTTSSGPLPLPTLETTTTPPALPLPPSDTTTTSIDLE
LQTLEPLPLASQDSLVTCKSARPKHKRKGIESTPNKEPKKRAKEEKNSASSTQPKVEQPVEPKTKQTLKYPLPGVSTPIFRARVTYNLAHDIPSNIFSEMMSWIV
DKNTDVDIRHNSYLPLSKKFFQQLTKPSSWVECDTVNIMFRFICDKFHNRPDMCMNKFTVLPTAIMGHLNLRSGIYKQIKNKPTLNAAMVWEYEDTYKDYVMGKF
DPHNMGWADVDFVYSSVNTGEHWVVVALDMNRGRVFVFDSLPSITSKKKLDYILEPMTLTLPSLLNYCDMKRLKPDLNLGRWRVFRPTSKNIQHGSLDCGIFAVK
FLEHLITGADSSIITQEKMIDYRMQLACQLWANMPFF