; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g31440 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g31440
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr5:23599568..23603249
RNA-Seq ExpressionMoc05g31440
SyntenyMoc05g31440
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]8.3e-6856.31Show/hide
Query:  MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHID
        MF  NKL+ R +LC R+FTTGD+   NFLR TDG+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH+D  TRW+D+DA+YLPYNIGG HWI++ ID
Subjt:  MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHID

Query:  LQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFRE
          EGE+IVWDS  +MTP P LE EL+PM  ++PT + R GV + +P +P TPWRIR+V+SAPQQ   GDCG+FC+ +FEYDVT  +  +LTQ  +SFFR 
Subjt:  LQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFRE

Query:  KLAIEM
        + A+++
Subjt:  KLAIEM

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]7.0e-8351.52Show/hide
Query:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT
        L E+++STP+TISFNLF  ++SF R +F +ISGLKY R  VR++T PHRL  LYFND  +++LS+FEK+Y AARFEDD+D VKV IVY+V + LLGRER 
Subjt:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT

Query:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS-----------------DVVPRILRWRCGHSTAWHV
        +K+D+TLLGIVDDWE CCN++W  LSF+KTI SL+RGP K SKDG  RKSYSLYGFPW FQV +                 D VP I +WR  HSTAWHV
Subjt:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS-----------------DVVPRILRWRCGHSTAWHV

Query:  LDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR
        LDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    +   PS+   G + DD  +G    +++EK                 S  RL+R  
Subjt:  LDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR

Query:  RMDVRMDKRMDDGFEGIKTELKYIRKFL
        +    MDKRMD+    I+ ELK I+KFL
Subjt:  RMDVRMDKRMDDGFEGIKTELKYIRKFL

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]3.2e-6758.06Show/hide
Query:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT
        L E++DSTP+TISFNLFG +VSFGRREFD+ISGL YDR  VRK T  H+LR LYFND    +LS+F K+Y+AA F+DDFD +KVSI+Y+VELVLLGRE T
Subjt:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT

Query:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIE
        +K+D  LLG+VDDWE CCNHD   LSFDKTI SL RGPT  +KD G RKSYSLYGFPW FQV                  W         +   RTR +E
Subjt:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIE

Query:  ATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQG
        ATDAET F+ RTFEPPEPED+D   R+ +A  PS+   G +  D  +G
Subjt:  ATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQG

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]4.6e-6667.18Show/hide
Query:  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWE
        L G+KVSFGRREFDIISGLKY R  VRK T P R   LYFN+S ++LLSE EK+Y + RFEDD DAVKV +VY VELVLLGRER+ K+D+ LLGIVDDWE
Subjt:  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWE

Query:  TCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHSTAWHVLDREIFRSST
         CCNHDW +LSFDKTIYSL+RG + +SK+GG RKSYSLYGFPWAFQ                 V+ DVVPRIL+WR  HSTA+H+L REIFRSST
Subjt:  TCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHSTAWHVLDREIFRSST

XP_022159253.1 uncharacterized protein LOC111025666 [Momordica charantia]1.5e-7287.34Show/hide
Query:  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRER
        L EI+D STP+TISFNLFGSKV F RREFDIISGLKYDR  VRKDTSPHRLRALYFNDSN++LLS+FEKIY+  RFEDDFDA K+SIVYL+ELVLLGRER
Subjt:  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRER

Query:  TLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP
        TLKYDYTLLGIVDD ETCCNHDWGM+SFDKTIYSLKRGPTKRSKDGGFRK YSLYGFP
Subjt:  TLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP

TrEMBL top hitse value%identityAlignment
A0A6J1DLV0 uncharacterized protein LOC1110216464.0e-6856.31Show/hide
Query:  MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHID
        MF  NKL+ R +LC R+FTTGD+   NFLR TDG+Y  M +PN + +RVA++YDW G+  ++LSY+DGTH+D  TRW+D+DA+YLPYNIGG HWI++ ID
Subjt:  MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHID

Query:  LQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFRE
          EGE+IVWDS  +MTP P LE EL+PM  ++PT + R GV + +P +P TPWRIR+V+SAPQQ   GDCG+FC+ +FEYDVT  +  +LTQ  +SFFR 
Subjt:  LQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFRE

Query:  KLAIEM
        + A+++
Subjt:  KLAIEM

A0A6J1DP34 uncharacterized protein LOC1110218023.4e-8351.52Show/hide
Query:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT
        L E+++STP+TISFNLF  ++SF R +F +ISGLKY R  VR++T PHRL  LYFND  +++LS+FEK+Y AARFEDD+D VKV IVY+V + LLGRER 
Subjt:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT

Query:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS-----------------DVVPRILRWRCGHSTAWHV
        +K+D+TLLGIVDDWE CCN++W  LSF+KTI SL+RGP K SKDG  RKSYSLYGFPW FQV +                 D VP I +WR  HSTAWHV
Subjt:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS-----------------DVVPRILRWRCGHSTAWHV

Query:  LDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR
        LDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D +    +   PS+   G + DD  +G    +++EK                 S  RL+R  
Subjt:  LDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR

Query:  RMDVRMDKRMDDGFEGIKTELKYIRKFL
        +    MDKRMD+    I+ ELK I+KFL
Subjt:  RMDVRMDKRMDDGFEGIKTELKYIRKFL

A0A6J1DQC8 uncharacterized protein LOC1110233531.5e-6758.06Show/hide
Query:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT
        L E++DSTP+TISFNLFG +VSFGRREFD+ISGL YDR  VRK T  H+LR LYFND    +LS+F K+Y+AA F+DDFD +KVSI+Y+VELVLLGRE T
Subjt:  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERT

Query:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIE
        +K+D  LLG+VDDWE CCNHD   LSFDKTI SL RGPT  +KD G RKSYSLYGFPW FQV                  W         +   RTR +E
Subjt:  LKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIE

Query:  ATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQG
        ATDAET F+ RTFEPPEPED+D   R+ +A  PS+   G +  D  +G
Subjt:  ATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQG

A0A6J1DYB1 uncharacterized protein LOC1110256667.1e-7387.34Show/hide
Query:  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRER
        L EI+D STP+TISFNLFGSKV F RREFDIISGLKYDR  VRKDTSPHRLRALYFNDSN++LLS+FEKIY+  RFEDDFDA K+SIVYL+ELVLLGRER
Subjt:  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRER

Query:  TLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP
        TLKYDYTLLGIVDD ETCCNHDWGM+SFDKTIYSLKRGPTKRSKDGGFRK YSLYGFP
Subjt:  TLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP

A0A6J1E0A9 uncharacterized protein LOC1110252092.2e-6667.18Show/hide
Query:  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWE
        L G+KVSFGRREFDIISGLKY R  VRK T P R   LYFN+S ++LLSE EK+Y + RFEDD DAVKV +VY VELVLLGRER+ K+D+ LLGIVDDWE
Subjt:  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWE

Query:  TCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHSTAWHVLDREIFRSST
         CCNHDW +LSFDKTIYSL+RG + +SK+GG RKSYSLYGFPWAFQ                 V+ DVVPRIL+WR  HSTA+H+L REIFRSST
Subjt:  TCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHSTAWHVLDREIFRSST

SwissProt top hitse value%identityAlignment
Q94F30 Ubiquitin-like-specific protease ESD46.1e-0524.81Show/hide
Query:  LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKY
        +D D I++P +  G HW +  I+ +E +++  DS+  + P          +   L  +M     +     +    W +  V   PQQ    DCGMF +KY
Subjt:  LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKY

Query:  FEYDVTGSNMTSLTQDNISFFREKLAIEM
         ++   G  +   +Q+++ +FR + A E+
Subjt:  FEYDVTGSNMTSLTQDNISFFREKLAIEM

Arabidopsis top hitse value%identityAlignment
AT4G15880.1 Cysteine proteinases superfamily protein4.3e-0624.81Show/hide
Query:  LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKY
        +D D I++P +  G HW +  I+ +E +++  DS+  + P          +   L  +M     +     +    W +  V   PQQ    DCGMF +KY
Subjt:  LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKY

Query:  FEYDVTGSNMTSLTQDNISFFREKLAIEM
         ++   G  +   +Q+++ +FR + A E+
Subjt:  FEYDVTGSNMTSLTQDNISFFREKLAIEM

AT5G45570.1 Ulp1 protease family protein4.0e-1228.24Show/hide
Query:  WLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHR-AGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCV
        ++D+D +Y    + GNHW+ L IDL    + V+DSI S+T    +  +   +  ++P  +      +  R +     W  +++T  P+  + GDC ++ +
Subjt:  WLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHR-AGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCV

Query:  KYFEYDVTGSNMTSLTQDNISFFREKLAIEM
        KY E    G +   L  +N+   R KLA+EM
Subjt:  KYFEYDVTGSNMTSLTQDNISFFREKLAIEM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTCAAGAACCTACATCACATTCTCCAAAAACCGACTCCCACAGCACAAAATCCTCGAAAACACTAACTGAAACTCAAGATCGAAATCGAATTTCAGACTCAGG
CCTAGGGGAGATCAAGGATAGTACACCTGACACCATTAGCTTCAACCTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTCGACATTATTTCTGGGCTTAAGTATG
ATAGGGGTCTAGTTAGAAAAGACACATCTCCCCACAGACTTAGGGCTCTTTACTTTAATGATAGCAACGAAGTTCTGTTGAGTGAATTTGAGAAGATTTATTTAGCCGCA
CGGTTCGAGGATGACTTCGACGCGGTCAAGGTATCTATTGTATACTTAGTAGAGTTAGTTCTGTTGGGGAGGGAGAGGACCCTGAAGTACGACTATACATTGCTGGGAAT
AGTCGATGATTGGGAAACGTGCTGCAACCACGACTGGGGGATGTTGTCCTTTGATAAGACTATATATAGTCTGAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCGGGT
TCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTAAATTCGGACGTCGTGCCACGGATTCTTCGATGGAGGTGCGGCCATTCAACTGCATGGCATGTG
CTTGATAGAGAGATTTTTCGATCTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCCCGAAGA
TGAGGACGAAGTCTCGCGCGAGAACAATGCTACTGAACCATCAAGTGCATGTGCAGGACCAGAGAAGGACGACGGGAGACAAGGAGGAATGTCGACAAAGGTGTTAGAGA
AGACGTCCATGAAGAGGCTGAGGAGAAAGAGGAGAATGGACGTGCGCATGGACAAGCGCATGGACGACGGGTTCGAAGGTATTAAGACTGAATTAAAATACATCCGGAAG
TTCTTGCGAAGAATCGCTAAGGGTTTGCCTGTCGACCCGAATGACATGAGAAGAGGCAGCAACCGTGATGGTACTGGACAAGGAGATGGTCCGAGTGATGGTCCTGGGCC
AGGAGATGGTCCGAGTGATGCACCCAGTGGGGGACCTAGTGATCGAGGTGATGGTCCCAAGGATGGCAATGGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATG
ACATGATCATCAATCCCCCCCCATGTGATCGCGCAGGAGACAAAGGAACATCAGTCCGGCCAAACGACCGGGAACCAGAGGACGTGCACAAGGAAGATATTGGTCTACAT
GCGCGTTGTGAGGCTGCCCCTCTCGAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGACGAGGACGA
ATACGCCGAGGACTTCACGGACTCTGATGCGAAAGAGCCGGGAGCGACGTCGCAACCAGACCTGGATGAGCTTCGCGGGTCGTTCAATGTCATGGTGGACGGGAAGCGGA
AGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACCGACCGCGCCACTCGCAAATCATGC
TACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCA
ACGGCATGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTAACTTTCTTCGACGAACAGACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAG
TTGCAACGGAATACGATTGGGCTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACAGACTATCAAACACGATGGCTTGATCTGGATGCTATTTACCTG
CCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATAAGGTCGATGACACCCTTTCCAGCTCTGGA
GTCCGAGTTGAGGCCAATGACTGTTGTCCTACCAACGTTTATGCACAGGGCCGGTGTTCAGATACTGAGGCCGACACTACCGAATACGCCATGGCGCATTCGTCAAGTAA
CGTCCGCGCCCCAGCAAAGCGAGTCTGGTGACTGTGGAATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATT
AGTTTTTTTAGGGAGAAATTGGCCATAGAAATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTCAAGAACCTACATCACATTCTCCAAAAACCGACTCCCACAGCACAAAATCCTCGAAAACACTAACTGAAACTCAAGATCGAAATCGAATTTCAGACTCAGG
CCTAGGGGAGATCAAGGATAGTACACCTGACACCATTAGCTTCAACCTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTCGACATTATTTCTGGGCTTAAGTATG
ATAGGGGTCTAGTTAGAAAAGACACATCTCCCCACAGACTTAGGGCTCTTTACTTTAATGATAGCAACGAAGTTCTGTTGAGTGAATTTGAGAAGATTTATTTAGCCGCA
CGGTTCGAGGATGACTTCGACGCGGTCAAGGTATCTATTGTATACTTAGTAGAGTTAGTTCTGTTGGGGAGGGAGAGGACCCTGAAGTACGACTATACATTGCTGGGAAT
AGTCGATGATTGGGAAACGTGCTGCAACCACGACTGGGGGATGTTGTCCTTTGATAAGACTATATATAGTCTGAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCGGGT
TCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTAAATTCGGACGTCGTGCCACGGATTCTTCGATGGAGGTGCGGCCATTCAACTGCATGGCATGTG
CTTGATAGAGAGATTTTTCGATCTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCCCGAAGA
TGAGGACGAAGTCTCGCGCGAGAACAATGCTACTGAACCATCAAGTGCATGTGCAGGACCAGAGAAGGACGACGGGAGACAAGGAGGAATGTCGACAAAGGTGTTAGAGA
AGACGTCCATGAAGAGGCTGAGGAGAAAGAGGAGAATGGACGTGCGCATGGACAAGCGCATGGACGACGGGTTCGAAGGTATTAAGACTGAATTAAAATACATCCGGAAG
TTCTTGCGAAGAATCGCTAAGGGTTTGCCTGTCGACCCGAATGACATGAGAAGAGGCAGCAACCGTGATGGTACTGGACAAGGAGATGGTCCGAGTGATGGTCCTGGGCC
AGGAGATGGTCCGAGTGATGCACCCAGTGGGGGACCTAGTGATCGAGGTGATGGTCCCAAGGATGGCAATGGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATG
ACATGATCATCAATCCCCCCCCATGTGATCGCGCAGGAGACAAAGGAACATCAGTCCGGCCAAACGACCGGGAACCAGAGGACGTGCACAAGGAAGATATTGGTCTACAT
GCGCGTTGTGAGGCTGCCCCTCTCGAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGACGAGGACGA
ATACGCCGAGGACTTCACGGACTCTGATGCGAAAGAGCCGGGAGCGACGTCGCAACCAGACCTGGATGAGCTTCGCGGGTCGTTCAATGTCATGGTGGACGGGAAGCGGA
AGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACCGACCGCGCCACTCGCAAATCATGC
TACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCA
ACGGCATGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTAACTTTCTTCGACGAACAGACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAG
TTGCAACGGAATACGATTGGGCTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACAGACTATCAAACACGATGGCTTGATCTGGATGCTATTTACCTG
CCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATAAGGTCGATGACACCCTTTCCAGCTCTGGA
GTCCGAGTTGAGGCCAATGACTGTTGTCCTACCAACGTTTATGCACAGGGCCGGTGTTCAGATACTGAGGCCGACACTACCGAATACGCCATGGCGCATTCGTCAAGTAA
CGTCCGCGCCCCAGCAAAGCGAGTCTGGTGACTGTGGAATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATT
AGTTTTTTTAGGGAGAAATTGGCCATAGAAATGTGA
Protein sequenceShow/hide protein sequence
MELQEPTSHSPKTDSHSTKSSKTLTETQDRNRISDSGLGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAA
RFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHV
LDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEKTSMKRLRRKRRMDVRMDKRMDDGFEGIKTELKYIRK
FLRRIAKGLPVDPNDMRRGSNRDGTGQGDGPSDGPGPGDGPSDAPSGGPSDRGDGPKDGNGSTPSPRDVDDTDDMIINPPPCDRAGDKGTSVRPNDREPEDVHKEDIGLH
ARCEAAPLEQTPVQSRQVDHITIDSHPLESSMDDEDEYAEDFTDSDAKEPGATSQPDLDELRGSFNVMVDGKRKKVMRYDPLVHVPSEQVQKFHAWLANPNTDRATRKSC
YGDRGKTWFRDLINSGKWMTSEVIDSLFMFTRNKLEQRHDLCSRRFTTGDINFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYL
PYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNI
SFFREKLAIEM