; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g32410 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g32410
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:23437680..23442927
RNA-Seq ExpressionMoc08g32410
SyntenyMoc08g32410
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154364.1 uncharacterized protein LOC111021646 [Momordica charantia]1.2e-6657.43Show/hide
Query:  RPDLCSRRFTTGDILLANFFRRTDGIYQRLIAPNVVPAIVSAEYDWESRYKTIMSYVDGTHTNYETRWLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVW
        RP+LC R+FTTGD+L++NF R TDG+Y  + +PNV+ + V+++YDWE R  +++SY+DGTH++ +TRW+D+DAVYLPYNIGG HWI++ ID  EGE+IVW
Subjt:  RPDLCSRRFTTGDILLANFFRRTDGIYQRLIAPNVVPAIVSAEYDWESRYKTIMSYVDGTHTNYETRWLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVW

Query:  DSMMAITTPATLEEELKPMSVILPALMCRAGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCVKFFEYDVTGSNPATLTQERIPFFREKLAIEFDPK
        DS M +T    LE+ELKPM  I+P L+CR GV + +P IP  PWRIRRV+ APQQ   GDCGIFC+ FFEYDVT  +  TLTQ R+ FFR + A++    
Subjt:  DSMMAITTPATLEEELKPMSVILPALMCRAGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCVKFFEYDVTGSNPATLTQERIPFFREKLAIEFDPK

Query:  KS
        KS
Subjt:  KS

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]1.8e-7850.4Show/hide
Query:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ
        MFRKT F HLL+VDLVFNGSL+HNILLREVE+ST ++ISFNLF R++SF R +F LISGLKY  + V+++T  HRL  LYFND+ DLVLSD E +Y AA+
Subjt:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ

Query:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR
        F+DD+D VK                               EVCCNY+WASLSFEKTI SL RGP KM+KDG LRKSYSLYGFPWVFQVWAY+ ISSLS R
Subjt:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR

Query:  VANKVLRMQC-HV------SSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVC
        VANKVL     H+       S     +     CST+     + L+ +D E SF+ R+F+PP +DDDD M                 + RGD+A P S V 
Subjt:  VANKVLRMQC-HV------SSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVC

Query:  EGPQEPDVGQGDAVGPSAVREGRGISADIADSKGQNNVRTSSMRLRKVEKRLKKMDKRIGERMSGMEAELKAIRKYL
        EG Q  D  +G  V    V +   +  +  ++KG+N V  S+ RL++VEK LK MDKR+ ERM  +EAELK+I+K+L
Subjt:  EGPQEPDVGQGDAVGPSAVREGRGISADIADSKGQNNVRTSSMRLRKVEKRLKKMDKRIGERMSGMEAELKAIRKYL

XP_022156465.1 uncharacterized protein LOC111023353 [Momordica charantia]9.1e-7556.42Show/hide
Query:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ
        MFRKTIFGHLL+VDLVFNG L+HNILLREVEDST ++ISFNLFGR+VSFGRREFDLISGL YD S V+K TH H+LR LYFNDR + VLSD   LY AA 
Subjt:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ

Query:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR
        F+DDFD +K                               E+CCN+D ASLSF+KTI SLHRGPT MAKD GLRKSYSLYGFPWVFQVW YE  +     
Subjt:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR

Query:  VANKVLRMQCHVSSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVCEGP
                                          +RLEA+DAE +FMRR FEPPE +DDD  D DAG + VREG ++P  GRGDDA P S V EGP
Subjt:  VANKVLRMQCHVSSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVCEGP

XP_022158083.1 uncharacterized protein LOC111024651 [Momordica charantia]2.1e-7179.68Show/hide
Query:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
        MELRPKIDPAIYASA VSCLSHLAKTVTAIKGKLAPRQL+MFRKTIF HLL+VDLVFNG LVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
Subjt:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL

Query:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK-------------------------------EVCCNYD
        KYDGSLV+KDTHVHRLRALYFNDR DLVLSDLEDLYEAAQFQDDFDAVK                               EVCCNYD
Subjt:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK-------------------------------EVCCNYD

XP_022158660.1 uncharacterized protein LOC111025123 [Momordica charantia]6.9e-75100Show/hide
Query:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
        MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
Subjt:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL

Query:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK
        KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK
Subjt:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK

TrEMBL top hitse value%identityAlignment
A0A6J1DLV0 uncharacterized protein LOC1110216465.7e-6757.43Show/hide
Query:  RPDLCSRRFTTGDILLANFFRRTDGIYQRLIAPNVVPAIVSAEYDWESRYKTIMSYVDGTHTNYETRWLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVW
        RP+LC R+FTTGD+L++NF R TDG+Y  + +PNV+ + V+++YDWE R  +++SY+DGTH++ +TRW+D+DAVYLPYNIGG HWI++ ID  EGE+IVW
Subjt:  RPDLCSRRFTTGDILLANFFRRTDGIYQRLIAPNVVPAIVSAEYDWESRYKTIMSYVDGTHTNYETRWLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVW

Query:  DSMMAITTPATLEEELKPMSVILPALMCRAGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCVKFFEYDVTGSNPATLTQERIPFFREKLAIEFDPK
        DS M +T    LE+ELKPM  I+P L+CR GV + +P IP  PWRIRRV+ APQQ   GDCGIFC+ FFEYDVT  +  TLTQ R+ FFR + A++    
Subjt:  DSMMAITTPATLEEELKPMSVILPALMCRAGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCVKFFEYDVTGSNPATLTQERIPFFREKLAIEFDPK

Query:  KS
        KS
Subjt:  KS

A0A6J1DP34 uncharacterized protein LOC1110218028.5e-7950.4Show/hide
Query:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ
        MFRKT F HLL+VDLVFNGSL+HNILLREVE+ST ++ISFNLF R++SF R +F LISGLKY  + V+++T  HRL  LYFND+ DLVLSD E +Y AA+
Subjt:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ

Query:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR
        F+DD+D VK                               EVCCNY+WASLSFEKTI SL RGP KM+KDG LRKSYSLYGFPWVFQVWAY+ ISSLS R
Subjt:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR

Query:  VANKVLRMQC-HV------SSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVC
        VANKVL     H+       S     +     CST+     + L+ +D E SF+ R+F+PP +DDDD M                 + RGD+A P S V 
Subjt:  VANKVLRMQC-HV------SSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVC

Query:  EGPQEPDVGQGDAVGPSAVREGRGISADIADSKGQNNVRTSSMRLRKVEKRLKKMDKRIGERMSGMEAELKAIRKYL
        EG Q  D  +G  V    V +   +  +  ++KG+N V  S+ RL++VEK LK MDKR+ ERM  +EAELK+I+K+L
Subjt:  EGPQEPDVGQGDAVGPSAVREGRGISADIADSKGQNNVRTSSMRLRKVEKRLKKMDKRIGERMSGMEAELKAIRKYL

A0A6J1DQC8 uncharacterized protein LOC1110233534.4e-7556.42Show/hide
Query:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ
        MFRKTIFGHLL+VDLVFNG L+HNILLREVEDST ++ISFNLFGR+VSFGRREFDLISGL YD S V+K TH H+LR LYFNDR + VLSD   LY AA 
Subjt:  MFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQ

Query:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR
        F+DDFD +K                               E+CCN+D ASLSF+KTI SLHRGPT MAKD GLRKSYSLYGFPWVFQVW YE  +     
Subjt:  FQDDFDAVK-------------------------------EVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGR

Query:  VANKVLRMQCHVSSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVCEGP
                                          +RLEA+DAE +FMRR FEPPE +DDD  D DAG + VREG ++P  GRGDDA P S V EGP
Subjt:  VANKVLRMQCHVSSDGGVAIRLHGMCSTERFFGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVCEGP

A0A6J1DV44 uncharacterized protein LOC1110246511.0e-7179.68Show/hide
Query:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
        MELRPKIDPAIYASA VSCLSHLAKTVTAIKGKLAPRQL+MFRKTIF HLL+VDLVFNG LVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
Subjt:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL

Query:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK-------------------------------EVCCNYD
        KYDGSLV+KDTHVHRLRALYFNDR DLVLSDLEDLYEAAQFQDDFDAVK                               EVCCNYD
Subjt:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK-------------------------------EVCCNYD

A0A6J1DWG2 uncharacterized protein LOC1110251233.4e-75100Show/hide
Query:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
        MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL
Subjt:  MELRPKIDPAIYASAKVSCLSHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGL

Query:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK
        KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK
Subjt:  KYDGSLVKKDTHVHRLRALYFNDRFDLVLSDLEDLYEAAQFQDDFDAVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G45570.1 Ulp1 protease family protein1.3e-1026.43Show/hide
Query:  WLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVWDSMMAITTPATLEEELKPMSVILPALMCR-AGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCV
        ++D+D +Y    + GNHW+ + ID+    + V+DS+ ++TT   +  +   +  ++PA++      +  R +   + W  +R+T  P+    GDC I+ +
Subjt:  WLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVWDSMMAITTPATLEEELKPMSVILPALMCR-AGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCV

Query:  KFFEYDVTGSNPATLTQERIPFFREKLAIEFDPKKSDNYR
        K+ E    G +   L  E +   R KLA+E   +  +N R
Subjt:  KFFEYDVTGSNPATLTQERIPFFREKLAIEFDPKKSDNYR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTTCAACGCCTCGCGGATTGTTCCCGTTGAACGCCTTCACCTGCACCTCGCCCGCGAAACCAAGGGTTTTAGTTTGAACAATTATGTGAAATATTTATGTCTTTC
GGGCGAATTAGGCGAGATATCGCCCCGAGATATTATACAGCTTCACATTTCTGTCGGCTGCTTTCACGTTTTATCTCGAGCGCTAAGTGTGATATGTTATTTCTCGGTGG
TATCTCGTCTTGGATTGGCACGAGAAATTGAACATAAATTACGCTCGAGTATGGAATTGAGACCGAAAATTGACCCTGCAATCTATGCATCTGCAAAGGTGTCCTGTTTA
TCGCATCTAGCGAAGACAGTGACTGCTATTAAGGGAAAATTGGCCCCTAGACAGCTATCCATGTTTAGGAAAACCATATTCGGTCATTTGCTGAACGTGGACCTCGTTTT
TAACGGGTCATTGGTACACAATATATTACTTAGGGAGGTTGAGGATAGTACGACGGACAGTATTAGTTTCAACCTGTTTGGGAGGAAAGTGTCGTTCGGACGGAGGGAGT
TTGACCTTATTAGTGGCCTTAAGTATGACGGGAGCCTAGTTAAGAAAGATACTCATGTTCATAGACTTAGGGCTCTGTACTTTAACGATAGGTTTGACCTTGTCTTGAGT
GATTTAGAAGACCTATATGAAGCCGCCCAGTTTCAGGATGACTTTGATGCGGTTAAGGAGGTCTGCTGCAACTACGACTGGGCGTCCCTATCGTTCGAGAAGACGATAGG
TAGTCTTCATCGTGGCCCAACCAAGATGGCAAAGGATGGAGGGTTGAGGAAATCATATAGCCTGTATGGTTTCCCCTGGGTGTTCCAGGTGTGGGCGTACGAGGTGATAT
CTTCCCTATCTGGCCGGGTCGCAAATAAAGTTTTAAGGATGCAGTGCCACGTATCCTCCGATGGAGGTGTGGCCATTCGACTGCATGGCATGTGCTCGACCGAGAGATTT
TTCGGTCTACAACGATTAGAGGCAAGTGATGCTGAGATGAGCTTCATGAGGCGAGCCTTCGAACCACCTGAAGCAGATGACGACGACGCCATGGATGTTGATGCTGGGCA
AGCGGGTGTACGTGAGGGCATAGAGGATCCTGCTGATGGTCGAGGCGATGATGCTCGACCACAGTCAGGTGTATGTGAGGGCCCACAGGAGCCTGATGTGGGCCAAGGTG
ATGCTGTTGGACCATCAGCTGTGCGTGAGGGACGAGGCATCAGTGCGGACATTGCCGATTCTAAAGGTCAGAATAATGTTCGCACATCTAGCATGCGCTTGAGGAAGGTT
GAGAAGCGCTTGAAGAAGATGGACAAGCGTATTGGCGAGCGTATGTCTGGCATGGAGGCTGAATTGAAAGCAATCAGGAAGTATTTGAGGCGACTTGCTAAGAGCTTACC
TATTGACGCGGATGACATGAGGAGAAGAAAAGGTACCGACCCGGGCGGTGGTGCTGGCCCGAGAGATGGTGATGAGCCGGGAGATGGTACAAATTCGAGGGATCGTGGTG
AGCCGAGGGATGGTGGTGGACCGGGAGAGAGCACCGAGGCGGATGCCGGTACTGGGCTGGGAGATCCTATCGAGCAGGATGCCGGTGCTGGGTTGGTTGATTGTACCGCG
CCTAATGTTGGTATTGCAGATGGGTCGGGAGATGTGGGTGCCGGCGGTGATGACAAGATGGTCGATACTGTAAAGAAACATGATGAAGCAGTCGAAGAACTTCCCCCGAC
TCACGATCGGGTCATTCTAGGACCCGACTTGGGGGGACATCCTACACCGAAGCGTTTGCAACACGAATCTGGCAAGGATAAAGTCGAAGAATGTGTCGAGTCAACATCGG
TGGAGCACATTATTATTGATTCGCAATTAACCGATTCCTCAACCGACAAGGAACATTACATGGACGATTTCACGGATTCCGACGCGGAGGAAACGAGGGAGGTTAAATCA
CACGTGGATGGAGATGAGGTTCGTGTGCTATCGCAGCCAGTTGCACCACCGAATCCGCGACGTGGCTCTCGGAAGAGGAAGGCATCGTGGAAGCTCCGTGGATCGTTTGT
GGTCATGGAGAACGGGAGGAAGAAGAAGGCTATCCAGTATGATCCACTAGTACGGATCCCTCCTGAGCAGGTCACCAAGTTCCACAATTGGATGAGTAGCCCTACCACGA
AGCATGCGAAGAGGAAATCGTGCTATGGTAAGAAGGACAAGACGTGGTTTCGTGACCTTTTAACGTCGGACAAGTGGCTGACCAGCGAGCGTCCCGACTTGTGCTCTAGG
AGATTCACGACTGGTGACATATTGCTTGCGAACTTCTTCCGCCGGACCGATGGTATATACCAGAGATTGATCGCCCCCAATGTCGTTCCAGCAATAGTGTCGGCAGAATA
TGACTGGGAGAGTCGATACAAAACGATCATGAGCTATGTCGACGGCACTCACACAAACTATGAGACACGGTGGCTGGACCTCGATGCTGTATACTTGCCGTACAACATCG
GTGGAAATCATTGGATCATGGTTTACATCGACATGCAAGAAGGTGAGATCATCGTATGGGATTCTATGATGGCAATCACAACACCGGCTACTCTGGAGGAGGAGTTGAAG
CCGATGAGCGTCATTCTCCCAGCGTTGATGTGTAGAGCCGGAGTTAGGGTTTTGAGGCCTACCATACCCACTGTACCATGGCGCATCCGTCGAGTAACCGGGGCTCCACA
GCAGACTGGTTCAGGTGATTGTGGTATTTTCTGTGTTAAGTTTTTTGAGTACGATGTAACGGGTTCCAATCCCGCGACTCTAACTCAAGAGAGGATCCCATTTTTTAGGG
AGAAACTCGCCATTGAATTTGATCCCAAAAAGTCAGATAATTATAGGGATGATGACCCATCTACCGTTCAAAGGTTCAACCTCATCAAAGGCTCCAAATCGTTTGTTCAA
ATGCTTCAAGCTACAGGCCACTCCGTTGTCCCGCTTGAGGGCACTAGGTTGACATTCTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTTCAACGCCTCGCGGATTGTTCCCGTTGAACGCCTTCACCTGCACCTCGCCCGCGAAACCAAGGGTTTTAGTTTGAACAATTATGTGAAATATTTATGTCTTTC
GGGCGAATTAGGCGAGATATCGCCCCGAGATATTATACAGCTTCACATTTCTGTCGGCTGCTTTCACGTTTTATCTCGAGCGCTAAGTGTGATATGTTATTTCTCGGTGG
TATCTCGTCTTGGATTGGCACGAGAAATTGAACATAAATTACGCTCGAGTATGGAATTGAGACCGAAAATTGACCCTGCAATCTATGCATCTGCAAAGGTGTCCTGTTTA
TCGCATCTAGCGAAGACAGTGACTGCTATTAAGGGAAAATTGGCCCCTAGACAGCTATCCATGTTTAGGAAAACCATATTCGGTCATTTGCTGAACGTGGACCTCGTTTT
TAACGGGTCATTGGTACACAATATATTACTTAGGGAGGTTGAGGATAGTACGACGGACAGTATTAGTTTCAACCTGTTTGGGAGGAAAGTGTCGTTCGGACGGAGGGAGT
TTGACCTTATTAGTGGCCTTAAGTATGACGGGAGCCTAGTTAAGAAAGATACTCATGTTCATAGACTTAGGGCTCTGTACTTTAACGATAGGTTTGACCTTGTCTTGAGT
GATTTAGAAGACCTATATGAAGCCGCCCAGTTTCAGGATGACTTTGATGCGGTTAAGGAGGTCTGCTGCAACTACGACTGGGCGTCCCTATCGTTCGAGAAGACGATAGG
TAGTCTTCATCGTGGCCCAACCAAGATGGCAAAGGATGGAGGGTTGAGGAAATCATATAGCCTGTATGGTTTCCCCTGGGTGTTCCAGGTGTGGGCGTACGAGGTGATAT
CTTCCCTATCTGGCCGGGTCGCAAATAAAGTTTTAAGGATGCAGTGCCACGTATCCTCCGATGGAGGTGTGGCCATTCGACTGCATGGCATGTGCTCGACCGAGAGATTT
TTCGGTCTACAACGATTAGAGGCAAGTGATGCTGAGATGAGCTTCATGAGGCGAGCCTTCGAACCACCTGAAGCAGATGACGACGACGCCATGGATGTTGATGCTGGGCA
AGCGGGTGTACGTGAGGGCATAGAGGATCCTGCTGATGGTCGAGGCGATGATGCTCGACCACAGTCAGGTGTATGTGAGGGCCCACAGGAGCCTGATGTGGGCCAAGGTG
ATGCTGTTGGACCATCAGCTGTGCGTGAGGGACGAGGCATCAGTGCGGACATTGCCGATTCTAAAGGTCAGAATAATGTTCGCACATCTAGCATGCGCTTGAGGAAGGTT
GAGAAGCGCTTGAAGAAGATGGACAAGCGTATTGGCGAGCGTATGTCTGGCATGGAGGCTGAATTGAAAGCAATCAGGAAGTATTTGAGGCGACTTGCTAAGAGCTTACC
TATTGACGCGGATGACATGAGGAGAAGAAAAGGTACCGACCCGGGCGGTGGTGCTGGCCCGAGAGATGGTGATGAGCCGGGAGATGGTACAAATTCGAGGGATCGTGGTG
AGCCGAGGGATGGTGGTGGACCGGGAGAGAGCACCGAGGCGGATGCCGGTACTGGGCTGGGAGATCCTATCGAGCAGGATGCCGGTGCTGGGTTGGTTGATTGTACCGCG
CCTAATGTTGGTATTGCAGATGGGTCGGGAGATGTGGGTGCCGGCGGTGATGACAAGATGGTCGATACTGTAAAGAAACATGATGAAGCAGTCGAAGAACTTCCCCCGAC
TCACGATCGGGTCATTCTAGGACCCGACTTGGGGGGACATCCTACACCGAAGCGTTTGCAACACGAATCTGGCAAGGATAAAGTCGAAGAATGTGTCGAGTCAACATCGG
TGGAGCACATTATTATTGATTCGCAATTAACCGATTCCTCAACCGACAAGGAACATTACATGGACGATTTCACGGATTCCGACGCGGAGGAAACGAGGGAGGTTAAATCA
CACGTGGATGGAGATGAGGTTCGTGTGCTATCGCAGCCAGTTGCACCACCGAATCCGCGACGTGGCTCTCGGAAGAGGAAGGCATCGTGGAAGCTCCGTGGATCGTTTGT
GGTCATGGAGAACGGGAGGAAGAAGAAGGCTATCCAGTATGATCCACTAGTACGGATCCCTCCTGAGCAGGTCACCAAGTTCCACAATTGGATGAGTAGCCCTACCACGA
AGCATGCGAAGAGGAAATCGTGCTATGGTAAGAAGGACAAGACGTGGTTTCGTGACCTTTTAACGTCGGACAAGTGGCTGACCAGCGAGCGTCCCGACTTGTGCTCTAGG
AGATTCACGACTGGTGACATATTGCTTGCGAACTTCTTCCGCCGGACCGATGGTATATACCAGAGATTGATCGCCCCCAATGTCGTTCCAGCAATAGTGTCGGCAGAATA
TGACTGGGAGAGTCGATACAAAACGATCATGAGCTATGTCGACGGCACTCACACAAACTATGAGACACGGTGGCTGGACCTCGATGCTGTATACTTGCCGTACAACATCG
GTGGAAATCATTGGATCATGGTTTACATCGACATGCAAGAAGGTGAGATCATCGTATGGGATTCTATGATGGCAATCACAACACCGGCTACTCTGGAGGAGGAGTTGAAG
CCGATGAGCGTCATTCTCCCAGCGTTGATGTGTAGAGCCGGAGTTAGGGTTTTGAGGCCTACCATACCCACTGTACCATGGCGCATCCGTCGAGTAACCGGGGCTCCACA
GCAGACTGGTTCAGGTGATTGTGGTATTTTCTGTGTTAAGTTTTTTGAGTACGATGTAACGGGTTCCAATCCCGCGACTCTAACTCAAGAGAGGATCCCATTTTTTAGGG
AGAAACTCGCCATTGAATTTGATCCCAAAAAGTCAGATAATTATAGGGATGATGACCCATCTACCGTTCAAAGGTTCAACCTCATCAAAGGCTCCAAATCGTTTGTTCAA
ATGCTTCAAGCTACAGGCCACTCCGTTGTCCCGCTTGAGGGCACTAGGTTGACATTCTACTAA
Protein sequenceShow/hide protein sequence
MLFNASRIVPVERLHLHLARETKGFSLNNYVKYLCLSGELGEISPRDIIQLHISVGCFHVLSRALSVICYFSVVSRLGLAREIEHKLRSSMELRPKIDPAIYASAKVSCL
SHLAKTVTAIKGKLAPRQLSMFRKTIFGHLLNVDLVFNGSLVHNILLREVEDSTTDSISFNLFGRKVSFGRREFDLISGLKYDGSLVKKDTHVHRLRALYFNDRFDLVLS
DLEDLYEAAQFQDDFDAVKEVCCNYDWASLSFEKTIGSLHRGPTKMAKDGGLRKSYSLYGFPWVFQVWAYEVISSLSGRVANKVLRMQCHVSSDGGVAIRLHGMCSTERF
FGLQRLEASDAEMSFMRRAFEPPEADDDDAMDVDAGQAGVREGIEDPADGRGDDARPQSGVCEGPQEPDVGQGDAVGPSAVREGRGISADIADSKGQNNVRTSSMRLRKV
EKRLKKMDKRIGERMSGMEAELKAIRKYLRRLAKSLPIDADDMRRRKGTDPGGGAGPRDGDEPGDGTNSRDRGEPRDGGGPGESTEADAGTGLGDPIEQDAGAGLVDCTA
PNVGIADGSGDVGAGGDDKMVDTVKKHDEAVEELPPTHDRVILGPDLGGHPTPKRLQHESGKDKVEECVESTSVEHIIIDSQLTDSSTDKEHYMDDFTDSDAEETREVKS
HVDGDEVRVLSQPVAPPNPRRGSRKRKASWKLRGSFVVMENGRKKKAIQYDPLVRIPPEQVTKFHNWMSSPTTKHAKRKSCYGKKDKTWFRDLLTSDKWLTSERPDLCSR
RFTTGDILLANFFRRTDGIYQRLIAPNVVPAIVSAEYDWESRYKTIMSYVDGTHTNYETRWLDLDAVYLPYNIGGNHWIMVYIDMQEGEIIVWDSMMAITTPATLEEELK
PMSVILPALMCRAGVRVLRPTIPTVPWRIRRVTGAPQQTGSGDCGIFCVKFFEYDVTGSNPATLTQERIPFFREKLAIEFDPKKSDNYRDDDPSTVQRFNLIKGSKSFVQ
MLQATGHSVVPLEGTRLTFY