; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g20740 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g20740
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr1:14474733..14477928
RNA-Seq ExpressionMoc01g20740
SyntenyMoc01g20740
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR015410 - Domain of unknown function DUF1985
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]3.5e-10650.51Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M++ L I+++DWFP  L+NLAH+ KT +R+K RLTP+QLDMF QTCFGPIL ++VVFNGPL+HHLLL+EVEEPR D+ISF+LFG RVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
         H MNRVD  +  RRLR  YF+D   VKCSELEKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT+LLG+VDRWEVFCNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVKVSTTSH-GSRYASTSG
        KNALKDK                            VWAYETISTLS        DDAIPRLLRWSC YS  F VL  EVF+N +     H  +  A    
Subjt:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVKVSTTSH-GSRYASTSG

Query:  PCRTSCTTELATEP-------LATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDR
          R     E+   P        A       SP    V DP    DV  +  P+ D V  D    D  +      +G EK+ K++K K   SR L+RL + 
Subjt:  PCRTSCTTELATEP-------LATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDR

Query:  VMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTG
        V AIE  L      +K I+ ++K+L K      +KY   GGGPD DG S           GGR   D   R++ED   DED +T +EP +G
Subjt:  VMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTG

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]6.9e-14779.67Show/hide
Query:  MMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSLKNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIP
        MM KERKQKMDTSLLGIVDRWEVFC+YD SSMIFERTLWSLKNALKDK                            VWAYETISTLSTRVALR+NDDAIP
Subjt:  MMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSLKNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIP

Query:  RLLRWSCTYSRVFNVLEREVFENVKVSTT----------SHGSRYASTS-GPCRTSCTTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDH
        RLLRWSCTYSR FNVLEREVFENVK               H +R       P      TELATEPLATTSTAQKSPVT EVGD VELDDVAKDASP+VD 
Subjt:  RLLRWSCTYSRVFNVLEREVFENVKVSTT----------SHGSRYASTS-GPCRTSCTTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDH

Query:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN
        VTEDIIGTDGGQDQLLPQKGTEKKKK+SKHKWSRELRRLGDRV AIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGG PDQDGSSGGRDPSGRN
Subjt:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN

Query:  EEDMDMDEDPKTGEEPKTGDEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE
        EEDMDMDEDPKTG+EPKTGDEPRMDEDPK CEEP DV ESDVEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  EEDMDMDEDPKTGEEPKTGDEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE

XP_022154967.1 uncharacterized protein LOC111022112 [Momordica charantia]1.5e-9391.44Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M+MT+KINQDDWFP ALSNL+HVGKT SRLK RLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLL+EVEEPRD+LISFNLFGN VSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYD
        RHTMNRVD DV NRRLRILYFQD ASVKC ELEKIFLEHTFENDEDAVKIAIVY+IELAMM KERKQKMDTSLLGIVDRWEV+CNYD
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYD

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]4.3e-9691.37Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M+MTLKINQDD FP ALSNLAHVGKT SRLK RLTPSQLDMFSQTCFG ILGMN VFN  LLHHLLL+EVEEPRDDLISFNLFGNRVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTL
        RHTMNRV +DV NRRLRILYFQDKASVKCSELEKIFLEHTF+NDEDAVKIAIVYFIELAMM KERKQKMDTSLLGIVDRWEVFCNYDWSSMI E TL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTL

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]7.9e-13586.97Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        MNMTLKINQDDWFP ALSNLAHVGKT SRLK RLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLL+EVEEP+DDLISFNLFGNRVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMM KERK KMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVK
        KNALKDK                            VWAYETISTLSTRVALR+NDDAIPRLLRWSCTYSR FNVLEREVFENVK
Subjt:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVK

TrEMBL top hitse value%identityAlignment
A0A6J1DJX9 uncharacterized protein LOC1110207571.7e-10650.51Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M++ L I+++DWFP  L+NLAH+ KT +R+K RLTP+QLDMF QTCFGPIL ++VVFNGPL+HHLLL+EVEEPR D+ISF+LFG RVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
         H MNRVD  +  RRLR  YF+D   VKCSELEKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT+LLG+VDRWEVFCNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVKVSTTSH-GSRYASTSG
        KNALKDK                            VWAYETISTLS        DDAIPRLLRWSC YS  F VL  EVF+N +     H  +  A    
Subjt:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVKVSTTSH-GSRYASTSG

Query:  PCRTSCTTELATEP-------LATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDR
          R     E+   P        A       SP    V DP    DV  +  P+ D V  D    D  +      +G EK+ K++K K   SR L+RL + 
Subjt:  PCRTSCTTELATEP-------LATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDR

Query:  VMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTG
        V AIE  L      +K I+ ++K+L K      +KY   GGGPD DG S           GGR   D   R++ED   DED +T +EP +G
Subjt:  VMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTG

A0A6J1DL40 uncharacterized protein LOC1110221103.4e-14779.67Show/hide
Query:  MMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSLKNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIP
        MM KERKQKMDTSLLGIVDRWEVFC+YD SSMIFERTLWSLKNALKDK                            VWAYETISTLSTRVALR+NDDAIP
Subjt:  MMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSLKNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIP

Query:  RLLRWSCTYSRVFNVLEREVFENVKVSTT----------SHGSRYASTS-GPCRTSCTTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDH
        RLLRWSCTYSR FNVLEREVFENVK               H +R       P      TELATEPLATTSTAQKSPVT EVGD VELDDVAKDASP+VD 
Subjt:  RLLRWSCTYSRVFNVLEREVFENVKVSTT----------SHGSRYASTS-GPCRTSCTTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDH

Query:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN
        VTEDIIGTDGGQDQLLPQKGTEKKKK+SKHKWSRELRRLGDRV AIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGG PDQDGSSGGRDPSGRN
Subjt:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN

Query:  EEDMDMDEDPKTGEEPKTGDEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE
        EEDMDMDEDPKTG+EPKTGDEPRMDEDPK CEEP DV ESDVEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  EEDMDMDEDPKTGEEPKTGDEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE

A0A6J1DL69 uncharacterized protein LOC1110221392.1e-9691.37Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M+MTLKINQDD FP ALSNLAHVGKT SRLK RLTPSQLDMFSQTCFG ILGMN VFN  LLHHLLL+EVEEPRDDLISFNLFGNRVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTL
        RHTMNRV +DV NRRLRILYFQDKASVKCSELEKIFLEHTF+NDEDAVKIAIVYFIELAMM KERKQKMDTSLLGIVDRWEVFCNYDWSSMI E TL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTL

A0A6J1DN46 uncharacterized protein LOC1110221124.3e-9491.98Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        M+MT+KINQDDWFP ALSNL+HVGKT SRLK RLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLL+EVEEPRDDLISFNLFGN VSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYD
        RHTMNRVD DV NRRLRILYFQD ASVKC ELEKIFLEHTFENDEDAVKIAIVY+IELAMM KERKQKMDTSLLGIVDRWEV+CNYD
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYD

A0A6J1DRZ7 uncharacterized protein LOC1110238473.8e-13586.97Show/hide
Query:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL
        MNMTLKINQDDWFP ALSNLAHVGKT SRLK RLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLL+EVEEP+DDLISFNLFGNRVSFGKREFDLITGL
Subjt:  MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMM KERK KMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVK
        KNALKDK                            VWAYETISTLSTRVALR+NDDAIPRLLRWSCTYSR FNVLEREVFENVK
Subjt:  KNALKDK----------------------------VWAYETISTLSTRVALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGACCGCGCTGTCAAATCTCGCTCACGTAGGGAAAACCTGTTCTCGTCTTAAGACTAGGTTAACTCCCTC
TCAGTTAGACATGTTCAGTCAAACATGTTTTGGTCCGATTTTAGGGATGAACGTTGTATTTAACGGTCCGTTGCTCCATCACCTGTTGCTTAAAGAGGTGGAGGAACCTA
GAGACGACCTCATTAGCTTTAACCTATTCGGGAATAGGGTCTCTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAATAGGGTAGATGAGGAT
GTTCGTAACCGGAGACTTAGAATTCTGTATTTTCAAGACAAGGCGAGTGTGAAGTGTTCGGAGTTGGAGAAAATTTTTTTAGAACACACATTCGAAAATGACGAGGACGC
TGTGAAGATTGCTATAGTGTACTTCATAGAGCTTGCCATGATGAGCAAGGAAAGGAAGCAGAAAATGGACACGAGCCTCCTTGGGATTGTGGATCGGTGGGAAGTTTTCT
GTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTTTGGGCATACGAGACAATATCAACCTTGTCGACTCGA
GTAGCATTGAGGGTGAATGACGATGCTATTCCTCGTTTACTTAGATGGTCCTGCACCTATTCACGTGTTTTTAATGTTTTGGAGCGAGAGGTCTTCGAGAACGTCAAGGT
GAGTACAACATCACATGGCTCGCGTTATGCATCCACCAGTGGCCCCTGTCGGACCTCCTGCACCACAGAACTTGCTACAGAACCACTGGCTACTACTTCCACCGCTCAGA
AGTCTCCCGTTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAATGGTTGATCATGTAACAGAAGATATTATTGGGACCGATGGA
GGACAAGATCAATTGTTGCCACAGAAAGGGACGGAGAAGAAGAAGAAGAGGTCGAAGCATAAGTGGAGTCGGGAGCTGCGGAGGCTCGGCGACAGAGTGATGGCCATTGA
GACAACTCTGACGGGCATGACGACTGACATAAAGGACATAAAGAAGTTTATGAAGAGGCTAACAAAGGTTATGTCAAAGGGCCAGAATAAATATGATAGAAGGGGCGGTG
GGCCGGATCAAGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAGGAGGATATGGACATGGATGAGGATCCGAAGACAGGGGAAGAGCCGAAGACAGGGGAC
GAGCCGAGGATGGACGAGGATCCGAAGAATTGTGAAGAACCCGCCGACGTCACCGAGAGTGACGTGGAGATGGATCACGCTCCTACCATTGTTGGAGCTACCCAGGAGGT
CCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTCTAGGTAAGTGCGCCAGTGACGGGGAGGCAAGTAAGGGGCAGCTGGTTAACGTACCGACACCGC
AACCCGCAGGGCCACCGAGAAAGCAAACTGATAGGACAGAAAGTCGACCCCTACCCTTATCACATGGAGGAACTCCACACCTAACCGTTGTTAAGGGTCTTCGGAAGAGG
AAATATCCGTGGAAGTTGCGGGCCATATACACGCCCACCGGCCAACGTGGTATCAAAGTTCAAGCGTACGACCCTACATGCCCCATCCCACCGCTCCTGGACGAGGGGTT
CCAGAAATGGATGGACGACCCATCAACCGACGGCAATTCCCGTTCAACGTCCGTCGGGATCAAATACAAGAGTTGGTTTGGCCTGCTCCTGGATCCTGAGTTTCAACTCA
ACGACGAGGTTGAGAAGTGTAAACATCTATTGCGCGTGCGATTCGCAATAGGCGACGTACTTCTATCCGTAAACCTTGCTACGACGTACAGACGGGCCATATGCAGCTAT
GAAGCCGGGTGTCCTACCGTCGAAATGTACGTCATGATCGGGATTGATCTTGTGGAGGGTGATTTAACCGTATGGGACTCACTGCAGTCGATCACCCCGTTGGATGATCT
CGAGAAGGCGCTCAAGCCAATGTGCACGATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCAATTCGGCCTAACCTGCCCACGGTGTCGTGGAGGGTCCGAAGAC
GTATTGTACCTCAGCAAGCCGGGTTCACAGATTGCGACATATTTTGTGTTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACGTT
TCTTTATTTCGTCGTCAATATGCTGTACAAATGTGGGCTCGCAGACCCTTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGACCGCGCTGTCAAATCTCGCTCACGTAGGGAAAACCTGTTCTCGTCTTAAGACTAGGTTAACTCCCTC
TCAGTTAGACATGTTCAGTCAAACATGTTTTGGTCCGATTTTAGGGATGAACGTTGTATTTAACGGTCCGTTGCTCCATCACCTGTTGCTTAAAGAGGTGGAGGAACCTA
GAGACGACCTCATTAGCTTTAACCTATTCGGGAATAGGGTCTCTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAATAGGGTAGATGAGGAT
GTTCGTAACCGGAGACTTAGAATTCTGTATTTTCAAGACAAGGCGAGTGTGAAGTGTTCGGAGTTGGAGAAAATTTTTTTAGAACACACATTCGAAAATGACGAGGACGC
TGTGAAGATTGCTATAGTGTACTTCATAGAGCTTGCCATGATGAGCAAGGAAAGGAAGCAGAAAATGGACACGAGCCTCCTTGGGATTGTGGATCGGTGGGAAGTTTTCT
GTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTTTGGGCATACGAGACAATATCAACCTTGTCGACTCGA
GTAGCATTGAGGGTGAATGACGATGCTATTCCTCGTTTACTTAGATGGTCCTGCACCTATTCACGTGTTTTTAATGTTTTGGAGCGAGAGGTCTTCGAGAACGTCAAGGT
GAGTACAACATCACATGGCTCGCGTTATGCATCCACCAGTGGCCCCTGTCGGACCTCCTGCACCACAGAACTTGCTACAGAACCACTGGCTACTACTTCCACCGCTCAGA
AGTCTCCCGTTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAATGGTTGATCATGTAACAGAAGATATTATTGGGACCGATGGA
GGACAAGATCAATTGTTGCCACAGAAAGGGACGGAGAAGAAGAAGAAGAGGTCGAAGCATAAGTGGAGTCGGGAGCTGCGGAGGCTCGGCGACAGAGTGATGGCCATTGA
GACAACTCTGACGGGCATGACGACTGACATAAAGGACATAAAGAAGTTTATGAAGAGGCTAACAAAGGTTATGTCAAAGGGCCAGAATAAATATGATAGAAGGGGCGGTG
GGCCGGATCAAGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAGGAGGATATGGACATGGATGAGGATCCGAAGACAGGGGAAGAGCCGAAGACAGGGGAC
GAGCCGAGGATGGACGAGGATCCGAAGAATTGTGAAGAACCCGCCGACGTCACCGAGAGTGACGTGGAGATGGATCACGCTCCTACCATTGTTGGAGCTACCCAGGAGGT
CCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTCTAGGTAAGTGCGCCAGTGACGGGGAGGCAAGTAAGGGGCAGCTGGTTAACGTACCGACACCGC
AACCCGCAGGGCCACCGAGAAAGCAAACTGATAGGACAGAAAGTCGACCCCTACCCTTATCACATGGAGGAACTCCACACCTAACCGTTGTTAAGGGTCTTCGGAAGAGG
AAATATCCGTGGAAGTTGCGGGCCATATACACGCCCACCGGCCAACGTGGTATCAAAGTTCAAGCGTACGACCCTACATGCCCCATCCCACCGCTCCTGGACGAGGGGTT
CCAGAAATGGATGGACGACCCATCAACCGACGGCAATTCCCGTTCAACGTCCGTCGGGATCAAATACAAGAGTTGGTTTGGCCTGCTCCTGGATCCTGAGTTTCAACTCA
ACGACGAGGTTGAGAAGTGTAAACATCTATTGCGCGTGCGATTCGCAATAGGCGACGTACTTCTATCCGTAAACCTTGCTACGACGTACAGACGGGCCATATGCAGCTAT
GAAGCCGGGTGTCCTACCGTCGAAATGTACGTCATGATCGGGATTGATCTTGTGGAGGGTGATTTAACCGTATGGGACTCACTGCAGTCGATCACCCCGTTGGATGATCT
CGAGAAGGCGCTCAAGCCAATGTGCACGATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCAATTCGGCCTAACCTGCCCACGGTGTCGTGGAGGGTCCGAAGAC
GTATTGTACCTCAGCAAGCCGGGTTCACAGATTGCGACATATTTTGTGTTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACGTT
TCTTTATTTCGTCGTCAATATGCTGTACAAATGTGGGCTCGCAGACCCTTTTTTTAG
Protein sequenceShow/hide protein sequence
MNMTLKINQDDWFPTALSNLAHVGKTCSRLKTRLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLKEVEEPRDDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDED
VRNRRLRILYFQDKASVKCSELEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVFCNYDWSSMIFERTLWSLKNALKDKVWAYETISTLSTR
VALRVNDDAIPRLLRWSCTYSRVFNVLEREVFENVKVSTTSHGSRYASTSGPCRTSCTTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPMVDHVTEDIIGTDG
GQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVMAIETTLTGMTTDIKDIKKFMKRLTKVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRNEEDMDMDEDPKTGEEPKTGD
EPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQLVNVPTPQPAGPPRKQTDRTESRPLPLSHGGTPHLTVVKGLRKR
KYPWKLRAIYTPTGQRGIKVQAYDPTCPIPPLLDEGFQKWMDDPSTDGNSRSTSVGIKYKSWFGLLLDPEFQLNDEVEKCKHLLRVRFAIGDVLLSVNLATTYRRAICSY
EAGCPTVEMYVMIGIDLVEGDLTVWDSLQSITPLDDLEKALKPMCTIIPAILHWSGMLAIRPNLPTVSWRVRRRIVPQQAGFTDCDIFCVRFFEYDVTGSKMDTLTQSNV
SLFRRQYAVQMWARRPFF