; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14330 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14330
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionULP_PROTEASE domain-containing protein
Genome locationchr3:9654289..9661847
RNA-Seq ExpressionMoc03g14330
SyntenyMoc03g14330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025510.1 uncharacterized protein E6C27_scaffold417G001180 [Cucumis melo var. makuwa]7.0e-4731.45Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KL   +   ++  Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--

Query:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY
              ++E  ++    G          H   +  MD+ G I +E T+ VV  ++E+  T  Q+  N   EED L+  LG +D PG L  VGKY+TK +Y
Subjt:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY

Query:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------
        +H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    +  E +   E     +E EK  +DV E          
Subjt:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------

Query:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY
                V   + K G  C+L+F   D++VA  TI      G NVK                                      V T    ++   F  
Subjt:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY

Query:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL
        D +T         P+AL  LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+       +HW L++I+ +K   + I+PL
Subjt:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL

Query:  RNRLDNDIMDVV
        +NR+D D+++VV
Subjt:  RNRLDNDIMDVV

TYK00356.1 uncharacterized protein E5676_scaffold1112G00200 [Cucumis melo var. makuwa]8.3e-4831.64Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KL   +   ++  Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--

Query:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY
              ++E  ++    G          H   +  MD+ G I +E T+ VV  ++E+  T  Q+  N   EED L+  LG +D PG L  VGKY+TK +Y
Subjt:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY

Query:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------
        +H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    +  E +   E     +E EK  +DV E          
Subjt:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------

Query:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY
                V   + K G  C+L+F   D++VA  TI      G NVK                                      V T    ++   F  
Subjt:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY

Query:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL
        D +T         P+ALR LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+       +HW L++I+ +K   + I+PL
Subjt:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL

Query:  RNRLDNDIMDVV
        +NR+D D+++VV
Subjt:  RNRLDNDIMDVV

TYK08419.1 uncharacterized protein E5676_scaffold654G00340 [Cucumis melo var. makuwa]8.0e-4329.3Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAGNACV--QPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KLK          Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAGNACV--QPH--

Query:  -VVMKGVRESLRRIGPEG----SRH------------------------------AMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDA
          V   ++E  ++    G     +H                                K+ARMD+ G I +E T+ VV  ++E+  T  Q+  N   EED 
Subjt:  -VVMKGVRESLRRIGPEG----SRH------------------------------AMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDA

Query:  LSFPLGTQDHPGRLTSVGKYITKTQYYHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEI
        L+  LG +D PG L  VGKY+TK +Y+H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    + +E +   E 
Subjt:  LSFPLGTQDHPGRLTSVGKYITKTQYYHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEI

Query:  GGEAMEGEKG-DDVFE------------------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE--------------------------
            +E EK  +DV E                  V   + K G  C+L+F + D++VA  TI      G NVK                           
Subjt:  GGEAMEGEKG-DDVFE------------------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE--------------------------

Query:  -----------VATREEVLELYPFEYDRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIY-
                   V T    ++   F  D +T         P+ALR LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+Y 
Subjt:  -----------VATREEVLELYPFEYDRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIY-

Query:  ------------YK-----------------NLVT-------------------SHWILIIIDYSKTMVYSINPLRNRLDNDIMDVV
                    YK                  L+T                   +HW L++I+ +K   + I+PL+NR+D D+ +VV
Subjt:  ------------YK-----------------NLVT-------------------SHWILIIIDYSKTMVYSINPLRNRLDNDIMDVV

XP_022156813.1 uncharacterized protein LOC111023653 [Momordica charantia]5.6e-4460.81Show/hide
Query:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL
        CG  I KD  T+R+HLY +GIDQSYRVWFWHGE+    T E   N++  K+HEDD D F+V  +VQ V DE S VPKSFE +F NAK PLYPGC KFT L
Subjt:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL

Query:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI
        S L+RLYNLKVR+ W+NSS SELLS+ SD LPE  EMP S++   K++
Subjt:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI

XP_022159378.1 uncharacterized protein LOC111025795 [Momordica charantia]2.2e-6481.76Show/hide
Query:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL
        CGNLI KDI  VRDHLYSNGIDQSY VWFWHGEDLGNI  EQTYNNDAL+S EDDLDFFEVG LVQYVQDEFSDVPKSFET+FHNAK PLYPGCGKFT +
Subjt:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL

Query:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI
        S L+RLYNLKVRFGWSNSS SELLSI SDILP+PNEM  S++   K++
Subjt:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI

TrEMBL top hitse value%identityAlignment
A0A5A7SH89 ULP_PROTEASE domain-containing protein3.4e-4731.45Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KL   +   ++  Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--

Query:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY
              ++E  ++    G          H   +  MD+ G I +E T+ VV  ++E+  T  Q+  N   EED L+  LG +D PG L  VGKY+TK +Y
Subjt:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY

Query:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------
        +H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    +  E +   E     +E EK  +DV E          
Subjt:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------

Query:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY
                V   + K G  C+L+F   D++VA  TI      G NVK                                      V T    ++   F  
Subjt:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY

Query:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL
        D +T         P+AL  LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+       +HW L++I+ +K   + I+PL
Subjt:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL

Query:  RNRLDNDIMDVV
        +NR+D D+++VV
Subjt:  RNRLDNDIMDVV

A0A5D3BQ34 ULP_PROTEASE domain-containing protein4.0e-4831.64Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KL   +   ++  Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAG--NACVQPH--

Query:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY
              ++E  ++    G          H   +  MD+ G I +E T+ VV  ++E+  T  Q+  N   EED L+  LG +D PG L  VGKY+TK +Y
Subjt:  -VVMKGVRESLRRIGPEGS--------RHAMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQY

Query:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------
        +H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    +  E +   E     +E EK  +DV E          
Subjt:  YHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKG-DDVFE----------

Query:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY
                V   + K G  C+L+F   D++VA  TI      G NVK                                      V T    ++   F  
Subjt:  --------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE-------------------------------------VATREEVLELYPFEY

Query:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL
        D +T         P+ALR LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+       +HW L++I+ +K   + I+PL
Subjt:  DRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPL

Query:  RNRLDNDIMDVV
        +NR+D D+++VV
Subjt:  RNRLDNDIMDVV

A0A5D3D5Q6 ULP_PROTEASE domain-containing protein3.9e-4329.3Show/hide
Query:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAGNACV--QPH--
        +N  KL S+IGT VR HVPIIY  W  VP E KDKI+EL++ GFVVD R+KK I+      FRQ+K+RLT TY+ P+ DD  KLK          Q H  
Subjt:  KNGHKLSSWIGTCVRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAGNACV--QPH--

Query:  -VVMKGVRESLRRIGPEG----SRH------------------------------AMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDA
          V   ++E  ++    G     +H                                K+ARMD+ G I +E T+ VV  ++E+  T  Q+  N   EED 
Subjt:  -VVMKGVRESLRRIGPEG----SRH------------------------------AMKKARMDKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDA

Query:  LSFPLGTQDHPGRLTSVGKYITKTQYYHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEI
        L+  LG +D PG L  VGKY+TK +Y+H   ++ T++ ED+K      DR     +E EE   KMKE +    + + E     +E    + +E +   E 
Subjt:  LSFPLGTQDHPGRLTSVGKYITKTQYYHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEE---KMKEGEKGEDSKEGEKTNNIREVEIGKKEEKIKKGEI

Query:  GGEAMEGEKG-DDVFE------------------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE--------------------------
            +E EK  +DV E                  V   + K G  C+L+F + D++VA  TI      G NVK                           
Subjt:  GGEAMEGEKG-DDVFE------------------VKTREEKEGDSCQLSFGSIDNIVALVTIF-----GGNVKE--------------------------

Query:  -----------VATREEVLELYPFEYDRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIY-
                   V T    ++   F  D +T         P+ALR LLR ++   S +++    +VFG +R C +  + ++ F  M+PI   C+DAYI+Y 
Subjt:  -----------VATREEVLELYPFEYDRTT---------PIALRCLLREMKMCKSQVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIY-

Query:  ------------YK-----------------NLVT-------------------SHWILIIIDYSKTMVYSINPLRNRLDNDIMDVV
                    YK                  L+T                   +HW L++I+ +K   + I+PL+NR+D D+ +VV
Subjt:  ------------YK-----------------NLVT-------------------SHWILIIIDYSKTMVYSINPLRNRLDNDIMDVV

A0A6J1DRM7 uncharacterized protein LOC1110236532.7e-4460.81Show/hide
Query:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL
        CG  I KD  T+R+HLY +GIDQSYRVWFWHGE+    T E   N++  K+HEDD D F+V  +VQ V DE S VPKSFE +F NAK PLYPGC KFT L
Subjt:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL

Query:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI
        S L+RLYNLKVR+ W+NSS SELLS+ SD LPE  EMP S++   K++
Subjt:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI

A0A6J1DYM6 uncharacterized protein LOC1110257951.1e-6481.76Show/hide
Query:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL
        CGNLI KDI  VRDHLYSNGIDQSY VWFWHGEDLGNI  EQTYNNDAL+S EDDLDFFEVG LVQYVQDEFSDVPKSFET+FHNAK PLYPGCGKFT +
Subjt:  CGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSDVPKSFETIFHNAKYPLYPGCGKFTNL

Query:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI
        S L+RLYNLKVRFGWSNSS SELLSI SDILP+PNEM  S++   K++
Subjt:  STLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCGGTGGAATCAAGCGCTACTGTCGAATCAAAGGCCTCCGGTGGTGGAATCGAACGTTGTCGGTTGAATCCAACACCGGTGAGGGTCAAAACCGCTCAGTTGTG
TGGGAATCTTATATACAAGGACATAGGAACCGTTAGAGATCATTTATATTCAAATGGTATTGACCAAAGTTATAGGGTATGGTTTTGGCACGGTGAAGACTTAGGAAATA
TCACGGTCGAACAGACGTACAATAATGATGCACTCAAGAGCCATGAAGATGATCTCGACTTTTTTGAGGTGGGTGGCTTGGTTCAATATGTTCAGGATGAATTCTCTGAT
GTACCAAAATCATTTGAAACTATTTTTCATAATGCTAAGTATCCATTGTACCCCGGATGTGGAAAGTTTACAAATTTATCAACTCTTCTAAGGTTATACAATTTGAAGGT
GAGATTTGGGTGGAGTAACTCGAGTCTTTCTGAACTTCTGTCCATAACAAGTGATATATTACCAGAACCTAATGAAATGCCGAATTCTATATGGAGGAAGGCCAAGTCAA
TTGAAGATAATAAGCAGCCTGCCGGTGAGGATGTAAAGGGAGAGTCCAGCAAACGAAAGACCCGTGGACGTAAGAATGGGCATAAGTTGAGTAGCTGGATTGGGACGTGC
GTTCGTCACCATGTCCCAATAATATATGACCATTGGAAGCTGGTTCCGATCGAGACAAAAGATAAGATATATGAGTTGGTTAAGGGTGGATTCGTTGTGGATCGGAGGGC
AAAGAAAGATATCTTGAGCTGTGTTAGTACTTTATTTCGTCAATATAAATGGAGGTTAACTGCCACCTATATAAACCCTTATCGCGATGATCCAACAAAGTTAAAGAGAA
TTGCGGGGAATGCATGTGTACAACCACATGTTGTCATGAAAGGGGTACGCGAATCTCTCAGAAGAATTGGACCCGAGGGATCGAGGCACGCTATGAAGAAAGCGCGTATG
GACAAAGATGGAAATATCAACGAAGAATGTACTAGAGTCGTAGTCGAACGCATGGAGGAGATAAAAGTTACACAGTGTCAAGATGGTGACAACAACTCTCCAGAAGAAGA
TGCACTATCATTTCCTTTGGGTACACAAGACCATCCTGGACGACTTACGAGTGTGGGAAAGTATATAACGAAGACTCAATACTATCATAAACCAAGGAAACGGTCGACAT
CAAAGTTTGAAGATGAGAAGGTTTGTCTGAATCTCGAAGATAGGTGGGAGACAATGAAACGAGAAAAGGAAGAGAAGATGAAGGAAGGAGAAAAAGGAGAGGACAGTAAG
GAAGGAGAAAAAACAAATAATATAAGAGAAGTAGAAATAGGAAAAAAAGAAGAGAAGATAAAGAAAGGAGAAATAGGAGGGGAGGCTATGGAAGGAGAAAAAGGAGACGA
TGTTTTTGAAGTGAAGACAAGGGAAGAAAAGGAGGGAGATTCTTGTCAACTGTCATTCGGGTCAATCGACAACATCGTTGCATTAGTAACGATATTTGGAGGCAATGTGA
AGGAGGTTGCAACGAGAGAAGAAGTGCTCGAGCTATACCCATTCGAATATGATAGAACGACCCCAATTGCTCTGCGATGCCTACTTCGTGAAATGAAGATGTGTAAATCT
CAGGTAAAATTGCCAGTGACTGAAGAAGTGTTCGGATGTAAACGCATCTGTAAGGTATGGACGGACGTCGTACAAACATTTTATGAGATGAAGCCGATAACTGATCCGTG
CATGGATGCGTACATCATATACTACAAGAACTTGGTGACATCTCATTGGATATTGATCATCATTGATTACTCGAAGACCATGGTGTATTCAATAAACCCTTTGAGAAATC
GTCTGGACAATGATATCATGGATGTTGTCAACTGCTTACTATCTGCGAAATGTCTTGTTGTATGCATAGTGTCTAAGACAACCTGGATCGACAGAATGTGGGCATTATGT
GATGCGATTCATGTGGGTCATAGTCAGCCAAAAGAGCACTTCGATCCCAGATGTAGTAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCCGGTGGAATCAAGCGCTACTGTCGAATCAAAGGCCTCCGGTGGTGGAATCGAACGTTGTCGGTTGAATCCAACACCGGTGAGGGTCAAAACCGCTCAGTTGTG
TGGGAATCTTATATACAAGGACATAGGAACCGTTAGAGATCATTTATATTCAAATGGTATTGACCAAAGTTATAGGGTATGGTTTTGGCACGGTGAAGACTTAGGAAATA
TCACGGTCGAACAGACGTACAATAATGATGCACTCAAGAGCCATGAAGATGATCTCGACTTTTTTGAGGTGGGTGGCTTGGTTCAATATGTTCAGGATGAATTCTCTGAT
GTACCAAAATCATTTGAAACTATTTTTCATAATGCTAAGTATCCATTGTACCCCGGATGTGGAAAGTTTACAAATTTATCAACTCTTCTAAGGTTATACAATTTGAAGGT
GAGATTTGGGTGGAGTAACTCGAGTCTTTCTGAACTTCTGTCCATAACAAGTGATATATTACCAGAACCTAATGAAATGCCGAATTCTATATGGAGGAAGGCCAAGTCAA
TTGAAGATAATAAGCAGCCTGCCGGTGAGGATGTAAAGGGAGAGTCCAGCAAACGAAAGACCCGTGGACGTAAGAATGGGCATAAGTTGAGTAGCTGGATTGGGACGTGC
GTTCGTCACCATGTCCCAATAATATATGACCATTGGAAGCTGGTTCCGATCGAGACAAAAGATAAGATATATGAGTTGGTTAAGGGTGGATTCGTTGTGGATCGGAGGGC
AAAGAAAGATATCTTGAGCTGTGTTAGTACTTTATTTCGTCAATATAAATGGAGGTTAACTGCCACCTATATAAACCCTTATCGCGATGATCCAACAAAGTTAAAGAGAA
TTGCGGGGAATGCATGTGTACAACCACATGTTGTCATGAAAGGGGTACGCGAATCTCTCAGAAGAATTGGACCCGAGGGATCGAGGCACGCTATGAAGAAAGCGCGTATG
GACAAAGATGGAAATATCAACGAAGAATGTACTAGAGTCGTAGTCGAACGCATGGAGGAGATAAAAGTTACACAGTGTCAAGATGGTGACAACAACTCTCCAGAAGAAGA
TGCACTATCATTTCCTTTGGGTACACAAGACCATCCTGGACGACTTACGAGTGTGGGAAAGTATATAACGAAGACTCAATACTATCATAAACCAAGGAAACGGTCGACAT
CAAAGTTTGAAGATGAGAAGGTTTGTCTGAATCTCGAAGATAGGTGGGAGACAATGAAACGAGAAAAGGAAGAGAAGATGAAGGAAGGAGAAAAAGGAGAGGACAGTAAG
GAAGGAGAAAAAACAAATAATATAAGAGAAGTAGAAATAGGAAAAAAAGAAGAGAAGATAAAGAAAGGAGAAATAGGAGGGGAGGCTATGGAAGGAGAAAAAGGAGACGA
TGTTTTTGAAGTGAAGACAAGGGAAGAAAAGGAGGGAGATTCTTGTCAACTGTCATTCGGGTCAATCGACAACATCGTTGCATTAGTAACGATATTTGGAGGCAATGTGA
AGGAGGTTGCAACGAGAGAAGAAGTGCTCGAGCTATACCCATTCGAATATGATAGAACGACCCCAATTGCTCTGCGATGCCTACTTCGTGAAATGAAGATGTGTAAATCT
CAGGTAAAATTGCCAGTGACTGAAGAAGTGTTCGGATGTAAACGCATCTGTAAGGTATGGACGGACGTCGTACAAACATTTTATGAGATGAAGCCGATAACTGATCCGTG
CATGGATGCGTACATCATATACTACAAGAACTTGGTGACATCTCATTGGATATTGATCATCATTGATTACTCGAAGACCATGGTGTATTCAATAAACCCTTTGAGAAATC
GTCTGGACAATGATATCATGGATGTTGTCAACTGCTTACTATCTGCGAAATGTCTTGTTGTATGCATAGTGTCTAAGACAACCTGGATCGACAGAATGTGGGCATTATGT
GATGCGATTCATGTGGGTCATAGTCAGCCAAAAGAGCACTTCGATCCCAGATGTAGTAAGTAA
Protein sequenceShow/hide protein sequence
MAPVESSATVESKASGGGIERCRLNPTPVRVKTAQLCGNLIYKDIGTVRDHLYSNGIDQSYRVWFWHGEDLGNITVEQTYNNDALKSHEDDLDFFEVGGLVQYVQDEFSD
VPKSFETIFHNAKYPLYPGCGKFTNLSTLLRLYNLKVRFGWSNSSLSELLSITSDILPEPNEMPNSIWRKAKSIEDNKQPAGEDVKGESSKRKTRGRKNGHKLSSWIGTC
VRHHVPIIYDHWKLVPIETKDKIYELVKGGFVVDRRAKKDILSCVSTLFRQYKWRLTATYINPYRDDPTKLKRIAGNACVQPHVVMKGVRESLRRIGPEGSRHAMKKARM
DKDGNINEECTRVVVERMEEIKVTQCQDGDNNSPEEDALSFPLGTQDHPGRLTSVGKYITKTQYYHKPRKRSTSKFEDEKVCLNLEDRWETMKREKEEKMKEGEKGEDSK
EGEKTNNIREVEIGKKEEKIKKGEIGGEAMEGEKGDDVFEVKTREEKEGDSCQLSFGSIDNIVALVTIFGGNVKEVATREEVLELYPFEYDRTTPIALRCLLREMKMCKS
QVKLPVTEEVFGCKRICKVWTDVVQTFYEMKPITDPCMDAYIIYYKNLVTSHWILIIIDYSKTMVYSINPLRNRLDNDIMDVVNCLLSAKCLVVCIVSKTTWIDRMWALC
DAIHVGHSQPKEHFDPRCSK