; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33350 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33350
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr8:24219500..24222701
RNA-Seq ExpressionMoc08g33350
SyntenyMoc08g33350
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]6.1e-9563.36Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MD+ L ++++DWFPA L+NLAHV KT++R+KARLTP+QLDMF QTCFGPIL M VVFNGPL+HHLLL EVEEPR D+ISF+LF  RV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
         H MNRV+  +  RRLR  YF+D   VKCSE EKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT  +G+VDRWE  CN DWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ---------------SKVVVRLEATDVERQHMARVMHPPVAPV--GPPAPTELATEP
        KN LKDK+ AY+QK   D +HVETYSLYGFPY                  SKV   L ATD E QHM RV+ PP   V   PPA  + A  P
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ---------------SKVVVRLEATDVERQHMARVMHPPVAPV--GPPAPTELATEP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]3.9e-11050.92Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MD+ L I+++DWFPA L+NLAH+ KTS+R+KARLTP+QLDMF QTCFGPIL ++VVFNGPL+HHLLLREVEEPR D+ISF+LFG RV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
         H MNRVD  +  RRLR  YF+D   VKCSE EKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT+LLG+VDRWEV CNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-----------------------------------------SKVVVRLEATDVERQHMARVMHPP
        KNALKDK+  Y+QK   D SHVETYSLYGFPYAFQ                                         SKV   L ATD + QHM RV+ PP
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-----------------------------------------SKVVVRLEATDVERQHMARVMHPP

Query:  VAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDRVTAIE
           V P  P   A    A       SP    V DP    ++     PVV     D    D  +      +G EK+ K++K K   SR L+RL + V AIE
Subjt:  VAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDRVTAIE

Query:  TTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTGN
          L       K I+ ++K+L        +KY   GGGPD DG S           GGR   D   R++ED   DED +T +EP +G+
Subjt:  TTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTGN

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]8.4e-15381.3Show/hide
Query:  MMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ------------------------
        MM KERKQKMDTSLLGIVDRWEV C+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVETYSLYGFPYAFQ                        
Subjt:  MMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ------------------------

Query:  -------------------------SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDH
                                 SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVT EVGD VELDDVAKDASP+VD 
Subjt:  -------------------------SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDH

Query:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVTAIETTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN
        VTEDIIGTDGGQDQLLPQKGTEKKKK+SKHKWSRELRRLGDRVTAIETTLTGM TD KDIKKFMKRLT VMSKGQNKYDRRGG PDQDGSSGGRDPSGRN
Subjt:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVTAIETTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN

Query:  EEDMDMDEDPKTGEEPKTGNEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE
        EEDMDMDEDPKTG+EPKTG+EPRMDEDPK CEEP DV ESDVEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  EEDMDMDEDPKTGEEPKTGNEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]1.1e-9692.39Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MDMTLKINQDD FPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFG ILGMN VFN  LLHHLLLREVEEPRDDLISFNLFGNRV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTL
        RHTMNRV +DV NRRLRILYFQDKASVKCSE EKIFLEHTF+NDEDAVKIAIVYFIELAMM KERKQKMDTSLLGIVDRWEV CNYDWSSMI E TL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTL

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]6.7e-12680.54Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        M+MTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEP+DDLISFNLFGNRV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRLRILYFQDKASVKCSE EKIFLEHTFENDEDAVKIAIVYFIELAMM KERK KMDTSLLGIVDRWEV CNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-------------------------------------------------SKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVETYSLY FPYAFQ                                                 SKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-------------------------------------------------SKVVVRLEATDVER

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156003.0e-9563.36Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MD+ L ++++DWFPA L+NLAHV KT++R+KARLTP+QLDMF QTCFGPIL M VVFNGPL+HHLLL EVEEPR D+ISF+LF  RV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
         H MNRV+  +  RRLR  YF+D   VKCSE EKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT  +G+VDRWE  CN DWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ---------------SKVVVRLEATDVERQHMARVMHPPVAPV--GPPAPTELATEP
        KN LKDK+ AY+QK   D +HVETYSLYGFPY                  SKV   L ATD E QHM RV+ PP   V   PPA  + A  P
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ---------------SKVVVRLEATDVERQHMARVMHPPVAPV--GPPAPTELATEP

A0A6J1DJX9 uncharacterized protein LOC1110207571.9e-11050.92Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MD+ L I+++DWFPA L+NLAH+ KTS+R+KARLTP+QLDMF QTCFGPIL ++VVFNGPL+HHLLLREVEEPR D+ISF+LFG RV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
         H MNRVD  +  RRLR  YF+D   VKCSE EKIFLE  F +DED VK+ IVYFIELAMM KERKQ +DT+LLG+VDRWEV CNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-----------------------------------------SKVVVRLEATDVERQHMARVMHPP
        KNALKDK+  Y+QK   D SHVETYSLYGFPYAFQ                                         SKV   L ATD + QHM RV+ PP
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-----------------------------------------SKVVVRLEATDVERQHMARVMHPP

Query:  VAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDRVTAIE
           V P  P   A    A       SP    V DP    ++     PVV     D    D  +      +G EK+ K++K K   SR L+RL + V AIE
Subjt:  VAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDHVTEDIIGTDGGQDQLLPQKGTEKKKKRSKHK--WSRELRRLGDRVTAIE

Query:  TTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTGN
          L       K I+ ++K+L        +KY   GGGPD DG S           GGR   D   R++ED   DED +T +EP +G+
Subjt:  TTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSS-----------GGR---DPSGRNEEDMDMDEDPKTGEEPKTGN

A0A6J1DL40 uncharacterized protein LOC1110221104.1e-15381.3Show/hide
Query:  MMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ------------------------
        MM KERKQKMDTSLLGIVDRWEV C+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVETYSLYGFPYAFQ                        
Subjt:  MMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ------------------------

Query:  -------------------------SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDH
                                 SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVT EVGD VELDDVAKDASP+VD 
Subjt:  -------------------------SKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDH

Query:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVTAIETTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN
        VTEDIIGTDGGQDQLLPQKGTEKKKK+SKHKWSRELRRLGDRVTAIETTLTGM TD KDIKKFMKRLT VMSKGQNKYDRRGG PDQDGSSGGRDPSGRN
Subjt:  VTEDIIGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVTAIETTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRN

Query:  EEDMDMDEDPKTGEEPKTGNEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE
        EEDMDMDEDPKTG+EPKTG+EPRMDEDPK CEEP DV ESDVEMDHAPTIVGATQEVPSGH SPVDVIE
Subjt:  EEDMDMDEDPKTGEEPKTGNEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIE

A0A6J1DL69 uncharacterized protein LOC1110221395.4e-9792.39Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        MDMTLKINQDD FPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFG ILGMN VFN  LLHHLLLREVEEPRDDLISFNLFGNRV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTL
        RHTMNRV +DV NRRLRILYFQDKASVKCSE EKIFLEHTF+NDEDAVKIAIVYFIELAMM KERKQKMDTSLLGIVDRWEV CNYDWSSMI E TL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTL

A0A6J1DRZ7 uncharacterized protein LOC1110238473.2e-12680.54Show/hide
Query:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL
        M+MTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEP+DDLISFNLFGNRV FGKREFDLITGL
Subjt:  MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRLRILYFQDKASVKCSE EKIFLEHTFENDEDAVKIAIVYFIELAMM KERK KMDTSLLGIVDRWEV CNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-------------------------------------------------SKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVETYSLY FPYAFQ                                                 SKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQ-------------------------------------------------SKVVVRLEATDVER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGGCCGCTCTGTCAAATCTCGCTCACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGGTTAACT
CCCTCTCAGTTAGACATGTTCAGTCAAACATGTTTTGGTCCGATTTTAGGGATGAACGTTGTATTTAACGGTCCTTTGCTCCATCACCTGTTGCTTAGAGAGGTG
GAGGAACCTAGAGACGACCTCATTAGCTTTAACCTATTCGGGAATAGGGTCTGTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAAT
AGGGTAGATGAGGATGTTCGTAACCGGAGACTTAGAATTCTGTATTTTCAAGACAAGGCGAGTGTGAAGTGTTCGGAGTCGGAGAAAATTTTTTTAGAACACACA
TTCGAAAATGACGAGGACGCTGTGAAGATTGCTATAGTGTACTTCATAGAGCTTGCCATGATGAGCAAGGAAAGGAAGCAGAAAATGGACACGAGCCTCCTTGGG
ATTGTGGATCGGTGGGAAGTTTTATGTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTCGAGGCG
TACAAACAGAAGGTCGCTATGGACTCAAGCCATGTTGAGACGTATAGCTTGTATGGGTTTCCATACGCTTTTCAGTCGAAGGTTGTAGTTCGTTTGGAGGCGACT
GATGTCGAACGACAGCACATGGCTCGCGTTATGCATCCACCAGTGGCCCCTGTCGGACCTCCTGCACCCACAGAACTTGCTACAGAACCACTGGCTACTACTTCC
ACCGCTCAGAAGTCTCCCGTTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAGTGGTTGATCATGTAACAGAAGATATT
ATTGGGACCGATGGAGGACAAGATCAATTGTTGCCACAAAAAGGGACGGAGAAGAAGAAGAAGAGGTCGAAGCATAAGTGGAGTCGGGAGCTACGGAGGCTCGGC
GACAGAGTGACGGCCATTGAGACAACTCTGACGGGCATGCCGACTGACAAAAAGGACATAAAGAAGTTTATGAAGAGGCTAACAAATGTTATGTCGAAGGGCCAG
AATAAATATGATAGAAGGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAGGAGGATATGGATATGGATGAGGATCCG
AAGACAGGGGAAGAGCCGAAGACAGGGAACGAGCCGAGGATGGACGAGGATCCGAAGAATTGTGAAGAACCCGCCGACGTCACCGAGAGTGACGTGGAGATGGAT
CACGCTCCTACCATTGTTGGAGCTACCCAGGAGGTCCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTTTAGGTAAGTGCGCCAGTGACGGG
GAGGCAAGTAAGGGGCAGCTGGTTAACGTATCGACACCGCAACCCGCAGGGCCACCGAGAAAGCAAACTGATAGGACAGAAAGTCGACCCCTACCCTTATCACAT
GGAGGAACTCCACACCTGACCGTTGTTAAGGTTGAACCTGAGTTTATTGAAGGCCCACTTAGCCAGGGTCTTCGGAAGAGGAAATATCCGTGGAAGTTGCGGGCC
ATATACACGCCCACCGGCCAACGTGGTATCAAAGTTCAAGCGTACGACCCTACATGCCCCATCCCACCGCTCCTGGACGAAGGGTTCCAGAAATGGATGGGTGAC
CCATCAACCGACGGCAATTCCCGTTCAACGTCCGTCGGGATCAAATACAAGAGTTGGTTTGGCCTGCTCCTGGATCTTGAGTTTCAACTCAACGACGAGGTTGAG
AAGTGTAAACATCTATTGCGCGTGCGATTCGCAATAGGCGACGTACTTCTATCCGTAAAAGTTGCTACGACGAACAGACGGGCCATATGCAGCTATGAAGCTGGG
TGTCCTACCGTCGAAATGTACGTAAGATTGGAGGCAAGAGCGTACCATCTTTCGTTGATCACCCCGTTGGATGATCTCGAGAAGGCGTTCAAGCCAATGTGCACG
ATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCAGTTCAGCCTAACCTGCCCACGGTGCCGTGGAGGGTCCGAAGATGTACTGTATCTCAGCAAGCCGGG
TTCACAGATTGCGACATATTTTGTATTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACATTTCTTTATTTCGTCGTCAA
TATGCTGTACAAATGTGGGCTCGCAGACCCTTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATATGACACTTAAGATCAACCAAGACGACTGGTTCCCGGCCGCTCTGTCAAATCTCGCTCACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGGTTAACT
CCCTCTCAGTTAGACATGTTCAGTCAAACATGTTTTGGTCCGATTTTAGGGATGAACGTTGTATTTAACGGTCCTTTGCTCCATCACCTGTTGCTTAGAGAGGTG
GAGGAACCTAGAGACGACCTCATTAGCTTTAACCTATTCGGGAATAGGGTCTGTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAAT
AGGGTAGATGAGGATGTTCGTAACCGGAGACTTAGAATTCTGTATTTTCAAGACAAGGCGAGTGTGAAGTGTTCGGAGTCGGAGAAAATTTTTTTAGAACACACA
TTCGAAAATGACGAGGACGCTGTGAAGATTGCTATAGTGTACTTCATAGAGCTTGCCATGATGAGCAAGGAAAGGAAGCAGAAAATGGACACGAGCCTCCTTGGG
ATTGTGGATCGGTGGGAAGTTTTATGTAATTATGACTGGAGTTCAATGATTTTTGAAAGGACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTCGAGGCG
TACAAACAGAAGGTCGCTATGGACTCAAGCCATGTTGAGACGTATAGCTTGTATGGGTTTCCATACGCTTTTCAGTCGAAGGTTGTAGTTCGTTTGGAGGCGACT
GATGTCGAACGACAGCACATGGCTCGCGTTATGCATCCACCAGTGGCCCCTGTCGGACCTCCTGCACCCACAGAACTTGCTACAGAACCACTGGCTACTACTTCC
ACCGCTCAGAAGTCTCCCGTTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAAAGGATGCTTCCCCAGTGGTTGATCATGTAACAGAAGATATT
ATTGGGACCGATGGAGGACAAGATCAATTGTTGCCACAAAAAGGGACGGAGAAGAAGAAGAAGAGGTCGAAGCATAAGTGGAGTCGGGAGCTACGGAGGCTCGGC
GACAGAGTGACGGCCATTGAGACAACTCTGACGGGCATGCCGACTGACAAAAAGGACATAAAGAAGTTTATGAAGAGGCTAACAAATGTTATGTCGAAGGGCCAG
AATAAATATGATAGAAGGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAGGAGGATATGGATATGGATGAGGATCCG
AAGACAGGGGAAGAGCCGAAGACAGGGAACGAGCCGAGGATGGACGAGGATCCGAAGAATTGTGAAGAACCCGCCGACGTCACCGAGAGTGACGTGGAGATGGAT
CACGCTCCTACCATTGTTGGAGCTACCCAGGAGGTCCCAAGTGGCCACCCTAGTCCGGTCGACGTAATTGAGGATCTTACTTTAGGTAAGTGCGCCAGTGACGGG
GAGGCAAGTAAGGGGCAGCTGGTTAACGTATCGACACCGCAACCCGCAGGGCCACCGAGAAAGCAAACTGATAGGACAGAAAGTCGACCCCTACCCTTATCACAT
GGAGGAACTCCACACCTGACCGTTGTTAAGGTTGAACCTGAGTTTATTGAAGGCCCACTTAGCCAGGGTCTTCGGAAGAGGAAATATCCGTGGAAGTTGCGGGCC
ATATACACGCCCACCGGCCAACGTGGTATCAAAGTTCAAGCGTACGACCCTACATGCCCCATCCCACCGCTCCTGGACGAAGGGTTCCAGAAATGGATGGGTGAC
CCATCAACCGACGGCAATTCCCGTTCAACGTCCGTCGGGATCAAATACAAGAGTTGGTTTGGCCTGCTCCTGGATCTTGAGTTTCAACTCAACGACGAGGTTGAG
AAGTGTAAACATCTATTGCGCGTGCGATTCGCAATAGGCGACGTACTTCTATCCGTAAAAGTTGCTACGACGAACAGACGGGCCATATGCAGCTATGAAGCTGGG
TGTCCTACCGTCGAAATGTACGTAAGATTGGAGGCAAGAGCGTACCATCTTTCGTTGATCACCCCGTTGGATGATCTCGAGAAGGCGTTCAAGCCAATGTGCACG
ATAATCCCGGCGATTCTTCATTGGAGCGGGATGCTCGCAGTTCAGCCTAACCTGCCCACGGTGCCGTGGAGGGTCCGAAGATGTACTGTATCTCAGCAAGCCGGG
TTCACAGATTGCGACATATTTTGTATTAGATTTTTCGAGTACGATGTAACTGGGTCAAAGATGGACACTTTGACTCAAAGTAACATTTCTTTATTTCGTCGTCAA
TATGCTGTACAAATGTGGGCTCGCAGACCCTTTTTTTAG
Protein sequenceShow/hide protein sequence
MDMTLKINQDDWFPAALSNLAHVGKTSSRLKARLTPSQLDMFSQTCFGPILGMNVVFNGPLLHHLLLREVEEPRDDLISFNLFGNRVCFGKREFDLITGLRHTMN
RVDEDVRNRRLRILYFQDKASVKCSESEKIFLEHTFENDEDAVKIAIVYFIELAMMSKERKQKMDTSLLGIVDRWEVLCNYDWSSMIFERTLWSLKNALKDKVEA
YKQKVAMDSSHVETYSLYGFPYAFQSKVVVRLEATDVERQHMARVMHPPVAPVGPPAPTELATEPLATTSTAQKSPVTGEVGDPVELDDVAKDASPVVDHVTEDI
IGTDGGQDQLLPQKGTEKKKKRSKHKWSRELRRLGDRVTAIETTLTGMPTDKKDIKKFMKRLTNVMSKGQNKYDRRGGGPDQDGSSGGRDPSGRNEEDMDMDEDP
KTGEEPKTGNEPRMDEDPKNCEEPADVTESDVEMDHAPTIVGATQEVPSGHPSPVDVIEDLTLGKCASDGEASKGQLVNVSTPQPAGPPRKQTDRTESRPLPLSH
GGTPHLTVVKVEPEFIEGPLSQGLRKRKYPWKLRAIYTPTGQRGIKVQAYDPTCPIPPLLDEGFQKWMGDPSTDGNSRSTSVGIKYKSWFGLLLDLEFQLNDEVE
KCKHLLRVRFAIGDVLLSVKVATTNRRAICSYEAGCPTVEMYVRLEARAYHLSLITPLDDLEKAFKPMCTIIPAILHWSGMLAVQPNLPTVPWRVRRCTVSQQAG
FTDCDIFCIRFFEYDVTGSKMDTLTQSNISLFRRQYAVQMWARRPFF