; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g14190 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g14190
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Genome locationchr2:10489702..10492231
RNA-Seq ExpressionMoc02g14190
SyntenyMoc02g14190
Gene Ontology termsNA
InterPro domainsIPR015410 - Domain of unknown function DUF1985


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146372.1 uncharacterized protein LOC111015600 [Momordica charantia]4.0e-9559.31Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M + L ++R+DWFPA L+NLAHV KT++R+KARLT +QLDMF +TCFGPIL M VVFNGPL+HHLLL EVEEPR+D+ISF+LF  RVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
         H MNRV+  +  RRL+  YF D   VKCSELEKIFLE +F +DED VKV IVYFIELAMM KERKQ +DT   G+V+RWE FCN DWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH
        KN LKDK+ AY+QK   D +HVETYSLYGFPY                                     R   VL  EVF+N  SKV   L ATD E QH
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH

Query:  MARIMHPPVAHVRPLAP
        M R++ PP   V P  P
Subjt:  MARIMHPPVAHVRPLAP

XP_022153201.1 uncharacterized protein LOC111020757 [Momordica charantia]5.4e-13257.52Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M + L I+R+DWFPA L+NLAH+ KTS+R+KARLT +QLDMF +TCFGPIL +DVVFNGPL+HHLLLREVEEPR+D+ISF+LFG RVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
         H MNRVD  +  RRL+  YF D   VKCSELEKIFLE +F +DED VKV IVYFIELAMM KERKQ +DT+L G+V+RWEVFCNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH
        KNALKDK+  Y+QK   D SHVETYSLYGFPYAFQVWAYETISTLS        DDAIPRLLRWSC YS  F VL  EVF+N +SKV   L ATD + QH
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH

Query:  MARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDASPVVD-HATEDI--IGNDGGQDQLLPQKGTKKKKKKSKHK--
        M R++ PP   V P  P   A    A       SP    V DP      A+  +G    PVVD HA ++     NDG        +G +K+ KK+K K  
Subjt:  MARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDASPVVD-HATEDI--IGNDGGQDQLLPQKGTKKKKKKSKHK--

Query:  WSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSS-----------GGR---DPSGRNEEDTDMDED
         SR L RL + V AIE  L      LK I+ Y+K+L K      +K    GGGPD DG S           GGR   D   R++ED   DED
Subjt:  WSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSS-----------GGR---DPSGRNEEDTDMDED

XP_022154965.1 uncharacterized protein LOC111022110 [Momordica charantia]1.1e-14888.92Show/hide
Query:  MMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP
        MM KERKQKMDTSL GIV+RWEVFC+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP
Subjt:  MMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP

Query:  RLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQHMARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDAS
        RLLRWSCTYSRAFNVLEREVFEN KSKVVVRLEATDVERQHMAR+MHPPVA V P APTELATEPLA+TSTAQKSP+T EVGD VELDDVA     +DAS
Subjt:  RLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQHMARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDAS

Query:  PVVDHATEDIIGNDGGQDQLLPQKGTKKKKKKSKHKWSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSSGGRD
        P+VD  TEDIIG DGGQDQLLPQKGT+KKKKKSKHKWSREL RLGDRVTAIETTLT MTTD+KDIKK+MKRLTKVM+KGQNK +RRGG PDQDGSSGGRD
Subjt:  PVVDHATEDIIGNDGGQDQLLPQKGTKKKKKKSKHKWSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSSGGRD

Query:  PSGRNEEDTDMDEDPK
        PSGRNEED DMDEDPK
Subjt:  PSGRNEEDTDMDEDPK

XP_022154995.1 uncharacterized protein LOC111022139 [Momordica charantia]9.3e-10080.75Show/hide
Query:  NRDEARDGFQKVHPSGISFLDSIPGFNPVR-----------DMSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFN
        +RDEARD  QKV      FLDSIP FNPVR           DM MTLKIN+DD FPAALSNLAHVGKTSSRLKARLT SQLDMFS+TCFG ILGM+ VFN
Subjt:  NRDEARDGFQKVHPSGISFLDSIPGFNPVR-----------DMSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFN

Query:  GPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIEL
          LLHHLLLREVEEPR+DLISFNLFGNRVSFGKREFDLITGLRHTMNRV +DV NRRL+ILYF DKA VKCSELEKIFLEH F+NDEDAVK+AIVYFIEL
Subjt:  GPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIEL

Query:  AMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTL
        AMM KERKQKMDTSL GIV+RWEVFCNYDWSSMI E TL
Subjt:  AMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTL

XP_022157020.1 uncharacterized protein LOC111023847 [Momordica charantia]6.3e-15793.62Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M+MTLKIN+DDWFPAALSNLAHVGKTSSRLKARLT SQLDMFS+TCFGPILGM+VVFNGPLLHHLLLREVEEP++DLISFNLFGNRVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRL+ILYF DKA VKCSELEKIFLEH FENDEDAVK+AIVYFIELAMM KERK KMDTSL GIV+RWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVETYSLY FPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN KSKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVER

TrEMBL top hitse value%identityAlignment
A0A6J1CZE8 uncharacterized protein LOC1110156002.0e-9559.31Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M + L ++R+DWFPA L+NLAHV KT++R+KARLT +QLDMF +TCFGPIL M VVFNGPL+HHLLL EVEEPR+D+ISF+LF  RVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
         H MNRV+  +  RRL+  YF D   VKCSELEKIFLE +F +DED VKV IVYFIELAMM KERKQ +DT   G+V+RWE FCN DWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH
        KN LKDK+ AY+QK   D +HVETYSLYGFPY                                     R   VL  EVF+N  SKV   L ATD E QH
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH

Query:  MARIMHPPVAHVRPLAP
        M R++ PP   V P  P
Subjt:  MARIMHPPVAHVRPLAP

A0A6J1DJX9 uncharacterized protein LOC1110207572.6e-13257.52Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M + L I+R+DWFPA L+NLAH+ KTS+R+KARLT +QLDMF +TCFGPIL +DVVFNGPL+HHLLLREVEEPR+D+ISF+LFG RVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
         H MNRVD  +  RRL+  YF D   VKCSELEKIFLE +F +DED VKV IVYFIELAMM KERKQ +DT+L G+V+RWEVFCNYDWSSMIF+RT+WSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH
        KNALKDK+  Y+QK   D SHVETYSLYGFPYAFQVWAYETISTLS        DDAIPRLLRWSC YS  F VL  EVF+N +SKV   L ATD + QH
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQH

Query:  MARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDASPVVD-HATEDI--IGNDGGQDQLLPQKGTKKKKKKSKHK--
        M R++ PP   V P  P   A    A       SP    V DP      A+  +G    PVVD HA ++     NDG        +G +K+ KK+K K  
Subjt:  MARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDASPVVD-HATEDI--IGNDGGQDQLLPQKGTKKKKKKSKHK--

Query:  WSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSS-----------GGR---DPSGRNEEDTDMDED
         SR L RL + V AIE  L      LK I+ Y+K+L K      +K    GGGPD DG S           GGR   D   R++ED   DED
Subjt:  WSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSS-----------GGR---DPSGRNEEDTDMDED

A0A6J1DL40 uncharacterized protein LOC1110221105.2e-14988.92Show/hide
Query:  MMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP
        MM KERKQKMDTSL GIV+RWEVFC+YD SSMIFERTLWSLKNALKDKVEAYKQKVA+DSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP
Subjt:  MMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIP

Query:  RLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQHMARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDAS
        RLLRWSCTYSRAFNVLEREVFEN KSKVVVRLEATDVERQHMAR+MHPPVA V P APTELATEPLA+TSTAQKSP+T EVGD VELDDVA     +DAS
Subjt:  RLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVERQHMARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDAS

Query:  PVVDHATEDIIGNDGGQDQLLPQKGTKKKKKKSKHKWSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSSGGRD
        P+VD  TEDIIG DGGQDQLLPQKGT+KKKKKSKHKWSREL RLGDRVTAIETTLT MTTD+KDIKK+MKRLTKVM+KGQNK +RRGG PDQDGSSGGRD
Subjt:  PVVDHATEDIIGNDGGQDQLLPQKGTKKKKKKSKHKWSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSSGGRD

Query:  PSGRNEEDTDMDEDPK
        PSGRNEED DMDEDPK
Subjt:  PSGRNEEDTDMDEDPK

A0A6J1DL69 uncharacterized protein LOC1110221394.5e-10080.75Show/hide
Query:  NRDEARDGFQKVHPSGISFLDSIPGFNPVR-----------DMSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFN
        +RDEARD  QKV      FLDSIP FNPVR           DM MTLKIN+DD FPAALSNLAHVGKTSSRLKARLT SQLDMFS+TCFG ILGM+ VFN
Subjt:  NRDEARDGFQKVHPSGISFLDSIPGFNPVR-----------DMSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFN

Query:  GPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIEL
          LLHHLLLREVEEPR+DLISFNLFGNRVSFGKREFDLITGLRHTMNRV +DV NRRL+ILYF DKA VKCSELEKIFLEH F+NDEDAVK+AIVYFIEL
Subjt:  GPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIEL

Query:  AMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTL
        AMM KERKQKMDTSL GIV+RWEVFCNYDWSSMI E TL
Subjt:  AMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTL

A0A6J1DRZ7 uncharacterized protein LOC1110238473.1e-15793.62Show/hide
Query:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL
        M+MTLKIN+DDWFPAALSNLAHVGKTSSRLKARLT SQLDMFS+TCFGPILGM+VVFNGPLLHHLLLREVEEP++DLISFNLFGNRVSFGKREFDLITGL
Subjt:  MSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISFNLFGNRVSFGKREFDLITGL

Query:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL
        RHTMNRVDEDVRNRRL+ILYF DKA VKCSELEKIFLEH FENDEDAVK+AIVYFIELAMM KERK KMDTSL GIV+RWEVFCNYDWSSMIFERTLWSL
Subjt:  RHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCNYDWSSMIFERTLWSL

Query:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVER
        KNALKDKVE YKQKVAMDSSHVETYSLY FPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFEN KSKVVVRLEATDVER
Subjt:  KNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVRLEATDVER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTTCTGGGTTCCATCTCGGGACTCATATCGGTTCAAAATGGACCGGGACAAGACCGAGATGGTATCCAGAAAGTCTTTCTGGATTCCATCTTCCATCTC
GGGCCTCGTCTCGGTCCATTTCGAACCGGGATGTGTCCCAAGATGGAACCCAGAAAGTCTTTTGGATTCCATCTCGGGACACATCTCGGTTCATTTCGAACCGGG
ATGAGTCCCAAGATGGAATCTAGGAAATCCATCCTTGTGGAATGGACTTTCTGGATTCCATCTTGGGACTCATCCCGGTCCATTTCGAACCGAGATGAGGCCCGA
GATGGATTCCAGAAAGTCCATCCTTCTGGGATTTCCTTTCTGGATTCCATCCCAGGCTTCAACCCGGTCCGAGACATGAGTATGACACTTAAGATCAACCGAGAC
GACTGGTTTCCGGCCGCACTGTCAAACCTCGCTCACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGGTTAACTTCCTCTCAGTTAGACATGTTTAGTCGAACA
TGTTTTGGTCCGATTTTAGGGATGGACGTCGTATTTAACGGTCCGTTGCTCCATCACCTGTTGCTTAGAGAGGTGGAGGAACCTAGAGAGGACCTCATTAGTTTT
AACCTATTCGGGAATAGGGTCTCTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAATAGGGTAGATGAGGATGTTCGTAACCGAAGA
CTTAAGATTCTGTATTTTGCAGACAAGGCGGGTGTGAAGTGTTCGGAGTTGGAGAAAATTTTTTTAGAACACTTATTCGAAAATGACGAGGACGCTGTGAAGGTT
GCTATAGTGTACTTCATAGAGCTTGCCATGATGAGAAAGGAAAGGAAACAGAAAATGGACACAAGCCTCTTTGGGATTGTGAATCGGTGGGAAGTTTTCTGTAAT
TATGACTGGAGTTCAATGATTTTTGAAAGAACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTCGAGGCGTACAAACAGAAGGTCGCTATGGACTCCAGC
CATGTTGAGACGTATAGCTTGTATGGGTTTCCATATGCTTTTCAGGTTTGGGCATACGAGACAATATCAACCTTGTCGACTCGAGTAGCATTGAGGCTAAATGAC
GATGCTATTCCTCGTCTACTTAGATGGTCCTGCACCTATTCGCGTGCTTTTAATGTTTTGGAGCGAGAGGTCTTCGAGAATCCCAAGTCGAAGGTTGTAGTTCGT
TTGGAGGCGACTGATGTCGAACGACAGCACATGGCTCGCATTATGCATCCACCGGTGGCCCATGTCAGACCTCTTGCACCCACAGAACTTGCTACAGAACCACTG
GCTTCTACTTCCACCGCTCAGAAGTCTCCCATTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAGAAAATATTATTGGTGAGGATGCTTCCCCA
GTGGTTGATCATGCAACAGAAGATATTATAGGGAACGATGGAGGACAAGATCAATTGTTGCCACAGAAAGGGACGAAGAAGAAGAAGAAGAAGTCGAAGCATAAG
TGGAGTCGGGAGCTGTGGAGGCTCGGCGACAGAGTGACGGCCATTGAGACAACTCTGACGAGCATGACGACTGACCTAAAGGACATAAAGAAGTATATGAAGAGG
CTAACAAAGGTTATGGCGAAGGGCCAGAATAAATCTGAAAGAAGGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAG
GAGGATACGGACATGGATGAGGATCCGAAGTTGCGGGCCAGAATAATTGAGGATATTACTCTAGGTGGGTGCGCCAGTGACAGGGAGGCAAGTAATGGGCGGATG
GTTAACGTAACTCATATTTACACCGTACTCCTCACGAATATCTTGGATGACGTCCTTCGGCCTGTTGCAATCTCTAATCTTAGTGGCTCTAAGTCGCCAAGTGCA
AGTAGGATCGACGCATCGCAGCAAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGACTTTCTGGGTTCCATCTCGGGACTCATATCGGTTCAAAATGGACCGGGACAAGACCGAGATGGTATCCAGAAAGTCTTTCTGGATTCCATCTTCCATCTC
GGGCCTCGTCTCGGTCCATTTCGAACCGGGATGTGTCCCAAGATGGAACCCAGAAAGTCTTTTGGATTCCATCTCGGGACACATCTCGGTTCATTTCGAACCGGG
ATGAGTCCCAAGATGGAATCTAGGAAATCCATCCTTGTGGAATGGACTTTCTGGATTCCATCTTGGGACTCATCCCGGTCCATTTCGAACCGAGATGAGGCCCGA
GATGGATTCCAGAAAGTCCATCCTTCTGGGATTTCCTTTCTGGATTCCATCCCAGGCTTCAACCCGGTCCGAGACATGAGTATGACACTTAAGATCAACCGAGAC
GACTGGTTTCCGGCCGCACTGTCAAACCTCGCTCACGTAGGGAAAACCTCTTCTCGTCTTAAGGCTAGGTTAACTTCCTCTCAGTTAGACATGTTTAGTCGAACA
TGTTTTGGTCCGATTTTAGGGATGGACGTCGTATTTAACGGTCCGTTGCTCCATCACCTGTTGCTTAGAGAGGTGGAGGAACCTAGAGAGGACCTCATTAGTTTT
AACCTATTCGGGAATAGGGTCTCTTTTGGGAAGCGGGAGTTCGACCTAATAACCGGTCTTAGACACACCATGAATAGGGTAGATGAGGATGTTCGTAACCGAAGA
CTTAAGATTCTGTATTTTGCAGACAAGGCGGGTGTGAAGTGTTCGGAGTTGGAGAAAATTTTTTTAGAACACTTATTCGAAAATGACGAGGACGCTGTGAAGGTT
GCTATAGTGTACTTCATAGAGCTTGCCATGATGAGAAAGGAAAGGAAACAGAAAATGGACACAAGCCTCTTTGGGATTGTGAATCGGTGGGAAGTTTTCTGTAAT
TATGACTGGAGTTCAATGATTTTTGAAAGAACTCTCTGGAGCTTGAAGAACGCTCTGAAGGACAAGGTCGAGGCGTACAAACAGAAGGTCGCTATGGACTCCAGC
CATGTTGAGACGTATAGCTTGTATGGGTTTCCATATGCTTTTCAGGTTTGGGCATACGAGACAATATCAACCTTGTCGACTCGAGTAGCATTGAGGCTAAATGAC
GATGCTATTCCTCGTCTACTTAGATGGTCCTGCACCTATTCGCGTGCTTTTAATGTTTTGGAGCGAGAGGTCTTCGAGAATCCCAAGTCGAAGGTTGTAGTTCGT
TTGGAGGCGACTGATGTCGAACGACAGCACATGGCTCGCATTATGCATCCACCGGTGGCCCATGTCAGACCTCTTGCACCCACAGAACTTGCTACAGAACCACTG
GCTTCTACTTCCACCGCTCAGAAGTCTCCCATTACTGGTGAGGTTGGGGATCCAGTTGAGCTCGATGATGTAGCAGAAAATATTATTGGTGAGGATGCTTCCCCA
GTGGTTGATCATGCAACAGAAGATATTATAGGGAACGATGGAGGACAAGATCAATTGTTGCCACAGAAAGGGACGAAGAAGAAGAAGAAGAAGTCGAAGCATAAG
TGGAGTCGGGAGCTGTGGAGGCTCGGCGACAGAGTGACGGCCATTGAGACAACTCTGACGAGCATGACGACTGACCTAAAGGACATAAAGAAGTATATGAAGAGG
CTAACAAAGGTTATGGCGAAGGGCCAGAATAAATCTGAAAGAAGGGGCGGTGGGCCGGATCAGGATGGTTCTTCGGGCGGACGTGATCCGAGTGGGCGTAACGAG
GAGGATACGGACATGGATGAGGATCCGAAGTTGCGGGCCAGAATAATTGAGGATATTACTCTAGGTGGGTGCGCCAGTGACAGGGAGGCAAGTAATGGGCGGATG
GTTAACGTAACTCATATTTACACCGTACTCCTCACGAATATCTTGGATGACGTCCTTCGGCCTGTTGCAATCTCTAATCTTAGTGGCTCTAAGTCGCCAAGTGCA
AGTAGGATCGACGCATCGCAGCAAGTATAG
Protein sequenceShow/hide protein sequence
MDFLGSISGLISVQNGPGQDRDGIQKVFLDSIFHLGPRLGPFRTGMCPKMEPRKSFGFHLGTHLGSFRTGMSPKMESRKSILVEWTFWIPSWDSSRSISNRDEAR
DGFQKVHPSGISFLDSIPGFNPVRDMSMTLKINRDDWFPAALSNLAHVGKTSSRLKARLTSSQLDMFSRTCFGPILGMDVVFNGPLLHHLLLREVEEPREDLISF
NLFGNRVSFGKREFDLITGLRHTMNRVDEDVRNRRLKILYFADKAGVKCSELEKIFLEHLFENDEDAVKVAIVYFIELAMMRKERKQKMDTSLFGIVNRWEVFCN
YDWSSMIFERTLWSLKNALKDKVEAYKQKVAMDSSHVETYSLYGFPYAFQVWAYETISTLSTRVALRLNDDAIPRLLRWSCTYSRAFNVLEREVFENPKSKVVVR
LEATDVERQHMARIMHPPVAHVRPLAPTELATEPLASTSTAQKSPITGEVGDPVELDDVAENIIGEDASPVVDHATEDIIGNDGGQDQLLPQKGTKKKKKKSKHK
WSRELWRLGDRVTAIETTLTSMTTDLKDIKKYMKRLTKVMAKGQNKSERRGGGPDQDGSSGGRDPSGRNEEDTDMDEDPKLRARIIEDITLGGCASDREASNGRM
VNVTHIYTVLLTNILDDVLRPVAISNLSGSKSPSASRIDASQQV