; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0182 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0182
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPeptidase_M48 domain-containing protein
Genome locationMC10:1342900..1349193
RNA-Seq ExpressionMC10g0182
SyntenyMC10g0182
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001915 - Peptidase M48


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576765.1 Embryo-specific protein ATS3B, partial [Cucurbita argyrosperma subsp. sororia]8.23e-28288.14Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        MSCYR+SK +FDAFRNLSSKIFP ++IRD+ +SRIS  G SFTAG+ SNSYGFQ +SPII+RFG+QV ENRRLYNPF GDSKRFYYVD YRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDP+TV++VVF GSGVF+TVYYGNLETIPYTKRRHFV+LSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDII+ALQRGLKQENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEA IGAPEGSG+ETLMAL D+GA ++E KW REDEILDDKWVERSRKKGQ+QGSQAD SHL+GL WEVLVVNE VVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHE+GHAVARHAAEGITKNL FA+LQ+ILYQFV PDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYE LGKV+G
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        DSALRDYLSTHPSGKKRAQLLA+AKVMEEALSVYREVRAG GVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

KAG7014804.1 Embryo-specific protein ATS3B [Cucurbita argyrosperma subsp. argyrosperma]3.15e-28188.14Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        MSCYR+SK +FDAFRNLSSKIFP ++IRD+ +SRIS  G SFTAG+ SNSYGFQ +SPII+RFG+QV ENRRLYNPF GDSKRFYYVD YRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDP+TV++VVF GSGVF+TVYYGNLETIPYTKRRHFV+LSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDII+ALQRGLKQENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEA IGAPEGSG+ETLMAL D+GA ++E KW REDEILDDKWVERSRKKGQ+QGSQAD SHL+GL WEVLVVNE VVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHE+GHAVARHAAEGITKNL FA+LQ+ILYQFV PDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYE LGKV+G
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        DSALRDYLSTHPSGKKRAQLLA+AKVMEEALSVYREVRAG GVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

XP_022140981.1 uncharacterized protein LOC111011501 [Momordica charantia]0.0100Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

XP_022922484.1 uncharacterized protein LOC111430479 isoform X1 [Cucurbita moschata]3.80e-28287.75Show/hide
Query:  MSCYRRS--KFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR
        MSCYR+S  K +FDAFRNLSSKIFP ++IRD  +SRIS  G SFTAG+ SNSYGFQ +SPII+RFG+QV ENRRLYNPF GDSKRFYYVD YRVQHFKPR
Subjt:  MSCYRRS--KFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR

Query:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV
        GPRRWFQDP+TV++VVF GSGVF+TVYYGNLETIPYTKRRHFV+LSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKD+I+ALQRGLKQENV
Subjt:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV

Query:  WSDLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGL
        WSDLGYASEA IGAPEGSG+ETLMAL D+GA ++E KW REDEILDDKWVERSRKKG++QGSQAD SHL+GL WEVLVVNE VVNAFCLPGGKIVVFTGL
Subjt:  WSDLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGL

Query:  LEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV
        LEHFRSDAEIATIIGHE+GHAVARHAAEGITKNL FA+LQ+ILYQFVMPDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYE LGKV
Subjt:  LEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV

Query:  TGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        +GDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAG GVEGFL
Subjt:  TGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

XP_038877447.1 mitochondrial metalloendopeptidase OMA1 [Benincasa hispida]3.66e-28286.13Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        M+CYR+SKF+FDAFRN SSKIFPKD I+   RSRIS  G SF +GK SNS+GFQ  SPII+RFG+QVGE RR YNPF GDSKRFYYVDRYRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDP+TV+IVV  GSGVF+TVYYGNLET+PYTKRRHFV+LSR MERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAVIG PEGSG ETLMAL D+GAE++E KW REDEI DDKWVE SRKKGQE+GSQA+TSHL+GLNWE+LVVNEPVVNAFCLPGGKIV+FTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHE+GHAVARH AEG+TKNLGF+ILQ+ILYQFVMPDIVN MS LFLRLPFSRRME+EADYIGLLLIASAGYDPR+APTVYE LGK+TG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL++YREVRAG GVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

TrEMBL top hitse value%identityAlignment
A0A0A0LB58 Peptidase_M48 domain-containing protein5.01e-27684.79Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        M+C+R+SKF FDAFRN SSKIFPKD+I+   RSRIS  G SF++GK SNS+GFQ  SPI+RRFG+ +G   R YNPF GDSKRFYYVDRYR+QHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDP+T++IVV +GSGVF+TVYYGNLET+PYTKRRHFV+LS+ MER++GES+FEQMKAAFKGKILPA+HPESVRVRLIAKDIIEALQRGL+QENVW+
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAVIGAPEGSGHETLMAL D+G+E++E KW REDEILDDKWVE SRKKGQ  GSQA+TSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHEV HAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        DSALRDYLSTHPSGKKRAQLLAQAKVMEEALS+YREVRAG G+EGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

A0A5D3E3G6 Putative peptidase5.58e-27384.79Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        M+C R+SKF FDAFRNLSSKIFPKD+I+   RSRIS  G SF +GK SNS+GFQ  SPI++RFG    E RR YNPF GDSKRFYYVDRYRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDP+T++IVV  GSGVF+TVYYGNLETIPYTKRRHFV+LS+ MER++GES+FEQMKAAFKGKILPA+HPESVR+RLIAKDIIEALQRGL+QENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAVIGAPEGSGHETL+AL D+G E++E KW REDEILDDKWVE SRKKGQ  GSQ +TSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HF +DAEIATIIGHEV HAVARHAAEGITKNLGFA+LQIILYQFVMPDIVNTMS LFLRLPFSRRMEMEADYIGLLL+ASAGYDPRVAP VYE LGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEALS+YREVRAG GV+GFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

A0A6J1CHM3 uncharacterized protein LOC1110115010.0100Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

A0A6J1E3I5 uncharacterized protein LOC111430479 isoform X11.84e-28287.75Show/hide
Query:  MSCYRRS--KFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR
        MSCYR+S  K +FDAFRNLSSKIFP ++IRD  +SRIS  G SFTAG+ SNSYGFQ +SPII+RFG+QV ENRRLYNPF GDSKRFYYVD YRVQHFKPR
Subjt:  MSCYRRS--KFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR

Query:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV
        GPRRWFQDP+TV++VVF GSGVF+TVYYGNLETIPYTKRRHFV+LSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKD+I+ALQRGLKQENV
Subjt:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV

Query:  WSDLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGL
        WSDLGYASEA IGAPEGSG+ETLMAL D+GA ++E KW REDEILDDKWVERSRKKG++QGSQAD SHL+GL WEVLVVNE VVNAFCLPGGKIVVFTGL
Subjt:  WSDLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGL

Query:  LEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV
        LEHFRSDAEIATIIGHE+GHAVARHAAEGITKNL FA+LQ+ILYQFVMPDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYE LGKV
Subjt:  LEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV

Query:  TGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        +GDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAG GVEGFL
Subjt:  TGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

A0A6J1G2G4 uncharacterized protein LOC1114500443.68e-27585.91Show/hide
Query:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP
        MSCYR+SKF+ DAFR+LSSKIFPKD IRD   SRIS   +SFTA         Q ASPII+RFG+QVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGP
Subjt:  MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWF+DP+ ++IVV  GSGV VTVYYGNLETIPYTKRRHFV+LSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
Subjt:  RRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAV+GAPEGSGHETLMAL  +GAE++E KW REDE+LDDKWVE SRKKGQE+GSQA+TSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG
        HFR DAEIATIIGHEVGHAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTG

Query:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYRE RAG GVEGFL
Subjt:  DSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAGGGVEGFL

SwissProt top hitse value%identityAlignment
E9QBI7 Metalloendopeptidase OMA1, mitochondrial5.5e-1931.72Show/hide
Query:  DTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPD------------IV
        D + +  + W V VV+ P +NAF LP G+I VFTG+L       ++  I+GHE+ HA+  HAAE  + +    +L ++L   +               I 
Subjt:  DTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPD------------IV

Query:  NTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEML---GKVTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSV
          +       PFSR++E EAD +GL + A A  D R  P  +E +    +++G   + ++LSTHPS + R + L   +++ EAL +
Subjt:  NTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEML---GKVTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSV

P36163 Mitochondrial metalloendopeptidase OMA12.2e-2827.25Show/hide
Query:  MIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVI
        + ++F G  +F   YY +L+  P + R  F+ +SR +E  +G   ++ +    + +ILP  HP S+++  I   I+EA               Y   +V 
Subjt:  MIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVI

Query:  GAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEI
                                                            D S L+G+ WE+ VVN+P    NAF LPGGK+ +F+ +L    +D  I
Subjt:  GAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEI

Query:  ATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNT-MSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV-----TGDS
        AT++ HE  H +ARH AE ++K   +++L ++LY       +N  +   FLR+P SR+ME EADYIGL++++ A + P+ +  V+E +         G  
Subjt:  ATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNT-MSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKV-----TGDS

Query:  ALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYRE
           ++LSTHP+  +R + +  +K + +A  +Y +
Subjt:  ALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYRE

Q5A663 Mitochondrial metalloendopeptidase OMA12.3e-1722.64Show/hide
Query:  VFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVIGAP
        +++G G+ +  Y  NL   PYT R  F+ +   +E ++G+  + Q+   F+ +ILP  +P   RV  I   +++                          
Subjt:  VFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVIGAP

Query:  EGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEIATI
                +AL+D   + + +++                            +HL+ L WE+ ++    +  NAF LP GKI +F+ ++   +++  +AT+
Subjt:  EGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEIATI

Query:  IGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSAL-FLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTGDSA-------
        + HE+ H +A+H++E ++K   + +L  ILY        N +     L +  SR ME EAD+IG  L+A A ++P+ +   +  + +    +A       
Subjt:  IGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSAL-FLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTGDSA-------

Query:  ----LRDYLSTHPSGKKR
              ++ STHP+  +R
Subjt:  ----LRDYLSTHPSGKKR

Q96E52 Metalloendopeptidase OMA1, mitochondrial1.2e-1832.8Show/hide
Query:  DTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAE--GITKNLGF--AILQIILYQFVMPD--------IV
        D   +  +NW + VV+ P++NAF LP G++ VFTG L       +++ ++GHE+ HAV  HAAE  G+   L F   I   +++     D        I 
Subjt:  DTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAE--GITKNLGF--AILQIILYQFVMPD--------IV

Query:  NTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVY---EMLGKVTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSV
        + +       P+SR++E EAD IGLLL A A  D R +   +   E +  + G   + ++LSTHPS   R + L   +++ +AL +
Subjt:  NTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVY---EMLGKVTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSV

Q9P7G4 Mitochondrial metalloendopeptidase OMA12.3e-2538.67Show/hide
Query:  SHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLR----L
        S +  L WE+ V+ +P  NAF LPGGK+ VF G+L   + +  +A ++ HE  H VARH+AE I      A+  I+       D+   +S   L     L
Subjt:  SHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLR----L

Query:  PFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTGD-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALSVYRE
        PFSR+ME EADYIGL+L++ A +DP  A T++E +    G       + STHPS KKR     + L +A+V  E    Y E
Subjt:  PFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTGD-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALSVYRE

Arabidopsis top hitse value%identityAlignment
AT5G51740.1 Peptidase family M48 family protein2.0e-15764.52Show/hide
Query:  MSCYRRSKFSFDAF-RNLSSKIFPKDIIRDHPRSRISQK-GSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR
        MS YRR+K  FD+  RN++ KI P    R H  SRI+   GSS  + K S+    +         G+    NR  YNPFL   KR+YYVDRY+V+HFKPR
Subjt:  MSCYRRSKFSFDAF-RNLSSKIFPKDIIRDHPRSRISQK-GSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPR

Query:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV
        GP RWFQ+P+TV  VV VGS   +T+  GN ETIPYTKR HF++LS+ ME+ LGE+QFEQ+K  ++GKILPA HPES+RVRLIAK++I+ALQRGL  E V
Subjt:  GPRRWFQDPKTVMIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENV

Query:  WSDLGYAS-EAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTG
        WSDLGYAS E+ +G     G + +           + KWS+ED++LDD+W+++SRKK  +  + A TSHLEG++WEVLVVNEP+VNAFCLP GKIVVFTG
Subjt:  WSDLGYAS-EAVIGAPEGSGHETLMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTG

Query:  LLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGK
        LL HF+SDAE+AT+IGHEVGHAVARH AEGITKNL FAILQ++LYQFVMPD+VNTMSALFLRLPFSR+ME+EADYIGLLL+ASAGYDPRVAPTVYE LGK
Subjt:  LLEHFRSDAEIATIIGHEVGHAVARHAAEGITKNLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGK

Query:  VTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAG-GGVEGFL
        + GD AL DYLSTHPSGKKR++LLAQA VMEEAL +YREV+AG  GVEGFL
Subjt:  VTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAG-GGVEGFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGCTACAGAAGATCCAAGTTTTCATTTGATGCATTTCGGAACCTGTCCTCAAAGATTTTTCCCAAGGATATAATTCGAGATCATCCTAGGTCAAGAATTTCCCA
AAAGGGCTCTTCGTTTACGGCTGGGAAACAGTCTAATTCTTATGGTTTCCAACCGGCTTCTCCAATAATACGAAGATTTGGTCAGCAAGTTGGAGAGAATAGGAGGCTAT
ACAATCCCTTCTTGGGTGATTCCAAGAGATTTTACTATGTTGATCGCTACCGCGTCCAGCATTTTAAGCCCAGAGGACCTCGGCGATGGTTTCAAGATCCAAAAACTGTA
ATGATAGTCGTGTTTGTGGGTTCTGGAGTATTTGTCACTGTGTATTATGGGAATTTAGAAACCATACCTTATACAAAACGAAGGCATTTTGTGATCTTGTCCAGAGCTAT
GGAGAGGAGGCTGGGAGAGTCGCAATTTGAGCAGATGAAGGCAGCTTTTAAGGGTAAAATATTGCCGGCCGTACACCCAGAAAGTGTGAGGGTAAGATTGATAGCTAAGG
ATATAATTGAGGCATTACAAAGAGGGTTGAAGCAAGAGAATGTGTGGAGTGATTTGGGCTATGCATCAGAGGCTGTGATAGGAGCACCTGAAGGCAGTGGCCATGAGACA
TTGATGGCGCTTAGCGATGCCGGGGCCGAGAGGGTGGAAAGTAAATGGTCCCGCGAAGATGAGATTCTTGATGACAAATGGGTTGAACGCAGTAGAAAGAAGGGTCAGGA
ACAGGGGTCCCAAGCAGATACCTCGCATTTGGAAGGATTGAATTGGGAAGTCTTGGTGGTCAATGAACCGGTTGTTAATGCATTCTGCTTGCCCGGTGGGAAGATCGTTG
TCTTCACTGGCTTGCTCGAGCACTTTAGAAGTGATGCAGAAATTGCAACTATTATTGGTCATGAGGTTGGGCATGCTGTGGCTCGACATGCTGCAGAGGGTATCACAAAG
AACCTGGGGTTTGCCATTTTACAAATTATCCTTTATCAGTTCGTCATGCCTGATATTGTCAACACTATGTCAGCTCTTTTCTTGAGGCTTCCATTCTCTAGACGGATGGA
AATGGAAGCGGATTACATTGGTCTGCTTTTGATAGCCTCCGCTGGATATGACCCTAGGGTTGCACCCACAGTATACGAGATGTTGGGTAAGGTAACCGGTGACTCGGCTC
TGAGAGATTATCTTTCTACTCATCCATCTGGAAAGAAGAGAGCTCAGTTGCTAGCTCAAGCCAAGGTTATGGAGGAAGCACTTAGCGTTTACAGAGAAGTAAGAGCCGGT
GGTGGGGTTGAAGGCTTCCTATAG
mRNA sequenceShow/hide mRNA sequence
CTGACACCGACCCTTCCAAGTTCCACGTGGACTAACGAGAGATTTTCAGAAAAAGAAAAAGAAAAAAAAAGGACGAACGATGGAGAGGGAGAGAGTTTTGATTCTTTGAA
TACAAGTGTGGCCTTCCACAAACCGCCTCGCGTACGATTTCTCTGCCTCTGTAATCTTCCGCGTGTTCGTCTTCTCCAGGAACTCCTCAGCGCCAAAGAAGCAGACGGAA
GAAGAAATGAAGAGACTACTCTGTTTTCTGCTCAGCTCCGTCTTCCTTTTCGCTCTTTCAGAGGCCACCCAATTGCTGCCTCAACCCGCTGAATCCTTCAATTTCAATTT
AACATATATTCAGCAAGCCGGGAGTTGCTCTTACTCGGTTGTTATATCGACGAGCTGTTCGTCACCTTCTTACACTAGGGATCACATCAGTCTTTCTTTCGGCGATGCTT
ATGGCAACCAGATTTATGTGCCAAGGATTGATGATCCATCCAGAAGAACATTTGAAAGATGTTCAACCGATACATTTCACATAAATGGACCTTGTGCTTACCAAATATGC
TACGTCTATCTTTATCGCTCTGGACCAGATGCCTGGATCCCAACAACAGTGAAAATCTCTGGTGATAATTCTAGACCCATCACATTTAACTACAACACTGCCATACCAAA
TGACGTATGGTTTGGGTTTAACTTGTGTGGACATGCTTCGTCTTCAAACCGTTTATCAAGTTGTATGTGGTTCATATACGTCATTGGAGTGTGGATTCTCGCTCTTCTCT
TTGCGCAGCAGCCCAATTACCCAAAAGAGATTGAAGAAACGAAAGAAATGAAGACTCGGGTAGAACTAGAAGCTTAGAAGATTCTCGACCTGAAGGCTGAATCTGATCCA
TTTCGAAGGCAAAATATTAAGATGATGCGTTCATCCATGCCCCAGCGGCGCCAAATCGTCGAAGTATCGGCTCGCCACCGTCGGGATGACCATGCAACCAAAAACTTCCA
CATTAATCCGGATTCCGATTTATCATCGTTCGATCCCTCTTTTTCTGTTTCTTCTTCCACTTCTCTTCCTCGCTTTTGATCCTACTTTCCCCTCGTCGGGAAGATTCTCA
ACAGATGAATTCGGATACTCTTTTTTGAGAGACTTATTTTCTTCTAATTGAATCTGATCTATTGGATTTCACATGTGTTTTTATTAATCTAGAGATATGAGTTGCTACAG
AAGATCCAAGTTTTCATTTGATGCATTTCGGAACCTGTCCTCAAAGATTTTTCCCAAGGATATAATTCGAGATCATCCTAGGTCAAGAATTTCCCAAAAGGGCTCTTCGT
TTACGGCTGGGAAACAGTCTAATTCTTATGGTTTCCAACCGGCTTCTCCAATAATACGAAGATTTGGTCAGCAAGTTGGAGAGAATAGGAGGCTATACAATCCCTTCTTG
GGTGATTCCAAGAGATTTTACTATGTTGATCGCTACCGCGTCCAGCATTTTAAGCCCAGAGGACCTCGGCGATGGTTTCAAGATCCAAAAACTGTAATGATAGTCGTGTT
TGTGGGTTCTGGAGTATTTGTCACTGTGTATTATGGGAATTTAGAAACCATACCTTATACAAAACGAAGGCATTTTGTGATCTTGTCCAGAGCTATGGAGAGGAGGCTGG
GAGAGTCGCAATTTGAGCAGATGAAGGCAGCTTTTAAGGGTAAAATATTGCCGGCCGTACACCCAGAAAGTGTGAGGGTAAGATTGATAGCTAAGGATATAATTGAGGCA
TTACAAAGAGGGTTGAAGCAAGAGAATGTGTGGAGTGATTTGGGCTATGCATCAGAGGCTGTGATAGGAGCACCTGAAGGCAGTGGCCATGAGACATTGATGGCGCTTAG
CGATGCCGGGGCCGAGAGGGTGGAAAGTAAATGGTCCCGCGAAGATGAGATTCTTGATGACAAATGGGTTGAACGCAGTAGAAAGAAGGGTCAGGAACAGGGGTCCCAAG
CAGATACCTCGCATTTGGAAGGATTGAATTGGGAAGTCTTGGTGGTCAATGAACCGGTTGTTAATGCATTCTGCTTGCCCGGTGGGAAGATCGTTGTCTTCACTGGCTTG
CTCGAGCACTTTAGAAGTGATGCAGAAATTGCAACTATTATTGGTCATGAGGTTGGGCATGCTGTGGCTCGACATGCTGCAGAGGGTATCACAAAGAACCTGGGGTTTGC
CATTTTACAAATTATCCTTTATCAGTTCGTCATGCCTGATATTGTCAACACTATGTCAGCTCTTTTCTTGAGGCTTCCATTCTCTAGACGGATGGAAATGGAAGCGGATT
ACATTGGTCTGCTTTTGATAGCCTCCGCTGGATATGACCCTAGGGTTGCACCCACAGTATACGAGATGTTGGGTAAGGTAACCGGTGACTCGGCTCTGAGAGATTATCTT
TCTACTCATCCATCTGGAAAGAAGAGAGCTCAGTTGCTAGCTCAAGCCAAGGTTATGGAGGAAGCACTTAGCGTTTACAGAGAAGTAAGAGCCGGTGGTGGGGTTGAAGG
CTTCCTATAGGACTCCCAAAACTGCCCTAGAAATTCAAAACAAGTTGATGCTGAGTGGAAGTAGATAGACTTTCCTCCTGTATATTGTGCTATTAATTGTTTAAGCATGA
TATTAGATTTATAATTTTTTTTTTTTATTTCAGCCATATGATCAAATGTTTCAGTGATATCTGGGAGTTGATTTACTTGCTTTAGACTTTTCTCCTGAAAACCCCTTTAC
CCTTCCACCAGGTTGAATTAGCGATATCCATAGAAACTTGTTATTTTAAAAGAATGAAACTACAATAACACTCACGGTGAATTTGGGATAGCTTTTGAGAAAAATGCTTT
CCAGTGAGGATCCTGCACGAGTTC
Protein sequenceShow/hide protein sequence
MSCYRRSKFSFDAFRNLSSKIFPKDIIRDHPRSRISQKGSSFTAGKQSNSYGFQPASPIIRRFGQQVGENRRLYNPFLGDSKRFYYVDRYRVQHFKPRGPRRWFQDPKTV
MIVVFVGSGVFVTVYYGNLETIPYTKRRHFVILSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVIGAPEGSGHET
LMALSDAGAERVESKWSREDEILDDKWVERSRKKGQEQGSQADTSHLEGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHAAEGITK
NLGFAILQIILYQFVMPDIVNTMSALFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPTVYEMLGKVTGDSALRDYLSTHPSGKKRAQLLAQAKVMEEALSVYREVRAG
GGVEGFL