; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17223 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17223
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPeptidase_M48 domain-containing protein
Genome locationCarg_Chr18:11079525..11082313
RNA-Seq ExpressionCarg17223
SyntenyCarg17223
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003724 - RNA helicase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001915 - Peptidase M48


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574129.1 hypothetical protein SDJN03_28016, partial [Cucurbita argyrosperma subsp. sororia]1.1e-247100Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

XP_022140981.1 uncharacterized protein LOC111011501 [Momordica charantia]1.9e-21585.91Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRD-CGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGP
        MSCYR+SKF+ DAFR+LSSKI PKD IRD   SRIS+ G SFTA         Q ASPII+RFG+QVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGP
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRD-CGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGP

Query:  RRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWF+DP+ ++IVV  GSGV VTVYYGNLETIPYTKRRHFV+LSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
Subjt:  RRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAV+GAPEGSGHETLMAL  +GAE++E KW REDE+LDDKWVE SRKKGQE+GSQA+TSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTG
        HFR DAEIATIIGHEVGHAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG
Subjt:  HFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTG

Query:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYRE RAG GVEGFL
Subjt:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

XP_022945955.1 uncharacterized protein LOC111450044 [Cucurbita moschata]6.8e-24599.08Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCYRKSKFAIDAFRSLSSKI PKDSIRDCGSRIS S  SFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

XP_022968009.1 uncharacterized protein LOC111467384 [Cucurbita maxima]1.2e-24197.71Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCY KSKFAIDAFRSLSSKI PKDS RDCGSRISRSG  F AQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALR SGAEKMEDKWY EDE+LDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

XP_023542344.1 uncharacterized protein LOC111802274 [Cucurbita pepo subsp. pepo]2.0e-24498.86Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCYRKSKFAIDAFRSLSSKI PKDSIRDCGSRIS SGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVI GSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPA HPESVRVRLIAKDIIEALQRGL+QENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

TrEMBL top hitse value%identityAlignment
A0A0A0LB58 Peptidase_M48 domain-containing protein9.4e-20882.74Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPR
        M+C+RKSKF  DAFR+ SSKI PKD I+   SRIS +G SF++         QS SPI++RF    GE  +  NPF G SKRFYYVDRYR++HFKPRGPR
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPR

Query:  RWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSD
        RWF+DPR LLIVV+ GSGV +TVYYGNLET+PYTKRRHFVLLS+ MER++GES+FEQMKAAFKGKILPA+HPESVRVRLIAKDIIEALQRGL+QENVW+D
Subjt:  RWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAV+GAPEGSGHETLMAL+ SG+EK+E KWYREDE+LDDKWVE SRKKGQ  GSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGE
        FR DAEIATIIGHEV HAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG+
Subjt:  FRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL++YRE RAGHG+EGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

A0A6J1CHM3 uncharacterized protein LOC1110115019.4e-21685.91Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRD-CGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGP
        MSCYR+SKF+ DAFR+LSSKI PKD IRD   SRIS+ G SFTA         Q ASPII+RFG+QVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGP
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRD-CGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGP

Query:  RRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
        RRWF+DP+ ++IVV  GSGV VTVYYGNLETIPYTKRRHFV+LSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS
Subjt:  RRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWS

Query:  DLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAV+GAPEGSGHETLMAL  +GAE++E KW REDE+LDDKWVE SRKKGQE+GSQA+TSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTG
        HFR DAEIATIIGHEVGHAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG
Subjt:  HFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTG

Query:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYRE RAG GVEGFL
Subjt:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

A0A6J1E3I5 uncharacterized protein LOC111430479 isoform X11.7e-20984.6Show/hide
Query:  MSCYRK--SKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRG
        MSCYRK  SK A DAFR+LSSKI P + IRD  SRIS  G SFTA         QS+SPIIQRFGRQV ENR+L NPF G SKRFYYVD YRV+HFKPRG
Subjt:  MSCYRK--SKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTA---------QSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRG

Query:  PRRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVW
        PRRWF+DPR +L+VV AGSGV +TVYYGNLETIPYTKRRHFVLLSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKD+I+ALQRGLKQENVW
Subjt:  PRRWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVW

Query:  SDLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL
        SDLGYASEA +GAPEGSG+ETLMALR SGA KME KWYREDE+LDDKWVE SRKKG+++GSQA+ SHLDGL WEVLVVNE VVNAFCLPGGKIVVFTGLL
Subjt:  SDLGYASEAVMGAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL

Query:  EHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVT
        EHFR DAEIATIIGHE+GHAVARH+AEGITKNL FAVLQLILYQF+MPDIVNTMSTLFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAP VYE LGKV+
Subjt:  EHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVT

Query:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        G+SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYRE RAG GVEGFL
Subjt:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

A0A6J1G2G4 uncharacterized protein LOC1114500443.3e-24599.08Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCYRKSKFAIDAFRSLSSKI PKDSIRDCGSRIS S  SFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

A0A6J1HYD1 uncharacterized protein LOC1114673845.8e-24297.71Show/hide
Query:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
        MSCY KSKFAIDAFRSLSSKI PKDS RDCGSRISRSG  F AQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL
Subjt:  MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNL

Query:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM
        LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWSDLGYASEAVM
Subjt:  LIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVM

Query:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
        GAPEGSGHETLMALR SGAEKMEDKWY EDE+LDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT
Subjt:  GAPEGSGHETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIAT

Query:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
        IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST
Subjt:  IIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
        HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL

SwissProt top hitse value%identityAlignment
E9QBI7 Metalloendopeptidase OMA1, mitochondrial2.1e-1832.97Show/hide
Query:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPD------------IVNT
        + +  + W V VV+ P +NAF LP G+I VFTG+L       ++  I+GHE+ HA+  H+AE  + +    +L L+L   I               I   
Subjt:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPD------------IVNT

Query:  MSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETL---GKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEAL
        +       PFSRK+E EAD +GL + A A  D R  P  +E +    +++G+  + ++LSTHPS + R + L   +++ EAL
Subjt:  MSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETL---GKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEAL

P36163 Mitochondrial metalloendopeptidase OMA14.4e-2927.83Show/hide
Query:  GVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVMGAPEGSGH
        G C   YY +L+  P + R  F+ +SR +E  +G   ++ +    + +ILP  HP S+++  I   I+EA                              
Subjt:  GVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVMGAPEGSGH

Query:  ETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEV
                          Y++  V                    + S LDG+ WE+ VVN+P    NAF LPGGK+ +F+ +L     D  IAT++ HE 
Subjt:  ETLMALRGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEV

Query:  GHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNT-MSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKV-----TGESALRDYLST
         H +ARH+AE ++K   +++L L+LY       +N  +   FLR+P SR+ME EADYIGL++++ A + P+ +  V+E +         G     ++LST
Subjt:  GHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNT-MSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKV-----TGESALRDYLST

Query:  HPSGKKRAQLLAQAKVMEEALTVYREA
        HP+  +R + +  +K + +A  +Y ++
Subjt:  HPSGKKRAQLLAQAKVMEEALTVYREA

Q5A663 Mitochondrial metalloendopeptidase OMA12.3e-1725.41Show/hide
Query:  YYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVMGAPEGSGHETLMAL
        Y  NL   PYT R  F+ +   +E ++G+  + Q+   F+ +ILP  +P   RV  I   +++                                  +AL
Subjt:  YYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVMGAPEGSGHETLMAL

Query:  RGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVAR
                       D + DD              +    +HL  L WE+ ++    +  NAF LP GKI +F+ ++   + +  +AT++ HE+ H +A+
Subjt:  RGSGAEKMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVAR

Query:  HSAEGITKNLGFAVLQLILYQFIMPDIVN-TMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDP--------RVAPAVYETLGKVTGESALR---DYLST
        HS+E ++K   + VL  ILY        N  +    L +  SR+ME EAD+IG  L+A A ++P        R++ A  +  G V+ E       ++ ST
Subjt:  HSAEGITKNLGFAVLQLILYQFIMPDIVN-TMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDP--------RVAPAVYETLGKVTGESALR---DYLST

Query:  HPSGKKR
        HP+  +R
Subjt:  HPSGKKR

Q96E52 Metalloendopeptidase OMA1, mitochondrial2.1e-1835.33Show/hide
Query:  LNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAE--GITKNLGF-AVLQLILYQFIMPD---------IVNTMSTLF
        +NW + VV+ P++NAF LP G++ VFTG L       +++ ++GHE+ HAV  H+AE  G+   L F  ++ L +   I P          I + +    
Subjt:  LNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAE--GITKNLGF-AVLQLILYQFIMPD---------IVNTMSTLF

Query:  LRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVY---ETLGKVTGESALRDYLSTHPSGKKRAQLL
           P+SRK+E EAD IGLLL A A  D R +   +   E +  + G+  + ++LSTHPS   R + L
Subjt:  LRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVY---ETLGKVTGESALRDYLSTHPSGKKRAQLL

Q9P7G4 Mitochondrial metalloendopeptidase OMA13.0e-2538.55Show/hide
Query:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAEGI--TKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPF
        S +  L WE+ V+ +P  NAF LPGGK+ VF G+L   + +  +A ++ HE  H VARHSAE I  T+ +   V        +   + + +    L LPF
Subjt:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAEGI--TKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPF

Query:  SRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGE-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALTVYRE
        SRKME EADYIGL+L++ A +DP  A  ++E +    G+      + STHPS KKR     + L +A+V  E    Y E
Subjt:  SRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGE-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALTVYRE

Arabidopsis top hitse value%identityAlignment
AT5G51740.1 Peptidase family M48 family protein8.0e-15162.58Show/hide
Query:  MSCYRKSKFAIDAF-RSLSSKISPKDSIRDCGSRI------SRSGRSFTAQSASPIIQRFGRQVGE--NRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPR
        MS YR++K   D+  R+++ KI P+  +    SRI      S     F++ S+  +  R    +G   NR   NPFL   KR+YYVDRY+V HFKPRGP 
Subjt:  MSCYRKSKFAIDAF-RSLSSKISPKDSIRDCGSRI------SRSGRSFTAQSASPIIQRFGRQVGE--NRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPR

Query:  RWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSD
        RWF++PR +  VV+ GS   +T+  GN ETIPYTKR HF+LLS+ ME+ LGE+QFEQ+K  ++GKILPA HPES+RVRLIAK++I+ALQRGL  E VWSD
Subjt:  RWFEDPRNLLIVVIAGSGVCVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSD

Query:  LGYAS-EAVMGAPEGSGHETLMALRGSGAEKMED-KWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL
        LGYAS E+ +G     G +  M +  SG + M D KW +ED+VLDD+W++ SRKK  +  + A TSHL+G++WEVLVVNEP+VNAFCLP GKIVVFTGLL
Subjt:  LGYAS-EAVMGAPEGSGHETLMALRGSGAEKMED-KWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL

Query:  EHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVT
         HF+ DAE+AT+IGHEVGHAVARH AEGITKNL FA+LQL+LYQF+MPD+VNTMS LFLRLPFSRKME+EADYIGLLL+ASAGYDPRVAP VYE LGK+ 
Subjt:  EHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVT

Query:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGH-GVEGFL
        G+ AL DYLSTHPSGKKR++LLAQA VMEEAL +YRE +AG  GVEGFL
Subjt:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGH-GVEGFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTGCTATAGAAAATCGAAGTTTGCGATTGATGCCTTTCGGAGCTTGTCTTCGAAGATTTCCCCCAAAGATTCAATTCGTGATTGTGGATCAAGAATTTCCCGTAG
TGGCCGTTCGTTTACGGCTCAATCGGCTTCTCCAATCATACAAAGATTTGGAAGACAAGTTGGAGAGAATCGGAAGCTATGCAATCCCTTTTTGGGTGGTTCGAAGAGAT
TTTACTATGTCGATCGCTATCGCGTCGAGCATTTTAAGCCCAGAGGACCTCGGCGATGGTTTGAAGATCCGAGGAACTTATTGATAGTTGTGATTGCGGGTTCTGGAGTT
TGTGTGACAGTGTATTATGGGAATTTAGAAACCATACCTTATACCAAACGGAGGCATTTTGTGCTGTTATCGAGAGCTATGGAGAGGAGGCTTGGAGAGTCGCAATTTGA
GCAAATGAAGGCAGCCTTTAAGGGTAAAATATTGCCTGCTGTACACCCTGAAAGTGTGAGGGTAAGATTGATAGCTAAGGATATAATTGAGGCATTACAAAGAGGGTTGA
AGCAAGAGAATGTGTGGAGTGATTTAGGCTATGCATCAGAGGCTGTGATGGGAGCCCCTGAAGGTAGTGGCCATGAGACATTGATGGCGCTTAGGGGTTCTGGGGCTGAG
AAGATGGAAGATAAATGGTACCGCGAAGACGAGGTTCTTGATGACAAATGGGTTGAGAGCTCTAGGAAGAAGGGCCAGGAAAAGGGTTCCCAAGCAAATACTTCACATTT
GGATGGATTGAACTGGGAGGTTTTGGTGGTGAATGAACCGGTTGTTAATGCATTTTGCTTGCCTGGTGGGAAGATTGTTGTTTTCACTGGCTTGCTCGAGCACTTTAGAA
AGGATGCAGAAATTGCAACTATTATTGGTCACGAGGTTGGGCATGCTGTGGCTCGACATTCTGCTGAGGGTATTACGAAGAACCTGGGGTTTGCCGTTTTGCAACTTATC
CTTTATCAGTTCATCATGCCTGATATTGTCAACACTATGTCAACTCTTTTCTTGAGGCTTCCTTTCTCTAGAAAGATGGAAATGGAAGCAGATTACATTGGTTTGCTTTT
GATCGCCTCCGCTGGATATGACCCGAGGGTTGCACCTGCCGTATACGAGACGTTGGGTAAGGTCACTGGTGAGTCTGCATTAAGAGATTATCTCTCTACTCATCCATCTG
GAAAGAAGAGGGCACAGTTGCTAGCCCAAGCCAAGGTTATGGAGGAAGCTCTCACTGTTTACAGAGAAGCAAGAGCCGGACATGGGGTTGAAGGCTTCCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTGCTATAGAAAATCGAAGTTTGCGATTGATGCCTTTCGGAGCTTGTCTTCGAAGATTTCCCCCAAAGATTCAATTCGTGATTGTGGATCAAGAATTTCCCGTAG
TGGCCGTTCGTTTACGGCTCAATCGGCTTCTCCAATCATACAAAGATTTGGAAGACAAGTTGGAGAGAATCGGAAGCTATGCAATCCCTTTTTGGGTGGTTCGAAGAGAT
TTTACTATGTCGATCGCTATCGCGTCGAGCATTTTAAGCCCAGAGGACCTCGGCGATGGTTTGAAGATCCGAGGAACTTATTGATAGTTGTGATTGCGGGTTCTGGAGTT
TGTGTGACAGTGTATTATGGGAATTTAGAAACCATACCTTATACCAAACGGAGGCATTTTGTGCTGTTATCGAGAGCTATGGAGAGGAGGCTTGGAGAGTCGCAATTTGA
GCAAATGAAGGCAGCCTTTAAGGGTAAAATATTGCCTGCTGTACACCCTGAAAGTGTGAGGGTAAGATTGATAGCTAAGGATATAATTGAGGCATTACAAAGAGGGTTGA
AGCAAGAGAATGTGTGGAGTGATTTAGGCTATGCATCAGAGGCTGTGATGGGAGCCCCTGAAGGTAGTGGCCATGAGACATTGATGGCGCTTAGGGGTTCTGGGGCTGAG
AAGATGGAAGATAAATGGTACCGCGAAGACGAGGTTCTTGATGACAAATGGGTTGAGAGCTCTAGGAAGAAGGGCCAGGAAAAGGGTTCCCAAGCAAATACTTCACATTT
GGATGGATTGAACTGGGAGGTTTTGGTGGTGAATGAACCGGTTGTTAATGCATTTTGCTTGCCTGGTGGGAAGATTGTTGTTTTCACTGGCTTGCTCGAGCACTTTAGAA
AGGATGCAGAAATTGCAACTATTATTGGTCACGAGGTTGGGCATGCTGTGGCTCGACATTCTGCTGAGGGTATTACGAAGAACCTGGGGTTTGCCGTTTTGCAACTTATC
CTTTATCAGTTCATCATGCCTGATATTGTCAACACTATGTCAACTCTTTTCTTGAGGCTTCCTTTCTCTAGAAAGATGGAAATGGAAGCAGATTACATTGGTTTGCTTTT
GATCGCCTCCGCTGGATATGACCCGAGGGTTGCACCTGCCGTATACGAGACGTTGGGTAAGGTCACTGGTGAGTCTGCATTAAGAGATTATCTCTCTACTCATCCATCTG
GAAAGAAGAGGGCACAGTTGCTAGCCCAAGCCAAGGTTATGGAGGAAGCTCTCACTGTTTACAGAGAAGCAAGAGCCGGACATGGGGTTGAAGGCTTCCTATAAGACTCC
CAAAACTGCGCTAGAAACTTTGAAATAAGTTCATATGGATTGGAAAGAGAATGGCTTCTCTCCTCCATTGTACTGTTTTTTAGCAATGCCGGAGTATTGCACTGTTAAAG
CATGATATCAAATTGTAATTCATTGATTTCAACCATGATCAAAATGTTTTAGTGAATTTTGGGTATTAATTTGCTTTGGACTCCTCTCTTGGAAACCCCTTCAAACGCCT
TCTAGGTTGAAACATAACGATTCTGGCCACTCGATGAGGAAGGTTTGGGGACGAGTGACAACCTTCATCGACTAGCGACAACCATTGACAATCATG
Protein sequenceShow/hide protein sequence
MSCYRKSKFAIDAFRSLSSKISPKDSIRDCGSRISRSGRSFTAQSASPIIQRFGRQVGENRKLCNPFLGGSKRFYYVDRYRVEHFKPRGPRRWFEDPRNLLIVVIAGSGV
CVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLKQENVWSDLGYASEAVMGAPEGSGHETLMALRGSGAE
KMEDKWYREDEVLDDKWVESSRKKGQEKGSQANTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRKDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLI
LYQFIMPDIVNTMSTLFLRLPFSRKMEMEADYIGLLLIASAGYDPRVAPAVYETLGKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALTVYREARAGHGVEGFL