; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016890 (gene) of Snake gourd v1 genome

Gene IDTan0016890
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPeptidase_M48 domain-containing protein
Genome locationLG01:2923797..2926240
RNA-Seq ExpressionTan0016890
SyntenyTan0016890
Gene Ontology termsGO:0051603 - proteolysis involved in cellular protein catabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003724 - RNA helicase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0016887 - ATPase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001915 - Peptidase M48


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576765.1 Embryo-specific protein ATS3B, partial [Cucurbita argyrosperma subsp. sororia]1.9e-22689.49Show/hide
Query:  DMSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP
        DMSC+RKSK AFDAFRNLSSKIFP +LI D  SRISH  YSFTAG+ SNSYGFQS+SPIIQRFGRQV ENRRLYNPF G+SKRFYYVD YRVQHFKPRGP
Subjt:  DMSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS
        RRWFQDPR V +VV AGSGVF+TVYYGNLETIPYTKRRHFVLLSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDII+ALQRGL+QENVWS
Subjt:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS

Query:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEA +GAPEGSG+ETLMALRD+G  K+EGKWYREDEILDDKWVERSRKKGQ++GSQAD SHLDGL WEVLVVNE VVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG
        HFRSDAEIATIIGHE+GHAVARH+AEGITKNL FAVLQLILYQF+ PDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYE LGKV+G
Subjt:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG

Query:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        +SALRDYLSTHPSGKKRAQLLA+AKVMEEAL+VYREVRAGRGVEGFL
Subjt:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

KAG7014804.1 Embryo-specific protein ATS3B [Cucurbita argyrosperma subsp. argyrosperma]5.1e-22789.51Show/hide
Query:  RDMSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRG
        RDMSC+RKSK AFDAFRNLSSKIFP +LI D  SRISH  YSFTAG+ SNSYGFQS+SPIIQRFGRQV ENRRLYNPF G+SKRFYYVD YRVQHFKPRG
Subjt:  RDMSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRG

Query:  PRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVW
        PRRWFQDPR V +VV AGSGVF+TVYYGNLETIPYTKRRHFVLLSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDII+ALQRGL+QENVW
Subjt:  PRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVW

Query:  SDLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL
        SDLGYASEA +GAPEGSG+ETLMALRD+G  K+EGKWYREDEILDDKWVERSRKKGQ++GSQAD SHLDGL WEVLVVNE VVNAFCLPGGKIVVFTGLL
Subjt:  SDLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL

Query:  EHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVT
        EHFRSDAEIATIIGHE+GHAVARH+AEGITKNL FAVLQLILYQF+ PDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYE LGKV+
Subjt:  EHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVT

Query:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        G+SALRDYLSTHPSGKKRAQLLA+AKVMEEAL+VYREVRAGRGVEGFL
Subjt:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

XP_022140981.1 uncharacterized protein LOC111011501 [Momordica charantia]3.8e-23090.6Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPD-SGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP
        MSC+R+SKF+FDAFRNLSSKIFP+D+I D   SRIS    SFTAGK+SNSYGFQ ASPII+RFG+QVGENRRLYNPFLG+SKRFYYVDRYRVQHFKPRGP
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPD-SGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS
        RRWFQDP+ V IVV  GSGVFVTVYYGNLETIPYTKRRHFV+LSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWS
Subjt:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS

Query:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAV+GAPEGSGHETLMAL DAG E+VE KW REDEILDDKWVERSRKKGQE+GSQADTSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG
        HFRSDAEIATIIGHEVGHAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYEMLGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG

Query:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYREVRAG GVEGFL
Subjt:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

XP_022945955.1 uncharacterized protein LOC111450044 [Cucurbita moschata]6.6e-22790.81Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR
        MSC+RKSKFA DAFR+LSSKIFP+D I D GSRISHS+ SFTA         QSASPIIQRFGRQVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGPR
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR

Query:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
        RWF+DPRN+ IVVIAGSGV VTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWSD
Subjt:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAVMGAPEGSGHETLMALR +G EK+E KWYREDE+LDDKWVE SRKKGQEKGSQA+TSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE
        FR DAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAPAVYE LGKVTGE
Subjt:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL VYRE RAG GVEGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

XP_038877447.1 mitochondrial metalloendopeptidase OMA1 [Benincasa hispida]3.5e-22888.79Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR
        M+C+RKSKFAFDAFRN SSKIFP+D I  S SRISHS YSF +GK SNS+GFQS SPIIQRFGRQVGE RR YNPF G+SKRFYYVDRYRVQHFKPRGPR
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR

Query:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
        RWFQDPR V IVV+AGSGVF+TVYYGNLET+PYTKRRHFVLLSR MERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
Subjt:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAV+G PEGSG ETLMALRD+G EK+EGKWYREDEI DDKWVE SRKKGQEKGSQA+TSHLDGLNWE+LVVNEPVVNAFCLPGGKIV+FTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE
        FRSDAEIATIIGHE+GHAVARH AEG+TKNLGF++LQLILYQF+MPDIVN MSTLFLRLPFSRRME+EADYIGLLLIASAGYDPR+AP VYE LGK+TGE
Subjt:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL +YREVRAG GVEGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

TrEMBL top hitse value%identityAlignment
A0A0A0LB58 Peptidase_M48 domain-containing protein2.4e-22287.22Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR
        M+CFRKSKF FDAFRN SSKIFP+DLI  S SRISH+ YSF++GK SNS+GFQS SPI++RF    GE  R YNPF G+SKRFYYVDRYR+QHFKPRGPR
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR

Query:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
        RWFQDPR + IVV+ GSGVF+TVYYGNLET+PYTKRRHFVLLS+ MER++GES+FEQMKAAFKGKILPA+HPESVRVRLIAKDIIEALQRGLRQENVW+D
Subjt:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAV+GAPEGSGHETLMAL+D+G+EK+EGKWYREDEILDDKWVE SRKKGQ  GSQA+TSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE
        FRSDAEIATIIGHEV HAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYE LGKVTG+
Subjt:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL++YREVRAG G+EGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

A0A6J1CHM3 uncharacterized protein LOC1110115011.8e-23090.6Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPD-SGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP
        MSC+R+SKF+FDAFRNLSSKIFP+D+I D   SRIS    SFTAGK+SNSYGFQ ASPII+RFG+QVGENRRLYNPFLG+SKRFYYVDRYRVQHFKPRGP
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPD-SGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGP

Query:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS
        RRWFQDP+ V IVV  GSGVFVTVYYGNLETIPYTKRRHFV+LSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWS
Subjt:  RRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWS

Query:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
        DLGYASEAV+GAPEGSGHETLMAL DAG E+VE KW REDEILDDKWVERSRKKGQE+GSQADTSHL+GLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE
Subjt:  DLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLE

Query:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG
        HFRSDAEIATIIGHEVGHAVARH+AEGITKNLGFA+LQ+ILYQF+MPDIVNTMS LFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYEMLGKVTG
Subjt:  HFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTG

Query:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        +SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYREVRAG GVEGFL
Subjt:  ESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

A0A6J1E3I5 uncharacterized protein LOC111430479 isoform X11.4e-22589.29Show/hide
Query:  MSCFRK--SKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRG
        MSC+RK  SK AFDAFRNLSSKIFP +LI DS SRISH  YSFTAG+ SNSYGFQS+SPIIQRFGRQV ENRRLYNPF G+SKRFYYVD YRVQHFKPRG
Subjt:  MSCFRK--SKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRG

Query:  PRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVW
        PRRWFQDPR V +VV AGSGVF+TVYYGNLETIPYTKRRHFVLLSRAMER LGESQFEQMKAAFKGKILPAVHPESVRVRLIAKD+I+ALQRGL+QENVW
Subjt:  PRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVW

Query:  SDLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL
        SDLGYASEA +GAPEGSG+ETLMALRD+G  K+EGKWYREDEILDDKWVERSRKKG+++GSQAD SHLDGL WEVLVVNE VVNAFCLPGGKIVVFTGLL
Subjt:  SDLGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLL

Query:  EHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVT
        EHFRSDAEIATIIGHE+GHAVARH+AEGITKNL FAVLQLILYQF+MPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP VYE LGKV+
Subjt:  EHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVT

Query:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        G+SALRDYLSTHPSGKKRAQLLAQAKVMEEAL+VYREVRAGRGVEGFL
Subjt:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

A0A6J1G2G4 uncharacterized protein LOC1114500443.2e-22790.81Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR
        MSC+RKSKFA DAFR+LSSKIFP+D I D GSRISHS+ SFTA         QSASPIIQRFGRQVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGPR
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR

Query:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
        RWF+DPRN+ IVVIAGSGV VTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGL+QENVWSD
Subjt:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAVMGAPEGSGHETLMALR +G EK+E KWYREDE+LDDKWVE SRKKGQEKGSQA+TSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE
        FR DAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAPAVYE LGKVTGE
Subjt:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL VYRE RAG GVEGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

A0A6J1HYD1 uncharacterized protein LOC1114673842.2e-22389.91Show/hide
Query:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR
        MSC+ KSKFA DAFR+LSSKIFP+D   D GSRIS S + F A         QSASPIIQRFGRQVGENR+L NPFLG SKRFYYVDRYRV+HFKPRGPR
Subjt:  MSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPR

Query:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
        RWF+DPRN+ IVVIAGSGV VTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD
Subjt:  RWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSD

Query:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
        LGYASEAVMGAPEGSGHETLMALR +G EK+E KWY EDEILDDKWVE SRKKGQEKGSQA+TSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH
Subjt:  LGYASEAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEH

Query:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE
        FR DAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSR+MEMEADYIGLLLIASAGYDPRVAPAVYE LGKVTGE
Subjt:  FRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE

Query:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL
        SALRDYLSTHPSGKKRAQLLAQAKVMEEAL VYRE RAG GVEGFL
Subjt:  SALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGRGVEGFL

SwissProt top hitse value%identityAlignment
E9QBI7 Metalloendopeptidase OMA1, mitochondrial4.3e-1932.26Show/hide
Query:  DTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPD------------IV
        D + +  + W V VV+ P +NAF LP G+I VFTG+L       ++  I+GHE+ HA+  H+AE  + +    +L L+L   I               I 
Subjt:  DTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPD------------IV

Query:  NTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEML---GKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALNV
          +       PFSR++E EAD +GL + A A  D R  P  +E +    +++G+  + ++LSTHPS + R + L   +++ EAL +
Subjt:  NTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEML---GKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALNV

P36163 Mitochondrial metalloendopeptidase OMA12.7e-2927.3Show/hide
Query:  RNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSDLGYASE
        R    ++  G  +F   YY +L+  P + R  F+ +SR +E  +G   ++ +    + +ILP  HP S+++  I   I+EA                   
Subjt:  RNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSDLGYASE

Query:  AVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSD
                                                          K    D S LDG+ WE+ VVN+P    NAF LPGGK+ +F+ +L    +D
Subjt:  AVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSD

Query:  AEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNT-MSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKV-----T
          IAT++ HE  H +ARH+AE ++K   +++L L+LY       +N  +   FLR+P SR+ME EADYIGL++++ A + P+ +  V+E +         
Subjt:  AEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNT-MSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKV-----T

Query:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYRE
        G     ++LSTHP+  +R + +  +K + +A  +Y +
Subjt:  GESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYRE

Q5A663 Mitochondrial metalloendopeptidase OMA18.1e-1824.6Show/hide
Query:  GVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSDLGYASEAVMGAPEGSGH
        G  +  Y  NL   PYT R  F+ +   +E ++G+  + Q+   F+ +ILP  +P   RV  I   +++                               
Subjt:  GVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSDLGYASEAVMGAPEGSGH

Query:  ETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEV
           +AL D   + +  ++                            +HL  L WE+ ++    +  NAF LP GKI +F+ ++   +++  +AT++ HE+
Subjt:  ETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVV--NAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEV

Query:  GHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVN-TMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDP--------RVAPAVYEMLGKVTGESALR--
         H +A+HS+E ++K   + VL  ILY        N  +    L +  SR ME EAD+IG  L+A A ++P        R++ A  +  G V+ E      
Subjt:  GHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVN-TMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDP--------RVAPAVYEMLGKVTGESALR--

Query:  -DYLSTHPSGKKR
         ++ STHP+  +R
Subjt:  -DYLSTHPSGKKR

Q96E52 Metalloendopeptidase OMA1, mitochondrial7.3e-1933.33Show/hide
Query:  DTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAE--GITKNLGF-AVLQLILYQFIMPD---------IV
        D   +  +NW + VV+ P++NAF LP G++ VFTG L       +++ ++GHE+ HAV  H+AE  G+   L F  ++ L +   I P          I 
Subjt:  DTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAE--GITKNLGF-AVLQLILYQFIMPD---------IV

Query:  NTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVY---EMLGKVTGESALRDYLSTHPSGKKRAQ----LLAQAKVMEEALN
        + +       P+SR++E EAD IGLLL A A  D R +   +   E +  + G+  + ++LSTHPS   R +    L+ QA  + E  N
Subjt:  NTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVY---EMLGKVTGESALRDYLSTHPSGKKRAQ----LLAQAKVMEEALN

Q9P7G4 Mitochondrial metalloendopeptidase OMA12.4e-2537.99Show/hide
Query:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGI--TKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPF
        S +  L WE+ V+ +P  NAF LPGGK+ VF G+L   + +  +A ++ HE  H VARHSAE I  T+ +   V        +   + + +    L LPF
Subjt:  SHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGI--TKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPF

Query:  SRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALNVYRE
        SR+ME EADYIGL+L++ A +DP  A  ++E +    G+      + STHPS KKR     + L +A+V  E  + Y E
Subjt:  SRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGE-SALRDYLSTHPSGKKR----AQLLAQAKVMEEALNVYRE

Arabidopsis top hitse value%identityAlignment
AT5G51740.1 Peptidase family M48 family protein3.6e-15462.31Show/hide
Query:  MSCFRKSKFAFDAF-RNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVG----------ENRRLYNPFLGNSKRFYYVDRY
        MS +R++K  FD+  RN++ KI PR  +    SRI++       G  + S  F S S       R+VG           NR  YNPFL   KR+YYVDRY
Subjt:  MSCFRKSKFAFDAF-RNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVG----------ENRRLYNPFLGNSKRFYYVDRY

Query:  RVQHFKPRGPRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQ
        +V+HFKPRGP RWFQ+PR VF VV+ GS   +T+  GN ETIPYTKR HF+LLS+ ME+ LGE+QFEQ+K  ++GKILPA HPES+RVRLIAK++I+ALQ
Subjt:  RVQHFKPRGPRRWFQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQ

Query:  RGLRQENVWSDLGYAS-EAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPG
        RGL  E VWSDLGYAS E+ +G     G + +           + KW +ED++LDD+W+++SRKK  +  + A TSHL+G++WEVLVVNEP+VNAFCLP 
Subjt:  RGLRQENVWSDLGYAS-EAVMGAPEGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPG

Query:  GKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP
        GKIVVFTGLL HF+SDAE+AT+IGHEVGHAVARH AEGITKNL FA+LQL+LYQF+MPD+VNTMS LFLRLPFSR+ME+EADYIGLLL+ASAGYDPRVAP
Subjt:  GKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARHSAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAP

Query:  AVYEMLGKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGR-GVEGFL
         VYE LGK+ G+ AL DYLSTHPSGKKR++LLAQA VMEEAL +YREV+AGR GVEGFL
Subjt:  AVYEMLGKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALNVYREVRAGR-GVEGFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTTTATCAATCTAGAGATATGAGTTGCTTTCGAAAATCGAAGTTTGCATTTGATGCATTTCGGAACTTGTCTTCGAAGATTTTTCCCAGAGATTTAATTCCAGA
TTCTGGATCAAGAATTTCCCATAGTCAGTATTCGTTTACGGCTGGAAAAAGGTCAAATTCTTATGGGTTTCAATCGGCTTCTCCAATTATACAAAGATTTGGAAGACAAG
TTGGAGAGAATAGGAGGCTATATAATCCCTTTTTGGGTAATTCCAAGAGATTTTACTATGTCGATCGCTACCGCGTCCAGCATTTTAAGCCCAGAGGACCTCGGCGATGG
TTTCAAGATCCGAGAAACGTATTCATTGTTGTGATTGCGGGTTCTGGAGTTTTTGTCACTGTGTATTATGGGAATTTAGAAACCATACCTTATACAAAACGGAGGCATTT
TGTGCTCTTGTCCAGAGCTATGGAGAGGAGGCTCGGAGAGTCGCAATTTGAGCAAATGAAGGCTGCTTTTAAGGGTAAAATCTTGCCTGCTGTACACCCAGAAAGTGTGA
GAGTAAGATTGATAGCTAAGGATATAATTGAGGCATTACAAAGAGGGTTGAGGCAAGAGAATGTGTGGAGTGATTTAGGCTATGCATCAGAGGCTGTGATGGGAGCCCCT
GAAGGTAGTGGCCATGAGACATTGATGGCGCTTAGGGATGCTGGGACTGAGAAGGTTGAAGGTAAATGGTACCGCGAAGACGAGATTCTTGATGACAAATGGGTTGAACG
CAGTAGAAAGAAGGGTCAGGAAAAGGGGTCCCAAGCAGATACCTCGCATTTGGATGGATTGAATTGGGAGGTTTTGGTGGTTAATGAACCGGTTGTTAATGCATTTTGCT
TGCCTGGTGGGAAGATTGTTGTTTTCACTGGCTTGCTCGAGCACTTTAGAAGTGATGCAGAAATTGCAACTATCATTGGTCATGAGGTTGGGCATGCGGTGGCTCGACAT
TCTGCAGAGGGCATTACAAAGAACCTGGGGTTTGCCGTTTTACAACTGATCCTTTATCAGTTCATCATGCCTGATATTGTGAACACTATGTCAACACTTTTCTTGAGGCT
TCCTTTTTCTAGAAGGATGGAAATGGAAGCAGATTACATTGGTCTGCTGTTGATTGCCTCTGCTGGATACGATCCCAGAGTTGCACCGGCCGTATATGAGATGTTGGGTA
AGGTCACCGGTGAGTCGGCGCTGAGAGATTATCTTTCTACTCATCCATCAGGAAAGAAGAGAGCACAGTTGCTAGCTCAAGCTAAGGTTATGGAGGAAGCACTTAACGTT
TACAGAGAAGTAAGAGCCGGACGTGGGGTTGAAGGCTTCCTATAA
mRNA sequenceShow/hide mRNA sequence
GAAATAGTATTCTTCGGTTGATGGAATTGGGAACTTGGGTAGAAGATTCTCGACCTGAAAAGTGAAAAACCGATCCATTTCGTTTGAATTGGAAGCTCAAATATTCAGAT
GATGAGTCCACACATGCCCCAAATAATCGAAGCATCGACTCTCCACCGTAGGGTTCATCTTCATGCAACCAAAAACTTCCAGATAGGTCCGGATCAATCCATTTTACTTC
CCAGCTTTATTCCTCAATTGGTAATTGAAGGTCACTATTTATTATTGTTCAATGTTTCTTCTTCCACTTCTTTTCCCCGCTTTTGAATTTTAATCCTACTTTCCCCTCGT
CCAAAAATTGTCCGTGGAGTGACAGATGAGAGATTTATTTTCTGATTGAATCCGATCTATTGGATTTCACATGCGTTTTTATCAATCTAGAGATATGAGTTGCTTTCGAA
AATCGAAGTTTGCATTTGATGCATTTCGGAACTTGTCTTCGAAGATTTTTCCCAGAGATTTAATTCCAGATTCTGGATCAAGAATTTCCCATAGTCAGTATTCGTTTACG
GCTGGAAAAAGGTCAAATTCTTATGGGTTTCAATCGGCTTCTCCAATTATACAAAGATTTGGAAGACAAGTTGGAGAGAATAGGAGGCTATATAATCCCTTTTTGGGTAA
TTCCAAGAGATTTTACTATGTCGATCGCTACCGCGTCCAGCATTTTAAGCCCAGAGGACCTCGGCGATGGTTTCAAGATCCGAGAAACGTATTCATTGTTGTGATTGCGG
GTTCTGGAGTTTTTGTCACTGTGTATTATGGGAATTTAGAAACCATACCTTATACAAAACGGAGGCATTTTGTGCTCTTGTCCAGAGCTATGGAGAGGAGGCTCGGAGAG
TCGCAATTTGAGCAAATGAAGGCTGCTTTTAAGGGTAAAATCTTGCCTGCTGTACACCCAGAAAGTGTGAGAGTAAGATTGATAGCTAAGGATATAATTGAGGCATTACA
AAGAGGGTTGAGGCAAGAGAATGTGTGGAGTGATTTAGGCTATGCATCAGAGGCTGTGATGGGAGCCCCTGAAGGTAGTGGCCATGAGACATTGATGGCGCTTAGGGATG
CTGGGACTGAGAAGGTTGAAGGTAAATGGTACCGCGAAGACGAGATTCTTGATGACAAATGGGTTGAACGCAGTAGAAAGAAGGGTCAGGAAAAGGGGTCCCAAGCAGAT
ACCTCGCATTTGGATGGATTGAATTGGGAGGTTTTGGTGGTTAATGAACCGGTTGTTAATGCATTTTGCTTGCCTGGTGGGAAGATTGTTGTTTTCACTGGCTTGCTCGA
GCACTTTAGAAGTGATGCAGAAATTGCAACTATCATTGGTCATGAGGTTGGGCATGCGGTGGCTCGACATTCTGCAGAGGGCATTACAAAGAACCTGGGGTTTGCCGTTT
TACAACTGATCCTTTATCAGTTCATCATGCCTGATATTGTGAACACTATGTCAACACTTTTCTTGAGGCTTCCTTTTTCTAGAAGGATGGAAATGGAAGCAGATTACATT
GGTCTGCTGTTGATTGCCTCTGCTGGATACGATCCCAGAGTTGCACCGGCCGTATATGAGATGTTGGGTAAGGTCACCGGTGAGTCGGCGCTGAGAGATTATCTTTCTAC
TCATCCATCAGGAAAGAAGAGAGCACAGTTGCTAGCTCAAGCTAAGGTTATGGAGGAAGCACTTAACGTTTACAGAGAAGTAAGAGCCGGACGTGGGGTTGAAGGCTTCC
TATAACCTGTGATAGAAACTTGAAATCAGTTGATGTAATGTTAAAGCATGATATCAGATTGTAATTCCTTGATTTCAACCATGATCCAAATGTTCTAGTGATTTTTTGGT
GTTA
Protein sequenceShow/hide protein sequence
MRFYQSRDMSCFRKSKFAFDAFRNLSSKIFPRDLIPDSGSRISHSQYSFTAGKRSNSYGFQSASPIIQRFGRQVGENRRLYNPFLGNSKRFYYVDRYRVQHFKPRGPRRW
FQDPRNVFIVVIAGSGVFVTVYYGNLETIPYTKRRHFVLLSRAMERRLGESQFEQMKAAFKGKILPAVHPESVRVRLIAKDIIEALQRGLRQENVWSDLGYASEAVMGAP
EGSGHETLMALRDAGTEKVEGKWYREDEILDDKWVERSRKKGQEKGSQADTSHLDGLNWEVLVVNEPVVNAFCLPGGKIVVFTGLLEHFRSDAEIATIIGHEVGHAVARH
SAEGITKNLGFAVLQLILYQFIMPDIVNTMSTLFLRLPFSRRMEMEADYIGLLLIASAGYDPRVAPAVYEMLGKVTGESALRDYLSTHPSGKKRAQLLAQAKVMEEALNV
YREVRAGRGVEGFL