; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh03G009860 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh03G009860
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionOTU domain-containing protein 3 isoform X1
Genome locationCmo_Chr03:7419963..7432906
RNA-Seq ExpressionCmoCh03G009860
SyntenyCmoCh03G009860
Gene Ontology termsNA
InterPro domainsIPR003323 - OTU domain
IPR004027 - SEC-C motif
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604170.1 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 7, partial [Cucurbita argyrosperma subsp. sororia]5.8e-24697.99Show/hide
Query:  MMRTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEPFFDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDL
        MMRTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEP FDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDL
Subjt:  MMRTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEPFFDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDL

Query:  LNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYI
        LNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRG FEPFIEDDVPF+EYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYI
Subjt:  LNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYI

Query:  RNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVD
        RNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ+AISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVD
Subjt:  RNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVD

Query:  AAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSS
        AAIEFLVAEQAAEEHKEPTESTLCHIDS FGSDETKDCKQLEERTEEK+DKVDSSNHNTKHSIS+SSQSDDK+IPRNKVCPCGSKKKYKACCGSVAASSS
Subjt:  AAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSS

Query:  GKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        GKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  GKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

KAG7034335.1 OTU domain-containing protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma]5.1e-24294.81Show/hide
Query:  RTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEPFFDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLN
        RTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEP FDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLN
Subjt:  RTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEPFFDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLN

Query:  LKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRN
        LKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRG FEPFIEDDVPF+EYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRN
Subjt:  LKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRN

Query:  FEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITI----------------KGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSK
        FEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITI                KGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSK
Subjt:  FEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITI----------------KGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSK

Query:  EVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISNSSQSDDKKIPRNKVCPCGSKK
        EVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDS FGSDETKDCKQLEERTEEK+DKVDSSNHNTKHSIS+SSQSDDK+IPRNKVCPCGSKK
Subjt:  EVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISNSSQSDDKKIPRNKVCPCGSKK

Query:  KYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        KYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  KYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

XP_022949748.1 OTU domain-containing protein 3 isoform X1 [Cucurbita moschata]6.0e-21199.22Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF  RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE

Query:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR
        SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR
Subjt:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR

Query:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI
        GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI
Subjt:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI

Query:  SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

XP_022949750.1 OTU domain-containing protein 3 isoform X2 [Cucurbita moschata]1.9e-21299.74Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
        EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ

Query:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
        SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
Subjt:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN

Query:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

XP_023543646.1 OTU domain-containing protein 3 isoform X2 [Cucurbita pepo subsp. pepo]1.8e-21098.69Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRG FEPFIEDDVPFDEYCESM
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
        EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ

Query:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
        SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKH+KVDSSNHNTKHSIS+
Subjt:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN

Query:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SSQSDDK+IPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

TrEMBL top hitse value%identityAlignment
A0A6J1GCW5 OTU domain-containing protein 3 isoform X32.0e-20497.39Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTAD         DQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
        EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ

Query:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
        SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
Subjt:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN

Query:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

A0A6J1GDN7 OTU domain-containing protein 3 isoform X29.1e-21399.74Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
        EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ

Query:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
        SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN
Subjt:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISN

Query:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

A0A6J1GDU7 OTU domain-containing protein 3 isoform X12.9e-21199.22Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE
        MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF  RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE

Query:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR
        SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR
Subjt:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR

Query:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI
        GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI
Subjt:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSI

Query:  SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  SNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

A0A6J1IIX6 uncharacterized protein LOC111477872 isoform X13.2e-20294.88Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE
        MVKTKHQKSN KR PQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF  RALADQLEGDQEEHEKYRKMVVQYILKNRG FEPFIEDDVPFDEYCE
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFF--RALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCE

Query:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR
        SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPS  S+QAKVFSTSSQKR
Subjt:  SMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKR

Query:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLE------ERTEEKHDKVDSSNH
        GQSAISLGNIKLVMAGSGCQNS+EVEKVLVQVDGD+DAAIEFLVAEQA+EEHKEPTESTLCHIDSSFGSDETKDCKQLE      ERTEEKHDKVDSSNH
Subjt:  GQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLE------ERTEEKHDKVDSSNH

Query:  NTKHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        NTKHSIS SSQSDDK+IPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  NTKHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

A0A6J1IQF8 uncharacterized protein LOC111477872 isoform X21.0e-20395.37Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        MVKTKHQKSN KR PQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRG FEPFIEDDVPFDEYCESM
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ
        EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPS  S+QAKVFSTSSQKRGQ
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQ

Query:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLE------ERTEEKHDKVDSSNHNT
        SAISLGNIKLVMAGSGCQNS+EVEKVLVQVDGD+DAAIEFLVAEQA+EEHKEPTESTLCHIDSSFGSDETKDCKQLE      ERTEEKHDKVDSSNHNT
Subjt:  SAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLE------ERTEEKHDKVDSSNHNT

Query:  KHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI
        KHSIS SSQSDDK+IPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGAL CI
Subjt:  KHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTGKKGGPAKVEVSSGSDGLPHDLGALFCI

SwissProt top hitse value%identityAlignment
B1AZ99 OTU domain-containing protein 31.7e-3039.88Show/hide
Query:  KRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLE
        +  P     G   +   F  QL  L LK+ +V  DGNC FRAL DQLEG    H K+R+  V Y+++ R  FEPF+EDD+PF+++  S+ K GT+AG+  
Subjt:  KRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLE

Query:  LQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLIT
        + A +     N+ IH++++P W IR  +    R +H++Y   EHY+SVR   D    PA L+T
Subjt:  LQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLIT

F4K3M6 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 71.3e-10757.89Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        M KTK QKS PK+ P ++K GK  D+SQFRAQLD L LKI+QVTADGNCFFRA+ADQLEG+++EH KYR M+V YI+KNR  FEPFIEDDVPF++YC++M
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG
        + DGTWAG++ELQAASLVT SNICIHR  SPRWYIRNFED   RM+HLSYHD EHYNSVR KED C GPAR + I+ D   SA+S QAK   S S  K  
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG

Query:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH
        +  ++ G IK+VM+GS C N+++ E+VL+QV+GDVDAAIEFL+A+Q  E   E+   T S    I+    SD   +    E+  EE  ++  +S +N++ 
Subjt:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH

Query:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG
             ++Q+DDKKIPRNK CPCGSKKKYK+CCG+    SS K +V++T++SKK RK  + G
Subjt:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG

Q3U2S4 OTU domain-containing protein 53.2e-1332.82Show/hide
Query:  RAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMS
        +A  D     I Q+  DG C FRA+ADQ+ GDQ+ HE  RK  + Y++KN   F  ++ +D  F  Y     K+     H+E+QA + + +  + +++ S
Subjt:  RAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMS

Query:  S-PRWYIRNFEDQEARMVHLSYHDEEHYNSV
        + P          E   + +SYH   HYNSV
Subjt:  S-PRWYIRNFEDQEARMVHLSYHDEEHYNSV

Q5T2D3 OTU domain-containing protein 31.3e-2729.3Show/hide
Query:  GKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTH
        G   +   F  QL  L LK+ +V  DGNC FRAL DQLEG    H K+R+  V Y++K R  FEPF+EDD+PF+++  S+ K GT+AG+  + A +    
Subjt:  GKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTH

Query:  SNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLIT----IKGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSG
         N+ IH++++P W IR  E    R +H++Y   EHY+SVR   D    PA L T    +  D       ++ K   +    R +   +   ++ V   +G
Subjt:  SNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLIT----IKGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSG

Query:  CQNSKEVEKVLVQVDGDVDAAIEFLV-----AEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHN---TKHSISNSSQSDDKK
        C +   + + L   + ++++AI  ++         AEE+ EP+   L            K C  L E          +   N   T+++ + +S S++ K
Subjt:  CQNSKEVEKVLVQVDGDVDAAIEFLV-----AEQAAEEHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHN---TKHSISNSSQSDDKK

Query:  IPRNKVCPCGSKKK
          +N++    +K++
Subjt:  IPRNKVCPCGSKKK

Q9LZF7 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 106.4e-1431.47Show/hide
Query:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC
        D  + R +L++ +   V+V  DGNC FRALADQL    + H+  R+ +V+ +     +++ ++  D  F +Y   M + G W  H+ LQAA+      I 
Subjt:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT
        +        YI      QE++ ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT

Arabidopsis top hitse value%identityAlignment
AT3G22260.1 Cysteine proteinases superfamily protein1.1e-1332.67Show/hide
Query:  PDISQFRAQLDLLN-------LKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAAS
        PDI+      +LL+       L  +Q+  DGNC FRALADQL  + + H+  RK VV+ + + R  +E ++   + +  Y   M+K G W  H+ LQAA+
Subjt:  PDISQFRAQLDLLN-------LKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAAS

Query:  LVTHSNICIHRMSSPRWYIRNFEDQE--ARMVHLSYHDEEHYNSVRLKED
            + IC+      + YI      +   R   LS+  E HYNS+    D
Subjt:  LVTHSNICIHRMSSPRWYIRNFEDQE--ARMVHLSYHDEEHYNSVRLKED

AT5G03330.1 Cysteine proteinases superfamily protein4.6e-1531.47Show/hide
Query:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC
        D  + R +L++ +   V+V  DGNC FRALADQL    + H+  R+ +V+ +     +++ ++  D  F +Y   M + G W  H+ LQAA+      I 
Subjt:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT
        +        YI      QE++ ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT

AT5G03330.2 Cysteine proteinases superfamily protein4.6e-1531.47Show/hide
Query:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC
        D  + R +L++ +   V+V  DGNC FRALADQL    + H+  R+ +V+ +     +++ ++  D  F +Y   M + G W  H+ LQAA+      I 
Subjt:  DISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT
        +        YI      QE++ ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNF-EDQEAR-MVHLSYHDEEHYNSVRLKEDT

AT5G67170.1 SEC-C motif-containing protein / OTU-like cysteine protease family protein9.3e-10957.89Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        M KTK QKS PK+ P ++K GK  D+SQFRAQLD L LKI+QVTADGNCFFRA+ADQLEG+++EH KYR M+V YI+KNR  FEPFIEDDVPF++YC++M
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG
        + DGTWAG++ELQAASLVT SNICIHR  SPRWYIRNFED   RM+HLSYHD EHYNSVR KED C GPAR + I+ D   SA+S QAK   S S  K  
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG

Query:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH
        +  ++ G IK+VM+GS C N+++ E+VL+QV+GDVDAAIEFL+A+Q  E   E+   T S    I+    SD   +    E+  EE  ++  +S +N++ 
Subjt:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH

Query:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG
             ++Q+DDKKIPRNK CPCGSKKKYK+CCG+    SS K +V++T++SKK RK  + G
Subjt:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG

AT5G67170.2 SEC-C motif-containing protein / OTU-like cysteine protease family protein4.6e-10857.62Show/hide
Query:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM
        M KTK QKS PK+ P  +  GK  D+SQFRAQLD L LKI+QVTADGNCFFRA+ADQLEG+++EH KYR M+V YI+KNR  FEPFIEDDVPF++YC++M
Subjt:  MVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKIVQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESM

Query:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG
        + DGTWAG++ELQAASLVT SNICIHR  SPRWYIRNFED   RM+HLSYHD EHYNSVR KED C GPAR + I+ D   SA+S QAK   S S  K  
Subjt:  EKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARMVHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVF-STSSQKRG

Query:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH
        +  ++ G IK+VM+GS C N+++ E+VL+QV+GDVDAAIEFL+A+Q  E   E+   T S    I+    SD   +    E+  EE  ++  +S +N++ 
Subjt:  QSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAE---EHKEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKH

Query:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG
             ++Q+DDKKIPRNK CPCGSKKKYK+CCG+    SS K +V++T++SKK RK  + G
Subjt:  -SISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKTG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAGAACTAAGAAACGGGAGGGATCGGGGTATCTCCGTTTGAACAGCGGCGCGCAAGTCAGGCTCAGTAACGAACAATTCTTGGAACATAGGCCGCAGCCT
GAGCCGTTTTTTGATATCCAACACAACGAAAACACAGCCGTTGCTGTGAGTCCTCAAGCAGCGGAAAGCGTTGTTCAGGATTTTGTGGGAATGGTTAAAACTAAG
CACCAAAAGTCCAATCCTAAGCGACACCCGCAAAACAAAAAGCCTGGAAAGTCACCTGATATTTCGCAGTTTCGTGCTCAGCTTGACTTGTTGAACCTTAAAATT
GTACAAGTGACCGCAGATGGTAATTGTTTTTTCAGGGCCCTTGCAGATCAGTTAGAAGGGGATCAAGAGGAACACGAAAAGTACCGGAAGATGGTCGTACAATAT
ATTTTAAAAAACCGTGGAGCATTTGAGCCATTTATTGAGGATGATGTCCCATTTGACGAATATTGTGAGTCCATGGAAAAGGATGGTACCTGGGCTGGACATCTG
GAATTGCAGGCTGCTTCTCTTGTTACTCATAGTAATATATGCATTCACCGGATGTCATCACCTCGTTGGTACATACGAAATTTCGAGGATCAAGAAGCTCGTATG
GTCCACTTATCTTATCACGACGAGGAACATTACAATAGTGTGCGATTGAAGGAAGACACATGTGCTGGCCCAGCCAGGCTAATTACAATCAAAGGTGATACTGTT
CCCTCAGCAAGCTCACTTCAAGCAAAAGTTTTCTCTACGAGTTCTCAAAAGAGAGGTCAAAGTGCTATTAGTCTTGGAAATATCAAGTTAGTTATGGCAGGTAGT
GGTTGTCAAAATTCTAAAGAAGTTGAAAAGGTTTTGGTCCAAGTCGATGGGGATGTTGATGCTGCAATAGAGTTTCTTGTGGCAGAACAAGCAGCAGAAGAACAC
AAAGAGCCAACTGAATCAACTTTGTGTCATATAGATTCTTCTTTTGGTAGCGATGAAACAAAAGATTGTAAGCAATTGGAAGAGCGAACAGAAGAGAAGCACGAC
AAAGTTGATTCATCTAATCATAACACTAAACATTCTATTAGCAACAGTTCTCAATCAGACGACAAGAAGATCCCAAGGAATAAAGTCTGCCCATGTGGTTCAAAA
AAGAAATATAAAGCTTGTTGTGGATCAGTTGCTGCCAGTTCGTCTGGCAAGTTTATAGTGAACAAAACTATCGACTCTAAGAAGACTAGAAAGGAAAGGAAAACG
GGCAAGAAAGGTGGACCTGCTAAAGTTGAAGTGTCCTCTGGATCTGATGGATTGCCACATGACTTGGGAGCTCTTTTTTGCATTGAGAACAGCGTTCTATTTTGT
TTCAGGAGGAAGCATTTTAGCTGCAGCTTAAAGCGAGTTTGGAATCACAGATGCTGTAGTCTATCGCTGATTGGTGGTGGGATTTTCCCACTAGAATACCCCAGT
GGTTATTGGGAAAATGAAGGGCGGTTGTTTCCCCAACGTACGTCTATGCAAGGCTGGATCTTTGTTGAAGCCAACTGGACTTGGAAGTATAAATATAGAAGCAGC
AGCTCACAAAATCTGTCAGCCTATGATCAATGTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATGAGAACTAAGAAACGGGAGGGATCGGGGTATCTCCGTTTGAACAGCGGCGCGCAAGTCAGGCTCAGTAACGAACAATTCTTGGAACATAGGCCGCAGCCT
GAGCCGTTTTTTGATATCCAACACAACGAAAACACAGCCGTTGCTGTGAGTCCTCAAGCAGCGGAAAGCGTTGTTCAGGATTTTGTGGGAATGGTTAAAACTAAG
CACCAAAAGTCCAATCCTAAGCGACACCCGCAAAACAAAAAGCCTGGAAAGTCACCTGATATTTCGCAGTTTCGTGCTCAGCTTGACTTGTTGAACCTTAAAATT
GTACAAGTGACCGCAGATGGTAATTGTTTTTTCAGGGCCCTTGCAGATCAGTTAGAAGGGGATCAAGAGGAACACGAAAAGTACCGGAAGATGGTCGTACAATAT
ATTTTAAAAAACCGTGGAGCATTTGAGCCATTTATTGAGGATGATGTCCCATTTGACGAATATTGTGAGTCCATGGAAAAGGATGGTACCTGGGCTGGACATCTG
GAATTGCAGGCTGCTTCTCTTGTTACTCATAGTAATATATGCATTCACCGGATGTCATCACCTCGTTGGTACATACGAAATTTCGAGGATCAAGAAGCTCGTATG
GTCCACTTATCTTATCACGACGAGGAACATTACAATAGTGTGCGATTGAAGGAAGACACATGTGCTGGCCCAGCCAGGCTAATTACAATCAAAGGTGATACTGTT
CCCTCAGCAAGCTCACTTCAAGCAAAAGTTTTCTCTACGAGTTCTCAAAAGAGAGGTCAAAGTGCTATTAGTCTTGGAAATATCAAGTTAGTTATGGCAGGTAGT
GGTTGTCAAAATTCTAAAGAAGTTGAAAAGGTTTTGGTCCAAGTCGATGGGGATGTTGATGCTGCAATAGAGTTTCTTGTGGCAGAACAAGCAGCAGAAGAACAC
AAAGAGCCAACTGAATCAACTTTGTGTCATATAGATTCTTCTTTTGGTAGCGATGAAACAAAAGATTGTAAGCAATTGGAAGAGCGAACAGAAGAGAAGCACGAC
AAAGTTGATTCATCTAATCATAACACTAAACATTCTATTAGCAACAGTTCTCAATCAGACGACAAGAAGATCCCAAGGAATAAAGTCTGCCCATGTGGTTCAAAA
AAGAAATATAAAGCTTGTTGTGGATCAGTTGCTGCCAGTTCGTCTGGCAAGTTTATAGTGAACAAAACTATCGACTCTAAGAAGACTAGAAAGGAAAGGAAAACG
GGCAAGAAAGGTGGACCTGCTAAAGTTGAAGTGTCCTCTGGATCTGATGGATTGCCACATGACTTGGGAGCTCTTTTTTGCATTGAGAACAGCGTTCTATTTTGT
TTCAGGAGGAAGCATTTTAGCTGCAGCTTAAAGCGAGTTTGGAATCACAGATGCTGTAGTCTATCGCTGATTGGTGGTGGGATTTTCCCACTAGAATACCCCAGT
GGTTATTGGGAAAATGAAGGGCGGTTGTTTCCCCAACGTACGTCTATGCAAGGCTGGATCTTTGTTGAAGCCAACTGGACTTGGAAGTATAAATATAGAAGCAGC
AGCTCACAAAATCTGTCAGCCTATGATCAATGTCGTTAGTTTATAGAAATGTAAAATGGAACAAATGAAATTAGCACCCTTTATATGCACACAAAATATAATCGC
AAGTTCTCTTGTAGAAGAATCATGGAAGTGGCAGATTTGAGAAAATGCAAAGACTTAATTGTATCAACCAAGAAAAATTTGATGAGATTTAGCTAAGATTACATG
TTTTCCCTAGTGTCTTGGACTATTGAGGAAGAAAAACATTTCGAAATTTGTTTGTTATGCACTTGCGTTCTGTAGTATTCGTGTAATTGCCTGCCGCGTGATCTT
TAAGTGGCTCCGTTATTCTTGCATTCAATTGAATTGTATTGAGCTATGATTCACAGGTCGTTTTTACAGTTCCCTTGGTGTATAAACATTCCCCTTGGTTTCTTG
GGTCACCGAGCCTCCAAGAAAAAGTATTGCTCTTTTACTCTTTCCAGGTTTATAGGGCTCCTGTAGAGCTTGATTTCTAAAACTTGGCGAGTCTTTGGAACCTAG
AAGACGCTACAGAAATCCTTTGGCTGGGCAAGAACTCTCTTCTATGAAAAGGCGGCATGTTCAGCATGCTGGGCTTGAAATATTTATTTCAAGCACGTGAGACAG
CCTTAAGATCCACCATGTCGGACCAATAAGGAAGCTGCCCTTTGCCTCCCTACAATAAGTTTGCTGTAAAGAATATAGTGCACGTATATGTCCGTCCATTCAGTA
ACAGATGTCAAAATTGCATCTTTTCGGGACCACAAAGTGATTGTGCAGTACTTGCTCTCGCTCTCAACCAATTCTTGGAAGGTCTAATCTGGGGAGAGAATCCTG
ATGATGCTGCAGCAGTTGCATGAACTTGCGATGGCCCCGATTCAACAACTTGATTCTGATGATCACTTAGAGATAGTGATAGATCACTTCCTATTATTCGGCCTC
CATTACTGCTGCCCGGAAACTGAATCTTCAACAACAATAACAACAAATCAAAATTTAAAAAACCAACACTTTTTTCTTCTCTAAATTATTAATTACACGTACTTG
ATAATTGAAGGTTGGATGATTTTGATTGTGAAGCAATATCTGGGAGTGGGGCTGATCTACACCACCCGCCTGCCGCCTGAATTGCCCATGTCTTTGCATTTCAAA
GGAAATTGGAGATTGAATGTGTGTTTGGCTTAACAGCTGCATAGTTTCACACTCCAGCAAATTTAGCTGCGCATGATACACATGCCACAAATTCGTGAACACCCA
TATTTCCCACGAGAAAAAAAAAAACATTTTTTTAAGGAAATTTGAAAGAACCTGCGGAGGCCTAAATCCTTGACTACGTTG
Protein sequenceShow/hide protein sequence
MMRTKKREGSGYLRLNSGAQVRLSNEQFLEHRPQPEPFFDIQHNENTAVAVSPQAAESVVQDFVGMVKTKHQKSNPKRHPQNKKPGKSPDISQFRAQLDLLNLKI
VQVTADGNCFFRALADQLEGDQEEHEKYRKMVVQYILKNRGAFEPFIEDDVPFDEYCESMEKDGTWAGHLELQAASLVTHSNICIHRMSSPRWYIRNFEDQEARM
VHLSYHDEEHYNSVRLKEDTCAGPARLITIKGDTVPSASSLQAKVFSTSSQKRGQSAISLGNIKLVMAGSGCQNSKEVEKVLVQVDGDVDAAIEFLVAEQAAEEH
KEPTESTLCHIDSSFGSDETKDCKQLEERTEEKHDKVDSSNHNTKHSISNSSQSDDKKIPRNKVCPCGSKKKYKACCGSVAASSSGKFIVNKTIDSKKTRKERKT
GKKGGPAKVEVSSGSDGLPHDLGALFCIENSVLFCFRRKHFSCSLKRVWNHRCCSLSLIGGGIFPLEYPSGYWENEGRLFPQRTSMQGWIFVEANWTWKYKYRSS
SSQNLSAYDQCR