; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013074 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013074
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionOTU domain-containing protein 3 isoform X1
Genome locationchr1:47177566..47182145
RNA-Seq ExpressionLag0013074
SyntenyLag0013074
Gene Ontology termsNA
InterPro domainsIPR003323 - OTU domain
IPR004027 - SEC-C motif
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604170.1 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 7, partial [Cucurbita argyrosperma subsp. sororia]1.5e-17387.19Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFFRALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPF+EYCESME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG +AI+ GN+KLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC
        GCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDS FG D  K  +QLE++TEE+ DKVDSSNHNTKHS SSSSQSDDKRIPRNKVC
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC

Query:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

XP_022949748.1 OTU domain-containing protein 3 isoform X1 [Cucurbita moschata]2.6e-17386.99Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFF  RALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQ
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ

Query:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA
        AASLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMA
Subjt:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA

Query:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK
        GSGCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE++DKVDSSNHNTKHS S+SSQSDDK+IPRNK
Subjt:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK

Query:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        VCPCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

XP_022949750.1 OTU domain-containing protein 3 isoform X2 [Cucurbita moschata]8.1e-17587.47Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFFRALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC
        GCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE++DKVDSSNHNTKHS S+SSQSDDK+IPRNKVC
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC

Query:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

XP_023543642.1 OTU domain-containing protein 3 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-17387.26Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFF  RALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQ
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ

Query:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA
        AASLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMA
Subjt:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA

Query:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK
        GSGCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE+++KVDSSNHNTKHS SSSSQSDDKRIPRNK
Subjt:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK

Query:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        VCPCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

XP_023543646.1 OTU domain-containing protein 3 isoform X2 [Cucurbita pepo subsp. pepo]6.2e-17587.74Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFFRALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC
        GCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE+++KVDSSNHNTKHS SSSSQSDDKRIPRNKVC
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC

Query:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

TrEMBL top hitse value%identityAlignment
A0A1S4DTN6 OTU domain-containing protein 3 isoform X34.2e-16984.47Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL LQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYC+SME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTH NICIHRMSSPRWYIRNFEDREA MVHLSYHDEEHYNSVR KEDTCAGPAR IIIKGD VPS  SLQ KV+ +NSQKRG +A +PGNVKLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC
        GCQN KKVEKVLVQV+GDVD+AIEFLVAEQAAEE+EEP+ESTL  +D SFG +  K YEQLE++ EE++ +VDSS+ NTKHSN SS Q DDKR+PRN++C
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC

Query:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PCGSKKKHKACCGSVAASSSGK   NKTIDSK+TRKERK  KKGGPAKVEVS+ SDGLPHDLGALCI
Subjt:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

A0A6J1GDN7 OTU domain-containing protein 3 isoform X23.9e-17587.47Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFFRALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC
        GCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE++DKVDSSNHNTKHS S+SSQSDDK+IPRNKVC
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVC

Query:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  PCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

A0A6J1GDU7 OTU domain-containing protein 3 isoform X11.3e-17386.99Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFF  RALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQ
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ

Query:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA
        AASLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPSASSLQAKV +T+SQKRG SAI+ GN+KLVMA
Subjt:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA

Query:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK
        GSGCQN K+VEKVLVQVDGDVD+AIEFLVAEQAAEE++EPTESTL HIDSSFG D  K  +QLE++TEE++DKVDSSNHNTKHS S+SSQSDDK+IPRNK
Subjt:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNK

Query:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        VCPCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  VCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

A0A6J1IIX6 uncharacterized protein LOC111477872 isoform X17.2e-16984.53Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFF  RALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQ
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFF--RALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQ

Query:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA
        AASLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPS  S+QAKV +T+SQKRG SAI+ GN+KLVMA
Subjt:  AASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMA

Query:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLED------QTEEEYDKVDSSNHNTKHSNSSSSQSDDK
        GSGCQN ++VEKVLVQVDGD+D+AIEFLVAEQA+EE++EPTESTL HIDSSFG D  K  +QLED      +TEE++DKVDSSNHNTKHS S SSQSDDK
Subjt:  GSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLED------QTEEEYDKVDSSNHNTKHSNSSSSQSDDK

Query:  RIPRNKVCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        RIPRNKVCPCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  RIPRNKVCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

A0A6J1IQF8 uncharacterized protein LOC111477872 isoform X22.2e-17084.99Show/hide
Query:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA
        +NKKPGKSPDISQFR QLDLL L+IVQVTADGNCFFRALADQL+GDQEEH KYRKMVVQYILKNR  FEPFIEDDVPFDEYCESME DGTWAGHLELQAA
Subjt:  KNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAA

Query:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS
        SLVTHSNICIHRMSSPRWYIRNFED+EA MVHLSYHDEEHYNSVRLKEDTCAGPAR I IKGD VPS  S+QAKV +T+SQKRG SAI+ GN+KLVMAGS
Subjt:  SLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLED------QTEEEYDKVDSSNHNTKHSNSSSSQSDDKRI
        GCQN ++VEKVLVQVDGD+D+AIEFLVAEQA+EE++EPTESTL HIDSSFG D  K  +QLED      +TEE++DKVDSSNHNTKHS S SSQSDDKRI
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLED------QTEEEYDKVDSSNHNTKHSNSSSSQSDDKRI

Query:  PRNKVCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        PRNKVCPCGSKKK+KACCGSVAASSSGK   NKTIDSKKTRKERK GKKGGPAKVEVSS SDGLPHDLGALCI
Subjt:  PRNKVCPCGSKKKHKACCGSVAASSSGK---NKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

SwissProt top hitse value%identityAlignment
B1AZ99 OTU domain-containing protein 31.5e-2526.05Show/hide
Query:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH
        G   +   F  QL  LGL++ +V  DGNC FRAL DQL+G    H K+R+  V Y+++ RE FEPF+EDD+PF+++  S+   GT+AG+  + A +    
Subjt:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH

Query:  SNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPI----IIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGSG
         N+ IH++++P W IR  +      +H++Y   EHY+SVR   D    PA  +    ++  D       ++ K +     K G        V  V + +G
Subjt:  SNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPI----IIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGSG

Query:  CQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVCP
        C +   + + L   + ++ SAI                 + L  ++   G D ++ +E   D+ ++     + +    + S +             +  P
Subjt:  CQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVCP

Query:  CGSKKKHKACCGSVAASSSGKNKTIDSKKTRKER
            K HK+    V      + + ++ KK ++ER
Subjt:  CGSKKKHKACCGSVAASSSGKNKTIDSKKTRKER

F4K3M6 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 71.3e-10354.35Show/hide
Query:  NKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS
        ++K GK  D+SQFR QLD LGL+I+QVTADGNCFFRA+ADQL+G+++EH KYR M+V YI+KNREMFEPFIEDDVPF++YC++M++DGTWAG++ELQAAS
Subjt:  NKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS

Query:  LVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVKLVMAGS
        LVT SNICIHR  SPRWYIRNFED    M+HLSYHD EHYNSVR KED C GPARP++I+ DA  SA+S QAK   + S+ +     +  G +K+VM+GS
Subjt:  LVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSDDKRIPR
         C N +K E+VL+QV+GDVD+AIEFL+A+Q  E   EN+  T S    I+     D     E  E   EE  ++  +S +N++   +  ++Q+DDK+IPR
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSDDKRIPR

Query:  NKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        NK CPCGSKKK+K+CCG+    SS K     + +++K RK  ++G   +VE ++       D+GALCI
Subjt:  NKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

Q01804 OTU domain-containing protein 43.7e-1333.8Show/hide
Query:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH
        G   D +     L  LGL    V  DG+C FRA+A+Q+   Q  H + R   + Y+ +NRE FE FIE    F+EY + +EN   W G +E+ A SL+  
Subjt:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH

Query:  SNICIHR---MSSPRWYIRNFEDREACMVHLSYHDEEHYNSV
         +  I+R   +S  +    NF ++    V L + +  HY+ V
Subjt:  SNICIHR---MSSPRWYIRNFEDREACMVHLSYHDEEHYNSV

Q5T2D3 OTU domain-containing protein 35.9e-2729.47Show/hide
Query:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH
        G   +   F  QL  LGL++ +V  DGNC FRAL DQL+G    H K+R+  V Y++K RE FEPF+EDD+PF+++  S+   GT+AG+  + A +    
Subjt:  GKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTH

Query:  SNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARP----IIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGSG
         N+ IH++++P W IR  E      +H++Y   EHY+SVR   D    PA       ++  D       ++ K M +    R         V+ V   +G
Subjt:  SNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARP----IIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGSG

Query:  CQNPKKVEKVLVQVDGDVDSAIEFLV-----AEQAAEENEEPTESTLYHI-----DSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDD
        C +   + + L   + +++SAI  ++         AEEN EP+   L        +   G  +       E +TE    +   S  N  + N  +  ++ 
Subjt:  CQNPKKVEKVLVQVDGDVDSAIEFLV-----AEQAAEENEEPTESTLYHI-----DSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDD

Query:  KR
        +R
Subjt:  KR

Q9LZF7 OVARIAN TUMOR DOMAIN-containing deubiquitinating enzyme 104.8e-1330.07Show/hide
Query:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC
        D  + R +L++     V+V  DGNC FRALADQL    + H   R+ +V+ +    + ++ ++  D  F +Y   M   G W  H+ LQAA+      I 
Subjt:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT
        +        YI      +    ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT

Arabidopsis top hitse value%identityAlignment
AT3G22260.1 Cysteine proteinases superfamily protein1.3e-1332Show/hide
Query:  PDISQFRTQLDLL-------GLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS
        PDI+      +LL       GL  +Q+  DGNC FRALADQL  + + H   RK VV+ + + R+++E ++   + +  Y   M+  G W  H+ LQAA+
Subjt:  PDISQFRTQLDLL-------GLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS

Query:  LVTHSNICIHRMSSPRWYIRNFEDREACM--VHLSYHDEEHYNSVRLKED
            + IC+      + YI      +  +    LS+  E HYNS+    D
Subjt:  LVTHSNICIHRMSSPRWYIRNFEDREACM--VHLSYHDEEHYNSVRLKED

AT5G03330.1 Cysteine proteinases superfamily protein3.4e-1430.07Show/hide
Query:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC
        D  + R +L++     V+V  DGNC FRALADQL    + H   R+ +V+ +    + ++ ++  D  F +Y   M   G W  H+ LQAA+      I 
Subjt:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT
        +        YI      +    ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT

AT5G03330.2 Cysteine proteinases superfamily protein3.4e-1430.07Show/hide
Query:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC
        D  + R +L++     V+V  DGNC FRALADQL    + H   R+ +V+ +    + ++ ++  D  F +Y   M   G W  H+ LQAA+      I 
Subjt:  DISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNIC

Query:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT
        +        YI      +    ++ LS+  E HYN++ L  DT
Subjt:  IHRMSSPRWYIRNFEDREAC--MVHLSYHDEEHYNSVRLKEDT

AT5G67170.1 SEC-C motif-containing protein / OTU-like cysteine protease family protein9.4e-10554.35Show/hide
Query:  NKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS
        ++K GK  D+SQFR QLD LGL+I+QVTADGNCFFRA+ADQL+G+++EH KYR M+V YI+KNREMFEPFIEDDVPF++YC++M++DGTWAG++ELQAAS
Subjt:  NKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAAS

Query:  LVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVKLVMAGS
        LVT SNICIHR  SPRWYIRNFED    M+HLSYHD EHYNSVR KED C GPARP++I+ DA  SA+S QAK   + S+ +     +  G +K+VM+GS
Subjt:  LVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVKLVMAGS

Query:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSDDKRIPR
         C N +K E+VL+QV+GDVD+AIEFL+A+Q  E   EN+  T S    I+     D     E  E   EE  ++  +S +N++   +  ++Q+DDK+IPR
Subjt:  GCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSDDKRIPR

Query:  NKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        NK CPCGSKKK+K+CCG+    SS K     + +++K RK  ++G   +VE ++       D+GALCI
Subjt:  NKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI

AT5G67170.2 SEC-C motif-containing protein / OTU-like cysteine protease family protein4.7e-10454.28Show/hide
Query:  SKNKK----PGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHL
        SK KK     GK  D+SQFR QLD LGL+I+QVTADGNCFFRA+ADQL+G+++EH KYR M+V YI+KNREMFEPFIEDDVPF++YC++M++DGTWAG++
Subjt:  SKNKK----PGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHL

Query:  ELQAASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVK
        ELQAASLVT SNICIHR  SPRWYIRNFED    M+HLSYHD EHYNSVR KED C GPARP++I+ DA  SA+S QAK   + S+ +     +  G +K
Subjt:  ELQAASLVTHSNICIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSA-ITPGNVK

Query:  LVMAGSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSD
        +VM+GS C N +K E+VL+QV+GDVD+AIEFL+A+Q  E   EN+  T S    I+     D     E  E   EE  ++  +S +N++   +  ++Q+D
Subjt:  LVMAGSGCQNPKKVEKVLVQVDGDVDSAIEFLVAEQAAE---ENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSS-SSQSD

Query:  DKRIPRNKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI
        DK+IPRNK CPCGSKKK+K+CCG+    SS K     + +++K RK  ++G   +VE ++       D+GALCI
Subjt:  DKRIPRNKVCPCGSKKKHKACCGSVAASSSGKNKTIDSKKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAAGAACAAAAAGCCTGGAAAGTCACCTGACATTTCTCAGTTTCGTACCCAGCTTGACTTATTGGGCCTCCAAATTGTTCAAGTGACTGCAGATGGTAATTGTTT
TTTCAGGGCCCTTGCAGATCAGTTAGATGGGGATCAAGAGGAACATGGAAAGTATCGGAAGATGGTTGTGCAATATATTTTGAAGAACCGTGAAATGTTTGAGCCATTTA
TTGAGGATGATGTCCCATTTGATGAATATTGCGAGTCCATGGAAAACGATGGTACCTGGGCTGGACATCTAGAATTGCAGGCTGCTTCTCTTGTTACTCATAGTAATATA
TGCATTCATCGGATGTCATCACCACGGTGGTACATACGTAATTTTGAGGATCGAGAAGCTTGTATGGTTCACTTGTCCTATCATGATGAGGAACATTACAATAGTGTGCG
GTTGAAGGAAGACACATGTGCCGGCCCAGCCAGGCCGATCATAATCAAAGGTGATGCTGTTCCTTCAGCCAGTTCACTTCAAGCAAAAGTTATGGCTACTAATTCTCAAA
AGAGAGGTGGAAGTGCTATTACTCCTGGAAATGTCAAATTAGTTATGGCAGGCAGTGGTTGTCAAAATCCTAAAAAAGTTGAAAAGGTTTTGGTCCAAGTCGATGGTGAT
GTTGATTCTGCAATAGAGTTTCTTGTGGCAGAACAAGCAGCAGAGGAAAATGAAGAGCCAACTGAATCAACTCTGTATCATATCGATTCTTCTTTTGGTGTTGATGTAAA
AAAAGGTTATGAGCAATTGGAAGACCAAACAGAAGAGGAATACGACAAAGTTGATTCATCTAATCATAACACTAAACATTCCAACTCTAGCAGCTCTCAATCCGATGACA
AGAGGATCCCAAGGAATAAAGTCTGCCCATGCGGTTCAAAAAAGAAACATAAAGCTTGCTGTGGATCAGTTGCTGCAAGTTCGTCTGGCAAGAACAAAACTATCGACTCT
AAGAAGACTAGAAAGGAAAGGAAGATGGGCAAGAAAGGTGGACCTGCTAAAGTTGAAGTGTCCTCCGCATCTGATGGACTGCCACATGACTTGGGAGCTCTTTGTATTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGTCGAAGAACAAAAAGCCTGGAAAGTCACCTGACATTTCTCAGTTTCGTACCCAGCTTGACTTATTGGGCCTCCAAATTGTTCAAGTGACTGCAGATGGTAATTGTTT
TTTCAGGGCCCTTGCAGATCAGTTAGATGGGGATCAAGAGGAACATGGAAAGTATCGGAAGATGGTTGTGCAATATATTTTGAAGAACCGTGAAATGTTTGAGCCATTTA
TTGAGGATGATGTCCCATTTGATGAATATTGCGAGTCCATGGAAAACGATGGTACCTGGGCTGGACATCTAGAATTGCAGGCTGCTTCTCTTGTTACTCATAGTAATATA
TGCATTCATCGGATGTCATCACCACGGTGGTACATACGTAATTTTGAGGATCGAGAAGCTTGTATGGTTCACTTGTCCTATCATGATGAGGAACATTACAATAGTGTGCG
GTTGAAGGAAGACACATGTGCCGGCCCAGCCAGGCCGATCATAATCAAAGGTGATGCTGTTCCTTCAGCCAGTTCACTTCAAGCAAAAGTTATGGCTACTAATTCTCAAA
AGAGAGGTGGAAGTGCTATTACTCCTGGAAATGTCAAATTAGTTATGGCAGGCAGTGGTTGTCAAAATCCTAAAAAAGTTGAAAAGGTTTTGGTCCAAGTCGATGGTGAT
GTTGATTCTGCAATAGAGTTTCTTGTGGCAGAACAAGCAGCAGAGGAAAATGAAGAGCCAACTGAATCAACTCTGTATCATATCGATTCTTCTTTTGGTGTTGATGTAAA
AAAAGGTTATGAGCAATTGGAAGACCAAACAGAAGAGGAATACGACAAAGTTGATTCATCTAATCATAACACTAAACATTCCAACTCTAGCAGCTCTCAATCCGATGACA
AGAGGATCCCAAGGAATAAAGTCTGCCCATGCGGTTCAAAAAAGAAACATAAAGCTTGCTGTGGATCAGTTGCTGCAAGTTCGTCTGGCAAGAACAAAACTATCGACTCT
AAGAAGACTAGAAAGGAAAGGAAGATGGGCAAGAAAGGTGGACCTGCTAAAGTTGAAGTGTCCTCCGCATCTGATGGACTGCCACATGACTTGGGAGCTCTTTGTATTTG
A
Protein sequenceShow/hide protein sequence
MSKNKKPGKSPDISQFRTQLDLLGLQIVQVTADGNCFFRALADQLDGDQEEHGKYRKMVVQYILKNREMFEPFIEDDVPFDEYCESMENDGTWAGHLELQAASLVTHSNI
CIHRMSSPRWYIRNFEDREACMVHLSYHDEEHYNSVRLKEDTCAGPARPIIIKGDAVPSASSLQAKVMATNSQKRGGSAITPGNVKLVMAGSGCQNPKKVEKVLVQVDGD
VDSAIEFLVAEQAAEENEEPTESTLYHIDSSFGVDVKKGYEQLEDQTEEEYDKVDSSNHNTKHSNSSSSQSDDKRIPRNKVCPCGSKKKHKACCGSVAASSSGKNKTIDS
KKTRKERKMGKKGGPAKVEVSSASDGLPHDLGALCI