; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G003260 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G003260
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
Descriptionubiquitin-like-specific protease ESD4
Genome locationCma_Chr17:1767417..1777584
RNA-Seq ExpressionCmaCh17G003260
SyntenyCmaCh17G003260
Gene Ontology termsGO:0016926 - protein desumoylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575066.1 hypothetical protein SDJN03_25705, partial [Cucurbita argyrosperma subsp. sororia]5.0e-30089.69Show/hide
Query:  QPLRHYLPETQTSFHKLPSPLVMAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIV
        QPLRHYLPE QTSFHKLPSPLVMAGTCGSAVSFSIHCNKF ICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKP PFTVSPIPPQIDSETPIV
Subjt:  QPLRHYLPETQTSFHKLPSPLVMAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIV

Query:  LPEISGSAGVETEVLSPVECCPSSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSG
         PEISG AGVETEVLSPVEC PSSTTTDGESRLSESSDTASL NFDVANFS GSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKS 
Subjt:  LPEISGSAGVETEVLSPVECCPSSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSG

Query:  KEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDS
        KEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDD+GEGDTEG NVISRARIGIDKEIDARLV+LQKRLNSNK+RIPDS
Subjt:  KEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDS

Query:  PVNFLFKSENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAI----------NKGVKDAEKRVANEIKYNDPK
        PVNFLFKSENVEEAAKRNDF DEEERNKSLIYKKKLQFRNSNGDRMKKP GFQGF SNGKKSGSNGKA           NKGVKDA+KRVANEIKY+DPK
Subjt:  PVNFLFKSENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAI----------NKGVKDAEKRVANEIKYNDPK

Query:  MSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLR
        M +DDSTNLGS+KSVL+QKNDGTNLD D+KVSSSKKKSSNGAVQ TSSVEISKS +LKDVMEKRSPSRAD WWMNLPYVLVIFMHRGSEDEEHGGLFTLR
Subjt:  MSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLR

Query:  IPSKTRDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMAFVTDDLLQQMQPWN
        +PSKT+D EEYTTYTVAFEHHVDANNFCYLLESFFEELD FTTDV+PLPTKELE VIKSHTSK+IVVKKGQLQLYAG         F     L QMQ WN
Subjt:  IPSKTRDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMAFVTDDLLQQMQPWN

Query:  RTEDGENLQQKMELLQIWIVG
        RTEDGENLQQKMELLQIW +G
Subjt:  RTEDGENLQQKMELLQIWIVG

KAG7013639.1 hypothetical protein SDJN02_23806, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-28488.8Show/hide
Query:  MGLRPIIGSKHKQPLRHYLPETQTSFHKLPSPLVMAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSP
        MGLRPIIGSKH QPLRHYLPE QTSFHKLPSPLVMAGTCGSAVSFSIHCNKF ICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKP PFTVSP
Subjt:  MGLRPIIGSKHKQPLRHYLPETQTSFHKLPSPLVMAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSP

Query:  IPPQIDSETPIVLPEISGSAGVETEVLSPVECCPSSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKN
        IPPQIDSETPIV PEISG AGVETEVLSPVECCPSSTTTDGESRLSESSDTASL NFDVANFS GSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKN
Subjt:  IPPQIDSETPIVLPEISGSAGVETEVLSPVECCPSSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKN

Query:  SDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQK
        SDKDLSIRSKS KEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDD+GEGDTEG NVISRARIGIDKEIDARLV+LQK
Subjt:  SDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQK

Query:  RLNSNKERIPDSPVNFLFKSENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAI----------NKGVKDAEK
        RLNSNK+RIPDSPVNFLFKSENVEEAAKRNDF DEEERNKSLIYKKKLQFRNSNGDRMKKP GFQGF SNGKKSGSNGKA           NKGVKDA+K
Subjt:  RLNSNKERIPDSPVNFLFKSENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAI----------NKGVKDAEK

Query:  RVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGS
        RVANEIKY+DPKM +DDSTNLGS+KSVL+QKNDGTNLD D+KVSSSKKKSSNGAVQ TSSVEISKS +LKD                     VIFMHRGS
Subjt:  RVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGS

Query:  EDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
        EDEEHGGLFTLR+PSKT+D EEYTTYTVAFEHHVDANNFCYLLESFFEELD FTTDV+PLPTKELE VIKSHTSK+IVVKKGQLQLYAGQPF+DVEMA
Subjt:  EDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA

XP_023006301.1 uncharacterized protein LOC111499077 [Cucurbita maxima]7.5e-304100Show/hide
Query:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
        MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
Subjt:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP

Query:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
        SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
Subjt:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE

Query:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
        LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
Subjt:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND

Query:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
        EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
Subjt:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS

Query:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
        SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
Subjt:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE

Query:  SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
        SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
Subjt:  SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA

XP_023006303.1 ubiquitin-like-specific protease ESD4 [Cucurbita maxima]1.4e-286100Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
        MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
Subjt:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ

Query:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN
        YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN
Subjt:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN

Query:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI
        AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI
Subjt:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI

Query:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
        EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
Subjt:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK

Query:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
        KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
Subjt:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA

Query:  N
        N
Subjt:  N

XP_023549379.1 uncharacterized protein LOC111807740 [Cucurbita pepo subsp. pepo]8.4e-28794.95Show/hide
Query:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
        MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPP IDSETPIV PEISG AGVETEVLSP ECCP
Subjt:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP

Query:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
        SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKS KEVLLNGNERIVLGNFGSKTNE
Subjt:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE

Query:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
        LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDD+GEGD EG NVISRARIGIDKEIDARLV+LQKRLNSNK+RIPDSPVNFLFKSENVEEAAKRNDFND
Subjt:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND

Query:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
        EEERNKSLIYKKKLQFRNSNGDRMKKP GFQGF SNGKKSGSNGKAINKGVKDAEKRVAN IKY+DPKM +DDSTNLGSDKSVL+QKNDGTNLD D+K S
Subjt:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS

Query:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
        SSKKKSSNGAVQETSSVEISKS +LKDVMEKRSPSRAD WWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKT+DIEEYTTYTVAFEHHVDANNFCYLLE
Subjt:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE

Query:  SFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
        SFFEELD FTTDV+PLPTKELE VIKSHTSK+IVVKKGQLQLYAGQPFSDVEMA
Subjt:  SFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA

TrEMBL top hitse value%identityAlignment
A0A0A0KEW0 ULP_PROTEASE domain-containing protein1.1e-23683.67Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
        MGART+NRKRDDECLS+NRSYSSL+SPDFHVSKKPK S +S DRPV SSN TVARLSRYPEET  LRREVHGPCR+ KFGL +S  R WESKN+ + SEQ
Subjt:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ

Query:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSV-VVDLTNADSKVE
           GN+LSYNYQ+AK+RAIG+LRSFP+D I LDSDS+TE+  SGDSKNED++E IED+  + R  EV TM+ELD K+ +VHQPS+S+ VVDLTN DSKVE
Subjt:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSV-VVDLTNADSKVE

Query:  NAEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSN
        NAE MLGALSL+ D+SSV AYKKLLQ+VE+RTSRLKSLDFEIELNEKRRS+LQS TPKKKP+DEIPQELFTPLTKEEE +VERA S+NRRR+LV HENSN
Subjt:  NAEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSN

Query:  IEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVIN
        IEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFF+TFFYKKLNGRNGYDY SV++WT+ +KLKYELIDCDKIFVPIHREIHWCLAVIN
Subjt:  IEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVIN

Query:  KKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLR
        KKEKKFQYLDSLKGMDSRVLKTLARYFVDEVK+KSGK+IDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFR RTAKEILKLR
Subjt:  KKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLR

Query:  AN
        AN
Subjt:  AN

A0A6J1H4G3 ubiquitin-like-specific protease ESD43.7e-28098.2Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
        MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARL+RYPEETPQLRREVHGPCRIRKFGL KSF RHWESKNNCESSEQ
Subjt:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ

Query:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN
        YAAGNMLSYNYQIAKNRAIGALRSFPKDFI+LDSDSETERGASGDSKNED +EAIEDDKPDQRP EVNT+QELDAKMKNVHQPSTSVVVDLTNADSKVEN
Subjt:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN

Query:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI
        AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEE QVERALSTNRRRVLVTHENSNI
Subjt:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI

Query:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
        EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
Subjt:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK

Query:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
        KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
Subjt:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA

Query:  N
        N
Subjt:  N

A0A6J1H4L2 uncharacterized protein LOC1114604285.1e-28293.5Show/hide
Query:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
        MAGTCGSAVSFSIHCNKF ICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKP  FTVSPIPPQIDSETPIV PEISG AGVETEVLSPVECCP
Subjt:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP

Query:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
        SST+TDGESRLSESSDTASL NFDVANFS GSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKS KEVLLNGNERIVLGNFGSKTNE
Subjt:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE

Query:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
        LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDD+GEGD EG NVISRARIGIDKEIDARLV+LQKRLNSNK+R+PDSPVNFLFKSENVEEAAKRNDFND
Subjt:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND

Query:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
        EEERNKSLIYKKKLQFRNSNGDRMKKP GFQGFVSN KKSGSNGKAINKGVKDAEKRVANEIKYNDPKM KDDSTNLGSDKSVL+QKNDGTNLD D+K S
Subjt:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS

Query:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
        SSKKKSSNGAVQ TSSVEISKS +LKDV+EKRSPSRAD WWMNLPYVLVIFMH GSEDEEHGGLFTLR+PSKT+D EEYTTYTVAFEHHVDANNFCYLLE
Subjt:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE

Query:  SFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
        SFFEELD FTTDV+PLPTKELE VIKSHTSK+IVVKKGQLQLYAGQPF+DVEMA
Subjt:  SFFEELDTFTTDVIPLPTKELE-VIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA

A0A6J1KXF0 uncharacterized protein LOC1114990773.6e-304100Show/hide
Query:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
        MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP
Subjt:  MAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQIDSETPIVLPEISGSAGVETEVLSPVECCP

Query:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
        SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE
Subjt:  SSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNGNERIVLGNFGSKTNE

Query:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
        LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND
Subjt:  LVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFKSENVEEAAKRNDFND

Query:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
        EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS
Subjt:  EEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQKNDGTNLDFDMKVS

Query:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
        SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE
Subjt:  SSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVDANNFCYLLE

Query:  SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
        SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA
Subjt:  SFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMA

A0A6J1L4J0 ubiquitin-like-specific protease ESD46.9e-287100Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
        MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ
Subjt:  MGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQ

Query:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN
        YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN
Subjt:  YAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVEN

Query:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI
        AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI
Subjt:  AEAMLGALSLDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNI

Query:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
        EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK
Subjt:  EITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINK

Query:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
        KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
Subjt:  KEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
O42957 Ubiquitin-like-specific protease 12.5e-3640.91Show/hide
Query:  NIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVI
        NI IT + L  LR   WLNDEVIN Y+ L+ ER + +     + H F+TFFY  L  R    Y  VR+W   KK +  + D D +F+P+H ++HWC+AVI
Subjt:  NIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVI

Query:  NKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEIL
        NK +K+F+Y DSL G   +V   L  Y++ E K      +DVS W     ++ P Q NG DCG+F  K A+  SR + + F Q  MP  R + A  I+
Subjt:  NKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEIL

O65278 Putative ubiquitin-like-specific protease 1B5.6e-8456.61Show/hide
Query:  SSVSAYKKLL------QTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALS-TNRRRVLVTHENSNIEITGETL
        SSVS  KK        Q V+    R++         +   + L S T  KK L     E F PL +EE   V  ALS  NR+++LV+H+NSNI+I+GETL
Subjt:  SSVSAYKKLL------QTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALS-TNRRRVLVTHENSNIEITGETL

Query:  QCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQY
        QCLRP  WLND+V NLYLELLKER+ R+P+KY KCHFF+TFFY KL   +GY+Y +V +WT  +KL Y+LIDCD IFVPIH +IHW L VIN +E+KF Y
Subjt:  QCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQY

Query:  LDSL-KGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRAN
        LDSL  G+   +L  +A+Y VDEVK KS K+IDVSSW  E+VE+ P+Q+NG+DCGMFM+KY DFYSRGL+L F Q+ MPYFR RTAKEIL+LRA+
Subjt:  LDSL-KGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRAN

P59110 Sentrin-specific protease 13.1e-3432.03Show/hide
Query:  SAYKKLLQTVERRTSRLKSLDFEIE---LNEKRRSILQS-----RTPKKKPLDEIP-----------------QELFTPLTKEEEVQVERALSTNRRRVL
        S Y    +   RR    K+L  +++   L E+  ++L S     R P +K   EIP                 ++ F  +T+E E +++     N  +  
Subjt:  SAYKKLLQTVERRTSRLKSLDFEIE---LNEKRRSILQS-----RTPKKKPLDEIP-----------------QELFTPLTKEEEVQVERALSTNRRRVL

Query:  VTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIH
        V  E   + IT + +Q L    WLNDE+IN Y+ +L ER +   + +   H F+TFF+ KL       Y +V++WT     K ++   D + VPIH  +H
Subjt:  VTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIH

Query:  WCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWA--QEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRR
        WCLAV++ + K   Y DS+ G+++   + L +Y   E  +K  K+ D + W    +  +++P+Q NG DCGMF  KYAD  ++   + F Q+HMPYFR+R
Subjt:  WCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWA--QEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRR

Query:  TAKEIL
           EIL
Subjt:  TAKEIL

Q8GYL3 Ubiquitin-like-specific protease 1A2.8e-10447.63Show/hide
Query:  SSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQYAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSK
        SS      + RYPE    LRR+VH P RI   G  +S                   G  L+ N  + K  A+ +   +  D   +D D E      GD  
Subjt:  SSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQYAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSK

Query:  NEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPS--TSVVVDLTNADSKV-ENAEAMLGALSLDR---DLSSVSAYKKLLQTVERRTSRLKSLDFEI
            +E I DD    R    N   E+D   +     +   S V  L N   +V E ++A   +L ++R   D++S  AY+K+L++   RTS+LK   F  
Subjt:  NEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPS--TSVVVDLTNADSKV-ENAEAMLGALSLDR---DLSSVSAYKKLLQTVERRTSRLKSLDFEI

Query:  ELNEKRRSILQSRT----PKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPE
           E+ R++L+S +      ++P++ + +E F PL++EEE  V RA S N   +LVTH+NSNI+ITG+ L+CL+P  WLNDEVINLY+ LLKERE REP+
Subjt:  ELNEKRRSILQSRT----PKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPE

Query:  KYLKCHFFSTFFYKKL-NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGK
        K+LKCHFF+TFF+ KL N   GY+YG+VR+WT+ K+L Y L DCDKIF+PIH  IHW LAVIN K++KFQYLDS KG + ++L  LARYFVDEV++KS  
Subjt:  KYLKCHFFSTFFYKKL-NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGK

Query:  DIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
        D+DVS W QEFV+DLP Q NGFDCGMFM+KY DFYSRGL+LCF QE MPYFR RTAKEIL+L+A
Subjt:  DIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA

Q94F30 Ubiquitin-like-specific protease ESD41.2e-12651.74Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSS---LKSPDFHVSKKPKVS-ALSLDR-PVSSSNYTVARLSRYPEETPQLRREVHGPCR-IRKFGLWKSFYRHWESKNN
        MGA   NRKR DE  +     S+     SP F  SKK + S A+S D    +SSN T++R+SRYP+    LRRE+H P R I ++G  K       S + 
Subjt:  MGARTNNRKRDDECLSLNRSYSS---LKSPDFHVSKKPKVS-ALSLDR-PVSSSNYTVARLSRYPEETPQLRREVHGPCR-IRKFGLWKSFYRHWESKNN

Query:  CESSEQYAAGNMLSYNYQIAKNRAIGALR--SFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLT
        CE        N     Y  AK  A+ ALR  +  KDF++L  + E E   S DS    +++AIE    D            D + KN+    +S V D+ 
Subjt:  CESSEQYAAGNMLSYNYQIAKNRAIGALR--SFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLT

Query:  NADS-KVENAEAMLGALSLDRDL----SSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKK-KPLDEIPQELFTPLTKEEEVQVERALS-
          ++ +VE+   ML +LSLDRD+    SS+ AY+KL+Q+ E+R S+L++L FEI LNEK+ S+L+   PK  +   E+P+E F PLT++EE +V RA S 
Subjt:  NADS-KVENAEAMLGALSLDRDL----SSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKK-KPLDEIPQELFTPLTKEEEVQVERALS-

Query:  TNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFV
         NRR+VL THENSNI+ITGE LQCL P+AWLNDEVIN+YLELLKERE REP+KYLKCH+F+TFFYKKL   +GY++ +VR+WT  +KL Y LIDCD IFV
Subjt:  TNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFV

Query:  PIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMP
        PIHR +HW LAVIN +E K  YLDSL G+D  +L  LA+Y  DE   KSGK ID +SW  EFVEDLP+Q+NG+DCGMFM+KY DF+SRGL LCF QEHMP
Subjt:  PIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMP

Query:  YFRRRTAKEILKLRAN
        YFR RTAKEIL+LRA+
Subjt:  YFRRRTAKEILKLRAN

Arabidopsis top hitse value%identityAlignment
AT1G10570.1 Cysteine proteinases superfamily protein3.9e-1626.29Show/hide
Query:  RRSILQSRTPKKKPLDEI----PQELFTPLTKEEEVQVERALSTNRRRVLVTHENSN--IEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYL
        + S LQS + +KK  D++      E  +P+  EE  ++   L  +         +    ++++ + L+CL P  +L   VIN Y+  ++       +   
Subjt:  RRSILQSRTPKKKPLDEI----PQELFTPLTKEEEVQVERALSTNRRRVLVTHENSN--IEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYL

Query:  KCHFFSTFFYKKL--------NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQ------YLDSLKGMDSRVLKTLARYF
         CHFF+TFFYKKL        N R+ Y +   R+W       ++L     IF+PIH ++HW L +I   +K+ +      +LDSL      ++    + F
Subjt:  KCHFFSTFFYKKL--------NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQ------YLDSLKGMDSRVLKTLARYF

Query:  VDEVKNKSGKDI------------DVSSWAQEFVEDLPEQENGFDCGMFMI
        + E  N   +D             D+ +   E    +P+Q+N FDCG+F++
Subjt:  VDEVKNKSGKDI------------DVSSWAQEFVEDLPEQENGFDCGMFMI

AT3G06910.1 UB-like protease 1A2.0e-10547.63Show/hide
Query:  SSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQYAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSK
        SS      + RYPE    LRR+VH P RI   G  +S                   G  L+ N  + K  A+ +   +  D   +D D E      GD  
Subjt:  SSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQYAAGNMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSK

Query:  NEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPS--TSVVVDLTNADSKV-ENAEAMLGALSLDR---DLSSVSAYKKLLQTVERRTSRLKSLDFEI
            +E I DD    R    N   E+D   +     +   S V  L N   +V E ++A   +L ++R   D++S  AY+K+L++   RTS+LK   F  
Subjt:  NEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPS--TSVVVDLTNADSKV-ENAEAMLGALSLDR---DLSSVSAYKKLLQTVERRTSRLKSLDFEI

Query:  ELNEKRRSILQSRT----PKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPE
           E+ R++L+S +      ++P++ + +E F PL++EEE  V RA S N   +LVTH+NSNI+ITG+ L+CL+P  WLNDEVINLY+ LLKERE REP+
Subjt:  ELNEKRRSILQSRT----PKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPE

Query:  KYLKCHFFSTFFYKKL-NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGK
        K+LKCHFF+TFF+ KL N   GY+YG+VR+WT+ K+L Y L DCDKIF+PIH  IHW LAVIN K++KFQYLDS KG + ++L  LARYFVDEV++KS  
Subjt:  KYLKCHFFSTFFYKKL-NGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGK

Query:  DIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA
        D+DVS W QEFV+DLP Q NGFDCGMFM+KY DFYSRGL+LCF QE MPYFR RTAKEIL+L+A
Subjt:  DIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRA

AT4G00690.1 UB-like protease 1B4.8e-8355.3Show/hide
Query:  SSVSAYKKLL------QTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALS-TNRRRVLVTHENSNIEITGETL
        SSVS  KK        Q V+    R++         +   + L S T  KK L     E F PL +EE   V  ALS  NR+++LV+H+NSNI+I+GETL
Subjt:  SSVSAYKKLL------QTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALS-TNRRRVLVTHENSNIEITGETL

Query:  QCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQY
        QCLRP  WLND+V NLYLELLKER+ R+P+KY KCHFF+TFFY KL   +GY+Y +V +WT  +KL Y+LIDCD IFVPIH +IHW L VIN +E+KF Y
Subjt:  QCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQY

Query:  LDSL-KGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQ-------EHMPYFRRRTAKEILKLR
        LDSL  G+   +L  +A+Y VDEVK KS K+IDVSSW  E+VE+ P+Q+NG+DCGMFM+KY DFYSRGL+L F Q       + MPYFR RTAKEIL+LR
Subjt:  LDSL-KGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQ-------EHMPYFRRRTAKEILKLR

Query:  AN
        A+
Subjt:  AN

AT4G15820.1 BEST Arabidopsis thaliana protein match is: embryo defective 1703 (TAIR:AT3G61780.1)6.6e-4031.4Show/hide
Query:  GVETEVLSPVECCPSSTTTDGES-RLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNG
        G+ET    P     S   +D E  ++  SS  ++ +N      S     ++G++L+ +FAFQT+C V  L  G+S K +K  +                 
Subjt:  GVETEVLSPVECCPSSTTTDGES-RLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSKSGKEVLLNG

Query:  NERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFK
               +  S+ N LV L++ +M +KI EIR++AR+ARK E  Q+ DD                I I+KEI+ARL  ++KRLNS ++ +    V  L +
Subjt:  NERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVNFLFK

Query:  SENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMG-FQGF---------VSNGKKSGSNGKAINK--GVKDAEKRVANEIKYNDPKMSKDD
        S N E               KSL+++KK +F+       K PMG  +GF         +S  +K+G NG A     G K+ E+++   + + D    + +
Subjt:  SENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMG-FQGF---------VSNGKKSGSNGKAINK--GVKDAEKRVANEIKYNDPKMSKDD

Query:  STNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKT
             ++     +  +      +MK  S    +S          ++ K   L+   EK+S     LWW+ LPYVL I M    + +   G FTLR  S  
Subjt:  STNLGSDKSVLVQKNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKT

Query:  RDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKEL-EVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMAFVT
        ++ E   ++ +AFE   DA NF YLLES FE+LD F+ D+ P+ TK+L + + S    +IVV+K QL LYAGQPF DVE A  T
Subjt:  RDIEEYTTYTVAFEHHVDANNFCYLLESFFEELDTFTTDVIPLPTKEL-EVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMAFVT

AT4G15880.1 Cysteine proteinases superfamily protein8.4e-12851.74Show/hide
Query:  MGARTNNRKRDDECLSLNRSYSS---LKSPDFHVSKKPKVS-ALSLDR-PVSSSNYTVARLSRYPEETPQLRREVHGPCR-IRKFGLWKSFYRHWESKNN
        MGA   NRKR DE  +     S+     SP F  SKK + S A+S D    +SSN T++R+SRYP+    LRRE+H P R I ++G  K       S + 
Subjt:  MGARTNNRKRDDECLSLNRSYSS---LKSPDFHVSKKPKVS-ALSLDR-PVSSSNYTVARLSRYPEETPQLRREVHGPCR-IRKFGLWKSFYRHWESKNN

Query:  CESSEQYAAGNMLSYNYQIAKNRAIGALR--SFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLT
        CE        N     Y  AK  A+ ALR  +  KDF++L  + E E   S DS    +++AIE    D            D + KN+    +S V D+ 
Subjt:  CESSEQYAAGNMLSYNYQIAKNRAIGALR--SFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLT

Query:  NADS-KVENAEAMLGALSLDRDL----SSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKK-KPLDEIPQELFTPLTKEEEVQVERALS-
          ++ +VE+   ML +LSLDRD+    SS+ AY+KL+Q+ E+R S+L++L FEI LNEK+ S+L+   PK  +   E+P+E F PLT++EE +V RA S 
Subjt:  NADS-KVENAEAMLGALSLDRDL----SSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKK-KPLDEIPQELFTPLTKEEEVQVERALS-

Query:  TNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFV
         NRR+VL THENSNI+ITGE LQCL P+AWLNDEVIN+YLELLKERE REP+KYLKCH+F+TFFYKKL   +GY++ +VR+WT  +KL Y LIDCD IFV
Subjt:  TNRRRVLVTHENSNIEITGETLQCLRPAAWLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFV

Query:  PIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMP
        PIHR +HW LAVIN +E K  YLDSL G+D  +L  LA+Y  DE   KSGK ID +SW  EFVEDLP+Q+NG+DCGMFM+KY DF+SRGL LCF QEHMP
Subjt:  PIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVLKTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMP

Query:  YFRRRTAKEILKLRAN
        YFR RTAKEIL+LRA+
Subjt:  YFRRRTAKEILKLRAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTCAGGCCCATCATAGGAAGCAAGCACAAGCAACCCTTGCGCCATTACTTACCAGAAACTCAAACCTCGTTTCACAAGCTTCCTTCCCCGCTGGTTATG
GCGGGTACGTGCGGTTCCGCTGTTTCTTTCTCCATTCATTGCAACAAATTCACCATTTGCAGCACCAAGCCATTGCTTTCAGTTTCAGCTTCCATTTCCATTTCA
TCTCGTTCAAGGCTTACAAGAAGAAAAAATCACTTGCGAATCAAAATCCTCAAAACCCTAACTAAACCTCCTCCGTTCACTGTCTCTCCCATTCCTCCGCAAATC
GACTCTGAAACTCCGATCGTATTGCCGGAAATCTCAGGTTCTGCGGGCGTCGAGACCGAGGTGTTATCTCCGGTGGAATGTTGCCCTTCCTCCACTACCACGGAC
GGTGAATCTCGACTCTCCGAGAGCTCGGACACTGCCTCGTTGCTTAATTTTGACGTTGCCAACTTTTCTTTGGGAAGTTTCGTCAGGTTTGGCGTTTACTTGCTT
GCTCTTTTTGCGTTTCAGACAATCTGTACTGTGTGGGTTTTAGATTATGGTAATTCAATTAAGGAAGATAAGAACTCAGATAAAGATTTGAGTATTAGAAGCAAA
AGCGGAAAAGAAGTGTTGTTGAATGGAAATGAGAGAATCGTTCTTGGAAATTTCGGGTCTAAAACGAACGAGTTGGTTTATTTAGACGAATCGAAGATGAGAGAC
AAAATTGAAGAGATCAGGTTGTTGGCTAGGAAAGCAAGGAAAGAGGAGAAATATCAAAAACCTGATGATGTAGGGGAGGGCGACACGGAGGGTAGCAATGTAATT
TCAAGGGCTAGGATTGGTATTGATAAAGAGATTGATGCTCGACTTGTTAGGTTACAGAAGAGGCTAAATTCTAACAAAGAAAGGATACCGGATTCACCAGTAAAT
TTTTTGTTCAAGTCTGAGAATGTTGAGGAAGCTGCTAAAAGGAACGATTTCAATGATGAAGAAGAACGGAATAAGAGTCTAATATATAAGAAAAAGTTGCAATTC
AGAAACTCTAATGGAGATAGAATGAAGAAGCCTATGGGGTTTCAAGGATTTGTTTCTAATGGTAAAAAAAGTGGCTCAAATGGCAAGGCTATTAACAAGGGAGTG
AAGGATGCTGAAAAGCGAGTAGCTAACGAAATCAAGTATAACGATCCCAAGATGTCCAAAGACGATAGCACAAATTTGGGCAGCGATAAGTCTGTATTGGTACAG
AAAAACGATGGAACCAATTTGGATTTCGATATGAAGGTTTCGAGTTCAAAGAAGAAATCAAGTAATGGTGCCGTTCAGGAGACTTCTTCAGTGGAGATCTCGAAG
TCACCGGATTTAAAAGATGTAATGGAGAAAAGATCTCCTTCGAGGGCTGATTTATGGTGGATGAATCTTCCTTATGTTCTAGTTATTTTTATGCATCGAGGTTCT
GAAGATGAAGAACATGGAGGACTTTTCACCTTAAGGATTCCTTCCAAGACACGGGATATTGAGGAATATACTACATATACAGTTGCTTTTGAACACCATGTTGAT
GCAAATAACTTCTGTTATCTTCTGGAATCATTTTTTGAAGAGCTCGACACTTTCACAACCGACGTCATTCCTCTGCCAACAAAAGAACTCGAGGTCATAAAATCA
CATACAAGTAAAATTATTGTTGTGAAGAAGGGGCAATTGCAGCTCTATGCTGGTCAACCGTTTTCCGATGTCGAGATGGCTTTTGTCACGGACGATCTGTTGCAA
CAAATGCAGCCATGGAATCGTACCGAAGATGGGGAAAATTTGCAACAGAAGATGGAGTTGTTGCAGATATGGATTGTAGGCGTTTCGTTTCAATCTCCGTGTCTA
CTAATGGGCGCCCGAACCAACAACCGTAAGCGCGACGACGAGTGTTTGAGCCTTAATCGTTCGTATTCATCTTTGAAATCACCGGATTTTCACGTCTCCAAGAAA
CCTAAAGTTTCTGCTTTGTCTTTGGATCGGCCAGTTTCCTCATCGAACTATACGGTGGCTAGGCTTTCCCGGTACCCTGAAGAAACTCCGCAATTGCGCCGAGAA
GTACACGGCCCTTGCAGAATTCGCAAATTCGGGCTTTGGAAGAGCTTCTATAGGCATTGGGAGTCAAAAAATAATTGCGAGTCGTCGGAACAGTACGCGGCGGGA
AATATGTTATCGTACAACTACCAGATTGCGAAGAATCGTGCAATTGGTGCATTGCGGTCTTTCCCGAAGGATTTCATTAACTTGGATTCAGACTCCGAAACCGAA
AGAGGCGCTTCTGGAGATTCGAAGAACGAAGACAACATCGAGGCAATTGAAGATGATAAACCGGACCAGAGGCCTTGTGAGGTTAATACAATGCAGGAATTGGAT
GCGAAGATGAAGAATGTTCATCAGCCGTCGACTTCGGTAGTTGTAGATTTGACAAATGCTGATTCGAAGGTCGAAAATGCTGAGGCGATGCTTGGTGCTTTGTCA
CTGGATCGAGACTTGTCCAGTGTTTCTGCTTACAAGAAGTTGCTTCAGACTGTGGAAAGGCGGACATCCAGATTGAAAAGCTTGGATTTCGAAATCGAGTTGAAC
GAGAAGCGCCGATCGATTCTTCAATCCCGAACTCCAAAGAAGAAGCCTCTTGATGAGATTCCACAGGAACTTTTTACTCCTCTTACAAAGGAAGAGGAGGTGCAG
GTTGAACGTGCGCTCTCTACGAACCGGAGGAGAGTTTTGGTTACTCATGAGAATTCAAATATTGAGATTACTGGGGAAACTTTGCAGTGTCTGAGACCAGCTGCC
TGGTTAAACGACGAGGTGATTAATTTGTATCTTGAGTTGCTGAAAGAAAGGGAAAGAAGGGAGCCAGAGAAGTATTTGAAATGCCATTTCTTTAGTACCTTCTTC
TATAAAAAGTTAAATGGAAGGAATGGCTACGATTACGGATCGGTTAGAAAATGGACCAACCCAAAGAAGTTGAAGTATGAACTCATTGATTGTGATAAAATATTT
GTCCCCATCCATAGAGAAATACACTGGTGCCTAGCAGTAATCAACAAAAAAGAGAAGAAGTTTCAGTATCTTGATTCTCTCAAGGGAATGGATTCTCGTGTTTTA
AAGACATTGGCGAGATATTTTGTAGATGAGGTGAAGAACAAGAGTGGAAAAGATATTGATGTAAGTTCTTGGGCACAAGAATTTGTCGAAGACCTTCCCGAGCAA
GAAAATGGGTTTGATTGTGGCATGTTTATGATCAAGTATGCTGATTTTTATAGCAGAGGCTTGAATCTCTGTTTCAAGCAGGAACACATGCCTTATTTCCGACGT
AGAACAGCCAAGGAAATTTTGAAATTGAGAGCTAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTCAGGCCCATCATAGGAAGCAAGCACAAGCAACCCTTGCGCCATTACTTACCAGAAACTCAAACCTCGTTTCACAAGCTTCCTTCCCCGCTGGTTATG
GCGGGTACGTGCGGTTCCGCTGTTTCTTTCTCCATTCATTGCAACAAATTCACCATTTGCAGCACCAAGCCATTGCTTTCAGTTTCAGCTTCCATTTCCATTTCA
TCTCGTTCAAGGCTTACAAGAAGAAAAAATCACTTGCGAATCAAAATCCTCAAAACCCTAACTAAACCTCCTCCGTTCACTGTCTCTCCCATTCCTCCGCAAATC
GACTCTGAAACTCCGATCGTATTGCCGGAAATCTCAGGTTCTGCGGGCGTCGAGACCGAGGTGTTATCTCCGGTGGAATGTTGCCCTTCCTCCACTACCACGGAC
GGTGAATCTCGACTCTCCGAGAGCTCGGACACTGCCTCGTTGCTTAATTTTGACGTTGCCAACTTTTCTTTGGGAAGTTTCGTCAGGTTTGGCGTTTACTTGCTT
GCTCTTTTTGCGTTTCAGACAATCTGTACTGTGTGGGTTTTAGATTATGGTAATTCAATTAAGGAAGATAAGAACTCAGATAAAGATTTGAGTATTAGAAGCAAA
AGCGGAAAAGAAGTGTTGTTGAATGGAAATGAGAGAATCGTTCTTGGAAATTTCGGGTCTAAAACGAACGAGTTGGTTTATTTAGACGAATCGAAGATGAGAGAC
AAAATTGAAGAGATCAGGTTGTTGGCTAGGAAAGCAAGGAAAGAGGAGAAATATCAAAAACCTGATGATGTAGGGGAGGGCGACACGGAGGGTAGCAATGTAATT
TCAAGGGCTAGGATTGGTATTGATAAAGAGATTGATGCTCGACTTGTTAGGTTACAGAAGAGGCTAAATTCTAACAAAGAAAGGATACCGGATTCACCAGTAAAT
TTTTTGTTCAAGTCTGAGAATGTTGAGGAAGCTGCTAAAAGGAACGATTTCAATGATGAAGAAGAACGGAATAAGAGTCTAATATATAAGAAAAAGTTGCAATTC
AGAAACTCTAATGGAGATAGAATGAAGAAGCCTATGGGGTTTCAAGGATTTGTTTCTAATGGTAAAAAAAGTGGCTCAAATGGCAAGGCTATTAACAAGGGAGTG
AAGGATGCTGAAAAGCGAGTAGCTAACGAAATCAAGTATAACGATCCCAAGATGTCCAAAGACGATAGCACAAATTTGGGCAGCGATAAGTCTGTATTGGTACAG
AAAAACGATGGAACCAATTTGGATTTCGATATGAAGGTTTCGAGTTCAAAGAAGAAATCAAGTAATGGTGCCGTTCAGGAGACTTCTTCAGTGGAGATCTCGAAG
TCACCGGATTTAAAAGATGTAATGGAGAAAAGATCTCCTTCGAGGGCTGATTTATGGTGGATGAATCTTCCTTATGTTCTAGTTATTTTTATGCATCGAGGTTCT
GAAGATGAAGAACATGGAGGACTTTTCACCTTAAGGATTCCTTCCAAGACACGGGATATTGAGGAATATACTACATATACAGTTGCTTTTGAACACCATGTTGAT
GCAAATAACTTCTGTTATCTTCTGGAATCATTTTTTGAAGAGCTCGACACTTTCACAACCGACGTCATTCCTCTGCCAACAAAAGAACTCGAGGTCATAAAATCA
CATACAAGTAAAATTATTGTTGTGAAGAAGGGGCAATTGCAGCTCTATGCTGGTCAACCGTTTTCCGATGTCGAGATGGCTTTTGTCACGGACGATCTGTTGCAA
CAAATGCAGCCATGGAATCGTACCGAAGATGGGGAAAATTTGCAACAGAAGATGGAGTTGTTGCAGATATGGATTGTAGGCGTTTCGTTTCAATCTCCGTGTCTA
CTAATGGGCGCCCGAACCAACAACCGTAAGCGCGACGACGAGTGTTTGAGCCTTAATCGTTCGTATTCATCTTTGAAATCACCGGATTTTCACGTCTCCAAGAAA
CCTAAAGTTTCTGCTTTGTCTTTGGATCGGCCAGTTTCCTCATCGAACTATACGGTGGCTAGGCTTTCCCGGTACCCTGAAGAAACTCCGCAATTGCGCCGAGAA
GTACACGGCCCTTGCAGAATTCGCAAATTCGGGCTTTGGAAGAGCTTCTATAGGCATTGGGAGTCAAAAAATAATTGCGAGTCGTCGGAACAGTACGCGGCGGGA
AATATGTTATCGTACAACTACCAGATTGCGAAGAATCGTGCAATTGGTGCATTGCGGTCTTTCCCGAAGGATTTCATTAACTTGGATTCAGACTCCGAAACCGAA
AGAGGCGCTTCTGGAGATTCGAAGAACGAAGACAACATCGAGGCAATTGAAGATGATAAACCGGACCAGAGGCCTTGTGAGGTTAATACAATGCAGGAATTGGAT
GCGAAGATGAAGAATGTTCATCAGCCGTCGACTTCGGTAGTTGTAGATTTGACAAATGCTGATTCGAAGGTCGAAAATGCTGAGGCGATGCTTGGTGCTTTGTCA
CTGGATCGAGACTTGTCCAGTGTTTCTGCTTACAAGAAGTTGCTTCAGACTGTGGAAAGGCGGACATCCAGATTGAAAAGCTTGGATTTCGAAATCGAGTTGAAC
GAGAAGCGCCGATCGATTCTTCAATCCCGAACTCCAAAGAAGAAGCCTCTTGATGAGATTCCACAGGAACTTTTTACTCCTCTTACAAAGGAAGAGGAGGTGCAG
GTTGAACGTGCGCTCTCTACGAACCGGAGGAGAGTTTTGGTTACTCATGAGAATTCAAATATTGAGATTACTGGGGAAACTTTGCAGTGTCTGAGACCAGCTGCC
TGGTTAAACGACGAGGTGATTAATTTGTATCTTGAGTTGCTGAAAGAAAGGGAAAGAAGGGAGCCAGAGAAGTATTTGAAATGCCATTTCTTTAGTACCTTCTTC
TATAAAAAGTTAAATGGAAGGAATGGCTACGATTACGGATCGGTTAGAAAATGGACCAACCCAAAGAAGTTGAAGTATGAACTCATTGATTGTGATAAAATATTT
GTCCCCATCCATAGAGAAATACACTGGTGCCTAGCAGTAATCAACAAAAAAGAGAAGAAGTTTCAGTATCTTGATTCTCTCAAGGGAATGGATTCTCGTGTTTTA
AAGACATTGGCGAGATATTTTGTAGATGAGGTGAAGAACAAGAGTGGAAAAGATATTGATGTAAGTTCTTGGGCACAAGAATTTGTCGAAGACCTTCCCGAGCAA
GAAAATGGGTTTGATTGTGGCATGTTTATGATCAAGTATGCTGATTTTTATAGCAGAGGCTTGAATCTCTGTTTCAAGCAGGAACACATGCCTTATTTCCGACGT
AGAACAGCCAAGGAAATTTTGAAATTGAGAGCTAACTGATTGATCACACATAAAGTCTCGAAATTTTCTTCGTGTTAGCAGCGTATAGATCCCGAGATTCTCTTG
CACGTTTGAGTGGCAGCAGCTCTAAATTGTATGTGTTTTGGATTGGTTTCGAGTAGAGAATATAGATGATGAGTACAAACAAATTTATACGGGAGAGGTTGCAGG
GCGAAGCATCATCTAGTATTTTTGGTAAGTGAATACTCTCCCCATCTCGACGTATTTATTTTAAATGGTTTAGTGATTCATTTCAAACTCTCTGTATTATTGAGT
TATTTCTCCCTCCCCCCCCCC
Protein sequenceShow/hide protein sequence
MGLRPIIGSKHKQPLRHYLPETQTSFHKLPSPLVMAGTCGSAVSFSIHCNKFTICSTKPLLSVSASISISSRSRLTRRKNHLRIKILKTLTKPPPFTVSPIPPQI
DSETPIVLPEISGSAGVETEVLSPVECCPSSTTTDGESRLSESSDTASLLNFDVANFSLGSFVRFGVYLLALFAFQTICTVWVLDYGNSIKEDKNSDKDLSIRSK
SGKEVLLNGNERIVLGNFGSKTNELVYLDESKMRDKIEEIRLLARKARKEEKYQKPDDVGEGDTEGSNVISRARIGIDKEIDARLVRLQKRLNSNKERIPDSPVN
FLFKSENVEEAAKRNDFNDEEERNKSLIYKKKLQFRNSNGDRMKKPMGFQGFVSNGKKSGSNGKAINKGVKDAEKRVANEIKYNDPKMSKDDSTNLGSDKSVLVQ
KNDGTNLDFDMKVSSSKKKSSNGAVQETSSVEISKSPDLKDVMEKRSPSRADLWWMNLPYVLVIFMHRGSEDEEHGGLFTLRIPSKTRDIEEYTTYTVAFEHHVD
ANNFCYLLESFFEELDTFTTDVIPLPTKELEVIKSHTSKIIVVKKGQLQLYAGQPFSDVEMAFVTDDLLQQMQPWNRTEDGENLQQKMELLQIWIVGVSFQSPCL
LMGARTNNRKRDDECLSLNRSYSSLKSPDFHVSKKPKVSALSLDRPVSSSNYTVARLSRYPEETPQLRREVHGPCRIRKFGLWKSFYRHWESKNNCESSEQYAAG
NMLSYNYQIAKNRAIGALRSFPKDFINLDSDSETERGASGDSKNEDNIEAIEDDKPDQRPCEVNTMQELDAKMKNVHQPSTSVVVDLTNADSKVENAEAMLGALS
LDRDLSSVSAYKKLLQTVERRTSRLKSLDFEIELNEKRRSILQSRTPKKKPLDEIPQELFTPLTKEEEVQVERALSTNRRRVLVTHENSNIEITGETLQCLRPAA
WLNDEVINLYLELLKERERREPEKYLKCHFFSTFFYKKLNGRNGYDYGSVRKWTNPKKLKYELIDCDKIFVPIHREIHWCLAVINKKEKKFQYLDSLKGMDSRVL
KTLARYFVDEVKNKSGKDIDVSSWAQEFVEDLPEQENGFDCGMFMIKYADFYSRGLNLCFKQEHMPYFRRRTAKEILKLRAN