; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020372 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020372
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUbiquitinyl hydrolase 1
Genome locationChr04:31254375..31257085
RNA-Seq ExpressionHG10020372
SyntenyHG10020372
Gene Ontology termsGO:0016579 - protein deubiquitination (biological process)
GO:0005634 - nucleus (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
InterPro domainsIPR006155 - Josephin domain
IPR033865 - Machado-Joseph disease protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580493.1 Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. sororia]2.9e-18481.22Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AE AQIDPELE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED KAAIAASLMDS A+MAAGV+NP NEPAVSSTQAASPQN            ++P V PKA 
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
        T Q+VPAV+PK AT +DVP +STKPASPQ++P V+PEAS SQ+VRALSPDAAAT   LHA S AKA TPN    VCTEVAVHQNE  NE+VGNADAAF E
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADNAEC +SSPRKKISRTN G A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

XP_004137739.1 ataxin-3 homolog [Cucumis sativus]1.3e-21389.67Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDP+LENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA+MAAGV+NP NEP VSSTQA SPQNVP V+LETA T+DV  VSP AS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
         L++VPAVSP+ ATLQDVP IS K ASPQN P VSPEASTSQDV  LSP+AA  PQDLH +STAKAA P  +S VCTEV+VHQNESGNE+VGNAD AFC+
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADN EC +SSPRKKISRTNEGTA
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

XP_008442472.1 PREDICTED: ataxin-3 homolog [Cucumis melo]3.2e-21590.85Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDP+LENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASL+DSSA+MAAGV+NP NEP VSSTQA SPQNVP VSLETA T+DV  VSP AS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
         LQ+V AVS K ATLQDVP IS K A PQNVP VSPEASTSQDVRAL PDAA  PQDLHA+ST KAATPN +S VCTEV VHQNESGNE+VGNAD AFCE
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADN EC ISSPRKKISRTNEG A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

XP_023527477.1 ataxin-3 homolog [Cucurbita pepo subsp. pepo]2.9e-18480.99Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLS+ESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AE AQIDPELE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED KAAIAASLMDS A+MAAGV+NP NEPAVSSTQAASPQN            ++P V PKA 
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
        T Q+VPAV+PK ATL+DVP +STKPASPQ++P V+PEAS SQ+VRALSPDAAAT   LHA S AKA TPN    VCTEVAVH+NE  NE+VGNADAAF E
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADNAEC +SSPRKKISRTN G A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

XP_038904645.1 ataxin-3 homolog [Benincasa hispida]3.0e-22192.25Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQRVNWTEQ DTFLSSGETEMLIDMEDEDLKAAIAASL+DSSA+MAAG +N QNEPAVSSTQAASPQNVPVVSLETAKT+D P+VS KAS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
        TLQ+VPAV PK ATLQDVP +S KP+SPQNVPFVSP+ASTSQDVR LSPDA ATPQDLHA+ST K ATPN +  VCTEV+VHQNESGNE+VGNA+AAF E
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SGPADNAECG+SSPRKKISRT+EGTA
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

TrEMBL top hitse value%identityAlignment
A0A0A0L9Z8 Ubiquitinyl hydrolase 16.5e-21489.67Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDP+LENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA+MAAGV+NP NEP VSSTQA SPQNVP V+LETA T+DV  VSP AS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
         L++VPAVSP+ ATLQDVP IS K ASPQN P VSPEASTSQDV  LSP+AA  PQDLH +STAKAA P  +S VCTEV+VHQNESGNE+VGNAD AFC+
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADN EC +SSPRKKISRTNEGTA
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

A0A1S3B6I4 Ubiquitinyl hydrolase 11.6e-21590.85Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDP+LENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASL+DSSA+MAAGV+NP NEP VSSTQA SPQNVP VSLETA T+DV  VSP AS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
         LQ+V AVS K ATLQDVP IS K A PQNVP VSPEASTSQDVRAL PDAA  PQDLHA+ST KAATPN +S VCTEV VHQNESGNE+VGNAD AFCE
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADN EC ISSPRKKISRTNEG A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

A0A5D3DN96 Ubiquitinyl hydrolase 11.6e-21590.85Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AEPAQIDP+LENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQAPPPPQR NWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASL+DSSA+MAAGV+NP NEP VSSTQA SPQNVP VSLETA T+DV  VSP AS
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
         LQ+V AVS K ATLQDVP IS K A PQNVP VSPEASTSQDVRAL PDAA  PQDLHA+ST KAATPN +S VCTEV VHQNESGNE+VGNAD AFCE
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADN EC ISSPRKKISRTNEG A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

A0A6J1F1U3 Ubiquitinyl hydrolase 14.6e-18380.75Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AE AQIDPELE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQA  PPQ +NWT+  DTFLS G+ EML+D+EDED KAAIAASLMDS A+MAAGV+NP NEPAVSSTQAASPQN            ++P V PKA 
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
        T Q+VPAV+ K ATL+DVP ISTKPASP+++P V+PEAS SQ+VRALSPDAAAT   LH  S AKA TPN    VCTEVAVHQNE  NE+VGNADAAF E
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRTNEGTA
        SG ADNAEC +SSPRKKISRTN G A
Subjt:  SGPADNAECGISSPRKKISRTNEGTA

A0A6J1J5J2 Ubiquitinyl hydrolase 11.9e-18481.47Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSE+SHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPV

Query:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK
        AE AQIDPELE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITK

Query:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS
        SCNSTQA  PPQ +NWT+ QDTFLS G+ EML+D+EDED KAAIAASLMDS A+MAAGV+NP NEPAVS TQAASPQN            ++P VSPKA 
Subjt:  SCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKAS

Query:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE
        T Q+VPAV+PK ATL+DVP +STKPASPQ++P V+PEAS SQ+VRALSPDAAAT   LHA+S AKA TPN    VCTEVAVH+NE  NE+VGNADAAF E
Subjt:  TLQEVPAVSPKPATLQDVPFISTKPASPQNVPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCE

Query:  SGPADNAECGISSPRKKISRT
        SG ADNAEC +SSPRKKISRT
Subjt:  SGPADNAECGISSPRKKISRT

SwissProt top hitse value%identityAlignment
O35815 Ataxin-32.8e-3634.15Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ
        ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL + S N+   G FSIQV+  AL+VW L++I  NSP  +  +
Subjt:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ

Query:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP---KDFPISSSEASNGYGQWLSPEDAERITKSC
        IDP  E +FIC+ ++HWF +RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P    D  +   +    +   L  E+   + +  
Subjt:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP---KDFPISSSEASNGYGQWLSPEDAERITKSC

Query:  -------NSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE--DLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQ
                  +A   P   +  ++ D   +   +   IDMEDE  DL+ AI  S+  SS  M       ++ P  SST  +S +
Subjt:  -------NSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE--DLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQ

P54252 Ataxin-31.5e-3737.19Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ
        ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL + S N+   G FSIQV+  AL+VW L++I  NSP  +  +
Subjt:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ

Query:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITKSCNST
        IDP  E +FIC+ ++HWF +RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        EA     Q +  +   R  K     
Subjt:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITKSCNST

Query:  QAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAAS
         A    QRV+ T+  +  L + +   ++D ++EDL+ A+A S
Subjt:  QAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDEDLKAAIAAS

Q8LQ36 Putative ataxin-3 homolog3.3e-9057.1Show/hide
Query:  ACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSG------STTGDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIP
        A NGG+LYHEVQE KLCAVHCVNT LQGPFFSEFDL+ALA DLD++ERQ+M  G      +  GDFL+  E SHNVSL GDFSIQVLQKALEVWDLQVIP
Subjt:  ACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSG------STTGDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIP

Query:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDA
        L+SP       DPELE AFICHLQDHWFCIRKVNGEWYNF+SLY AP+HLSKFYLSA++D+LKG GWSIF VRGNFPK+ P+ ++E SNG+GQWL+P+DA
Subjt:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDA

Query:  ERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE-DLKAAIAASLMDSSA----IMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTK
         RIT SCN  Q P     V+    Q   +S  E +M+   ++E DL AAIAASLMD+        A   S  Q+  A+ ST     ++    +LE     
Subjt:  ERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE-DLKAAIAASLMDSSA----IMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTK

Query:  DVPVVSPKASTLQEVPAVSPKPAT
              P +  ++     +PK  T
Subjt:  DVPVVSPKASTLQEVPAVSPKPAT

Q9CVD2 Ataxin-35.6e-3733.45Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ
        ++HE QE  LCA HC+N +LQG +FS  +L+++A  LD +ER  M  G  T +    FL + S N+   G FSIQV+  AL+VW L++I  NSP  +  +
Subjt:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQ

Query:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP--------KDFPISSSEASNGYGQWLS------
        IDP  E +FIC+ ++HWF +RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P        +   +         G+ L+      
Subjt:  IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFP--------KDFPISSSEASNGYGQWLS------

Query:  --PEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE--DLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQ
            D ER+ ++ + +        +   ++ D   +   +   IDMEDE  DL+ AI  S+  SS  M       +N P  SS   +S +
Subjt:  --PEDAERITKSCNSTQAPPPPQRVNWTEQQDTFLSSGETEMLIDMEDE--DLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQ

Q9M391 Ataxin-3 homolog1.0e-10270.76Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIP
        M+   NGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQ+ML G+       GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIP
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIP

Query:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI-SSSEASNGYGQWLSPED
        LN P AEPAQIDPELE+AFICHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ P+ SSSEASN +GQWLSPED
Subjt:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI-SSSEASNGYGQWLSPED

Query:  AERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA----IMAAGVSNPQNE
        AERI K+ +S  +    +  +   QQ  +  LS  E +   +MED+DLKAAIAASL+D+SA    + A G S  + E
Subjt:  AERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA----IMAAGVSNPQNE

Arabidopsis top hitse value%identityAlignment
AT2G29640.1 JOSEPHIN-like protein4.3e-0827.21Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQG-PFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALE------VWDLQVIPLNSPVAE
        +YHE Q  + C +HC+N + Q    F++  L ++A  L+  +        T   F+ +  HN ++ G++ + V+  ALE      VW  + I  +S   +
Subjt:  LYHEVQESKLCAVHCVNTVLQG-PFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALE------VWDLQVIPLNSPVAE

Query:  PAQ------IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQ
         A       ++  ++         HW  +RK+NG WYN DS    PQ
Subjt:  PAQ------IDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQ

AT3G54130.1 Josephin family protein7.1e-10470.76Show/hide
Query:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIP
        M+   NGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A+DLD KERQ+ML G+       GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVIP
Subjt:  MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIP

Query:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI-SSSEASNGYGQWLSPED
        LN P AEPAQIDPELE+AFICHL DHWFCIRKVNGEWYNFDSL AAPQHLSKFYLSA+LDSLKG GWSIFIV+GNFP++ P+ SSSEASN +GQWLSPED
Subjt:  LNSPVAEPAQIDPELENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPI-SSSEASNGYGQWLSPED

Query:  AERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA----IMAAGVSNPQNE
        AERI K+ +S  +    +  +   QQ  +  LS  E +   +MED+DLKAAIAASL+D+SA    + A G S  + E
Subjt:  AERITKSCNSTQAPPPPQRVNWTEQQ--DTFLSSGETEMLIDMEDEDLKAAIAASLMDSSA----IMAAGVSNPQNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGAGCCTGCAATGGAGGCATGTTGTATCACGAGGTTCAAGAATCCAAGCTCTGCGCTGTGCATTGCGTCAACACCGTCTTGCAAGGTCCCTTCTTCTCCGAATT
CGATTTGGCTGCTCTCGCTTCCGATCTTGACCGCAAAGAGCGCCAGATGATGCTTTCTGGTTCCACCACCGGTGATTTCCTCTCCGAGGAGTCTCACAATGTCTCCTTGG
ACGGTGATTTTAGCATCCAGGTCTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCCCTCAACTCACCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTG
GAGAATGCATTTATATGTCACTTGCAAGATCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTCGATAGTCTATATGCAGCCCCTCAGCATCTTTCTAA
GTTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTTAGGGGTAACTTCCCTAAGGATTTTCCCATCTCATCCTCTGAAGCATCCA
ACGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCTGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCTCAAAGAGTAAACTGGACAGAGCAGCAA
GATACATTTCTTTCATCTGGAGAAACAGAAATGCTAATAGACATGGAGGATGAGGACTTGAAGGCTGCAATAGCTGCTAGCCTTATGGATTCCTCAGCAATCATGGCAGC
AGGAGTTTCTAACCCCCAAAATGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATGTACCTGTTGTTTCCCTTGAAACTGCCAAGACCAAAGATGTACCCG
TAGTTTCCCCGAAAGCTTCCACCCTCCAAGAGGTCCCTGCAGTCTCTCCAAAACCTGCCACCCTCCAAGATGTACCTTTCATTTCCACCAAACCTGCCTCCCCCCAAAAT
GTACCTTTTGTTTCCCCTGAAGCTTCCACCTCCCAGGATGTACGTGCACTTTCGCCTGATGCTGCTGCTACCCCTCAAGATTTACATGCCATTTCCACCGCCAAAGCTGC
CACCCCCAATATTGAATCAATGGTCTGCACAGAAGTTGCTGTTCATCAAAACGAGTCTGGAAATGAAGCTGTAGGCAATGCAGATGCTGCCTTCTGTGAAAGTGGACCTG
CAGATAATGCAGAATGTGGCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAATGAGGGAACTGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGAGCCTGCAATGGAGGCATGTTGTATCACGAGGTTCAAGAATCCAAGCTCTGCGCTGTGCATTGCGTCAACACCGTCTTGCAAGGTCCCTTCTTCTCCGAATT
CGATTTGGCTGCTCTCGCTTCCGATCTTGACCGCAAAGAGCGCCAGATGATGCTTTCTGGTTCCACCACCGGTGATTTCCTCTCCGAGGAGTCTCACAATGTCTCCTTGG
ACGGTGATTTTAGCATCCAGGTCTTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTCCCCTCAACTCACCAGTTGCTGAACCTGCGCAGATTGATCCTGAACTG
GAGAATGCATTTATATGTCACTTGCAAGATCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTCGATAGTCTATATGCAGCCCCTCAGCATCTTTCTAA
GTTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTTAGGGGTAACTTCCCTAAGGATTTTCCCATCTCATCCTCTGAAGCATCCA
ACGGTTATGGTCAGTGGCTTTCCCCTGAGGATGCTGAGAGGATAACCAAATCGTGCAACTCTACCCAAGCCCCCCCTCCTCCTCAAAGAGTAAACTGGACAGAGCAGCAA
GATACATTTCTTTCATCTGGAGAAACAGAAATGCTAATAGACATGGAGGATGAGGACTTGAAGGCTGCAATAGCTGCTAGCCTTATGGATTCCTCAGCAATCATGGCAGC
AGGAGTTTCTAACCCCCAAAATGAACCTGCAGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATGTACCTGTTGTTTCCCTTGAAACTGCCAAGACCAAAGATGTACCCG
TAGTTTCCCCGAAAGCTTCCACCCTCCAAGAGGTCCCTGCAGTCTCTCCAAAACCTGCCACCCTCCAAGATGTACCTTTCATTTCCACCAAACCTGCCTCCCCCCAAAAT
GTACCTTTTGTTTCCCCTGAAGCTTCCACCTCCCAGGATGTACGTGCACTTTCGCCTGATGCTGCTGCTACCCCTCAAGATTTACATGCCATTTCCACCGCCAAAGCTGC
CACCCCCAATATTGAATCAATGGTCTGCACAGAAGTTGCTGTTCATCAAAACGAGTCTGGAAATGAAGCTGTAGGCAATGCAGATGCTGCCTTCTGTGAAAGTGGACCTG
CAGATAATGCAGAATGTGGCATTTCCAGCCCTCGAAAGAAAATTAGTCGTACGAATGAGGGAACTGCGTGA
Protein sequenceShow/hide protein sequence
MDGACNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALASDLDRKERQMMLSGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIPLNSPVAEPAQIDPEL
ENAFICHLQDHWFCIRKVNGEWYNFDSLYAAPQHLSKFYLSAYLDSLKGFGWSIFIVRGNFPKDFPISSSEASNGYGQWLSPEDAERITKSCNSTQAPPPPQRVNWTEQQ
DTFLSSGETEMLIDMEDEDLKAAIAASLMDSSAIMAAGVSNPQNEPAVSSTQAASPQNVPVVSLETAKTKDVPVVSPKASTLQEVPAVSPKPATLQDVPFISTKPASPQN
VPFVSPEASTSQDVRALSPDAAATPQDLHAISTAKAATPNIESMVCTEVAVHQNESGNEAVGNADAAFCESGPADNAECGISSPRKKISRTNEGTA