; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12499 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12499
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUbiquitinyl hydrolase 1
Genome locationCarg_Chr14:392487..398681
RNA-Seq ExpressionCarg12499
SyntenyCarg12499
Gene Ontology termsGO:0016579 - protein deubiquitination (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0004843 - thiol-dependent ubiquitin-specific protease activity (molecular function)
InterPro domainsIPR003388 - Reticulon
IPR006155 - Josephin domain
IPR033865 - Machado-Joseph disease protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580493.1 Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.4e-22798.29Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  ISLLLMGSS
        IS   +G++
Subjt:  ISLLLMGSS

KAG7017246.1 Ataxin-3-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  ISLLLMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSE
        ISLLLMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSE
Subjt:  ISLLLMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSE

Query:  DMVNEAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRV
        DMVNEAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRV
Subjt:  DMVNEAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRV

Query:  LTYQQWILEKEKLS
        LTYQQWILEKEKLS
Subjt:  LTYQQWILEKEKLS

XP_022934132.1 ataxin-3 homolog [Cucurbita moschata]3.5e-22396.58Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKP DTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVA KAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        T KDVPV+STKPASP+DMPVVTPEASISQNVRALSPDAAATLH DSAAKATTPNNL VCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  ISLLLMGSS
        IS   +G++
Subjt:  ISLLLMGSS

XP_022983320.1 ataxin-3 homolog [Cucurbita maxima]6.2e-22097.26Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMML GSTTGDFLSE+SHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVS TQAASPQNNIPAV PKAFTHQDVPAVAPKAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        T KDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHA SAAKATTPNNL VCTEVAVH+NEPANESVGNADAAF ESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  IS
        IS
Subjt:  IS

XP_023527477.1 ataxin-3 homolog [Cucurbita pepo subsp. pepo]6.4e-22597.31Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLS+ESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        T KDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNL VCTEVAVH+NEPANESVGNADAAFGESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  ISLLLMGSS
        IS   +G++
Subjt:  ISLLLMGSS

TrEMBL top hitse value%identityAlignment
A0A0A0L9Z8 Ubiquitinyl hydrolase 18.1e-17378.04Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AE AQIDP+LE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF
        SCNSTQA  PP   NWT+ QDTFLS G+ EML+D+EDED KAAIAASLMDS AVMAAGVANP NEP VSSTQA SPQN            ++ AV P A 
Subjt:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF

Query:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE
          +DVPAV+P+AAT +DVP +S K ASPQ+ P V+PEAS SQ+V  LSP+AA     LH  S AKA  P N   VCTEV+VHQNE  NESVGNAD AF +
Subjt:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE

Query:  SGLADNAECAVSSPRKKIS
        SG ADN ECAVSSPRKKIS
Subjt:  SGLADNAECAVSSPRKKIS

A0A1S3B6I4 Ubiquitinyl hydrolase 11.6e-17378.28Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AE AQIDP+LE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF
        SCNSTQA  PP   NWT+ QDTFLS G+ EML+D+EDED KAAIAASL+DS AVMAAGVANP NEP VSSTQA SPQN            ++ AV P A 
Subjt:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF

Query:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE
          QDV AV+ KAAT +DVP +S K A PQ++P V+PEAS SQ+VRAL PDAA     LHA S  KA TPN+   VCTEV VHQNE  NESVGNAD AF E
Subjt:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE

Query:  SGLADNAECAVSSPRKKIS
        SG ADN ECA+SSPRKKIS
Subjt:  SGLADNAECAVSSPRKKIS

A0A5D3DN96 Ubiquitinyl hydrolase 11.6e-17378.28Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MD A NGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALA DLDRKERQMML+GSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVI LNSP 
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AE AQIDP+LE+AFICHLQ+HWFCIRKVNGEWYNFDSLYAAPQ+LSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPIS SEASNGYGQWL+PEDA+RITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF
        SCNSTQA  PP   NWT+ QDTFLS G+ EML+D+EDED KAAIAASL+DS AVMAAGVANP NEP VSSTQA SPQN            ++ AV P A 
Subjt:  SCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQN------------NIPAVFPKAF

Query:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE
          QDV AV+ KAAT +DVP +S K A PQ++P V+PEAS SQ+VRAL PDAA     LHA S  KA TPN+   VCTEV VHQNE  NESVGNAD AF E
Subjt:  THQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAA---ATLHADSAAKATTPNNLK-VCTEVAVHQNEPANESVGNADAAFGE

Query:  SGLADNAECAVSSPRKKIS
        SG ADN ECA+SSPRKKIS
Subjt:  SGLADNAECAVSSPRKKIS

A0A6J1F1U3 Ubiquitinyl hydrolase 11.7e-22396.58Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKP DTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVA KAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        T KDVPV+STKPASP+DMPVVTPEASISQNVRALSPDAAATLH DSAAKATTPNNL VCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  ISLLLMGSS
        IS   +G++
Subjt:  ISLLLMGSS

A0A6J1J5J2 Ubiquitinyl hydrolase 13.0e-22097.26Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
        MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMML GSTTGDFLSE+SHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPA

Query:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK
        AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFP DFPISCSEASNGYGQWLTPEDADRITK
Subjt:  AELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITK

Query:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA
        SCNSTQARPP GINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVS TQAASPQNNIPAV PKAFTHQDVPAVAPKAA
Subjt:  SCNSTQARPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAA

Query:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK
        T KDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHA SAAKATTPNNL VCTEVAVH+NEPANESVGNADAAF ESGLADNAECAVSSPRKK
Subjt:  TPKDVPVVSTKPASPQDMPVVTPEASISQNVRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKK

Query:  IS
        IS
Subjt:  IS

SwissProt top hitse value%identityAlignment
O35815 Ataxin-34.7e-3742.86Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQ
        ++HE QE  LCA HC+N +LQG +FS  +L+++A+ LD +ER  M  G  T +    FL + S N+   G FSIQV+  AL+VW L++I  NSP  +  +
Subjt:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQ

Query:  IDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPN
        IDP  E +FIC+ + HWF +RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P+
Subjt:  IDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPN

P54252 Ataxin-31.6e-3734.52Show/hide
Query:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQ
        ++HE QE  LCA HC+N +LQG +FS  +L+++A+ LD +ER  M  G  T +    FL + S N+   G FSIQV+  AL+VW L++I  NSP  +  +
Subjt:  LYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGD----FLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQ

Query:  IDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITKSCNST
        IDP  E +FIC+ + HWF +RK+  +W+N +SL   P+ +S  YL+ +L  L+  G+SIF+V+G+ P+     C              +AD++ +     
Subjt:  IDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITKSCNST

Query:  QARPPLGINWTKPQ-----------DTFLSYGDAEMLMDVEDEDFKAAIAAS
        Q   P  I     Q           +  L   D   ++D ++ED + A+A S
Subjt:  QARPPLGINWTKPQ-----------DTFLSYGDAEMLMDVEDEDFKAAIAAS

Q8LQ36 Putative ataxin-3 homolog4.1e-8965.62Show/hide
Query:  ASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAG------STTGDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIA
        ASNGG+LYHEVQE KLCAVHCVNT LQGPFFSEFDL+ALA DLD++ERQ+M  G      +  GDFL+  E SHNVSL GDFSIQVLQKALEVWDLQVI 
Subjt:  ASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAG------STTGDFLS--EESHNVSLDGDFSIQVLQKALEVWDLQVIA

Query:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDA
        L+SP       DPELE+AFICHLQ+HWFCIRKVNGEWYNF+SLY AP++LSKFYLSA++D+LKG GWSIF VRGNFP + P++ +E SNG+GQWLTP+DA
Subjt:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDA

Query:  DRITKSCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDS
         RIT SCN  Q      G++    Q   +S  D  +    E+ D  AAIAASLMD+
Subjt:  DRITKSCNSTQA-RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDS

Q9M391 Ataxin-3 homolog1.4e-9769.43Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIA
        M+  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A DLD KERQ+ML G+       GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVI 
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIA

Query:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPI-SCSEASNGYGQWLTPED
        LN P AE AQIDPELESAFICHL +HWFCIRKVNGEWYNFDSL AAPQ+LSKFYLSA+LDSLKG GWSIFIV+GNFP + P+ S SEASN +GQWL+PED
Subjt:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPI-SCSEASNGYGQWLTPED

Query:  ADRITKSCNSTQA----RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMA
        A+RI K+ +S  +    R    +N  + ++  LS  + +   ++ED+D KAAIAASL+D+ A  A
Subjt:  ADRITKSCNSTQA----RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMA

Q9M392 Reticulon-like protein B122.1e-5353.65Show/hide
Query:  LMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVN
        +  SSD+LF R RT+HEI+GGGIVADV+LWR+K++++ I+++T+A+W+VFE   YT+ +LISSVLLLL++I+FLW+KSASILNRP+PPLPE  +SE M  
Subjt:  LMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVN

Query:  EAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLD
        EA+ ++R HVN  L VS DIAM ++  L+ KVA  L+++S+I  L D  TLC+TS+L+V+T+PA YE+YEDY+   +  +  K  + Y++L+
Subjt:  EAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLD

Arabidopsis top hitse value%identityAlignment
AT3G10915.2 Reticulon family protein2.8e-3742.72Show/hide
Query:  SSD-RLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEA
        SSD RL  RQ T+H+ +GGG  AD++LWRR+ L+L ++ I+   W++FE  G   LS+ S VLL+++ I F+ A+ ++  NR    LPEL LSE+MVN A
Subjt:  SSD-RLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEA

Query:  ASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRVLTYQQW
        A+  R  +N  L ++ D+ +G + RLFFKV  CLW++S I     L TL Y   +L +TIPALY KY+  VD+    ++R+L   Y K+ ++ V++   W
Subjt:  ASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRVLTYQQW

Query:  ILEKEK
         L K+K
Subjt:  ILEKEK

AT3G10915.4 Reticulon family protein7.0e-3642.51Show/hide
Query:  SSD-RLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEA
        SSD RL  RQ T+H+ +GGG  AD++LWRR+ L+L ++ I+   W++FE  G   LS+ S VLL+++ I F+ A+ ++  NR    LPEL LSE+MVN A
Subjt:  SSD-RLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEA

Query:  ASFIRSHVNDFLSVSQDIAMGKNPRLFFK-VAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRVLTYQQ
        A+  R  +N  L ++ D+ +G + RLFFK V  CLW++S I     L TL Y   +L +TIPALY KY+  VD+    ++R+L   Y K+ ++ V++   
Subjt:  ASFIRSHVNDFLSVSQDIAMGKNPRLFFK-VAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRVLTYQQ

Query:  WILEKEK
        W L K+K
Subjt:  WILEKEK

AT3G19460.1 Reticulon family protein7.0e-3644.51Show/hide
Query:  TLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEAASFIRSHVNDF
        ++H+ +G G VAD++LWR +   + +L  +   W +FER GY LLS +S+VLLLLV I FLWAKSA++LNRP PP+P + + E+  N+AA  +R  +N  
Subjt:  TLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEAASFIRSHVNDF

Query:  LSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQV
        LS++ DI + +NP    +V+  LW IS +  L + +TL Y  +LL L+ P +YEKY+D++D +V
Subjt:  LSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQV

AT3G54120.1 Reticulon family protein1.5e-5453.65Show/hide
Query:  LMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVN
        +  SSD+LF R RT+HEI+GGGIVADV+LWR+K++++ I+++T+A+W+VFE   YT+ +LISSVLLLL++I+FLW+KSASILNRP+PPLPE  +SE M  
Subjt:  LMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLTLWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVN

Query:  EAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLD
        EA+ ++R HVN  L VS DIAM ++  L+ KVA  L+++S+I  L D  TLC+TS+L+V+T+PA YE+YEDY+   +  +  K  + Y++L+
Subjt:  EAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLTDLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLD

AT3G54130.1 Josephin family protein9.9e-9969.43Show/hide
Query:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIA
        M+  SNGGMLYHEVQES LCAVHCVNTVLQGPFFSEFDLAA+A DLD KERQ+ML G+       GDFL+EESHNVSL GDFSIQVLQKALEVWDLQVI 
Subjt:  MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTT-----GDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIA

Query:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPI-SCSEASNGYGQWLTPED
        LN P AE AQIDPELESAFICHL +HWFCIRKVNGEWYNFDSL AAPQ+LSKFYLSA+LDSLKG GWSIFIV+GNFP + P+ S SEASN +GQWL+PED
Subjt:  LNSPAAELAQIDPELESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPI-SCSEASNGYGQWLTPED

Query:  ADRITKSCNSTQA----RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMA
        A+RI K+ +S  +    R    +N  + ++  LS  + +   ++ED+D KAAIAASL+D+ A  A
Subjt:  ADRITKSCNSTQA----RPPLGINWTKPQDTFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGAAGCCAGCAATGGAGGGATGTTGTATCATGAGGTACAAGAATCAAAGCTCTGCGCTGTGCATTGTGTCAACACCGTGTTGCAGGGTCCCTTCTTCTCCGAATT
CGATTTGGCTGCTCTGGCATACGATCTTGACCGAAAAGAACGCCAGATGATGCTTGCTGGTTCCACCACCGGTGATTTTCTCTCCGAGGAGTCTCACAATGTCTCCTTGG
ACGGTGATTTTAGCATCCAGGTATTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTGCTCTCAACTCACCAGCTGCTGAACTTGCACAGATTGATCCTGAACTG
GAGAGTGCATTTATTTGTCACTTGCAAAACCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTTGATAGTCTATATGCAGCCCCACAGAATCTTTCTAA
GTTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTGAGGGGTAATTTCCCCAACGATTTCCCCATCTCATGCTCTGAAGCATCCA
ACGGTTATGGTCAATGGCTTACCCCTGAGGATGCTGATAGGATAACCAAATCTTGCAACTCAACCCAAGCCCGCCCTCCTCTAGGAATAAACTGGACAAAGCCGCAAGAT
ACATTTCTTTCATATGGAGATGCAGAAATGCTGATGGACGTGGAGGATGAGGACTTCAAGGCTGCAATAGCTGCTAGCTTGATGGATTCCCCAGCAGTCATGGCAGCAGG
AGTTGCTAACCCCCATAATGAACCTGCCGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATAATATACCTGCAGTTTTCCCAAAAGCTTTCACCCACCAAGATGTTCCTG
CAGTTGCTCCCAAAGCTGCCACCCCCAAAGATGTACCTGTAGTTTCCACCAAACCTGCCTCCCCCCAAGACATGCCTGTTGTTACCCCTGAAGCTTCCATCTCCCAGAAT
GTACGTGCACTTTCCCCCGATGCAGCTGCTACGTTACATGCAGATTCCGCTGCCAAAGCTACCACCCCCAATAATCTTAAGGTCTGCACTGAAGTTGCTGTTCATCAAAA
CGAGCCTGCAAATGAATCTGTAGGCAATGCAGACGCTGCCTTTGGTGAAAGTGGACTTGCAGATAATGCAGAATGCGCTGTTTCGAGCCCCCGAAAGAAAATTAGTCTTC
TTTTGATGGGTTCCTCCGATCGTTTGTTTTGTAGGCAGCGAACTCTTCACGAAATTGTTGGCGGCGGTATTGTTGCAGATGTAATTCTTTGGAGGCGGAAGGATCTAACT
CTGTGGATTCTGTCAATTACGGTGGCGACTTGGGTGGTGTTTGAGAGATGTGGTTACACCCTTTTGTCTTTAATTTCCAGTGTTCTACTTCTTCTTGTTACCATTATCTT
TCTCTGGGCCAAATCGGCGTCCATTCTCAACAGACCCGCGCCGCCTCTTCCTGAGTTGCATCTCTCAGAAGATATGGTGAACGAAGCTGCATCTTTCATTCGCTCCCATG
TGAATGATTTTCTGTCTGTTTCACAAGATATAGCCATGGGAAAAAACCCCAGATTGTTCTTCAAAGTAGCTGCTTGCCTGTGGGTGATTTCAGTTATCAGTGGCTTGACT
GATCTTATCACTTTATGTTACACCAGCCTTTTACTCGTGCTGACAATCCCTGCATTGTATGAGAAGTATGAAGATTACGTAGATAGGCAGGTCGTATTGATGTACAGAAA
ATTGCATCAGTTTTATGTGAAATTGGATGAGAAGCGAGTTCTGACATACCAACAATGGATTTTGGAGAAAGAAAAGCTGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGAAGCCAGCAATGGAGGGATGTTGTATCATGAGGTACAAGAATCAAAGCTCTGCGCTGTGCATTGTGTCAACACCGTGTTGCAGGGTCCCTTCTTCTCCGAATT
CGATTTGGCTGCTCTGGCATACGATCTTGACCGAAAAGAACGCCAGATGATGCTTGCTGGTTCCACCACCGGTGATTTTCTCTCCGAGGAGTCTCACAATGTCTCCTTGG
ACGGTGATTTTAGCATCCAGGTATTACAAAAGGCTTTGGAGGTATGGGATCTCCAAGTCATTGCTCTCAACTCACCAGCTGCTGAACTTGCACAGATTGATCCTGAACTG
GAGAGTGCATTTATTTGTCACTTGCAAAACCATTGGTTTTGTATTAGGAAAGTGAATGGGGAGTGGTACAATTTTGATAGTCTATATGCAGCCCCACAGAATCTTTCTAA
GTTTTACCTTTCAGCTTACTTGGACTCTCTAAAGGGCTTCGGTTGGAGCATTTTTATTGTGAGGGGTAATTTCCCCAACGATTTCCCCATCTCATGCTCTGAAGCATCCA
ACGGTTATGGTCAATGGCTTACCCCTGAGGATGCTGATAGGATAACCAAATCTTGCAACTCAACCCAAGCCCGCCCTCCTCTAGGAATAAACTGGACAAAGCCGCAAGAT
ACATTTCTTTCATATGGAGATGCAGAAATGCTGATGGACGTGGAGGATGAGGACTTCAAGGCTGCAATAGCTGCTAGCTTGATGGATTCCCCAGCAGTCATGGCAGCAGG
AGTTGCTAACCCCCATAATGAACCTGCCGTTTCCTCCACCCAAGCTGCCTCCCCCCAGAATAATATACCTGCAGTTTTCCCAAAAGCTTTCACCCACCAAGATGTTCCTG
CAGTTGCTCCCAAAGCTGCCACCCCCAAAGATGTACCTGTAGTTTCCACCAAACCTGCCTCCCCCCAAGACATGCCTGTTGTTACCCCTGAAGCTTCCATCTCCCAGAAT
GTACGTGCACTTTCCCCCGATGCAGCTGCTACGTTACATGCAGATTCCGCTGCCAAAGCTACCACCCCCAATAATCTTAAGGTCTGCACTGAAGTTGCTGTTCATCAAAA
CGAGCCTGCAAATGAATCTGTAGGCAATGCAGACGCTGCCTTTGGTGAAAGTGGACTTGCAGATAATGCAGAATGCGCTGTTTCGAGCCCCCGAAAGAAAATTAGTCTTC
TTTTGATGGGTTCCTCCGATCGTTTGTTTTGTAGGCAGCGAACTCTTCACGAAATTGTTGGCGGCGGTATTGTTGCAGATGTAATTCTTTGGAGGCGGAAGGATCTAACT
CTGTGGATTCTGTCAATTACGGTGGCGACTTGGGTGGTGTTTGAGAGATGTGGTTACACCCTTTTGTCTTTAATTTCCAGTGTTCTACTTCTTCTTGTTACCATTATCTT
TCTCTGGGCCAAATCGGCGTCCATTCTCAACAGACCCGCGCCGCCTCTTCCTGAGTTGCATCTCTCAGAAGATATGGTGAACGAAGCTGCATCTTTCATTCGCTCCCATG
TGAATGATTTTCTGTCTGTTTCACAAGATATAGCCATGGGAAAAAACCCCAGATTGTTCTTCAAAGTAGCTGCTTGCCTGTGGGTGATTTCAGTTATCAGTGGCTTGACT
GATCTTATCACTTTATGTTACACCAGCCTTTTACTCGTGCTGACAATCCCTGCATTGTATGAGAAGTATGAAGATTACGTAGATAGGCAGGTCGTATTGATGTACAGAAA
ATTGCATCAGTTTTATGTGAAATTGGATGAGAAGCGAGTTCTGACATACCAACAATGGATTTTGGAGAAAGAAAAGCTGAGCTGATCAGCAATCTCCCTTCTTCATGACT
TTCTTATTTCATTTTGGATGTTTCATGGTGTTCTTTCAACCCACCTTGGCCAATGGCTACGACCCCTCATCAGTGACAGAAGCCCGAGAGATGATTGTATTTTGTTCATA
ACTTTGGTTTTCATTTTCTCTAATTCCTGTTAAAAAAAATGTATTTAGAATCGGCATATTATGAATTTATCTTCCCCCATTTGGGTGATTGGGGTTTTACAGGCAAGTTG
ATAACGTATTTACATATGCAGCCTAAGCAGCAGCTGCTGTAATTGGTATTTCAAACCTTTTAATGTCAAATAAAGATTGCAACATTTTCTTATAAATCTTCTTCATCAAG
CAATGTTAATGTCCTTCGTACTAAGGAATTATCAGAATTGATAAACAATAGGATGAGTTTAGTAACAACTTCTCAAAACACCTTCAATGATGATACTCTCATCTACTGTC
AGTATGGCATCATATGATATTGATCTCAACATGTCCTCTAAGGGAAAAACTTTCCCAATAAGCATCAAGTTCAAATATACAGACATTCTAACGCTGATGCAATTTTGATC
AAAACAATGGCTGGAAGTATACAGCAAAGGAAACACAGAACGAGCCATGGAAAAATCATCAGTCTTGGCCACTTCCAAAGCTATCTCGGCTATGGTTGTCATTCTGCCAT
CCAAGGTTTTGCCCATCATGATCCATATCTCCTGGATAAAACTTACCCCGCTTGTTATGCGACAAGAACGCTGGGGGTCGTGGGGGTGGGGGCAATTGTATTAGAGGCTG
TGGTGCCAATATACCTTTACCATCCCTTAAGTCCATCGTACTACCTGATGCATGCCCCTGATCCATCCCTTTGTTGGCCATAGACGAGACAGTAGAATTCACAGGCATGC
CTCCAGGGCTGCGTCTATAATCTCCTGGATAAAATCCATGTTTTCCTTGAGAATCAGGCGTATTAGCAAAAGCATCAGGCATTCCTTGACCTCCAGTAGCATGATCGTAA
GGAATACCGGCTTGTCCAGCATTCCAAG
Protein sequenceShow/hide protein sequence
MDEASNGGMLYHEVQESKLCAVHCVNTVLQGPFFSEFDLAALAYDLDRKERQMMLAGSTTGDFLSEESHNVSLDGDFSIQVLQKALEVWDLQVIALNSPAAELAQIDPEL
ESAFICHLQNHWFCIRKVNGEWYNFDSLYAAPQNLSKFYLSAYLDSLKGFGWSIFIVRGNFPNDFPISCSEASNGYGQWLTPEDADRITKSCNSTQARPPLGINWTKPQD
TFLSYGDAEMLMDVEDEDFKAAIAASLMDSPAVMAAGVANPHNEPAVSSTQAASPQNNIPAVFPKAFTHQDVPAVAPKAATPKDVPVVSTKPASPQDMPVVTPEASISQN
VRALSPDAAATLHADSAAKATTPNNLKVCTEVAVHQNEPANESVGNADAAFGESGLADNAECAVSSPRKKISLLLMGSSDRLFCRQRTLHEIVGGGIVADVILWRRKDLT
LWILSITVATWVVFERCGYTLLSLISSVLLLLVTIIFLWAKSASILNRPAPPLPELHLSEDMVNEAASFIRSHVNDFLSVSQDIAMGKNPRLFFKVAACLWVISVISGLT
DLITLCYTSLLLVLTIPALYEKYEDYVDRQVVLMYRKLHQFYVKLDEKRVLTYQQWILEKEKLS