; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024074 (gene) of Chayote v1 genome

Gene IDSed0024074
OrganismSechium edule (Chayote v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationLG12:7389916..7401905
RNA-Seq ExpressionSed0024074
SyntenySed0024074
Gene Ontology termsGO:0000380 - alternative mRNA splicing, via spliceosome (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016604 - nuclear body (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0043021 - ribonucleoprotein complex binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580903.1 Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia]8.3e-25177.31Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++   DSSKTTIGSS+      E G+AQSQS+A NE ++L  F       Q GE  SS               DQN V HDGV N  VSSS+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRS+RDS  ADGLP+ERSDIFSERYDPS++KEHL+KITS HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VREAEQS + K LPEYLKQKLKARGILKED +H +S  SDAISNQ LQG+KL HGWVEAKDPGSGVSYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG KYYYN RTQVTQWE PVASH+ T  HS  ++PGSWN+QTS QSKCVTCG  M L QGSR CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+ KWQDQLS  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+  NQQKTENIK++AD  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

XP_022935213.1 uncharacterized protein LOC111442162 [Cucurbita moschata]6.4e-25177.31Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++   DSSKTTIGSS+      E G+AQSQS+A NE ++L  F       Q GE RSS               DQN V HDGV N  VSSS+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRS+RDS  ADGLP+ERSDIFSERYDPS++KEHL+KITS HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VREAEQS + K LPEYLKQKLKARGILKED +H +S  SDAISNQ LQG+KL HGWVEAKDPGSGVSYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG +YYYN RTQVTQWE PVASH+ T  HST ++PGSWNDQTS QSKCVTCG  M L QG+R CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+ KWQDQ S  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+  NQQKTENIK++AD  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

XP_022983732.1 uncharacterized protein LOC111482260 [Cucurbita maxima]4.6e-24976.64Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++ S DSSKTTIGSS+      E G+AQSQS+A NE ++L  F       Q GE  SS               DQN V H GV N  VSSS+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRSQRDS GADGLP+ERSDIFSERYDPS++KEHL+KIT+ HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VRE +QSS+ K LPEYLKQKLKARGILKED +H +S  +DAISNQ LQG+KL HGWVEAKDPGSG SYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG KYYYN RTQVTQWE P ASH+ T  HS   +PGSWNDQTS QSKCVTCG  M L QGSR CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+ KWQDQ S  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+ NNQ KTENIK+++D  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

XP_023527950.1 uncharacterized protein LOC111791012 [Cucurbita pepo subsp. pepo]2.7e-24977.31Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++ S DSSKTTIGSS+      E G+AQSQS A NE ++L  F       Q GE  SS               DQN V HDGV N  VS+S+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRS+RDS  ADGLP+ERSDIFSERYDPS++KEHL+KITS HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VREAEQSS+ K LPEYLKQKLKARGILKED +H +S  SDAISNQ LQG+KL HGWVEAKDPGSGVSYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG KYYYN RTQVTQWE PVASH+ T  HS  ++PGSWNDQTS QSKCVTCG  M L QGSR CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+  WQDQ S  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+  NQQKTENIK++AD  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

XP_038906175.1 uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida]4.1e-25078.1Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSLEP------GAAQSQSHALNEPRQLAMFAK-----QTGEA-------RSSTADQNNVHHDGVSNTGVSSSTKFGSHVG
        MPT+TAA++ S DSS T IGSS+E        AAQSQ H  NE ++L    K     Q GEA       RS   D + V HD V N  VSSS+KF SHV 
Subjt:  MPTSTAAVSVSEDSSKTTIGSSLEP------GAAQSQSHALNEPRQLAMFAK-----QTGEA-------RSSTADQNNVHHDGVSNTGVSSSTKFGSHVG

Query:  DSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFYG
        D+RDID AVQDAVLREQELATQNIIRSQRDS GADGLP ERSDIFSERYDPS+IKEHL+KITS HRAEMAMKRGK N PEEGNLEIGNGYGVPGGCAFYG
Subjt:  DSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFYG

Query:  ASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQW
        ASKPG+   GNN IGQ IQ +VRE EQSSAVKALPEYLKQKL+ARGILKE+ +H +S  SDAISNQ LQG+KL HGWVEAKDPGSGVSYYYNES+GKSQW
Subjt:  ASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQW

Query:  ERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCNDCISGV
        ERPSESS  +QLSSA  LPEDWMEA+D+ATGLKYYYN+RTQVTQWE PVASH+ T THS DN  GSWN+QT EQSKC+TCG  + L QGSR CN C SGV
Subjt:  ERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCNDCISGV

Query:  STSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRENKKRSY
        STSST+ +WQDQ S  NKCMGC GWG+GLVQAWGYCNHCTRIL LPQCQYLPTS+ +NQQKTENIKHSAD  IKKSATD SKWKPP+GKGGKRE++KRSY
Subjt:  STSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRENKKRSY

Query:  TEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        +EDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  TEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

TrEMBL top hitse value%identityAlignment
A0A0A0LFL2 Polyglutamine tract-binding protein 11.3e-24475.17Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEA-------------RSSTADQNNV-HHDGVSNTGVSSST
        MPTSTA ++ S DSS T IGSS       E  AAQSQ  A NE ++L   +K     Q GEA             RSS  DQN V HH   +N  VSSS+
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEA-------------RSSTADQNNV-HHDGVSNTGVSSST

Query:  KFGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVP
         F S+V D+RDIDIAVQDAVLREQELATQNIIRSQRDS GADGLP ERSDIFSERYDPSS+KEHL+KITS HRAEMA+KRGKLN PEEGNLEIGNGYGVP
Subjt:  KFGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVP

Query:  GGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKK----SDAISNQPLQGKKLHHGWVEAKDPGSGVSY
        GGCAFYGASKPGI   GNN  GQ IQ +++EAEQSSA KALPEYLKQKL+ARGILKED +H +S +    SDA+SN  LQG+KL HGWVEAKDP SGVSY
Subjt:  GGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKK----SDAISNQPLQGKKLHHGWVEAKDPGSGVSY

Query:  YYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQG
        YYNES+GKSQWERPSE S ++QLSSAV LPEDWMEA+D+ +G+KYYYN+RT VTQWERPVASH+ T THS D  PG WNDQT EQSKC+TCG  M L QG
Subjt:  YYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQG

Query:  SRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGK
        SR CN C SGVSTSST+  WQDQ S  NKCMGCGGWG+GLVQAWGYC HCTRIL LPQCQYLPT++ +NQQK EN+KHSAD  IKKS TDRSKWKPP+GK
Subjt:  SRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGK

Query:  GGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        GGKRE++KRSY+EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  GGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

A0A5D3DPP7 Polyglutamine tract-binding protein 11.0e-24675.59Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSLEPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGV-SNTGVSSSTKFGSHV
        MPTST A++ S DSS T IGSS E  + +  + A NE ++L  F+K     Q GEA+ S A             DQN V H+GV +N  VS+S+ F S+V
Subjt:  MPTSTAAVSVSEDSSKTTIGSSLEPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGV-SNTGVSSSTKFGSHV

Query:  GDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFY
         D RDI+IAVQDAVLREQELATQNIIRSQR+S GADGLP E+SDIFSERYDPS+IKEHL+KITS HRAEMAMKRGKLN PEEGNLEIGNGYGVPGGCA Y
Subjt:  GDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFY

Query:  GASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQ
        GASKPGI   GNN  GQ IQ +V+E EQSSA KALPEYLKQKL+ARGILKED +H +   SDA+SN  L G+KL HGWVEAKDP SGVSYYYNES+GKSQ
Subjt:  GASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQ

Query:  WERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCNDCISG
        WERPSE S  +QLSSAV LPEDWMEA+D+ +GLKYYYN+RT +TQWERPVASH+ T THS D  PG WNDQT EQSKC+TCG  M L QGSR CN C SG
Subjt:  WERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCNDCISG

Query:  VSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRENKKRS
        VSTSST+  WQDQ S  NKCMGCGGWG+GLVQAWGYCNHCTRIL LPQCQYLPT++ +NQQKTENIKHSAD  IKKSATDRSKWKPPMGKGGKRE++KRS
Subjt:  VSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRENKKRS

Query:  YTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        Y+EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  YTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

A0A6J1DKA8 Polyglutamine tract-binding protein 12.0e-24272.76Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA---------------------------------
        MPTS AA +VS DS  TTIGSS+      E GAAQSQS+A NE ++L    K     Q GEA+SS A                                 
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA---------------------------------

Query:  -------DQNNVHHDGVSNTGVSSSTKFGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRA
               DQNNV HDGV     SSS+KFGSHVGD+RDID AVQDAVLREQELATQNIIRSQR+S GADG P ERSDIFSERYDPS++KEHL+KITS HRA
Subjt:  -------DQNNVHHDGVSNTGVSSSTKFGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRA

Query:  EMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQP
        EMAMKRGK N PEEGNLEIGNGYGVPGGCAFYGASKPGI T GNNAIG  IQ +V EAEQ+SA K LPEYLKQKL+ARGILKEDTQ  +S  SDAISNQP
Subjt:  EMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQP

Query:  LQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSW
        +QG KL  GWVEAKDP SGV YYYNESTGKSQWERPS+SSF  QL SAV LPEDWMEA+DE TGLKYYYNVRT VTQWE PV+SH+ T THS  N PG W
Subjt:  LQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSW

Query:  NDQTSEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNN--QQKTENI
        N+QT EQ+KC+ CGR M L QGSR CN      STSST+  WQ+Q    NKCMGCGGWG+GLVQ+WGYCNHCTRILRLPQC+YLPTSS +N  QQKTE+I
Subjt:  NDQTSEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNN--QQKTENI

Query:  KHSADTLIKKSATDRSKWKPPMGKGGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
         HSAD  IKKSA DRSKWKPPMGKGGKRE++KRSY+EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT
Subjt:  KHSADTLIKKSATDRSKWKPPMGKGGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQT

Query:  KKGSSNYAPISKRGDGSDGLGDAD
        KKGSS+YAPISK+GDGSDGLGDAD
Subjt:  KKGSSNYAPISKRGDGSDGLGDAD

A0A6J1F9X9 Polyglutamine tract-binding protein 13.1e-25177.31Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++   DSSKTTIGSS+      E G+AQSQS+A NE ++L  F       Q GE RSS               DQN V HDGV N  VSSS+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRS+RDS  ADGLP+ERSDIFSERYDPS++KEHL+KITS HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VREAEQS + K LPEYLKQKLKARGILKED +H +S  SDAISNQ LQG+KL HGWVEAKDPGSGVSYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG +YYYN RTQVTQWE PVASH+ T  HST ++PGSWNDQTS QSKCVTCG  M L QG+R CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+ KWQDQ S  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+  NQQKTENIK++AD  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

A0A6J1J063 Polyglutamine tract-binding protein 12.2e-24976.64Show/hide
Query:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK
        MPTSTAA++ S DSSKTTIGSS+      E G+AQSQS+A NE ++L  F       Q GE  SS               DQN V H GV N  VSSS+K
Subjt:  MPTSTAAVSVSEDSSKTTIGSSL------EPGAAQSQSHALNEPRQLAMFAK-----QTGEARSSTA-------------DQNNVHHDGVSNTGVSSSTK

Query:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG
        FGSHV D+RDID AV+DAVLREQELATQNIIRSQRDS GADGLP+ERSDIFSERYDPS++KEHL+KIT+ HRAEMAMKRGKLN PEEGNLEIGNGYGVPG
Subjt:  FGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQRDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPG

Query:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES
        GCAFYGASKPGI T+GNN I Q IQ +VRE +QSS+ K LPEYLKQKLKARGILKED +H +S  +DAISNQ LQG+KL HGWVEAKDPGSG SYYYNES
Subjt:  GCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNES

Query:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN
        TGKSQWERP+ESSF  QLSSAV LPEDWMEAVD+ TG KYYYN RTQVTQWE P ASH+ T  HS   +PGSWNDQTS QSKCVTCG  M L QGSR CN
Subjt:  TGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCN

Query:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE
         C SGVSTSST+ KWQDQ S  +KCMGCGGWG+GLVQAWGYCNHCTR L LPQCQYLPTS+ NNQ KTENIK+++D  IKKSA+DRSK KPP+GKGGKRE
Subjt:  DCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRE

Query:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD
        ++KRS++EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS+YAPISKRGDGSDGLGDAD
Subjt:  NKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSNYAPISKRGDGSDGLGDAD

SwissProt top hitse value%identityAlignment
A1YFA7 Polyglutamine-binding protein 16.0e-1865.79Show/hide
Query:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ++KK    +D+ELDPMDPS+YSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

O60828 Polyglutamine-binding protein 16.0e-1865.79Show/hide
Query:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ++KK    +D+ELDPMDPS+YSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q2HJC9 Polyglutamine-binding protein 12.7e-1865.79Show/hide
Query:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ++KK +  +D+ELDPMDPS+YSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q6PCT5 Polyglutamine-binding protein 12.1e-1865.79Show/hide
Query:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +NKK +  +D+ELDPMDPS+YSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS++K+
Subjt:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q91VJ5 Polyglutamine-binding protein 17.1e-1967.11Show/hide
Query:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +NKK +  +D+ELDPMDPS+YSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein3.1e-11046.45Show/hide
Query:  SNTGVSSSTKFGSHVG--DSRDIDIAVQDAVLREQELATQNIIRSQRDSG-GADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKR-GKLNAPE
        + + V+S+  +GS +    S+DI+ A   A+LREQE+ TQ II+ QR++G    G     +DI  +R DP+++KEHL+K T+ HRAE A KR G ++   
Subjt:  SNTGVSSSTKFGSHVG--DSRDIDIAVQDAVLREQELATQNIIRSQRDSG-GADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKR-GKLNAPE

Query:  EGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSD--AIS-----NQPLQ--GK
        EGN+++GNGYG+PGG A+ G S                 E   + E ++A   LPEYLKQKLKARGIL++      S   D  A+S       P Q    
Subjt:  EGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSD--AIS-----NQPLQ--GK

Query:  KLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQT
         L  GWV+AKDP SG +YYYN+ TG  QWERP E S+++  +  V   E+W+E  DEA+G KY+YN RT V+QWE P +  K  +T+S            
Subjt:  KLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQT

Query:  SEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKH--SA
                                  + V+ S+ + K +   S   +C GCGGWGVGLVQ WGYC HCTR+  LP+ Q+LP           ++ H  +A
Subjt:  SEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNNQQKTENIKH--SA

Query:  DTLIKKSATDRSKWKPPMGKGGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQTKKG
            +K    RS  KPPM    K   KKR++ EDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTA+GPLFQQRPYPSPGAVLR+NAE+A SQ KK 
Subjt:  DTLIKKSATDRSKWKPPMGKGGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQTKKG

Query:  SSNYAPISKRGDGSDGLGDAD
        +S +  I+KRGDGSDGLGDAD
Subjt:  SSNYAPISKRGDGSDGLGDAD

AT2G41020.2 WW domain-containing protein1.3e-6538.86Show/hide
Query:  SNTGVSSSTKFGSHVG--DSRDIDIAVQDAVLREQELATQNIIRSQRDSG-GADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKR-GKLNAPE
        + + V+S+  +GS +    S+DI+ A   A+LREQE+ TQ II+ QR++G    G     +DI  +R DP+++KEHL+K T+ HRAE A KR G ++   
Subjt:  SNTGVSSSTKFGSHVG--DSRDIDIAVQDAVLREQELATQNIIRSQRDSG-GADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKR-GKLNAPE

Query:  EGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSD--AIS-----NQPLQ--GK
        EGN+++GNGYG+PGG A+ G S                 E   + E ++A   LPEYLKQKLKARGIL++      S   D  A+S       P Q    
Subjt:  EGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYLKQKLKARGILKEDTQHDHSKKSD--AIS-----NQPLQ--GK

Query:  KLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQT
         L  GWV+AKDP SG +YYYN+ TG  QWERP E S+++  +  V   E+W+E  DEA+G KY+YN RT V+QWE P +  K  +T+S            
Subjt:  KLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERPVASHKVTSTHSTDNSPGSWNDQT

Query:  SEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLP------TSSSNNQQKTENI
                                  + V+ S+ + K +   S   +C GCGGWGVGLVQ WGYC HCTR+  LP+ Q+LP      T++ ++ QK  N 
Subjt:  SEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLP------TSSSNNQQKTENI

Query:  KHSA
        ++++
Subjt:  KHSA

AT3G19670.1 pre-mRNA-processing protein 40B6.6e-0434.94Show/hide
Query:  QPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERP
        +PL  +K    WVE      G  Y++N+ T KS WE+P E     + + A     DW E      G KYYYN  T+ + W  P
Subjt:  QPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERP

AT3G19840.1 pre-mRNA-processing protein 40C1.7e-0433.33Show/hide
Query:  DAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLP--------EDWMEAVDEATGLKYYYNVRTQVTQWERP
        D  +   L G +L   W   K   +GV YYYN  TG+S +E+P             P+P         DW   V    G KYYYN +T+V+ W+ P
Subjt:  DAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLP--------EDWMEAVDEATGLKYYYNVRTQVTQWERP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACTTCTACTGCAGCAGTTTCTGTTTCGGAGGACTCGTCCAAAACTACGATTGGTTCAAGTCTGGAACCAGGCGCTGCTCAATCTCAATCTCATGCCCTAAATGA
ACCGCGACAACTTGCAATGTTTGCCAAACAAACGGGAGAAGCTCGGAGTTCTACAGCCGATCAGAACAATGTTCATCACGATGGTGTGTCTAACACTGGTGTTTCGTCTT
CTACCAAATTCGGGTCACATGTTGGCGATTCCAGAGATATTGACATCGCTGTTCAGGATGCTGTGTTGAGGGAACAGGAACTTGCTACCCAAAATATCATTCGTAGCCAA
AGAGACTCTGGAGGTGCAGATGGACTTCCTGATGAGCGATCAGATATCTTTTCAGAACGTTATGATCCAAGTTCTATTAAGGAGCATCTTATGAAGATTACTTCTGTACA
TCGTGCTGAAATGGCTATGAAAAGGGGAAAGTTGAATGCTCCAGAAGAAGGGAACTTGGAGATTGGAAATGGTTATGGTGTACCGGGTGGATGTGCTTTTTATGGTGCTT
CAAAGCCTGGAATTGCTACCTATGGAAATAATGCGATTGGCCAGAACATCCAGGAAGAGGTTAGGGAAGCAGAACAAAGTTCTGCTGTCAAAGCATTGCCCGAGTACCTC
AAGCAGAAGCTAAAAGCTAGGGGTATTCTTAAAGAAGATACACAACATGACCATTCTAAAAAATCTGATGCTATCTCAAATCAACCCTTGCAAGGAAAAAAGCTACATCA
TGGATGGGTGGAGGCTAAAGACCCTGGTAGTGGTGTTTCATATTATTATAATGAAAGTACTGGGAAAAGCCAATGGGAAAGGCCCTCTGAATCCTCTTTCAGTTCACAAC
TTTCATCAGCTGTACCCCTTCCAGAAGATTGGATGGAGGCAGTCGATGAAGCAACAGGTCTTAAATACTACTACAATGTGAGAACCCAGGTAACCCAATGGGAGCGGCCT
GTTGCATCTCATAAAGTAACTTCAACGCACTCAACTGATAATTCTCCTGGGTCTTGGAACGACCAAACTTCGGAGCAAAGTAAATGCGTCACATGTGGGAGGGTAATGAT
CCTCACGCAGGGTTCAAGATGCTGCAATGACTGTATAAGTGGGGTTTCTACAAGTTCAACCGATGAAAAGTGGCAGGACCAATTGTCTGTGCCAAATAAATGCATGGGAT
GTGGTGGTTGGGGAGTTGGCCTTGTGCAAGCTTGGGGTTACTGCAATCATTGTACGAGAATTCTCAGGCTTCCCCAGTGTCAGTACTTGCCAACCAGCAGTAGTAATAAT
CAGCAGAAGACTGAGAACATCAAGCATAGCGCTGACACTTTAATTAAAAAATCTGCCACGGACAGGTCCAAGTGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAA
TAAGAAGCGTTCCTACACGGAGGATGATGAATTGGATCCAATGGATCCTAGCGCGTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTTTAAAAGGAGTGCAACCTC
GAGCAGCAGATACTACAGCTACAGGTCCTCTCTTTCAACAGCGGCCATACCCATCGCCCGGAGCTGTTCTGAGGAAGAATGCAGAAATTGCTTCACAGACGAAAAAGGGA
AGCTCTAACTACGCACCGATCTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATTTTACGATTCAAAGAAAAAAGCGACGTTGCAGTCGAAAAATAGGAGCAGAAAAGTATTTTGAATTTGGGTGCTTCAACTTCATCGTTTTCCTTTGCTCGCCAACTGCG
AAACCCCAATCTGATTCTCTTCACAATTAGGGTTTTCCAATTCGTTGCTTCATTTCATCTCAAATCGTCTCTCCACACATCGCCGAATCCGCCATGTTCAAATTCGCCTG
AAATTCCCCACAAGCAGTCGATCAAGTGCAGCGAAGGTTTCAAAATCGAAACCCTAATCCGAGATGCCGACTTCTACTGCAGCAGTTTCTGTTTCGGAGGACTCGTCCAA
AACTACGATTGGTTCAAGTCTGGAACCAGGCGCTGCTCAATCTCAATCTCATGCCCTAAATGAACCGCGACAACTTGCAATGTTTGCCAAACAAACGGGAGAAGCTCGGA
GTTCTACAGCCGATCAGAACAATGTTCATCACGATGGTGTGTCTAACACTGGTGTTTCGTCTTCTACCAAATTCGGGTCACATGTTGGCGATTCCAGAGATATTGACATC
GCTGTTCAGGATGCTGTGTTGAGGGAACAGGAACTTGCTACCCAAAATATCATTCGTAGCCAAAGAGACTCTGGAGGTGCAGATGGACTTCCTGATGAGCGATCAGATAT
CTTTTCAGAACGTTATGATCCAAGTTCTATTAAGGAGCATCTTATGAAGATTACTTCTGTACATCGTGCTGAAATGGCTATGAAAAGGGGAAAGTTGAATGCTCCAGAAG
AAGGGAACTTGGAGATTGGAAATGGTTATGGTGTACCGGGTGGATGTGCTTTTTATGGTGCTTCAAAGCCTGGAATTGCTACCTATGGAAATAATGCGATTGGCCAGAAC
ATCCAGGAAGAGGTTAGGGAAGCAGAACAAAGTTCTGCTGTCAAAGCATTGCCCGAGTACCTCAAGCAGAAGCTAAAAGCTAGGGGTATTCTTAAAGAAGATACACAACA
TGACCATTCTAAAAAATCTGATGCTATCTCAAATCAACCCTTGCAAGGAAAAAAGCTACATCATGGATGGGTGGAGGCTAAAGACCCTGGTAGTGGTGTTTCATATTATT
ATAATGAAAGTACTGGGAAAAGCCAATGGGAAAGGCCCTCTGAATCCTCTTTCAGTTCACAACTTTCATCAGCTGTACCCCTTCCAGAAGATTGGATGGAGGCAGTCGAT
GAAGCAACAGGTCTTAAATACTACTACAATGTGAGAACCCAGGTAACCCAATGGGAGCGGCCTGTTGCATCTCATAAAGTAACTTCAACGCACTCAACTGATAATTCTCC
TGGGTCTTGGAACGACCAAACTTCGGAGCAAAGTAAATGCGTCACATGTGGGAGGGTAATGATCCTCACGCAGGGTTCAAGATGCTGCAATGACTGTATAAGTGGGGTTT
CTACAAGTTCAACCGATGAAAAGTGGCAGGACCAATTGTCTGTGCCAAATAAATGCATGGGATGTGGTGGTTGGGGAGTTGGCCTTGTGCAAGCTTGGGGTTACTGCAAT
CATTGTACGAGAATTCTCAGGCTTCCCCAGTGTCAGTACTTGCCAACCAGCAGTAGTAATAATCAGCAGAAGACTGAGAACATCAAGCATAGCGCTGACACTTTAATTAA
AAAATCTGCCACGGACAGGTCCAAGTGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAATAAGAAGCGTTCCTACACGGAGGATGATGAATTGGATCCAATGGATC
CTAGCGCGTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTTTAAAAGGAGTGCAACCTCGAGCAGCAGATACTACAGCTACAGGTCCTCTCTTTCAACAGCGGCCA
TACCCATCGCCCGGAGCTGTTCTGAGGAAGAATGCAGAAATTGCTTCACAGACGAAAAAGGGAAGCTCTAACTACGCACCGATCTCCAAGAGAGGAGATGGAAGTGATGG
CCTTGGTGATGCTGACTGATCTTCATCTTCATTTCTGCTTCAACAATACAGCAAAACCATATTTCTGTAGAGCAGGAATTCACTTTCACATGAGATGAGACAGATATAGC
CTTCTCTGAAGTTTTGGGATCAATTGCACATGATTGCTGCTTACTTTCTGGAAGGGGAAGGGTTGTGGCTGTGTATTCAAGTGGGAGTTATTGAGCCATTTTAAAGCCTC
TACATTTGGGGTCAGCGTTTTTCCATGGTCGAGCTACTGCTCTTGTTGGCAATATGAAATCATTAGTATTTTGAATCTATTGTACATCTTTTCTTTTGTTTCTCTTTTCT
AATAATTTTGTTGTGTTTAGTTGGTTGGTTTCTTTTTCTTAACTTGGGGGATATTGTCAATCCTACAACTATGTGAACTCAATGTTTATCAAACATAGTTATACAAACTT
GAGTTCACCAATTCTAAATTTTATGAAGTTGGATTTGAATTCACGAA
Protein sequenceShow/hide protein sequence
MPTSTAAVSVSEDSSKTTIGSSLEPGAAQSQSHALNEPRQLAMFAKQTGEARSSTADQNNVHHDGVSNTGVSSSTKFGSHVGDSRDIDIAVQDAVLREQELATQNIIRSQ
RDSGGADGLPDERSDIFSERYDPSSIKEHLMKITSVHRAEMAMKRGKLNAPEEGNLEIGNGYGVPGGCAFYGASKPGIATYGNNAIGQNIQEEVREAEQSSAVKALPEYL
KQKLKARGILKEDTQHDHSKKSDAISNQPLQGKKLHHGWVEAKDPGSGVSYYYNESTGKSQWERPSESSFSSQLSSAVPLPEDWMEAVDEATGLKYYYNVRTQVTQWERP
VASHKVTSTHSTDNSPGSWNDQTSEQSKCVTCGRVMILTQGSRCCNDCISGVSTSSTDEKWQDQLSVPNKCMGCGGWGVGLVQAWGYCNHCTRILRLPQCQYLPTSSSNN
QQKTENIKHSADTLIKKSATDRSKWKPPMGKGGKRENKKRSYTEDDELDPMDPSAYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKG
SSNYAPISKRGDGSDGLGDAD