; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C009780 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C009780
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionPolyglutamine tract-binding protein 1
Genome locationchr04:28183879..28196815
RNA-Seq ExpressionMELO3C009780
SyntenyMELO3C009780
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053859.1 WW domain-containing protein [Cucumis melo var. makuwa]2.4e-30176.99Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN
        DDGRDIEIAVQDAVLREQ                                     FKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIK     
Subjt:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN

Query:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG
                            EHLLKITSEHRAEMAMKRGKLNLPEE                                        GNLEIGNGYGVPGG
Subjt:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG

Query:  CASYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----------
        CASYGASKPGIVAN                            GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP          
Subjt:  CASYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----------

Query:  --------------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV
                      VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV
Subjt:  --------------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV

Query:  PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE
        PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE
Subjt:  PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE

Query:  NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQR
        NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQR
Subjt:  NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQR

Query:  PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
        PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Subjt:  PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD

TYK25544.1 uncharacterized protein E5676_scaffold352G006960 [Cucumis melo var. makuwa]4.3e-29878.9Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN
        DDGRDIEIAVQDAVLREQ +                        QN                  +   RESVGADGLPAEQSDIFSERYDPSTIK     
Subjt:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN

Query:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG
                            EHLLKITSEHRAEMAMKRGKLNLPEE                                        GNLEIGNGYGVPGG
Subjt:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG

Query:  CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSYYYNESS
        CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP                    VEAKDPHSGVSYYYNESS
Subjt:  CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES

Query:  RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR
        RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR
Subjt:  RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR

Query:  GDGSDGLGDAD
        GDGSDGLGDAD
Subjt:  GDGSDGLGDAD

XP_004136655.1 uncharacterized protein LOC101203374 [Cucumis sativus]6.0e-27672.54Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS
        MPTST  IAGSGDSSNTIIGSSAEDKSLKESAA      AQNEVQELEK SKQ+YPCQPGEAQ +VAI ADQETN+S GNDQNIVPH G FNNIAVS+SS
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS

Query:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI
        NFRSNVDD RDI+IAVQDAVLREQ +                        QN                  +   R+SVGADGLP E+SDIFSERYDPS++
Subjt:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI

Query:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG
        K                         EHLLKITSEHRAEMA+KRGKLNLPEE                                        GNLEIGNG
Subjt:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG

Query:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP------------------------VEAKDPHS
        YGVPGGCA YGASKPGIVANGNNVTGQKIQGQ+KE EQSSA+KALPEYLKQKLRARGILKEDAEHSN                         VEAKDPHS
Subjt:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP------------------------VEAKDPHS

Query:  GVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMT
        GVSYYYNESSGKSQWERPSELSS+TQLSSAVSLPEDWMEAIDQTSG+KYYYNMRTH+TQWERPVASHQTTLTHSNDK PGPWNDQTLEQSKCITCGSGMT
Subjt:  GVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMT

Query:  LVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKP
        LVQGSRYCN+CTSGVSTSSTNG+WQDQ SEQNKCMGCGGWGLGLVQAWGYC HCTRIL LPQCQYLPTNNISNQQK EN+KHSADPSIKKS TDRSKWKP
Subjt:  LVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKP

Query:  PMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKG
        P+GKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKG
Subjt:  PMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKG

Query:  SSHYAPISKRGDGSDGLGDAD
        SSHYAPISKRGDGSDGLGDAD
Subjt:  SSHYAPISKRGDGSDGLGDAD

XP_038906174.1 uncharacterized protein LOC120092051 isoform X2 [Benincasa hispida]5.5e-25368.21Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAA------QNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS
        MPT+T AIAGSGDSSNTIIGSS EDKSLKE AAA      QNEVQELEK  K IYPCQ GEAQ S      QETN+SFGND +IVPH+ VF NIAVS+SS
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAA------QNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS

Query:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI
         FRS+V+D RDI+ AVQDAVLREQ +                        QN                  +   R+SVGADGLP E+SDIFSERYDPSTI
Subjt:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI

Query:  K---------------GMQENVGAIR----WRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFI
        K               G   N  A R    W     ++    +  EHLLKITSEHRAEMAMKRGK NLPEE                             
Subjt:  K---------------GMQENVGAIR----WRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFI

Query:  IRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP-------------
                   GNLEIGNGYGVPGGCA YGASKPG+VA GNN  GQKIQGQV+E EQSSA KALPEYLKQKLRARGILKE+AEHSN              
Subjt:  IRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP-------------

Query:  -------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQ
               VEAKDP SGVSYYYNESSGKSQWERPSE SSDTQLSSA SLPEDWMEA+DQ +GLKYYYNMRT +TQWE PVASHQTTLTHSND V G WN+Q
Subjt:  -------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQ

Query:  TLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSAD
        TLEQSKCITCGSG+TLVQGSRYCN CTSGVSTSSTNG WQDQSSEQNKCMGC GWGLGLVQAWGYCNHCTRIL LPQCQYLPT+NISNQQKTENIKHSAD
Subjt:  TLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSAD

Query:  PSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGA
        PSIKKSATD SKWKPP+GKGGKRESRKRSYSEDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGA
Subjt:  PSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGA

Query:  VLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
        VLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Subjt:  VLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD

XP_038906175.1 uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida]1.2e-25269.32Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAA------QNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS
        MPT+T AIAGSGDSSNTIIGSS EDKSLKE AAA      QNEVQELEK  K IYPCQ GEAQ S      QETN+SFGND +IVPH+ VF NIAVS+SS
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAA------QNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS

Query:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI
         FRS+V+D RDI+ AVQDAVLREQ +                        QN                  +   R+SVGADGLP E+SDIFSERYDPSTI
Subjt:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI

Query:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG
        K                         EHLLKITSEHRAEMAMKRGK NLPEE                                        GNLEIGNG
Subjt:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG

Query:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY
        YGVPGGCA YGASKPG+VA GNN  GQKIQGQV+E EQSSA KALPEYLKQKLRARGILKE+AEHSN                     VEAKDP SGVSY
Subjt:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY

Query:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG
        YYNESSGKSQWERPSE SSDTQLSSA SLPEDWMEA+DQ +GLKYYYNMRT +TQWE PVASHQTTLTHSND V G WN+QTLEQSKCITCGSG+TLVQG
Subjt:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG

Query:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK
        SRYCN CTSGVSTSSTNG WQDQSSEQNKCMGC GWGLGLVQAWGYCNHCTRIL LPQCQYLPT+NISNQQKTENIKHSADPSIKKSATD SKWKPP+GK
Subjt:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK

Query:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
        GGKRESRKRSYSEDDELDPMDPS+YSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
Subjt:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY

Query:  APISKRGDGSDGLGDAD
        APISKRGDGSDGLGDAD
Subjt:  APISKRGDGSDGLGDAD

TrEMBL top hitse value%identityAlignment
A0A0A0LFL2 Polyglutamine tract-binding protein 19.1e-27866.34Show/hide
Query:  MVLKHSTKTFRI-----GRRSPSNI--NKGRRRRRQRIGERTGRRIRFVGRFIGSFRPPFRNKTRYSQFEASFLIAGSAIFKFSRIPTSSRSITGKLSK-
        M LKHSTKT R+      RR  SNI      R  ++       +  R V R + +    F+ + +     +   +     F+  +I    R    +L + 
Subjt:  MVLKHSTKTFRI-----GRRSPSNI--NKGRRRRRQRIGERTGRRIRFVGRFIGSFRPPFRNKTRYSQFEASFLIAGSAIFKFSRIPTSSRSITGKLSK-

Query:  ---LKVPAMPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFN
           LK+PAMPTST  IAGSGDSSNTIIGSSAEDKSLKESAA      AQNEVQELEK SKQ+YPCQPGEAQ +VAI ADQETN+S GNDQNIVPH G FN
Subjt:  ---LKVPAMPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFN

Query:  NIAVSTSSNFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFS
        NIAVS+SSNFRSNVDD RDI+IAVQDAVLREQ +                        QN                  +   R+SVGADGLP E+SDIFS
Subjt:  NIAVSTSSNFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFS

Query:  ERYDPSTIKGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEK
        ERYDPS++K                         EHLLKITSEHRAEMA+KRGKLNLPEE                                        
Subjt:  ERYDPSTIKGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEK

Query:  GNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP------------------------
        GNLEIGNGYGVPGGCA YGASKPGIVANGNNVTGQKIQGQ+KE EQSSA+KALPEYLKQKLRARGILKEDAEHSN                         
Subjt:  GNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP------------------------

Query:  VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKC
        VEAKDPHSGVSYYYNESSGKSQWERPSELSS+TQLSSAVSLPEDWMEAIDQTSG+KYYYNMRTH+TQWERPVASHQTTLTHSNDK PGPWNDQTLEQSKC
Subjt:  VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKC

Query:  ITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSA
        ITCGSGMTLVQGSRYCN+CTSGVSTSSTNG+WQDQ SEQNKCMGCGGWGLGLVQAWGYC HCTRIL LPQCQYLPTNNISNQQK EN+KHSADPSIKKS 
Subjt:  ITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSA

Query:  TDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAE
        TDRSKWKPP+GKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAE
Subjt:  TDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAE

Query:  IASQTKKGSSHYAPISKRGDGSDGLGDAD
        IASQTKKGSSHYAPISKRGDGSDGLGDAD
Subjt:  IASQTKKGSSHYAPISKRGDGSDGLGDAD

A0A5A7UK56 Polyglutamine tract-binding protein 11.2e-30176.99Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN
        DDGRDIEIAVQDAVLREQ                                     FKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIK     
Subjt:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN

Query:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG
                            EHLLKITSEHRAEMAMKRGKLNLPEE                                        GNLEIGNGYGVPGG
Subjt:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG

Query:  CASYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----------
        CASYGASKPGIVAN                            GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP          
Subjt:  CASYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----------

Query:  --------------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV
                      VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV
Subjt:  --------------VEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKV

Query:  PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE
        PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE
Subjt:  PGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTE

Query:  NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQR
        NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQR
Subjt:  NIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQR

Query:  PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
        PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD
Subjt:  PYPSPGAVLRKNAEIASQTKKGSSHYAPISKRGDGSDGLGDAD

A0A5D3DPP7 Polyglutamine tract-binding protein 12.1e-29878.9Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN
        DDGRDIEIAVQDAVLREQ +                        QN                  +   RESVGADGLPAEQSDIFSERYDPSTIK     
Subjt:  DDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQEN

Query:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG
                            EHLLKITSEHRAEMAMKRGKLNLPEE                                        GNLEIGNGYGVPGG
Subjt:  VGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGG

Query:  CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSYYYNESS
        CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP                    VEAKDPHSGVSYYYNESS
Subjt:  CASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRES

Query:  RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR
        RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR
Subjt:  RKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKR

Query:  GDGSDGLGDAD
        GDGSDGLGDAD
Subjt:  GDGSDGLGDAD

A0A6J1F9X9 Polyglutamine tract-binding protein 12.9e-23965.41Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS
        MPTST AIA  GDSS T IGSS ED SLKES +      AQNEVQELEKF  QI PCQPGE + SV IS+DQE   SFGNDQNIVPH+GVF NIAVS+SS
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS

Query:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI
         F S+V D RDI+ AV+DAVLREQ +                        QN            I++++     R+SV ADGLP E+SDIFSERYDPS +
Subjt:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI

Query:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG
        K                         EHLLKITSEHRAEMAMKRGKLNLPEE                                        GNLEIGNG
Subjt:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG

Query:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY
        YGVPGGCA YGASKPGIV +GNN   QKIQGQV+E EQS +AK LPEYLKQKL+ARGILKEDA+HSN                     VEAKDP SGVSY
Subjt:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY

Query:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG
        YYNES+GKSQWERP+E S   QLSSAVSLPEDWMEA+DQT+G +YYYN RT +TQWE PVASHQ TL HS    PG WNDQT  QSKC+TCGSGMTLVQG
Subjt:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG

Query:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK
        +RYCN C SGVSTSSTNG WQDQ S+Q+KCMGCGGWGLGLVQAWGYCNHCTR L LPQCQYLPT+NI NQQKTENIK++ADPSIKKSA+DRSK KPP+GK
Subjt:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK

Query:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
        GGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
Subjt:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY

Query:  APISKRGDGSDGLGDAD
        APISKRGDGSDGLGDAD
Subjt:  APISKRGDGSDGLGDAD

A0A6J1J063 Polyglutamine tract-binding protein 13.2e-23864.99Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS
        MPTST AIA SGDSS T IGSS ED SLKES +      AQNEVQELEKF  QI PCQPGE   SV I +DQE   SFGNDQNIVPH GVF NIAVS+SS
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAA------AQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSS

Query:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI
         F S+V D RDI+ AV+DAVLREQ +                        QN                  +   R+SVGADGLP E+SDIFSERYDPST+
Subjt:  NFRSNVDDGRDIEIAVQDAVLREQRIDLDIFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTI

Query:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG
        K                         EHLLKIT+EHRAEMAMKRGKLNLPEE                                        GNLEIGNG
Subjt:  KGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNG

Query:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY
        YGVPGGCA YGASKPGIV +GNN   QKIQGQV+E +QSS+AK LPEYLKQKL+ARGILKEDA+HSN                     VEAKDP SG SY
Subjt:  YGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP--------------------VEAKDPHSGVSY

Query:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG
        YYNES+GKSQWERP+E S   QLSSAVSLPEDWMEA+DQ +G KYYYN RT +TQWE P ASHQ TL HSN   PG WNDQT  QSKC+TCGSGMTLVQG
Subjt:  YYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQG

Query:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK
        SRYCN C SGVSTSSTNG WQDQ S+ +KCMGCGGWGLGLVQAWGYCNHCTR L LPQCQYLPT+NI+NQ KTENIK+++DPSIKKSA+DRSK KPP+GK
Subjt:  SRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGK

Query:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
        GGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTAT                 GPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY
Subjt:  GGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHY

Query:  APISKRGDGSDGLGDAD
        APISKRGDGSDGLGDAD
Subjt:  APISKRGDGSDGLGDAD

SwissProt top hitse value%identityAlignment
A1YFA7 Polyglutamine-binding protein 16.5e-1554.84Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA                  GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK

A2T806 Polyglutamine-binding protein 16.5e-1554.84Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA                  GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK

O60828 Polyglutamine-binding protein 16.5e-1554.84Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA                  GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q2HJC9 Polyglutamine-binding protein 12.9e-1554.84Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K +  +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA                  GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q91VJ5 Polyglutamine-binding protein 18.5e-1554.46Show/hide
Query:  PMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTK
        P  K  K  SRK     D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA                  GPLFQQRPYPSPGAVLR NAE AS+TK
Subjt:  PMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTK

Query:  K
        +
Subjt:  K

Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein5.4e-8941.95Show/hide
Query:  EHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTG
        EHLLK T+ HRAE A KRG                                         V+   +GN+++GNGYG+PGG A  G S+         ++G
Subjt:  EHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTG

Query:  QKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNP---------------------------VEAKDPHSGVSYYYNESSGKSQWERPSEL
        +         E ++A+  LPEYLKQKL+ARGIL++   A  SNP                           V+AKDP SG +YYYN+ +G  QWERP EL
Subjt:  QKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNP---------------------------VEAKDPHSGVSYYYNESSGKSQWERPSEL

Query:  SSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTN
        S  T  +  V   E+W+E  D+ SG KY+YN RTH++QWE P +  +   T+SN                                     + V+ S+ N
Subjt:  SSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTN

Query:  GMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPT--NNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDD
        G  +   S+  +C GCGGWG+GLVQ WGYC HCTR+ +LP+ Q+LP   N+ +N          A  S +K    RS  KPPM    K   +KR+++EDD
Subjt:  GMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPT--NNISNQQKTENIKHSADPSIKKSATDRSKWKPPMGKGGKRESRKRSYSEDD

Query:  ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIA-SQTKKGSSHYAPISKRGDGSDGLG
        ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTA+                 GPLFQQRPYPSPGAVLR+NAE+A SQ KK +S +  I+KRGDGSDGLG
Subjt:  ELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIA-SQTKKGSSHYAPISKRGDGSDGLG

Query:  DAD
        DAD
Subjt:  DAD

AT2G41020.2 WW domain-containing protein1.9e-4633.88Show/hide
Query:  EHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTG
        EHLLK T+ HRAE A KRG                                         V+   +GN+++GNGYG+PGG A  G S+         ++G
Subjt:  EHLLKITSEHRAEMAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTG

Query:  QKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNP---------------------------VEAKDPHSGVSYYYNESSGKSQWERPSEL
        +         E ++A+  LPEYLKQKL+ARGIL++   A  SNP                           V+AKDP SG +YYYN+ +G  QWERP EL
Subjt:  QKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNP---------------------------VEAKDPHSGVSYYYNESSGKSQWERPSEL

Query:  SSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTN
        S  T  +  V   E+W+E  D+ SG KY+YN RTH++QWE P +  +   T+SN                                     + V+ S+ N
Subjt:  SSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTN

Query:  GMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLP------TNNISNQQKTENIKHSA
        G  +   S+  +C GCGGWG+GLVQ WGYC HCTR+ +LP+ Q+LP      TN   + QK  N ++++
Subjt:  GMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLP------TNNISNQQKTENIKHSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTGAAGCACTCAACGAAAACCTTCCGAATTGGAAGAAGATCACCAAGCAACATCAATAAGGGACGGCGGCGGCGGCGGCAGAGGATAGGAGAGCGGACAGGACG
GCGAATTCGGTTTGTCGGTCGGTTTATCGGGTCGTTTCGACCACCTTTCAGAAATAAAACCAGGTATTCCCAATTCGAAGCGTCGTTTCTCATCGCCGGTTCCGCCATAT
TCAAATTCTCTCGAATTCCGACTAGCAGTCGATCAATTACAGGAAAGCTTTCGAAATTGAAAGTTCCGGCGATGCCGACTTCTACTACAGCAATTGCAGGTTCAGGAGAT
TCGTCCAATACCATAATTGGTTCTAGTGCCGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAAAATGAAGTGCAAGAACTTGAAAAGTTTAGCAAACAAATTTATCC
TTGCCAACCGGGAGAAGCTCAGTGTTCTGTGGCAATATCTGCTGATCAAGAGACCAATCAAAGTTTTGGGAACGATCAGAACATTGTTCCCCATGAGGGTGTGTTTAACA
ACATTGCTGTCTCAACTTCTAGCAACTTCAGGTCGAATGTTGACGATGGCAGAGACATTGAGATTGCTGTTCAGGATGCCGTGTTGAGGGAACAGAGAATAGATTTGGAT
ATTTTCAAGTTTCTCAATCTATACACCCCTATTATGATGATAAACGATATGGAAGGAATGAGGCAGAATCCTTTTCTTTTGCACGCATGGCTAAAGTTCAAATTGATAAT
GAAAACTAAATATGTTTCCTTCCCCAGGGAATCTGTGGGTGCAGATGGACTTCCGGCTGAGCAATCAGATATCTTTTCAGAACGTTATGACCCGAGTACTATTAAAGGGA
TGCAAGAAAATGTTGGGGCCATCAGGTGGCGAATCTGGATTCCGAAGCTTAGGTTTGAGGCGTTACAGATGGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAA
ATGGCTATGAAAAGGGGAAAGTTGAATCTTCCAGAAGAAGATAGGTGGCCACTTTGGGACTTGAACCCTTGCCTCAAAGTCCCCAAGTTCCTTCACGAGGCCCTTTACCT
CTTTGTAACTTTCATCATCAGACCAAGTCGGGTCAATCTGAGTGAGAAAGGGAACTTGGAAATTGGAAATGGATACGGAGTACCTGGTGGATGTGCTTCCTATGGTGCTT
CAAAGCCTGGAATTGTTGCCAATGGAAATAATGTGACTGGCCAGAAAATCCAGGGACAAGTCAAGGAAGTCGAACAAAGTTCTGCTGCCAAAGCATTGCCAGAGTACCTC
AAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACACAGTAATCCTGTGGAGGCTAAAGACCCTCACAGTGGCGTTTCATATTATTATAATGAAAGTAG
TGGGAAGAGTCAATGGGAAAGGCCCTCCGAACTTTCTTCTGATACGCAACTTTCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAATTGATCAAACATCAGGCC
TTAAATACTATTACAATATGAGAACCCATATAACCCAGTGGGAGCGGCCTGTTGCATCTCATCAAACAACTTTGACACACTCGAATGATAAAGTTCCTGGGCCTTGGAAC
GACCAAACTTTGGAGCAAAGTAAATGCATCACATGTGGAAGTGGAATGACCCTCGTACAGGGTTCAAGATACTGCAACGCTTGTACAAGTGGGGTTTCTACAAGTTCAAC
CAATGGGATGTGGCAGGACCAATCGTCTGAGCAAAATAAATGCATGGGATGTGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGCACACGAA
TTCTCAGTCTCCCCCAGTGTCAGTACTTGCCAACCAACAACATTAGTAATCAGCAGAAGACAGAGAACATCAAGCATAGTGCCGATCCCTCCATTAAAAAATCTGCGACA
GATAGGTCCAAATGGAAACCTCCAATGGGAAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCTACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCTTATTC
AGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGTGTGCAGCCTCGGGCAGCAGATACTACTGCTACACTGTTTGCTGTTGCAACCGTTTCACCTTATGATGGGC
AATTATCATTTCATGGTCCTCTCTTTCAGCAGCGGCCATATCCATCACCTGGAGCTGTTCTGAGGAAGAATGCTGAAATTGCTTCACAGACCAAGAAGGGAAGCTCTCAC
TATGCACCCATTTCCAAGAGGGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTGAAGCACTCAACGAAAACCTTCCGAATTGGAAGAAGATCACCAAGCAACATCAATAAGGGACGGCGGCGGCGGCGGCAGAGGATAGGAGAGCGGACAGGACG
GCGAATTCGGTTTGTCGGTCGGTTTATCGGGTCGTTTCGACCACCTTTCAGAAATAAAACCAGGTATTCCCAATTCGAAGCGTCGTTTCTCATCGCCGGTTCCGCCATAT
TCAAATTCTCTCGAATTCCGACTAGCAGTCGATCAATTACAGGAAAGCTTTCGAAATTGAAAGTTCCGGCGATGCCGACTTCTACTACAGCAATTGCAGGTTCAGGAGAT
TCGTCCAATACCATAATTGGTTCTAGTGCCGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAAAATGAAGTGCAAGAACTTGAAAAGTTTAGCAAACAAATTTATCC
TTGCCAACCGGGAGAAGCTCAGTGTTCTGTGGCAATATCTGCTGATCAAGAGACCAATCAAAGTTTTGGGAACGATCAGAACATTGTTCCCCATGAGGGTGTGTTTAACA
ACATTGCTGTCTCAACTTCTAGCAACTTCAGGTCGAATGTTGACGATGGCAGAGACATTGAGATTGCTGTTCAGGATGCCGTGTTGAGGGAACAGAGAATAGATTTGGAT
ATTTTCAAGTTTCTCAATCTATACACCCCTATTATGATGATAAACGATATGGAAGGAATGAGGCAGAATCCTTTTCTTTTGCACGCATGGCTAAAGTTCAAATTGATAAT
GAAAACTAAATATGTTTCCTTCCCCAGGGAATCTGTGGGTGCAGATGGACTTCCGGCTGAGCAATCAGATATCTTTTCAGAACGTTATGACCCGAGTACTATTAAAGGGA
TGCAAGAAAATGTTGGGGCCATCAGGTGGCGAATCTGGATTCCGAAGCTTAGGTTTGAGGCGTTACAGATGGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAA
ATGGCTATGAAAAGGGGAAAGTTGAATCTTCCAGAAGAAGATAGGTGGCCACTTTGGGACTTGAACCCTTGCCTCAAAGTCCCCAAGTTCCTTCACGAGGCCCTTTACCT
CTTTGTAACTTTCATCATCAGACCAAGTCGGGTCAATCTGAGTGAGAAAGGGAACTTGGAAATTGGAAATGGATACGGAGTACCTGGTGGATGTGCTTCCTATGGTGCTT
CAAAGCCTGGAATTGTTGCCAATGGAAATAATGTGACTGGCCAGAAAATCCAGGGACAAGTCAAGGAAGTCGAACAAAGTTCTGCTGCCAAAGCATTGCCAGAGTACCTC
AAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACACAGTAATCCTGTGGAGGCTAAAGACCCTCACAGTGGCGTTTCATATTATTATAATGAAAGTAG
TGGGAAGAGTCAATGGGAAAGGCCCTCCGAACTTTCTTCTGATACGCAACTTTCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAATTGATCAAACATCAGGCC
TTAAATACTATTACAATATGAGAACCCATATAACCCAGTGGGAGCGGCCTGTTGCATCTCATCAAACAACTTTGACACACTCGAATGATAAAGTTCCTGGGCCTTGGAAC
GACCAAACTTTGGAGCAAAGTAAATGCATCACATGTGGAAGTGGAATGACCCTCGTACAGGGTTCAAGATACTGCAACGCTTGTACAAGTGGGGTTTCTACAAGTTCAAC
CAATGGGATGTGGCAGGACCAATCGTCTGAGCAAAATAAATGCATGGGATGTGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGCACACGAA
TTCTCAGTCTCCCCCAGTGTCAGTACTTGCCAACCAACAACATTAGTAATCAGCAGAAGACAGAGAACATCAAGCATAGTGCCGATCCCTCCATTAAAAAATCTGCGACA
GATAGGTCCAAATGGAAACCTCCAATGGGAAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCTACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCTTATTC
AGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGTGTGCAGCCTCGGGCAGCAGATACTACTGCTACACTGTTTGCTGTTGCAACCGTTTCACCTTATGATGGGC
AATTATCATTTCATGGTCCTCTCTTTCAGCAGCGGCCATATCCATCACCTGGAGCTGTTCTGAGGAAGAATGCTGAAATTGCTTCACAGACCAAGAAGGGAAGCTCTCAC
TATGCACCCATTTCCAAGAGGGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGATCTCTTCTACTTTATTCTGCTACGACGATACATCGACCCCTATTTCTGTAGTG
TAGAAATGCACTTTTACTTGAGATGCGACATAGGTCGTGACATATATCTTCGACCGTATAGCCTTACATGAAGTTATGGATCAAATGCACGTCTCACTTTCTGAAGGGGA
GGATCGTGGCTGTACGATGGCACCTTCTTGTCTATATACTAGTTACTATACTTCCTTTATATGGAAAACTTGACTCGTGGACAAAACGGTCAATGAGGTAGATTTTTGAC
CTTCACATATGTACCGGAAGCGGAAGATGTGAATTACTAGTCAGGAGTAGAAAATTTGGATTTATTGTATTTTCTTTCTTCTTTTACTTCTCTCATCTAAGCATTTAGTT
CAAACATTACGATAATTTAACTTACCGTCACTTATACGACCTTAAGTTGGGCTGTACAATTGTGTCCTTTTAAATTCTAAAGAGTTGTTTATGTTAAGTATGTTAGTTTG
CTTGTCTGAACTTAGAAGCTTGTTAATCCTATAATCATGTAAACACTAAGCTTATCAATTCTAAATTTTATAATTTAGATGAATTTAAGTTCATAATACGAAGAGAGTTG
TTTGTGTTACACAGCCTTGTTGGGTTGCTTTTGGATGGTTCCTTGGGAGTTTAGCCAAAAGTGAGATGGCATTAGTTGTTGGGATTGTTGATGGAGAATCAATATTTCAA
ATAGAAATAAAGACTACA
Protein sequenceShow/hide protein sequence
MVLKHSTKTFRIGRRSPSNINKGRRRRRQRIGERTGRRIRFVGRFIGSFRPPFRNKTRYSQFEASFLIAGSAIFKFSRIPTSSRSITGKLSKLKVPAMPTSTTAIAGSGD
SSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNVDDGRDIEIAVQDAVLREQRIDLD
IFKFLNLYTPIMMINDMEGMRQNPFLLHAWLKFKLIMKTKYVSFPRESVGADGLPAEQSDIFSERYDPSTIKGMQENVGAIRWRIWIPKLRFEALQMEHLLKITSEHRAE
MAMKRGKLNLPEEDRWPLWDLNPCLKVPKFLHEALYLFVTFIIRPSRVNLSEKGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYL
KQKLRARGILKEDAEHSNPVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWN
DQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPQCQYLPTNNISNQQKTENIKHSADPSIKKSAT
DRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATLFAVATVSPYDGQLSFHGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSH
YAPISKRGDGSDGLGDAD