; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G193650 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G193650
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationCiama_Chr10:28416107..28431873
RNA-Seq ExpressionCaUC10G193650
SyntenyCaUC10G193650
Gene Ontology termsGO:0005622 - intracellular (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK25544.1 uncharacterized protein E5676_scaffold352G006960 [Cucumis melo var. makuwa]1.5e-24965.81Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS
        MPTST AIAGSGDSSNTI+GSS EDKSLKESAA       AQNEVQELEKF KQI+PCQPGEAQ S      QETN+SFGNDQNIVPH+GVF NIAVS+S
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS

Query:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR
        S FRS+VDD RDI+ AVQDAVLRE                                                                            
Subjt:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR

Query:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY
                                                                           QELATQNIIRSQR+SVGADGLP E+SDIFSERY
Subjt:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY

Query:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK
        DPST+KEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA YGASKPGIVANGNN  GQKIQGQV+E EQSS AKALPEYLKQKLRARGILK
Subjt:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK

Query:  EDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPV
        EDA+H+N TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSD QL SA SLPEDWMEA+DQT+GLKYYYNMRT +TQWE PV
Subjt:  EDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPV

Query:  ASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQ
        ASHQTTL HSND VPG WN+QTLEQSKCITCGSGMTLVQGSRYCN CTSGVSTSSTNG WQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRIL LPQCQ
Subjt:  ASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQ

Query:  YLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYP
        YL TNNISNQQKTEN+KHSADPSI+K AT+R                                          VVGLKGVQPRAADTTATGPLFQQRPYP
Subjt:  YLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYP

Query:  SPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        SPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  SPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

XP_004136655.1 uncharacterized protein LOC101203374 [Cucumis sativus]1.1e-24965.99Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS
        MPTSTA IAGSGDSSNTI+GSS EDKSLKESAAAQSQ Y AQNEVQELEK  KQ++PCQPGEAQG+      QETNRS GNDQNIVPH G F NIAVSSS
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS

Query:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR
        S FRS+VDD RDID AVQDAVLRE                                                                            
Subjt:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR

Query:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY
                                                                           QELATQNIIRSQRDSVGADGLPVERSDIFSERY
Subjt:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY

Query:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK
        DPS+LKEHLLKITSEHRAEMA+KRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNN  GQKIQGQ++EAEQSS +KALPEYLKQKLRARGILK
Subjt:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK

Query:  EDADHNNS----TNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQW
        EDA+H+NS    TNSDA+SN  LQGEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SS+ QL SA SLPEDWMEA+DQT+G+KYYYNMRT VTQW
Subjt:  EDADHNNS----TNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQW

Query:  EPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGL
        E PVASHQTTL HSND  PG WN+QTLEQSKCITCGSGMTLVQGSRYCN CTSGVSTSSTNG WQDQ SEQNKCMGCGGWGLGLVQAWGYC HCTRILGL
Subjt:  EPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGL

Query:  PQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQ
        PQCQYL TNNISNQQK ENVKHSADPSI+K  T+R                                          VVGLKGVQPRAADTTATGPLFQQ
Subjt:  PQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQ

Query:  RPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        RPYPSPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  RPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

XP_038906172.1 uncharacterized protein LOC120092051 isoform X1 [Benincasa hispida]2.0e-25162.76Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV
        MPT+TAAIAGSGDSSNTI+GSSVEDKSLKE AAAQSQ YH QNEVQELEK GK I+PCQ GEAQGSQETNRSFGND +IVPHD VFNIAVSSSSKFRSHV
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV

Query:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC
        +DTRDIDSAVQDAVLRE                                                                                   
Subjt:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC

Query:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSTLK-
                                                                    QELATQNIIRSQRDSVGADGLPVERSDIFSERYDPST+K 
Subjt:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSTLK-

Query:  ---------------------------------------------------------------------EHLLKITSEHRAEMAMKRGKLNLPEEGNLEI
                                                                             EHLLKITSEHRAEMAMKRGK NLPEEGNLEI
Subjt:  ---------------------------------------------------------------------EHLLKITSEHRAEMAMKRGKLNLPEEGNLEI

Query:  GNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSG
        GNGYGVPGGCAFYGASKPG+VA GNN IGQKIQGQVRE EQSS  KALPEYLKQKLRARGILKE+A+H+NST+SDAISNQ LQGEKLPHGWVEAKDPGSG
Subjt:  GNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSG

Query:  VSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTL
        VSYYYNESSGKSQWERPSESSSD QL SA SLPEDWMEALDQ TGLKYYYNMRTQVTQWEPPVASHQTTL HSNDNV GSWNNQTLEQSKCITCGSG+TL
Subjt:  VSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTL

Query:  VQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------
        VQGSRYCN CTSGVSTSSTNG+WQDQSSEQNKCMGC GWGLGLVQAWGYCNHCTRILGLPQCQYL T+NISNQQKTEN+KHSADPSI+K AT+       
Subjt:  VQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------

Query:  ------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLG
                                            VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLG
Subjt:  ------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLG

Query:  DAD
        DAD
Subjt:  DAD

XP_038906174.1 uncharacterized protein LOC120092051 isoform X2 [Benincasa hispida]2.0e-25464.86Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV
        MPT+TAAIAGSGDSSNTI+GSSVEDKSLKE AAAQSQ YH QNEVQELEK GK I+PCQ GEAQGSQETNRSFGND +IVPHD VFNIAVSSSSKFRSHV
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV

Query:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC
        +DTRDIDSAVQDAVLRE                                                                                   
Subjt:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC

Query:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPST---
                                                                    QELATQNIIRSQRDSVGADGLPVERSDIFSERYDPST   
Subjt:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPST---

Query:  -----------------------------------------LKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNN
                                                 +KEHLLKITSEHRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPG+VA GNN
Subjt:  -----------------------------------------LKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNN

Query:  AIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQL
         IGQKIQGQVRE EQSS  KALPEYLKQKLRARGILKE+A+H+NST+SDAISNQ LQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSD QL
Subjt:  AIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQL

Query:  PSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQ
         SA SLPEDWMEALDQ TGLKYYYNMRTQVTQWEPPVASHQTTL HSNDNV GSWNNQTLEQSKCITCGSG+TLVQGSRYCN CTSGVSTSSTNG+WQDQ
Subjt:  PSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQ

Query:  SSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER--------------------------------
        SSEQNKCMGC GWGLGLVQAWGYCNHCTRILGLPQCQYL T+NISNQQKTEN+KHSADPSI+K AT+                                 
Subjt:  SSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER--------------------------------

Query:  ----------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
                  VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  ----------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

XP_038906175.1 uncharacterized protein LOC120092051 isoform X3 [Benincasa hispida]8.2e-26168.76Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV
        MPT+TAAIAGSGDSSNTI+GSSVEDKSLKE AAAQSQ YH QNEVQELEK GK I+PCQ GEAQGSQETNRSFGND +IVPHD VFNIAVSSSSKFRSHV
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHV

Query:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC
        +DTRDIDSAVQDAVLRE                                                                                   
Subjt:  DDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDC

Query:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSTLKE
                                                                    QELATQNIIRSQRDSVGADGLPVERSDIFSERYDPST+KE
Subjt:  CSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSTLKE

Query:  HLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNN
        HLLKITSEHRAEMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPG+VA GNN IGQKIQGQVRE EQSS  KALPEYLKQKLRARGILKE+A+H+N
Subjt:  HLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNN

Query:  STNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTL
        ST+SDAISNQ LQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSD QL SA SLPEDWMEALDQ TGLKYYYNMRTQVTQWEPPVASHQTTL
Subjt:  STNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTL

Query:  PHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNI
         HSNDNV GSWNNQTLEQSKCITCGSG+TLVQGSRYCN CTSGVSTSSTNG+WQDQSSEQNKCMGC GWGLGLVQAWGYCNHCTRILGLPQCQYL T+NI
Subjt:  PHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNI

Query:  SNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLR
        SNQQKTEN+KHSADPSI+K AT+                                           VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLR
Subjt:  SNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLR

Query:  KNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        KNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  KNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

TrEMBL top hitse value%identityAlignment
A0A0A0LFL2 Polyglutamine tract-binding protein 15.4e-25065.99Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS
        MPTSTA IAGSGDSSNTI+GSS EDKSLKESAAAQSQ Y AQNEVQELEK  KQ++PCQPGEAQG+      QETNRS GNDQNIVPH G F NIAVSSS
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS

Query:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR
        S FRS+VDD RDID AVQDAVLRE                                                                            
Subjt:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR

Query:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY
                                                                           QELATQNIIRSQRDSVGADGLPVERSDIFSERY
Subjt:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY

Query:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK
        DPS+LKEHLLKITSEHRAEMA+KRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNN  GQKIQGQ++EAEQSS +KALPEYLKQKLRARGILK
Subjt:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK

Query:  EDADHNNS----TNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQW
        EDA+H+NS    TNSDA+SN  LQGEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SS+ QL SA SLPEDWMEA+DQT+G+KYYYNMRT VTQW
Subjt:  EDADHNNS----TNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQW

Query:  EPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGL
        E PVASHQTTL HSND  PG WN+QTLEQSKCITCGSGMTLVQGSRYCN CTSGVSTSSTNG WQDQ SEQNKCMGCGGWGLGLVQAWGYC HCTRILGL
Subjt:  EPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGL

Query:  PQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQ
        PQCQYL TNNISNQQK ENVKHSADPSI+K  T+R                                          VVGLKGVQPRAADTTATGPLFQQ
Subjt:  PQCQYLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQ

Query:  RPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        RPYPSPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  RPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

A0A5A7UK56 Polyglutamine tract-binding protein 11.6e-23862.44Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS
        MPTST AIAGSGDSSNTI+GSS EDKSLKESAA       AQNEVQELEKF KQI+PCQPGEAQ S      QETN+SFGNDQNIVPH+GVF NIAVS+S
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS

Query:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR
        S FRS+VDD RDI+ AVQDAVLREQ   + K                                                                     
Subjt:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR

Query:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY
                           TK                                        SFP               R+SVGADGLP E+SDIFSERY
Subjt:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY

Query:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVAN----------------------------GNNAIGQKIQGQ
        DPST+KEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA YGASKPGIVAN                            GNN  GQKIQGQ
Subjt:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVAN----------------------------GNNAIGQKIQGQ

Query:  VREAEQSSVAKALPEYLKQKLRARGILKEDADHNN----STNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAAS
        V+E EQSS AKALPEYLKQKLRARGILKEDA+H+N     TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSD QL SA S
Subjt:  VREAEQSSVAKALPEYLKQKLRARGILKEDADHNN----STNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAAS

Query:  LPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQN
        LPEDWMEA+DQT+GLKYYYNMRT +TQWE PVASHQTTL HSND VPG WN+QTLEQSKCITCGSGMTLVQGSRYCN CTSGVSTSSTNG WQDQSSEQN
Subjt:  LPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQN

Query:  KCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER-------------------------------------
        KCMGCGGWGLGLVQAWGYCNHCTRIL LPQCQYL TNNISNQQKTEN+KHSADPSI+K AT+R                                     
Subjt:  KCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATER-------------------------------------

Query:  -----VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
             VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  -----VVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

A0A5D3DPP7 Polyglutamine tract-binding protein 17.0e-25065.81Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS
        MPTST AIAGSGDSSNTI+GSS EDKSLKESAA       AQNEVQELEKF KQI+PCQPGEAQ S      QETN+SFGNDQNIVPH+GVF NIAVS+S
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVF-NIAVSSS

Query:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR
        S FRS+VDD RDI+ AVQDAVLRE                                                                            
Subjt:  SKFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLR

Query:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY
                                                                           QELATQNIIRSQR+SVGADGLP E+SDIFSERY
Subjt:  DGIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERY

Query:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK
        DPST+KEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA YGASKPGIVANGNN  GQKIQGQV+E EQSS AKALPEYLKQKLRARGILK
Subjt:  DPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILK

Query:  EDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPV
        EDA+H+N TNSDA+SN  L GEKLPHGWVEAKDP SGVSYYYNESSGKSQWERPSE SSD QL SA SLPEDWMEA+DQT+GLKYYYNMRT +TQWE PV
Subjt:  EDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPV

Query:  ASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQ
        ASHQTTL HSND VPG WN+QTLEQSKCITCGSGMTLVQGSRYCN CTSGVSTSSTNG WQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRIL LPQCQ
Subjt:  ASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQ

Query:  YLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYP
        YL TNNISNQQKTEN+KHSADPSI+K AT+R                                          VVGLKGVQPRAADTTATGPLFQQRPYP
Subjt:  YLSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYP

Query:  SPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        SPGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  SPGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

A0A6J1F9X9 Polyglutamine tract-binding protein 12.3e-24063.73Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVFNIAVSSSS
        MPTSTAAIA  GDSS T +GSSVED SLKES +AQSQ Y AQNEVQELEKFG QI PCQPGE + S      QE   SFGNDQNIVPHDGVFNIAVSSSS
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVFNIAVSSSS

Query:  KFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRD
        KF SHV DTRDID+AV+DAVLRE                                                                             
Subjt:  KFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRD

Query:  GIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYD
                                                                          QELATQNIIRS+RDSV ADGLP ERSDIFSERYD
Subjt:  GIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYD

Query:  PSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKE
        PS LKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV +GNN I QKIQGQVREAEQS  AK LPEYLKQKL+ARGILKE
Subjt:  PSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKE

Query:  DADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVA
        DA H+NS NSDAISNQ+LQGEKLPHGWVEAKDPGSGVSYYYNES+GKSQWERP+ESS   QL SA SLPEDWMEA+DQTTG +YYYN RTQVTQWEPPVA
Subjt:  DADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVA

Query:  SHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQY
        SHQ TL HS  + PGSWN+QT  QSKC+TCGSGMTLVQG+RYCN C SGVSTSSTNGKWQDQ S+Q+KCMGCGGWGLGLVQAWGYCNHCTR LGLPQCQY
Subjt:  SHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQY

Query:  LSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPS
        L T+NI NQQKTEN+K++ADPSI+K A++R                                          VVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

A0A6J1J063 Polyglutamine tract-binding protein 12.5e-23963.33Show/hide
Query:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVFNIAVSSSS
        MPTSTAAIA SGDSS T +GSSVED SLKES +AQSQ Y AQNEVQELEKFG QI PCQPGE   S      QE   SFGNDQNIVPH GVFNIAVSSSS
Subjt:  MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGS------QETNRSFGNDQNIVPHDGVFNIAVSSSS

Query:  KFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRD
        KF SHV DTRDID+AV+DAVLRE                                                                             
Subjt:  KFRSHVDDTRDIDSAVQDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRD

Query:  GIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYD
                                                                          QELATQNIIRSQRDSVGADGLP ERSDIFSERYD
Subjt:  GIDKDCCSLENSESLSLLTKMTRIELNLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYD

Query:  PSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKE
        PSTLKEHLLKIT+EHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIV +GNN I QKIQGQVRE +QSS AK LPEYLKQKL+ARGILKE
Subjt:  PSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKE

Query:  DADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVA
        DA H+NS N+DAISNQ+LQGEKLPHGWVEAKDPGSG SYYYNES+GKSQWERP+ESS   QL SA SLPEDWMEA+DQ TG KYYYN RTQVTQWEPP A
Subjt:  DADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVA

Query:  SHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQY
        SHQ TL HSN   PGSWN+QT  QSKC+TCGSGMTLVQGSRYCN C SGVSTSSTNGKWQDQ S+ +KCMGCGGWGLGLVQAWGYCNHCTR LGLPQCQY
Subjt:  SHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTNGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQY

Query:  LSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPS
        L T+NI+NQ KTEN+K+++DPSI+K A++R                                          VVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LSTNNISNQQKTENVKHSADPSIRKPATER------------------------------------------VVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSS YAPISKRGDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSQYAPISKRGDGSDGLGDAD

SwissProt top hitse value%identityAlignment
A1YFA7 Polyglutamine-binding protein 19.8e-0785.29Show/hide
Query:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

A2T806 Polyglutamine-binding protein 19.8e-0785.29Show/hide
Query:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

O60828 Polyglutamine-binding protein 19.8e-0785.29Show/hide
Query:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q2HJC9 Polyglutamine-binding protein 19.8e-0785.29Show/hide
Query:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q91VJ5 Polyglutamine-binding protein 19.8e-0785.29Show/hide
Query:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein8.0e-8942.62Show/hide
Query:  QELATQNIIRSQRDS-VGADGLPVERSDIFSERYDPSTLKEHLLKITSEHRAEMAMKR-GKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAI
        QE+ TQ II+ QR++     G     +DI  +R DP+ LKEHLLK T+ HRAE A KR G ++   EGN+++GNGYG+PGG A+ G S            
Subjt:  QELATQNIIRSQRDS-VGADGLPVERSDIFSERYDPSTLKEHLLKITSEHRAEMAMKR-GKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAI

Query:  GQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAD--HNNSTNSDAIS-NQ------LLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSE
          ++ G   + E ++ +  LPEYLKQKL+ARGIL++ A    +N  ++ A+S N+            LP GWV+AKDP SG +YYYN+ +G  QWERP E
Subjt:  GQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAD--HNNSTNSDAIS-NQ------LLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSE

Query:  SSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSST
         S            E+W+E  D+ +G KY+YN RT V+QWEPP +  +    +SN                                     + V+ S+ 
Subjt:  SSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSST

Query:  NGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLS------TNNISNQQKTENVKHSADPSIRK------------------------
        NGK +   S+  +C GCGGWG+GLVQ WGYC HCTR+  LP+ Q+L       TN   + QK  N + S+ P ++K                        
Subjt:  NGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLS------TNNISNQQKTENVKHSADPSIRK------------------------

Query:  -PATERVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQTKKGSSQYAPISKRGDGSDGLGDAD
         P    VVGLKGVQPRAADTTA+GPLFQQRPYPSPGAVLR+NAE+A SQ KK +SQ+  I+KRGDGSDGLGDAD
Subjt:  -PATERVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQTKKGSSQYAPISKRGDGSDGLGDAD

AT2G41020.2 WW domain-containing protein4.6e-6038.11Show/hide
Query:  QELATQNIIRSQRDS-VGADGLPVERSDIFSERYDPSTLKEHLLKITSEHRAEMAMKR-GKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAI
        QE+ TQ II+ QR++     G     +DI  +R DP+ LKEHLLK T+ HRAE A KR G ++   EGN+++GNGYG+PGG A+ G S            
Subjt:  QELATQNIIRSQRDS-VGADGLPVERSDIFSERYDPSTLKEHLLKITSEHRAEMAMKR-GKLNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVANGNNAI

Query:  GQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAD--HNNSTNSDAIS-NQ------LLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSE
          ++ G   + E ++ +  LPEYLKQKL+ARGIL++ A    +N  ++ A+S N+            LP GWV+AKDP SG +YYYN+ +G  QWERP E
Subjt:  GQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDAD--HNNSTNSDAIS-NQ------LLQGEKLPHGWVEAKDPGSGVSYYYNESSGKSQWERPSE

Query:  SSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSST
         S            E+W+E  D+ +G KY+YN RT V+QWEPP +  +    +SN                                     + V+ S+ 
Subjt:  SSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSST

Query:  NGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLS------TNNISNQQKTENVKHSA
        NGK +   S+  +C GCGGWG+GLVQ WGYC HCTR+  LP+ Q+L       TN   + QK  N ++++
Subjt:  NGKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLS------TNNISNQQKTENVKHSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACTTCTACTGCAGCAATTGCAGGTTCAGGAGACTCGTCCAATACTATAGTTGGTTCTAGTGTTGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAATCTCA
AGATTATCATGCCCAAAATGAAGTGCAAGAACTTGAGAAGTTTGGCAAACAAATTCATCCTTGCCAACCGGGAGAAGCTCAGGGTTCTCAAGAGACAAACCGAAGTTTTG
GGAATGATCAGAACATTGTTCCTCATGATGGTGTGTTTAACATTGCTGTCTCATCTTCCAGCAAATTCAGGTCACATGTTGACGATACCAGAGATATTGATAGTGCTGTT
CAGGATGCTGTGTTGAGGGAACAGGAAACTGCCTTGTGGAAAGTTGTGATTGCCTACGAAGAAACTGCCTTGTGGAGAGCTGTGATTGCAAGCATATACCACTGCGAACC
TCACGGGTGGATGACCTGTCCATCTAAAGGAGCCATCAAGGCTAGACCGTGGGACCTTGGTTTCAGAAGGGACATTAAGGACAACAAATTTGATAGCTGGGTTGGCCTTG
GCTCAATGATTGACACTATTAGATTGAGGGATGGCATCGATAAAGACTGTTGCTCCTTGGAAAATTCAGAGTCCCTTTCCTTGTTGACTAAGATGACCCGTATCGAGCTC
AACTTATATGGTGAAACGGAGGCTGGGTGGTTGTTAAGGATCTCTATTTGCATTTTTAGAGGCAGAATCCCTTTTTTTACATGGGAAACAAGAGTCTCTTTCAGCTTCCC
CATGGTGCCTCAAGAGCTTGCTACCCAAAATATCATTCGTAGCCAAAGAGACTCTGTGGGTGCAGATGGACTTCCTGTTGAGCGATCGGATATCTTTTCAGAACGTTATG
ACCCGAGTACACTTAAAGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGCTGAATCTGCCAGAAGAAGGGAACTTGGAAATT
GGAAATGGATATGGCGTACCTGGTGGATGTGCTTTCTACGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGAAATAATGCGATTGGCCAGAAAATCCAGGGACAGGTTAG
GGAAGCAGAGCAAAGTTCTGTTGCCAAAGCATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCGGACCATAACAATTCTACAAATT
CTGATGCTATTTCAAATCAACTGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTTGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTATTATAATGAAAGTAGTGGG
AAGAGTCAATGGGAAAGGCCCTCTGAATCTTCTTCTGACGCGCAACTTCCATCAGCTGCATCCCTTCCAGAAGATTGGATGGAGGCACTCGATCAAACAACAGGCCTTAA
ATACTACTACAATATGAGAACCCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGACAACTTTGCCACACTCGAATGATAATGTTCCTGGGTCTTGGAACAACC
AAACTTTGGAGCAAAGTAAATGCATCACATGTGGAAGTGGAATGACCCTCGTGCAGGGTTCAAGATACTGCAACGGTTGTACAAGTGGGGTTTCTACAAGTTCAACCAAT
GGGAAGTGGCAGGATCAATCGTCTGAGCAAAACAAATGCATGGGTTGCGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGTACACGAATTCT
CGGCCTCCCCCAGTGTCAGTACTTGTCAACCAACAATATTAGTAATCAGCAGAAGACAGAGAATGTAAAGCATAGCGCTGATCCCTCCATTAGAAAACCTGCGACAGAGA
GGGTTGTGGGTCTAAAAGGTGTGCAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTCTTTCAACAGCGGCCATATCCATCACCTGGAGCTGTTCTGAGGAAGAAT
GCTGAAATTGCTTCACAGACTAAGAAGGGAAGCTCTCAGTATGCACCTATTTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGA
mRNA sequenceShow/hide mRNA sequence
GGCGAGGGCGGCGGTCTCGGTTGGTTGGTCGGTTTCCCGGATTGTTTTGATCACCCCTCAGAAATACAGCCAGGTTTTTCCAATTCGACGCTCTGATACATCCTTCATTT
CGACTCAAAGCGTCTCTTCGCATCGCTGAATCTGCCATATTCAGTTTTGTCTCAAATTATTACTACCAGTTGGTCAATTACGGCAAAACTTTCGAAATTGAAAACTTCCA
GGGATGCCGACTTCTACTGCAGCAATTGCAGGTTCAGGAGACTCGTCCAATACTATAGTTGGTTCTAGTGTTGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAATC
TCAAGATTATCATGCCCAAAATGAAGTGCAAGAACTTGAGAAGTTTGGCAAACAAATTCATCCTTGCCAACCGGGAGAAGCTCAGGGTTCTCAAGAGACAAACCGAAGTT
TTGGGAATGATCAGAACATTGTTCCTCATGATGGTGTGTTTAACATTGCTGTCTCATCTTCCAGCAAATTCAGGTCACATGTTGACGATACCAGAGATATTGATAGTGCT
GTTCAGGATGCTGTGTTGAGGGAACAGGAAACTGCCTTGTGGAAAGTTGTGATTGCCTACGAAGAAACTGCCTTGTGGAGAGCTGTGATTGCAAGCATATACCACTGCGA
ACCTCACGGGTGGATGACCTGTCCATCTAAAGGAGCCATCAAGGCTAGACCGTGGGACCTTGGTTTCAGAAGGGACATTAAGGACAACAAATTTGATAGCTGGGTTGGCC
TTGGCTCAATGATTGACACTATTAGATTGAGGGATGGCATCGATAAAGACTGTTGCTCCTTGGAAAATTCAGAGTCCCTTTCCTTGTTGACTAAGATGACCCGTATCGAG
CTCAACTTATATGGTGAAACGGAGGCTGGGTGGTTGTTAAGGATCTCTATTTGCATTTTTAGAGGCAGAATCCCTTTTTTTACATGGGAAACAAGAGTCTCTTTCAGCTT
CCCCATGGTGCCTCAAGAGCTTGCTACCCAAAATATCATTCGTAGCCAAAGAGACTCTGTGGGTGCAGATGGACTTCCTGTTGAGCGATCGGATATCTTTTCAGAACGTT
ATGACCCGAGTACACTTAAAGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGCTGAATCTGCCAGAAGAAGGGAACTTGGAA
ATTGGAAATGGATATGGCGTACCTGGTGGATGTGCTTTCTACGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGAAATAATGCGATTGGCCAGAAAATCCAGGGACAGGT
TAGGGAAGCAGAGCAAAGTTCTGTTGCCAAAGCATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCGGACCATAACAATTCTACAA
ATTCTGATGCTATTTCAAATCAACTGTTGCAAGGAGAAAAGCTGCCTCATGGATGGGTTGAGGCTAAAGACCCTGGCAGTGGTGTTTCATATTATTATAATGAAAGTAGT
GGGAAGAGTCAATGGGAAAGGCCCTCTGAATCTTCTTCTGACGCGCAACTTCCATCAGCTGCATCCCTTCCAGAAGATTGGATGGAGGCACTCGATCAAACAACAGGCCT
TAAATACTACTACAATATGAGAACCCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGACAACTTTGCCACACTCGAATGATAATGTTCCTGGGTCTTGGAACA
ACCAAACTTTGGAGCAAAGTAAATGCATCACATGTGGAAGTGGAATGACCCTCGTGCAGGGTTCAAGATACTGCAACGGTTGTACAAGTGGGGTTTCTACAAGTTCAACC
AATGGGAAGTGGCAGGATCAATCGTCTGAGCAAAACAAATGCATGGGTTGCGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGTACACGAAT
TCTCGGCCTCCCCCAGTGTCAGTACTTGTCAACCAACAATATTAGTAATCAGCAGAAGACAGAGAATGTAAAGCATAGCGCTGATCCCTCCATTAGAAAACCTGCGACAG
AGAGGGTTGTGGGTCTAAAAGGTGTGCAACCTCGAGCAGCAGATACTACTGCTACAGGTCCTCTCTTTCAACAGCGGCCATATCCATCACCTGGAGCTGTTCTGAGGAAG
AATGCTGAAATTGCTTCACAGACTAAGAAGGGAAGCTCTCAGTATGCACCTATTTCCAAGAGAGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGATCTCTTCTACT
TTTTTGAGCTTCGACTATACTGCAAGCTACTTCTGTAGCGTAGAAATGCACTTTTCACTTGAAATGCTCTGACCTTATAGCCTTACCTGAAGTTATGGATCAAATGCACC
TACGTTATGGCCACATATTCAACTGGGAGTTATTGAGGTCTTCTATTTATGAGTCTCTTTGTTTCTAGACGGCTCCATCTTGTATATATACTTCTATATGGAAGACTTGA
CTCGTGGACAACATTCCACAACATTTCAACCTTCCCATGTAGGGAAAGATGTGGTTACTGGACAGGTATAGAAAATTGGAACCTATTGTACCATTTTTTCTTTTGTTTTT
TTAATCCTTCACTTCTCTCATCTGAGTATTTAGTTCAAACGCTAAAGTTATGTAACTTTCCGTCGCTCATACGATTTTAACTCAACTGTATCTGTAAGTTCTAAAGAGTT
GCTCATGTCAAGTATGTTAGTTTGTCTATCTCAACCTCGAAGTTTGTTAATCTTTCAATTATGTAACCTCAAAGCTTCCATTTCATGCCACACTTTTACTCTTATAAATT
ATTATGATTTAAC
Protein sequenceShow/hide protein sequence
MPTSTAAIAGSGDSSNTIVGSSVEDKSLKESAAAQSQDYHAQNEVQELEKFGKQIHPCQPGEAQGSQETNRSFGNDQNIVPHDGVFNIAVSSSSKFRSHVDDTRDIDSAV
QDAVLREQETALWKVVIAYEETALWRAVIASIYHCEPHGWMTCPSKGAIKARPWDLGFRRDIKDNKFDSWVGLGSMIDTIRLRDGIDKDCCSLENSESLSLLTKMTRIEL
NLYGETEAGWLLRISICIFRGRIPFFTWETRVSFSFPMVPQELATQNIIRSQRDSVGADGLPVERSDIFSERYDPSTLKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEI
GNGYGVPGGCAFYGASKPGIVANGNNAIGQKIQGQVREAEQSSVAKALPEYLKQKLRARGILKEDADHNNSTNSDAISNQLLQGEKLPHGWVEAKDPGSGVSYYYNESSG
KSQWERPSESSSDAQLPSAASLPEDWMEALDQTTGLKYYYNMRTQVTQWEPPVASHQTTLPHSNDNVPGSWNNQTLEQSKCITCGSGMTLVQGSRYCNGCTSGVSTSSTN
GKWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILGLPQCQYLSTNNISNQQKTENVKHSADPSIRKPATERVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKN
AEIASQTKKGSSQYAPISKRGDGSDGLGDAD