; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023813 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023813
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationscaffold207:884079..909537
RNA-Seq ExpressionMS023813
SyntenyMS023813
Gene Ontology termsGO:0005622 - intracellular (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580903.1 Polyglutamine-binding protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.3e-26576.06Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS AA A  GDSS TTIGSSVED  LKESG+AQSQSYAQNEVQELEK G Q S  QPGE  SSV +SSD                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
          PS GNDQN V HDGVFNIA SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSERYDPS LKEHLLKITS+HRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIVT GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
                NSDAISNQ +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D+TTG KYYYN RT VTQWE PV+
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQ TL HSN + PG WNNQT  Q+KC+ CG GMTL+QGSRYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQAWGYCNHCTR L LPQC+Y
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
        LPTS++ NQQKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

XP_004136655.1 uncharacterized protein LOC101203374 [Cucumis sativus]1.0e-26575.62Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS A  A SGDSS T IGSS ED+ LKES AAQSQ  AQNEVQELEKS KQ    QPGEAQ +VA+ +D                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR
        T  S GNDQN VPH G F NIA SSSS F S+V D RDID AVQDAVLREQELATQNIIRSQR+S+GADG P ERSDIFSERYDPS+LKEHLLKITS+HR
Subjt:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR

Query:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP
        AEMA+KRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV  GNN  G KIQGQ+ EAEQ+SA+K LPEYLKQKLRARGILKED +  NSVR D     
Subjt:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP

Query:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV
                TNSDA+SN  +QG+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+ S + QL SAVSLPEDWMEA+D+T+G+KYYYN+RTHVTQWE PV
Subjt:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV

Query:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE
        +SHQ TLTHSN   PG WN+QT EQ+KCI CG GMTL+QGSRYCNS T   STSSTNG WQ+Q  E NKCMGCGGWGLGLVQAWGYC HCTRIL LPQC+
Subjt:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE

Query:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
        YLPT+++SNQQK E++ HSAD SIKKS  DRSKWKPP+GKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
Subjt:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP

Query:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        SPGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

XP_022154433.1 uncharacterized protein LOC111021704 [Momordica charantia]0.0e+0096.1Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTSNAAAAVSGDS ITTIGSSVEDRPLKESGAAQSQSYAQNEVQEL KSGKQ+SSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
        TEPSR NDQNNVPHDGVF IACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
               TNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQ+WGYCNHCTRILRLPQCEY
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSN--QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY
        LPTSSVSN  QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY
Subjt:  LPTSSVSN--QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY

Query:  PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
Subjt:  PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

XP_022935213.1 uncharacterized protein LOC111442162 [Cucurbita moschata]8.7e-26575.59Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS AA A  GDSS TTIGSSVED  LKESG+AQSQSYAQNEVQELEK G Q S  QPGE +SSV +SSD                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
          PS GNDQN VPHDGVFNIA SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSERYDPS LKEHLLKITS+HRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIVT GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
                NSDAISNQ +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D+TTG +YYYN RT VTQWE PV+
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQ TL HS  + PG WN+QT  Q+KC+ CG GMTL+QG+RYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQAWGYCNHCTR L LPQC+Y
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
        LPTS++ NQQKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

XP_022983732.1 uncharacterized protein LOC111482260 [Cucurbita maxima]2.5e-26475.27Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS AA A SGDSS TTIGSSVED  LKESG+AQSQSYAQNEVQELEK G Q S  QPGE  SSV + SD                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
          PS GNDQN VPH GVFNIA SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRSQR+S+GADG P ERSDIFSERYDPSTLKEHLLKIT++HRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIVT GNN I  KIQGQV E +Q+S+ KELPEYLKQKL+ARGILKED +  NS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
                N+DAISNQ +QG+KLP GWVEAKDP SG  YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D+ TG KYYYN RT VTQWE P +
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQ TL HSN   PG WN+QT  Q+KC+ CG GMTL+QGSRYCN      STSSTNG WQ+Q  +L+KCMGCGGWGLGLVQAWGYCNHCTR L LPQC+Y
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
        LPTS+++NQ KTE+I +++D SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

TrEMBL top hitse value%identityAlignment
A0A0A0LFL2 Polyglutamine tract-binding protein 15.0e-26675.62Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS A  A SGDSS T IGSS ED+ LKES AAQSQ  AQNEVQELEKS KQ    QPGEAQ +VA+ +D                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR
        T  S GNDQN VPH G F NIA SSSS F S+V D RDID AVQDAVLREQELATQNIIRSQR+S+GADG P ERSDIFSERYDPS+LKEHLLKITS+HR
Subjt:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR

Query:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP
        AEMA+KRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIV  GNN  G KIQGQ+ EAEQ+SA+K LPEYLKQKLRARGILKED +  NSVR D     
Subjt:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP

Query:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV
                TNSDA+SN  +QG+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+ S + QL SAVSLPEDWMEA+D+T+G+KYYYN+RTHVTQWE PV
Subjt:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV

Query:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE
        +SHQ TLTHSN   PG WN+QT EQ+KCI CG GMTL+QGSRYCNS T   STSSTNG WQ+Q  E NKCMGCGGWGLGLVQAWGYC HCTRIL LPQC+
Subjt:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE

Query:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
        YLPT+++SNQQK E++ HSAD SIKKS  DRSKWKPP+GKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
Subjt:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP

Query:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        SPGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

A0A5D3DPP7 Polyglutamine tract-binding protein 13.5e-26475.47Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS  A A SGDSS T IGSS ED+ LKES A      AQNEVQELEK  KQ    QPGEAQ SVA+S+D                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR
        T  S GNDQN VPH+GVF NIA S+SS F S+V D RDI+ AVQDAVLREQELATQNIIRSQRES+GADG P+E+SDIFSERYDPST+KEHLLKITS+HR
Subjt:  TEPSRGNDQNNVPHDGVF-NIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHR

Query:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP
        AEMAMKRGK NLPEEGNLEIGNGYGVPGGCA YGASKPGIV  GNN  G KIQGQV E EQ+SA K LPEYLKQKLRARGILKED +  N          
Subjt:  AEMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFP

Query:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV
               PTNSDA+SN  + G+KLP GWVEAKDP SGV YYYNES+GKSQWERPS+ S D QL SAVSLPEDWMEA+D+T+GLKYYYN+RTH+TQWE PV
Subjt:  ILCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPV

Query:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE
        +SHQ TLTHSN  VPG WN+QT EQ+KCI CG GMTL+QGSRYCN+ T   STSSTNG WQ+QS E NKCMGCGGWGLGLVQAWGYCNHCTRIL LPQC+
Subjt:  SSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCE

Query:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
        YLPT+++SNQQKTE+I HSAD SIKKSA DRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP
Subjt:  YLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYP

Query:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        SPGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  SPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

A0A6J1DKA8 Polyglutamine tract-binding protein 10.0e+0096.1Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTSNAAAAVSGDS ITTIGSSVEDRPLKESGAAQSQSYAQNEVQEL KSGKQ+SSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
        TEPSR NDQNNVPHDGVF IACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
               TNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQ+WGYCNHCTRILRLPQCEY
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSN--QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY
        LPTSSVSN  QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY
Subjt:  LPTSSVSN--QQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPY

Query:  PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
Subjt:  PSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

A0A6J1F9X9 Polyglutamine tract-binding protein 14.2e-26575.59Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS AA A  GDSS TTIGSSVED  LKESG+AQSQSYAQNEVQELEK G Q S  QPGE +SSV +SSD                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
          PS GNDQN VPHDGVFNIA SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRS+R+S+ ADG P ERSDIFSERYDPS LKEHLLKITS+HRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIVT GNN I  KIQGQV EAEQ+ + KELPEYLKQKL+ARGILKED +  NS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
                NSDAISNQ +QG+KLP GWVEAKDP SGV YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D+TTG +YYYN RT VTQWE PV+
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQ TL HS  + PG WN+QT  Q+KC+ CG GMTL+QG+RYCN      STSSTNG WQ+Q  + +KCMGCGGWGLGLVQAWGYCNHCTR L LPQC+Y
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
        LPTS++ NQQKTE+I ++AD SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

A0A6J1J063 Polyglutamine tract-binding protein 11.2e-26475.27Show/hide
Query:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE
        MPTS AA A SGDSS TTIGSSVED  LKESG+AQSQSYAQNEVQELEK G Q S  QPGE  SSV + SD                           QE
Subjt:  MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQE

Query:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA
          PS GNDQN VPH GVFNIA SSSSKFGSHV DTRDID+AV+DAVLREQELATQNIIRSQR+S+GADG P ERSDIFSERYDPSTLKEHLLKIT++HRA
Subjt:  TEPSRGNDQNNVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRA

Query:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI
        EMAMKRGK NLPEEGNLEIGNGYGVPGGCAFYGASKPGIVT GNN I  KIQGQV E +Q+S+ KELPEYLKQKL+ARGILKED +  NS          
Subjt:  EMAMKRGKSNLPEEGNLEIGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPI

Query:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS
                N+DAISNQ +QG+KLP GWVEAKDP SG  YYYNESTGKSQWERP++SSF LQL SAVSLPEDWMEA+D+ TG KYYYN RT VTQWE P +
Subjt:  LCPSFSPTNSDAISNQPVQGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVS

Query:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY
        SHQ TL HSN   PG WN+QT  Q+KC+ CG GMTL+QGSRYCN      STSSTNG WQ+Q  +L+KCMGCGGWGLGLVQAWGYCNHCTR L LPQC+Y
Subjt:  SHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEY

Query:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
        LPTS+++NQ KTE+I +++D SIKKSA DRSK KPP+GKGGKRESRKRS+SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS
Subjt:  LPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPS

Query:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD
        PGAVLRKNAEIASQTKKGSSHYAPISK+GDGSDGLGDAD
Subjt:  PGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD

SwissProt top hitse value%identityAlignment
A1YFA7 Polyglutamine-binding protein 13.9e-1867.11Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

A2T806 Polyglutamine-binding protein 13.9e-1867.11Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

O60828 Polyglutamine-binding protein 13.9e-1867.11Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K    +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q2HJC9 Polyglutamine-binding protein 11.8e-1867.11Show/hide
Query:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
        +S+K +  +D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  ESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Q91VJ5 Polyglutamine-binding protein 16.7e-1831.34Show/hide
Query:  DSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCE---AS
        D   D +      LP  W +  D + GL YY+NV T +  W   +S H      +  +   V NN    ++K     R +  +  +   +  + E    S
Subjt:  DSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGSRYCNSYTCE---AS

Query:  TSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYS
            + N ++   E                            R    + +      ++++  + D +     K     R +   P  K  K  SRK    
Subjt:  TSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSYS

Query:  EDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK
         D+ELDPMDPSSYSDAPRG W  GL  +      ADTTA GPLFQQRPYPSPGAVLR NAE AS+TK+
Subjt:  EDDELDPMDPSSYSDAPRGGWVVGL--KGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKK

Arabidopsis top hitse value%identityAlignment
AT1G44910.1 pre-mRNA-processing protein 40A3.3e-0434.44Show/hide
Query:  AISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP
        A+S  P  G+  P+    W E    D G  YYYN+ T +S WE+P +    L+   A ++   W E      G KYYYN  T  ++W  P
Subjt:  AISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP

AT1G44910.2 pre-mRNA-processing protein 40A3.3e-0434.44Show/hide
Query:  AISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP
        A+S  P  G+  P+    W E    D G  YYYN+ T +S WE+P +    L+   A ++   W E      G KYYYN  T  ++W  P
Subjt:  AISNQPVQGDKLPRG---WVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP

AT2G41020.1 WW domain-containing protein2.5e-11346.18Show/hide
Query:  SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLE
        +S+  +GS +    ++DI+SA   A+LREQE+ TQ II+ QRE+     G     +DI  +R DP+ LKEHLLK T++HRAE A KRG S +   EGN++
Subjt:  SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLE

Query:  IGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPILCPSFSPTNSDAISNQPV
        +GNGYG+PGG A+ G S              ++ G   + E  +A+  LPEYLKQKL+ARGIL++      S   D         S    N  A      
Subjt:  IGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPILCPSFSPTNSDAISNQPV

Query:  QGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWN
            LP GWV+AKDP SG  YYYN+ TG  QWERP + S+       V   E+W+E  DE +G KY+YN RTHV+QWE P S  +   T+SN  V     
Subjt:  QGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWN

Query:  NQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLPTSSVSNQQKTESIDHS
                                        + S+ NG  +    +L +C GCGGWG+GLVQ WGYC HCTR+  LP+ ++LP            ++H 
Subjt:  NQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLPTSSVSNQQKTESIDHS

Query:  ADA--SIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQT
         +A  S +K    RS  KPPM    K   +KR+++EDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTA+GPLFQQRPYPSPGAVLR+NAE+A SQ 
Subjt:  ADA--SIKKSAMDRSKWKPPMGKGGKRESRKRSYSEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIA-SQT

Query:  KKGSSHYAPISKKGDGSDGLGDAD
        KK +S +  I+K+GDGSDGLGDAD
Subjt:  KKGSSHYAPISKKGDGSDGLGDAD

AT2G41020.2 WW domain-containing protein1.9e-6839.84Show/hide
Query:  SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLE
        +S+  +GS +    ++DI+SA   A+LREQE+ TQ II+ QRE+     G     +DI  +R DP+ LKEHLLK T++HRAE A KRG S +   EGN++
Subjt:  SSSSKFGSHVG--DTRDIDSAVQDAVLREQELATQNIIRSQRES-LGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKS-NLPEEGNLE

Query:  IGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPILCPSFSPTNSDAISNQPV
        +GNGYG+PGG A+ G S              ++ G   + E  +A+  LPEYLKQKL+ARGIL++      S   D         S    N  A      
Subjt:  IGNGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPILCPSFSPTNSDAISNQPV

Query:  QGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWN
            LP GWV+AKDP SG  YYYN+ TG  QWERP + S+       V   E+W+E  DE +G KY+YN RTHV+QWE P S  +   T+SN  V     
Subjt:  QGDKLPRGWVEAKDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWN

Query:  NQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLP
                                        + S+ NG  +    +L +C GCGGWG+GLVQ WGYC HCTR+  LP+ ++LP
Subjt:  NQTFEQNKCIACGRGMTLMQGSRYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLP

AT3G19840.1 pre-mRNA-processing protein 40C3.0e-0538.16Show/hide
Query:  AKDPDSGVLYYYNESTGKSQWERP------SDSSFDLQLP-SAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP
        A   ++GVLYYYN  TG+S +E+P       D      +P S  SLP      +    G KYYYN +T V+ W+ P
Subjt:  AKDPDSGVLYYYNESTGKSQWERP------SDSSFDLQLP-SAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACCTCAAATGCAGCAGCTGCAGTTTCCGGAGACTCGTCCATAACTACAATTGGGTCCAGCGTTGAAGATAGACCCCTCAAGGAATCGGGCGCCGCTCAATCTCA
ATCTTACGCCCAGAATGAAGTGCAAGAACTTGAAAAGTCTGGCAAACAAAACTCCTCTAGCCAACCGGGAGAAGCGCAGAGTTCTGTAGCAGTGTCTTCCGATCAAGATA
CCTTTGTTGAGCAGCAACTTGGGAAGAGCACTGCAACTGTGGATGAACTGCTCGTGCAGGGGAATGAAAAGTTCCAGGAGACAGAACCAAGTCGTGGAAATGATCAGAAC
AATGTTCCCCATGACGGCGTGTTTAACATTGCTTGCTCGTCTTCTAGCAAATTCGGATCGCATGTTGGCGATACCAGGGACATTGACAGTGCTGTTCAGGATGCAGTGTT
GAGGGAACAGGAACTTGCAACCCAAAATATTATTCGCAGCCAAAGGGAGTCCTTGGGTGCAGATGGACCTCCTAGCGAAAGATCAGATATCTTTTCAGAACGTTATGACC
CAAGTACTCTTAAAGAGCATCTTTTGAAGATTACTTCTGATCATCGTGCTGAAATGGCTATGAAAAGAGGAAAGTCAAACCTTCCAGAAGAAGGGAACTTGGAAATTGGA
AATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTAACCCCTGGAAATAATGCGATTGGCTGGAAAATCCAGGGACAGGTTACGGA
AGCAGAACAAAACTCTGCTACTAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCCAGGGGTATTCTTAAAGAAGATACACAAAAAAGAAATTCTGTAAGAACTG
ATATGCCAAACTTCCCAATACTTTGTCCATCTTTTTCTCCTACAAATTCTGATGCTATTTCAAATCAACCAGTGCAAGGAGATAAGCTGCCTCGTGGATGGGTGGAGGCT
AAAGACCCTGATAGTGGTGTTTTGTATTATTATAATGAAAGTACTGGGAAGAGTCAATGGGAAAGGCCCTCTGACTCCTCTTTTGATCTGCAACTTCCATCAGCTGTATC
CCTTCCAGAAGATTGGATGGAGGCACTCGATGAAACAACAGGCCTTAAATACTACTACAATGTAAGAACCCACGTAACCCAGTGGGAGCATCCTGTTTCATCTCATCAGG
GAACTTTGACACACTCGAATGGTAACGTTCCTGGGGTTTGGAACAACCAAACTTTTGAACAAAATAAATGCATCGCATGTGGAAGAGGAATGACCCTCATGCAGGGTTCT
AGATACTGCAACAGTTATACATGTGAGGCTTCTACAAGTTCAACCAATGGGAATTGGCAGGAGCAATCGTTTGAGCTAAACAAATGCATGGGATGTGGCGGTTGGGGGCT
TGGCCTTGTGCAAGCTTGGGGTTACTGCAATCATTGTACACGAATTCTCAGGCTTCCCCAGTGTGAATACTTGCCAACCAGCAGTGTTAGTAATCAGCAGAAGACCGAGA
GTATCGATCACAGCGCTGATGCTTCCATTAAAAAGTCTGCTATGGATAGGTCCAAATGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAGTAGGAAGCGATCCTAC
AGCGAGGATGATGAATTGGATCCGATGGACCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGAGTACAACCTCGAGCAGCAGATACTAC
TGCTACAGGTCCTCTATTTCAACAGCGGCCATACCCATCACCTGGAGCTGTTCTGAGGAAGAATGCCGAAATTGCTTCACAAACCAAGAAGGGAAGCTCTCACTATGCAC
CTATTTCCAAGAAAGGAGATGGGAGTGATGGCCTTGGTGATGCTGAC
mRNA sequenceShow/hide mRNA sequence
ATGCCGACCTCAAATGCAGCAGCTGCAGTTTCCGGAGACTCGTCCATAACTACAATTGGGTCCAGCGTTGAAGATAGACCCCTCAAGGAATCGGGCGCCGCTCAATCTCA
ATCTTACGCCCAGAATGAAGTGCAAGAACTTGAAAAGTCTGGCAAACAAAACTCCTCTAGCCAACCGGGAGAAGCGCAGAGTTCTGTAGCAGTGTCTTCCGATCAAGATA
CCTTTGTTGAGCAGCAACTTGGGAAGAGCACTGCAACTGTGGATGAACTGCTCGTGCAGGGGAATGAAAAGTTCCAGGAGACAGAACCAAGTCGTGGAAATGATCAGAAC
AATGTTCCCCATGACGGCGTGTTTAACATTGCTTGCTCGTCTTCTAGCAAATTCGGATCGCATGTTGGCGATACCAGGGACATTGACAGTGCTGTTCAGGATGCAGTGTT
GAGGGAACAGGAACTTGCAACCCAAAATATTATTCGCAGCCAAAGGGAGTCCTTGGGTGCAGATGGACCTCCTAGCGAAAGATCAGATATCTTTTCAGAACGTTATGACC
CAAGTACTCTTAAAGAGCATCTTTTGAAGATTACTTCTGATCATCGTGCTGAAATGGCTATGAAAAGAGGAAAGTCAAACCTTCCAGAAGAAGGGAACTTGGAAATTGGA
AATGGTTATGGCGTACCTGGTGGATGTGCTTTCTATGGTGCTTCAAAGCCTGGAATTGTAACCCCTGGAAATAATGCGATTGGCTGGAAAATCCAGGGACAGGTTACGGA
AGCAGAACAAAACTCTGCTACTAAAGAATTGCCCGAGTACCTCAAGCAGAAGCTAAGAGCCAGGGGTATTCTTAAAGAAGATACACAAAAAAGAAATTCTGTAAGAACTG
ATATGCCAAACTTCCCAATACTTTGTCCATCTTTTTCTCCTACAAATTCTGATGCTATTTCAAATCAACCAGTGCAAGGAGATAAGCTGCCTCGTGGATGGGTGGAGGCT
AAAGACCCTGATAGTGGTGTTTTGTATTATTATAATGAAAGTACTGGGAAGAGTCAATGGGAAAGGCCCTCTGACTCCTCTTTTGATCTGCAACTTCCATCAGCTGTATC
CCTTCCAGAAGATTGGATGGAGGCACTCGATGAAACAACAGGCCTTAAATACTACTACAATGTAAGAACCCACGTAACCCAGTGGGAGCATCCTGTTTCATCTCATCAGG
GAACTTTGACACACTCGAATGGTAACGTTCCTGGGGTTTGGAACAACCAAACTTTTGAACAAAATAAATGCATCGCATGTGGAAGAGGAATGACCCTCATGCAGGGTTCT
AGATACTGCAACAGTTATACATGTGAGGCTTCTACAAGTTCAACCAATGGGAATTGGCAGGAGCAATCGTTTGAGCTAAACAAATGCATGGGATGTGGCGGTTGGGGGCT
TGGCCTTGTGCAAGCTTGGGGTTACTGCAATCATTGTACACGAATTCTCAGGCTTCCCCAGTGTGAATACTTGCCAACCAGCAGTGTTAGTAATCAGCAGAAGACCGAGA
GTATCGATCACAGCGCTGATGCTTCCATTAAAAAGTCTGCTATGGATAGGTCCAAATGGAAACCTCCAATGGGGAAAGGTGGAAAACGAGAAAGTAGGAAGCGATCCTAC
AGCGAGGATGATGAATTGGATCCGATGGACCCTAGCTCCTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGGAGTACAACCTCGAGCAGCAGATACTAC
TGCTACAGGTCCTCTATTTCAACAGCGGCCATACCCATCACCTGGAGCTGTTCTGAGGAAGAATGCCGAAATTGCTTCACAAACCAAGAAGGGAAGCTCTCACTATGCAC
CTATTTCCAAGAAAGGAGATGGGAGTGATGGCCTTGGTGATGCTGAC
Protein sequenceShow/hide protein sequence
MPTSNAAAAVSGDSSITTIGSSVEDRPLKESGAAQSQSYAQNEVQELEKSGKQNSSSQPGEAQSSVAVSSDQDTFVEQQLGKSTATVDELLVQGNEKFQETEPSRGNDQN
NVPHDGVFNIACSSSSKFGSHVGDTRDIDSAVQDAVLREQELATQNIIRSQRESLGADGPPSERSDIFSERYDPSTLKEHLLKITSDHRAEMAMKRGKSNLPEEGNLEIG
NGYGVPGGCAFYGASKPGIVTPGNNAIGWKIQGQVTEAEQNSATKELPEYLKQKLRARGILKEDTQKRNSVRTDMPNFPILCPSFSPTNSDAISNQPVQGDKLPRGWVEA
KDPDSGVLYYYNESTGKSQWERPSDSSFDLQLPSAVSLPEDWMEALDETTGLKYYYNVRTHVTQWEHPVSSHQGTLTHSNGNVPGVWNNQTFEQNKCIACGRGMTLMQGS
RYCNSYTCEASTSSTNGNWQEQSFELNKCMGCGGWGLGLVQAWGYCNHCTRILRLPQCEYLPTSSVSNQQKTESIDHSADASIKKSAMDRSKWKPPMGKGGKRESRKRSY
SEDDELDPMDPSSYSDAPRGGWVVGLKGVQPRAADTTATGPLFQQRPYPSPGAVLRKNAEIASQTKKGSSHYAPISKKGDGSDGLGDAD