; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0012064 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0012064
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationchr04:27350138..27362739
RNA-Seq ExpressionIVF0012064
SyntenyIVF0012064
Gene Ontology termsGO:0005622 - intracellular (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053859.1 WW domain-containing protein [Cucumis melo var. makuwa]5.72e-30488.29Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQE--LATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA
        DDGRDIEIAVQDAVLREQ   +     +   RESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA
Subjt:  DDGRDIEIAVQDAVLREQE--LATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA

Query:  SYGASKPGIVANG----------------------------NNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSN
        SYGASKPGIVANG                            NNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSN
Subjt:  SYGASKPGIVANG----------------------------NNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSN

Query:  TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG
        TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG
Subjt:  TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG

Query:  PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQR
        PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+
Subjt:  PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQR

Query:  TSSI
        T +I
Subjt:  TSSI

TYK25544.1 uncharacterized protein E5676_scaffold352G006960 [Cucumis melo var. makuwa]0.097.02Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ

Query:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
        WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
Subjt:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG

Query:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

XP_016899708.1 PREDICTED: uncharacterized protein LOC103486911 isoform X1 [Cucumis melo]0.096.2Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

XP_016899711.1 PREDICTED: uncharacterized protein LOC103486911 isoform X2 [Cucumis melo]0.095.57Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTR   LP    CQ  PT   IS +Q+T +I
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

XP_016899712.1 PREDICTED: uncharacterized protein LOC103486911 isoform X3 [Cucumis melo]0.097.02Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ

Query:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
        WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
Subjt:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG

Query:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

TrEMBL top hitse value%identityAlignment
A0A1S4DUP7 uncharacterized protein LOC103486911 isoform X31.9e-25597.02Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ

Query:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
        WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
Subjt:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG

Query:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

A0A1S4DUQ3 uncharacterized protein LOC103486911 isoform X21.4e-25095.15Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTR+        CQ  PT   IS +Q+T +I
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

A0A1S4DVH1 uncharacterized protein LOC103486911 isoform X11.0e-25396.2Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESS

Query:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
        GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA
Subjt:  GKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNA

Query:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  CTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

A0A5A7UK56 Polyglutamine tract-binding protein 14.5e-24188.29Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQ--ELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA
        DDGRDIEIAVQDAVLREQ   +     +   RESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA
Subjt:  DDGRDIEIAVQDAVLREQ--ELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCA

Query:  SYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSN
        SYGASKPGIVAN                            GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP    TNSDAVSN
Subjt:  SYGASKPGIVAN----------------------------GNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNP----TNSDAVSN

Query:  TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG
        TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG
Subjt:  TKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPG

Query:  PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQR
        PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+
Subjt:  PWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQR

Query:  TSSI
        T +I
Subjt:  TSSI

A0A5D3DPP7 Polyglutamine tract-binding protein 11.9e-25597.02Show/hide
Query:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
        MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV
Subjt:  MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNV

Query:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
        DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY
Subjt:  DDGRDIEIAVQDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASY

Query:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
        GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ
Subjt:  GASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQ

Query:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
        WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG
Subjt:  WERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSG

Query:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI
        VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP    CQ  PT   IS +Q+T +I
Subjt:  VSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLPSVSTCQ--PTTLVISRRQRTSSI

SwissProt top hitse value%identityAlignment
O75554 WW domain-binding protein 41.2e-0428.42Show/hide
Query:  QKIQGQVKEVEQSS------AAKALPEYLKQKLRARGILKEDAEHS-----------NPTNSDAVSNTKLHGEKLPH--GWVEAKDPHSGVSYYYNESSG
        QK   + KE E++S       A AL  Y ++ L+  G+  E  E S           + +N       K   +K P    WVE      G  YYY+  SG
Subjt:  QKIQGQVKEVEQSS------AAKALPEYLKQKLRARGILKEDAEHS-----------NPTNSDAVSNTKLHGEKLPH--GWVEAKDPHSGVSYYYNESSG

Query:  KSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQ---TLEQSK
         SQWE+P     D + ++  ++   W+E + +  G  YYYN  T  ++WE+P       + H++D      N+    TL++SK
Subjt:  KSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQ---TLEQSK

P33203 Pre-mRNA-processing protein PRP402.4e-0544.44Show/hide
Query:  WVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP
        W EAKD  SG  YYYN  + KS WE+P EL S  +L     L E+  +A     G  YYYN  T  T W  P
Subjt:  WVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP

Q59XV0 Histone-lysine N-methyltransferase, H3 lysine-36 specific5.9e-0432.67Show/hide
Query:  EHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSA--VSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVA
        + S PTN+ A + T         G   +  P   +S +  E  G +    PS  S   Q  ++    LPE+W  A D+ +G  YYYN+ T  T WERP+ 
Subjt:  EHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSA--VSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVA

Query:  S
        S
Subjt:  S

Q5F457 WW domain-binding protein 42.0e-0733Show/hide
Query:  KEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP
        KE  E    T      + K      P  WV+   P  G +YYYN  +G+SQWE+P     +++ S   S+   W+E + +  G  YYYN +T ++ WE+P
Subjt:  KEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP

Q61048 WW domain-binding protein 42.7e-0429.35Show/hide
Query:  QKIQGQVKEVEQSS------AAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNT---------------KLHGEKLPHGWVEAKDPHSGVSYYYNES
        QK   + KE E++S       A AL  Y ++ L+  G L   ++ S PT S  +S                 K   E    GWVE      G  YYY+  
Subjt:  QKIQGQVKEVEQSS------AAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNT---------------KLHGEKLPHGWVEAKDPHSGVSYYYNES

Query:  SGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP--VASHQTTLTHSNDKVPGPWNDQTLEQSK
        +G SQWE+P     + + ++A ++   W+E + +  G  YYYN  T  ++WE+P     H   +  S D    P    TLE +K
Subjt:  SGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP--VASHQTTLTHSNDKVPGPWNDQTLEQSK

Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein3.0e-6740.53Show/hide
Query:  NNIAVSTSSNFRSNV--DDGRDIEIAVQDAVLREQELATQNIIRSQRES-VGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKR-GKLNLPE
        N  +V+++  + S++  D  +DIE A   A+LREQE+ TQ II+ QRE+     G     +DI  +R DP+ +KEHLLK T+ HRAE A KR G ++   
Subjt:  NNIAVSTSSNFRSNV--DDGRDIEIAVQDAVLREQELATQNIIRSQRES-VGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKR-GKLNLPE

Query:  EGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNPTNSDAVSNTK-------LHGE
        EGN+++GNGYG+PGG A  G S+         ++G+         E ++A+  LPEYLKQKL+ARGIL++   A  SNP ++ AVS  +        +  
Subjt:  EGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNPTNSDAVSNTK-------LHGE

Query:  KLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQT
         LP GWV+AKDP SG +YYYN+ +G  QWERP ELS  T  +  V   E+W+E  D+ SG KY+YN RTH++QWE P +  +   T+SN           
Subjt:  KLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQT

Query:  LEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP
                                  + V+ S+ NG  +   S+  +C GCGGWG+GLVQ WGYC HCTR+ +LP
Subjt:  LEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP

AT2G41020.2 WW domain-containing protein3.0e-6740.53Show/hide
Query:  NNIAVSTSSNFRSNV--DDGRDIEIAVQDAVLREQELATQNIIRSQRES-VGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKR-GKLNLPE
        N  +V+++  + S++  D  +DIE A   A+LREQE+ TQ II+ QRE+     G     +DI  +R DP+ +KEHLLK T+ HRAE A KR G ++   
Subjt:  NNIAVSTSSNFRSNV--DDGRDIEIAVQDAVLREQELATQNIIRSQRES-VGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKR-GKLNLPE

Query:  EGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNPTNSDAVSNTK-------LHGE
        EGN+++GNGYG+PGG A  G S+         ++G+         E ++A+  LPEYLKQKL+ARGIL++   A  SNP ++ AVS  +        +  
Subjt:  EGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQGQVKEVEQSSAAKALPEYLKQKLRARGILKE--DAEHSNPTNSDAVSNTK-------LHGE

Query:  KLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQT
         LP GWV+AKDP SG +YYYN+ +G  QWERP ELS  T  +  V   E+W+E  D+ SG KY+YN RTH++QWE P +  +   T+SN           
Subjt:  KLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQT

Query:  LEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP
                                  + V+ S+ NG  +   S+  +C GCGGWG+GLVQ WGYC HCTR+ +LP
Subjt:  LEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHCTRILSLP

AT3G19840.1 pre-mRNA-processing protein 40C3.6e-0430.53Show/hide
Query:  DAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQL-------SSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP
        D  + ++L G +L   W   K   +GV YYYN  +G+S +E+P     +           S  SLP      +    G KYYYN +T ++ W+ P
Subjt:  DAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQL-------SSAVSLPEDWMEAIDQTSGLKYYYNMRTHITQWERP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGACTTCTACTACAGCAATTGCAGGTTCAGGAGATTCGTCCAATACCATAATTGGTTCTAGTGCCGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAAAATGA
AGTGCAAGAACTTGAAAAGTTTAGCAAACAAATTTATCCTTGCCAACCGGGAGAAGCTCAGTGTTCTGTGGCAATATCTGCTGATCAAGAGACCAATCAAAGTTTTGGGA
ACGATCAGAACATTGTTCCCCATGAGGGTGTGTTTAACAACATTGCTGTCTCAACTTCTAGCAACTTCAGGTCGAATGTTGACGATGGCAGAGACATTGAGATTGCTGTT
CAGGATGCCGTGTTGAGGGAACAGGAACTCGCTACCCAAAATATTATTCGTAGCCAAAGGGAATCTGTGGGTGCAGATGGACTTCCGGCTGAGCAATCAGATATCTTTTC
AGAACGTTATGACCCGAGTACTATTAAAGAGCATCTTTTGAAGATTACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGTTGAATCTTCCAGAAGAAGGGA
ACTTGGAAATTGGAAATGGATACGGAGTACCTGGTGGATGTGCTTCCTATGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGAAATAATGTGACTGGCCAGAAAATCCAG
GGACAAGTCAAGGAAGTCGAACAAAGTTCTGCTGCCAAAGCATTGCCAGAGTACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACACAGTAA
TCCTACAAATTCTGACGCTGTTTCAAACACAAAGTTGCATGGAGAAAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTCACAGTGGCGTTTCATATTATTATAATG
AAAGTAGTGGGAAGAGTCAATGGGAAAGGCCCTCCGAACTTTCTTCTGATACGCAACTTTCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAATTGATCAAACA
TCAGGCCTTAAATACTATTACAATATGAGAACCCATATAACCCAGTGGGAGCGGCCTGTTGCATCTCATCAAACAACTTTGACACACTCGAATGATAAAGTTCCTGGGCC
TTGGAACGACCAAACTTTGGAGCAAAGTAAATGCATCACATGTGGAAGTGGAATGACCCTCGTACAGGGTTCAAGATACTGCAACGCTTGTACAAGTGGGGTTTCTACAA
GTTCAACCAATGGGATGTGGCAGGACCAATCGTCTGAGCAAAATAAATGCATGGGATGTGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGC
ACACGAATTCTCAGTCTCCCCAGTGTCAGTACTTGCCAACCAACAACATTAGTAATCAGCAGAAGACAGAGAACATCAAGCATAGTGCCGATCCCTCCATTAAAAAATCT
GCGACAGATAGGTCCAAATGGAAACCTCCAATGGGAAAAGGTGGAAAGCGAGAAAGTAGGAAGCGTTCCTACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTC
TTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAA
mRNA sequenceShow/hide mRNA sequence
TTTACTTTCATTTGTCATTTGGACTCTACCTCATCGTGTCCAAACTCAACAATATATTTACCAATTGAAACACTAAACAGCCTCGGAATCAACTAGGCCTTTGAGAAAAC
ACCACTCGCATTTATTCTACAAAATCTCAAAACAAAAAACAGGAAAGAAGATCAAATCAAATGAAATTGAAATTAGGACACATCGGTGATAAGCCAAAAGAAGACTAGTC
ACTACATTATGCTTGTAAACTTGTAAAGATAATACTCCAGTTTGTTTGTTTGGGCAGAAGGGACGGCGGCGGCGGCGGCAGAGGATAGGAGAGCGGACAGGACGGCGAAT
TCGGTTTGTCGGTCGGTTTATCGGGTCGTTTCGACCACCTTTCAGAAATAAAACCAGGTATTCCCAATTCGAAGCGTCGTTTCTCATCGCCGGTTCCGCCATATTCAAAT
TCTCTCGAATTCCGACTAGCAGTCGATCAATTACAGGAAAGCTTTCGAAATTGAAAGTTCCGGCGATGCCGACTTCTACTACAGCAATTGCAGGTTCAGGAGATTCGTCC
AATACCATAATTGGTTCTAGTGCCGAAGATAAATCCCTCAAGGAATCAGCTGCTGCTCAAAATGAAGTGCAAGAACTTGAAAAGTTTAGCAAACAAATTTATCCTTGCCA
ACCGGGAGAAGCTCAGTGTTCTGTGGCAATATCTGCTGATCAAGAGACCAATCAAAGTTTTGGGAACGATCAGAACATTGTTCCCCATGAGGGTGTGTTTAACAACATTG
CTGTCTCAACTTCTAGCAACTTCAGGTCGAATGTTGACGATGGCAGAGACATTGAGATTGCTGTTCAGGATGCCGTGTTGAGGGAACAGGAACTCGCTACCCAAAATATT
ATTCGTAGCCAAAGGGAATCTGTGGGTGCAGATGGACTTCCGGCTGAGCAATCAGATATCTTTTCAGAACGTTATGACCCGAGTACTATTAAAGAGCATCTTTTGAAGAT
TACTTCTGAACATCGTGCAGAAATGGCTATGAAAAGGGGAAAGTTGAATCTTCCAGAAGAAGGGAACTTGGAAATTGGAAATGGATACGGAGTACCTGGTGGATGTGCTT
CCTATGGTGCTTCAAAGCCTGGAATTGTTGCCAATGGAAATAATGTGACTGGCCAGAAAATCCAGGGACAAGTCAAGGAAGTCGAACAAAGTTCTGCTGCCAAAGCATTG
CCAGAGTACCTCAAGCAGAAGCTAAGAGCTAGGGGTATTCTTAAAGAAGATGCAGAACACAGTAATCCTACAAATTCTGACGCTGTTTCAAACACAAAGTTGCATGGAGA
AAAGCTGCCTCATGGATGGGTGGAGGCTAAAGACCCTCACAGTGGCGTTTCATATTATTATAATGAAAGTAGTGGGAAGAGTCAATGGGAAAGGCCCTCCGAACTTTCTT
CTGATACGCAACTTTCATCAGCTGTATCCCTTCCAGAAGATTGGATGGAGGCAATTGATCAAACATCAGGCCTTAAATACTATTACAATATGAGAACCCATATAACCCAG
TGGGAGCGGCCTGTTGCATCTCATCAAACAACTTTGACACACTCGAATGATAAAGTTCCTGGGCCTTGGAACGACCAAACTTTGGAGCAAAGTAAATGCATCACATGTGG
AAGTGGAATGACCCTCGTACAGGGTTCAAGATACTGCAACGCTTGTACAAGTGGGGTTTCTACAAGTTCAACCAATGGGATGTGGCAGGACCAATCGTCTGAGCAAAATA
AATGCATGGGATGTGGTGGCTGGGGACTAGGCCTTGTGCAAGCTTGGGGTTACTGTAATCATTGCACACGAATTCTCAGTCTCCCCAGTGTCAGTACTTGCCAACCAACA
ACATTAGTAATCAGCAGAAGACAGAGAACATCAAGCATAGTGCCGATCCCTCCATTAAAAAATCTGCGACAGATAGGTCCAAATGGAAACCTCCAATGGGAAAAGGTGGA
AAGCGAGAAAGTAGGAAGCGTTCCTACAGTGAGGATGATGAATTGGATCCAATGGATCCTAGCTCTTATTCAGATGCTCCTCGTGGTGGCTGGGTTGTGGGTCTAAAAGG
TGTGCAGCCTCGGGCAGCAGATACTACTGCTACAGGTCCTCTCTTTCAGCAGCGGCCATATCCATCACCTGGAGCTGTTCTGAGGAAGAATGCTGAAATTGCTTCACAGA
CAAGAAGGGAAGCTCTCACTATGCACCCATTTCCAAGAGGGGAGATGGAAGTGATGGCCTTGGTGATGCTGACTGATCTCTTCTACTTTATTCTGCTACGACGATACATC
GACCCCTATTTCTGTAGTGTAGAAATGCACTTTTACTTGAGATGCGACATAGGTCGTGACATATATCTTCGACCGTATAGCCTTACATGAAGTTATGGATCAAATGCACG
TCTCACTTTCTGAAGGGGAGGATCGTGGCTGTACGATGGCACCTTCTTGTCTATATACTAGTTACTATACTTCCTTTATATGGAAAACTTGACTCGTGGACAAAACGGTC
AATGAGGTAGATTTTGACCTTCACATATGTACCGGAAGCGGAAGATGTGAATTACTAGTCAGGAGTAGAAAATTTGGATTTATTGTATTTTCTTTCTTCTTTTACTTCTC
TCATCTAAGCATTTAGTTCAAACATTACGATAATTTAACTTACCGTCACTTATACGACCTTAAGTTGGGCTGTACAATTGTGTCCTTTTAAATTCTAAAGAGTTGTTTAT
GTTAAGTATGTTAGTTTGCTTGTCTGAACTTAGAAGCTTGTTAATCCTATAATCATGTAAACACTA
Protein sequenceShow/hide protein sequence
MPTSTTAIAGSGDSSNTIIGSSAEDKSLKESAAAQNEVQELEKFSKQIYPCQPGEAQCSVAISADQETNQSFGNDQNIVPHEGVFNNIAVSTSSNFRSNVDDGRDIEIAV
QDAVLREQELATQNIIRSQRESVGADGLPAEQSDIFSERYDPSTIKEHLLKITSEHRAEMAMKRGKLNLPEEGNLEIGNGYGVPGGCASYGASKPGIVANGNNVTGQKIQ
GQVKEVEQSSAAKALPEYLKQKLRARGILKEDAEHSNPTNSDAVSNTKLHGEKLPHGWVEAKDPHSGVSYYYNESSGKSQWERPSELSSDTQLSSAVSLPEDWMEAIDQT
SGLKYYYNMRTHITQWERPVASHQTTLTHSNDKVPGPWNDQTLEQSKCITCGSGMTLVQGSRYCNACTSGVSTSSTNGMWQDQSSEQNKCMGCGGWGLGLVQAWGYCNHC
TRILSLPSVSTCQPTTLVISRRQRTSSIVPIPPLKNLRQIGPNGNLQWEKVESEKVGSVPTVRMMNWIQWILALIQMLLVVAGLWV