; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G004430 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G004430
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2-like
Genome locationCmo_Chr16:2121098..2125558
RNA-Seq ExpressionCmoCh16G004430
SyntenyCmoCh16G004430
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]9.4e-17184.9Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQT VVPAPAPVGSAK A ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD             EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQ------------KQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFE +Q              N +IN H QT+TQE R+TETN+APTT NS N A SSAFSSGIVT N+VS  SADNVSDGKLLS
Subjt:  INFEATQ------------KQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

XP_022923145.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like [Cucurbita moschata]6.1e-19496.77Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDL            EVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
Subjt:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

XP_022984988.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Cucurbita maxima]1.1e-19296.24Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDL            EVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFEATQK NANINVH QTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
Subjt:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

XP_023553578.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like [Cucurbita pepo subsp. pepo]3.3e-19295.7Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDL            EVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSG+DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFEATQK NANINVH QTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
Subjt:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]1.5e-17184.05Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA EVAEM+AILQ HNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQT VVPAP PVGSAK A ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD             EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKS +DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQK------------QNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFE TQK             NANINVH QTNTQE R+TETN           SAPTT NSGN A  SAFS GIVT N+VSG SADNVSDGKLLS
Subjt:  INFEATQK------------QNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X13.6e-16882.28Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQT VVPAP PVG+AKSA ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD             EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFE  Q              N +IN H QT+TQE R+TETN           +APTT NS N A SSAFSSGIVT N+VSG SADNVSDGKLLS
Subjt:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X13.6e-16882.28Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQT VVPAP PVG+AKSA ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD             EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFE  Q              N +IN H QT+TQE R+TETN           +APTT NS N A SSAFSSGIVT N+VSG SADNVSDGKLLS
Subjt:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X15.2e-16781.77Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQT VVPAP PVG+AKSA ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD             EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFE  Q              N + N H QT+TQE R+TETN           +APTT NS N A SSAFSSGIVT N+VSG SADNVSDGKLLS
Subjt:  INFEATQ------------KQNANINVHTQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

A0A6J1EAV6 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like2.9e-19496.77Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDL            EVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
Subjt:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

A0A6J1JC16 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X15.6e-19396.24Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
        VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDL            EVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE
Subjt:  VPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSE

Query:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
        CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR
Subjt:  CVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQR

Query:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
        INFEATQK NANINVH QTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
Subjt:  INFEATQKQNANINVHTQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 25.7e-10264.03Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW
        +V Q   VP             PAP GS      +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD             EV VRF+GF  EEDEW
Subjt:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW

Query:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEA
        +N+++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ 
Subjt:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEA

Query:  AST
        A++
Subjt:  AST

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 14.6e-4341.38Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N    T V
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV

Query:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP
             V + K  AS+   L FEAKS RD AWYDV++FL++R + +G+L            EV VRFSGF +  DEWVN++ ++R RS P E SEC  V  
Subjt:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP

Query:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors3.2e-4441.38Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N    T V
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV

Query:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP
             V + K  AS+   L FEAKS RD AWYDV++FL++R + +G+L            EV VRFSGF +  DEWVN++ ++R RS P E SEC  V  
Subjt:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP

Query:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors2.4e-3940.32Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N    T V
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTTVV

Query:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP
             V + K  AS+   L FEAKS RD AWYDV++FL++R + +G+L            EV VRFSGF +  DEWVN++ ++R RS P E SEC  V  
Subjt:  PAPAPVGSAK-SASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLP

Query:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
        GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  GDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding4.0e-10364.03Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW
        +V Q   VP             PAP GS      +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD             EV VRF+GF  EEDEW
Subjt:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW

Query:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEA
        +N+++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ 
Subjt:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEA

Query:  AST
        A++
Subjt:  AST

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding9.9e-10263.82Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW
        +V Q   VP             PAP GS      +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD             EV VRF+GF  EEDEW
Subjt:  NVPQTTVVP------------APAPVGS-----AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEW

Query:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNE
        +N+++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+
Subjt:  VNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNE

Query:  AAST
         A++
Subjt:  AAST

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding2.0e-10264.12Show/hide
Query:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  ------NVPQTT--------VVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNI
              +VP+TT        + PAP+  G  +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD             EV VRF+GF  EEDEW+N+
Subjt:  ------NVPQTT--------VVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNI

Query:  RRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAAS
        ++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A+
Subjt:  RRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAAS

Query:  T
        +
Subjt:  T


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACGGCTCCCGAGGTTGCGGAGATGGACGCTATATTGCAAGCACACAATAATACCATGCCAGCTCGGGA
AGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCGGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAACAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TAAGAGCGAAGACAACCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATCGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCACAGTTGTTCCTGCTCCT
GCACCAGTAGGATCTGCAAAGAGTGCTTCAGAAAATCCATCGTTGGAGTTCGAAGCTAAGTCTGGGAGAGATGGTGCATGGTATGATGTTGCTACCTTTCTATCCCATAG
ATCTGTGGAAAGCGGTGACCTGTCTACTTTCTGTGCGTTTTTCATTAAAAACAAACATGAAGTACTAGTCAGATTTTCTGGTTTTGGATCGGAGGAGGACGAGTGGGTTA
ATATCCGAAGGAACATTAGACCTCGTTCTTTTCCTTGTGAATCATCAGAATGCGTGGCAGTTCTTCCGGGTGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCA
CTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGACGTTCGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAGTCTGAGGAAATCGT
CCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCACTGAGCCCTCAAAGTCTGGCGTGGATTCTG
TACTGCTCAGCGGCCAGAGGATAAATTTTGAGGCAACACAAAAGCAAAATGCCAATATAAACGTCCATACCCAAACTAATACTCAGGAGGGAAGGAGTACTGAAACTAAC
AGTGCTCCAACCACACTCAACTCTGGTAATTCTGCAGCTAGCTCTGCATTCTCGAGTGGTATCGTGACGTCGAACTCTGTTTCTGGATTGTCGGCTGACAATGTGTCTGA
TGGGAAGTTACTTAGCTGA
mRNA sequenceShow/hide mRNA sequence
GAATTTTGGGCTTGAATGTTCAATCTTTATTGCATAATATATATATGTGTATATATATATATACATATACATTCATTTATATCCGCTCAAAGATTTCCGAAAATAACGAA
TTTCATTTCTGTAGACGGGATCTTCAATTCTGCTCTCTTTTTCCCGGCACTTTTTTTTTTCTTCTTCTCAGTTTCTCTCTCTGCTTCTGCGATAGCGACGCCGAAATCAG
AGAAACCACAGCGGAAGGTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACGGCTCCCGAGGTTGCGGAGATGGACGCTATATTGCAAGCA
CACAATAATACCATGCCAGCTCGGGAAGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCGGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAACAAGTTTGGAA
TTGGTTCCAGAATAGACGATATGCTATAAGAGCGAAGACAACCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATCGAGTCAACTCCCGTGAGAAATGTGC
CTCAAACCACAGTTGTTCCTGCTCCTGCACCAGTAGGATCTGCAAAGAGTGCTTCAGAAAATCCATCGTTGGAGTTCGAAGCTAAGTCTGGGAGAGATGGTGCATGGTAT
GATGTTGCTACCTTTCTATCCCATAGATCTGTGGAAAGCGGTGACCTGTCTACTTTCTGTGCGTTTTTCATTAAAAACAAACATGAAGTACTAGTCAGATTTTCTGGTTT
TGGATCGGAGGAGGACGAGTGGGTTAATATCCGAAGGAACATTAGACCTCGTTCTTTTCCTTGTGAATCATCAGAATGCGTGGCAGTTCTTCCGGGTGATCTCATCTTAT
GCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGACGTTCGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTAT
GATCACGATCAGTCTGAGGAAATCGTCCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCACTGA
GCCCTCAAAGTCTGGCGTGGATTCTGTACTGCTCAGCGGCCAGAGGATAAATTTTGAGGCAACACAAAAGCAAAATGCCAATATAAACGTCCATACCCAAACTAATACTC
AGGAGGGAAGGAGTACTGAAACTAACAGTGCTCCAACCACACTCAACTCTGGTAATTCTGCAGCTAGCTCTGCATTCTCGAGTGGTATCGTGACGTCGAACTCTGTTTCT
GGATTGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGCTGACTATGAAAACGAATTTCTCAATCAGTCTAATTTTAACTGAACGTATCAATTTAAAATTTTGCCTG
ACTCGTTTATTTAGGATGAGTAAATACGTAGCGAAGTCTGTTTTTTGCCACATGTTTCGAAGTTTTAGGTTCGAATCCACTCGTTGTGAATGCTGAATGTTCTTGACGGA
TAAAAAATGCAGGAGTCACGCCTCGAGTCTAACAGGTCAGAGGCATCATCTCTCTTGTTTTACTTCTTCTCTGCTTATCTAGACAGGGTTCTACACTTAGC
Protein sequenceShow/hide protein sequence
MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRNVPQTTVVPAP
APVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLSTFCAFFIKNKHEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQA
LYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGVDSVLLSGQRINFEATQKQNANINVHTQTNTQEGRSTETN
SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS