; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0160801 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0160801
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1
Genome locationCMiso1.1chr06:9156437..9162419
RNA-Seq ExpressionCmc06g0160801
SyntenyCmc06g0160801
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]2.6e-20095.55Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFET QNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPHINAHAQTSTQEARNTETNT           APTTFNSANLAGSSAFSSGIVTNTVS GSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]6.6e-21299.48Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPH NAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]4.6e-213100Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

XP_008456011.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo]3.4e-200100Show/hide
Query:  METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE
        METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE
Subjt:  METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE

Query:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
        NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
Subjt:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR

Query:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST
        RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST
Subjt:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST

Query:  QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]2.4e-19892.93Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PL+K
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        D  LVIPNAN +IN HAQT+TQEARNTETN+AP TF+SG H+APTTFNS N AG SAFS GIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein1.1e-19393.98Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMK      QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFET QNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPHINAHAQTSTQEARNTETNT           APTTFNSANLAGSSAFSSGIVTNTVS GSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X12.2e-213100Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

A0A1S3C2X6 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X21.7e-200100Show/hide
Query:  METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE
        METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE
Subjt:  METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPE

Query:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
        NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
Subjt:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR

Query:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST
        RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST
Subjt:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST

Query:  QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X12.2e-213100Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X13.2e-21299.48Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRN

Query:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK

Query:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
        DAALVIPNANPH NAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS
Subjt:  DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 21.8e-10358Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EME IL  HN  MP R +L ALADKFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP  T +                 G  +S  +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMD
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +     +D
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMD

Query:  SVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET
        +          +TP +       +V P + +P ++A   T  Q + N  T
Subjt:  SVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 13.8e-4543.55Show/hide
Query:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ +        S+   K   SP +QI      S+   N      V 
Subjt:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP

Query:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ
          T V T K  A +     FEAKS RD AWYDV++FL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++Q
Subjt:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ

Query:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        ALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors2.7e-4643.55Show/hide
Query:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ +        S+   K   SP +QI      S+   N      V 
Subjt:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP

Query:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ
          T V T K  A +     FEAKS RD AWYDV++FL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++Q
Subjt:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ

Query:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        ALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.5e-4142.55Show/hide
Query:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ +        S+   K   SP +QI      S+   N      V 
Subjt:  FTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-----STPVRNVPQTVVVP

Query:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ
          T V T K  A +     FEAKS RD AWYDV++FL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++Q
Subjt:  APTPVGTAK-SAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQ

Query:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
        ALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  ALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding1.3e-10458Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EME IL  HN  MP R +L ALADKFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP  T +                 G  +S  +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMD
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +     +D
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMD

Query:  SVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET
        +          +TP +       +V P + +P ++A   T  Q + N  T
Subjt:  SVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding3.2e-10357.83Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EME IL  HN  MP R +L ALADKFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP  T +                 G  +S  +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTVVVPAPTPV-----------------GTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGM
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +     +
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGM

Query:  DSVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET
        D+          +TP +       +V P + +P ++A   T  Q + N  T
Subjt:  DSVLLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding2.9e-10458.62Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EME IL  HN  MP R +L ALADKFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+ +KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTVVV--------------PAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCE
        +V Q + V              PAP+  G  +S  +N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSLPCE
Subjt:  NVPQTVVV--------------PAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCE

Query:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMDSV
        +SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +     +D+ 
Subjt:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSGMDSV

Query:  LLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET
                 +TP +       +V P + +P ++A   T  Q + N  T
Subjt:  LLSGQRINFETPQNPLSKDAALVIPNA-NPHINAHAQTSTQEARNTET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCGCGGAGATGGAAACTATATTGCAAGGACACAATAATACCATGCCAGCTCGGGA
AGTTCTTGTTGCACTTGCTGATAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACATCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAATTGAGTCAACCCCTGTGAGAAATGTGCCTCAAACCGTAGTTGTTCCTGCTCCC
ACACCAGTAGGCACCGCAAAGAGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCGCATAG
ATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTAAGATTTTCTGGTTTTGGATCCGAGGAGGATGAGTGGGTTAATATACGAAGGAACATTAGACCTCGTTCTCTACCTT
GTGAATCATCAGAATGTGTGGCGGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGAAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAACGA
AGAAGGCATGATGTACGAGGTTGTCGATGCAGGTTTTTGGTTCGTTATGATCATGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGACCTGAGACTGA
TTACCGGTTGCAACAGCTTCACGCCGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCTGGCATGGATTCTGTACTGCTCAGTGGTCAGAGGATAAATTTCGAAACAC
CACAAAATCCACTTAGCAAGGATGCAGCCTTGGTTATACCAAATGCAAATCCCCATATAAATGCCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACTGAAACTAAC
ACTGCTCCGATCACATTCAGTTCCGGTAATCACAATGCTCCAACCACATTCAACTCTGCTAATCTCGCAGGTAGCTCTGCATTCTCAAGTGGTATCGTGACAAACACTGT
TTCTGGTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
CATTTATACTGGCTCAAAGATTTCCCAAAATAACGAATTTCATTTCTGTAGACGGGATCTTCAATTCTGATTTCTTTTTCCCGGCACTTTTTTCTTTGTTTCTCTCTCTG
CTTCTTCGATAGCGACGCCGAAATCAAAGAAACACACCGAAAATTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCG
CGGAGATGGAAACTATATTGCAAGGACACAATAATACCATGCCAGCTCGGGAAGTTCTTGTTGCACTTGCTGATAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATT
GCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGACATCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAGTTGTCCAAAT
TGAGTCAACCCCTGTGAGAAATGTGCCTCAAACCGTAGTTGTTCCTGCTCCCACACCAGTAGGCACCGCAAAGAGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTA
AATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCGCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTAAGATTTTCTGGTTTTGGATCCGAG
GAGGATGAGTGGGTTAATATACGAAGGAACATTAGACCTCGTTCTCTACCTTGTGAATCATCAGAATGTGTGGCGGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGA
GGGAAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAACGAAGAAGGCATGATGTACGAGGTTGTCGATGCAGGTTTTTGGTTCGTTATGATCATGATC
AATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGACCTGAGACTGATTACCGGTTGCAACAGCTTCACGCCGTAAATGAAGCAGCATCCATTGAGCCCTCAAAG
TCTGGCATGGATTCTGTACTGCTCAGTGGTCAGAGGATAAATTTCGAAACACCACAAAATCCACTTAGCAAGGATGCAGCCTTGGTTATACCAAATGCAAATCCCCATAT
AAATGCCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACTGAAACTAACACTGCTCCGATCACATTCAGTTCCGGTAATCACAATGCTCCAACCACATTCAACTCTG
CTAATCTCGCAGGTAGCTCTGCATTCTCAAGTGGTATCGTGACAAACACTGTTTCTGGTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGATTATGGGGG
AAAAGTAAAATTCTCCATTAGTCTAATTTTAACAGAACCTATCAATTTAAAATTTTGCCTGACT
Protein sequenceShow/hide protein sequence
MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAP
TPVGTAKSAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQR
RRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTSTQEARNTETN
TAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVTNTVSGGSADNVSDGKLLS