; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10011049 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10011049
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1
Genome locationChr01:1865707..1872100
RNA-Seq ExpressionHG10011049
SyntenyHG10011049
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]1.1e-19089.06Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET+Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN ++N HAQTSTQEA NTETN+APTTFNS                       N AGSSAFSSGI+ NTVS+GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]2.4e-19389.82Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN + N HAQTSTQEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]3.8e-19490.08Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN ++N HAQTSTQEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]7.8e-20093.13Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMP+REVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSPIVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTIVVPAP PVGSAKGAPENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPL+K
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D  LVIPNANAN+NVHAQT+TQEA NTETNSAPTTFNSG HSAPTTFNS           GNPAG SAFS GI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

XP_038878067.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X2 [Benincasa hispida]4.5e-18792.74Show/hide
Query:  MEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE
        MEAILQGHNNTMP+REVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSPIVQIESTPVRNVPQTIVVPAP PVGSAKGAPE
Subjt:  MEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE

Query:  NPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
        NPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
Subjt:  NPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR

Query:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSKDGALVIPNANANMNVHAQTST
        RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPL+KD  LVIPNANAN+NVHAQT+T
Subjt:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSKDGALVIPNANANMNVHAQTST

Query:  QEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        QEA NTETNSAPTTFNSG HSAPTTFNS           GNPAG SAFS GI+ NTVS GSADNVSDGKLLS
Subjt:  QEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein5.0e-18487.53Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMK      QNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET+Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN ++N HAQTSTQEA NTETN+APTTFNS                       N AGSSAFSSGI+ NTVS+GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X11.8e-19490.08Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN ++N HAQTSTQEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

A0A1S3C2X6 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X21.0e-18189.52Show/hide
Query:  MEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE
        ME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APE
Subjt:  MEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE

Query:  NPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
        NPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR
Subjt:  NPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRR

Query:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSKDGALVIPNANANMNVHAQTST
        RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSKD ALVIPNAN ++N HAQTST
Subjt:  RHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSKDGALVIPNANANMNVHAQTST

Query:  QEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        QEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  QEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X11.8e-19490.08Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN ++N HAQTSTQEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X11.2e-19389.82Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMP+REVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVA+FLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDSVLLSGQRINFET Q PLSK
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSK

Query:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS
        D ALVIPNAN + N HAQTSTQEA NTETN+AP TF+SGNH+APTTFNS            N AGSSAFSSGI+ NTVS GSADNVSDGKLLS
Subjt:  DGALVIPNANANMNVHAQTSTQEASNTETNSAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 24.2e-10358.64Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV +FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMD
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +    ++D
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMD

Query:  SVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET
        +             + PLS  GA   +V P + + +++    T  Q +SN  T
Subjt:  SVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 11.5e-4443.37Show/hide
Query:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V + KG A +     FEAKS RD AWYDV+SFL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.1e-4543.37Show/hide
Query:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V + KG A +     FEAKS RD AWYDV+SFL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors6.1e-4142.37Show/hide
Query:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTPKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V + KG A +     FEAKS RD AWYDV+SFL++R + +G+ EV VRFSGF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
        QALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding3.0e-10458.64Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV +FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMD
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +    ++D
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMD

Query:  SVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET
        +             + PLS  GA   +V P + + +++    T  Q +SN  T
Subjt:  SVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding7.3e-10358.47Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV +FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSM
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +    ++
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSM

Query:  DSVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET
        D+             + PLS  GA   +V P + + +++    T  Q +SN  T
Subjt:  DSVLLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding7.3e-10358.4Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVV--------------PAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCE
        +V Q + V              PAP+  G  +   +N   EFEAKS RDGAWYDV +FL+HR++E GDPEV VRF+GF  EEDEW+N+++++R RSLPCE
Subjt:  NVPQTIVV--------------PAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCE

Query:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMDSV
        +SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +    ++D+ 
Subjt:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-SIEPSKSSMDSV

Query:  LLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET
                    + PLS  GA   +V P + + +++    T  Q +SN  T
Subjt:  LLSGQRINFETTQKPLSKDGA---LVIPNA-NANMNVHAQTSTQEASNTET


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCGCGGAGATGGAAGCTATATTGCAAGGACACAATAATACCATGCCATCTCGGGA
AGTTCTCGTTGCCCTTGCTGAGAAGTTTAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACACCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCC
GCACCAGTAGGCTCTGCAAAGGGGGCTCCAGAAAATCCGTTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCGTCCTTTTTATCCCATAG
ATCTGTGGAAAGTGGTGATCCGGAAGTACTAGTTAGATTTTCTGGTTTTGGATCCGAGGAGGATGAGTGGGTTAATATCCGAAGAAACATTAGACCTCGTTCTCTACCTT
GTGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAGCGA
AGAAGACATGATGTACGGGGTTGTCGCTGCAGGTTTTTGGTTCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGA
TTACCGGTTGCAACAGCTTCACGCGGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCTAGCATGGATTCTGTATTGCTTAGTGGCCAGAGGATAAATTTCGAGACAA
CACAAAAGCCGCTTAGCAAGGATGGAGCCTTGGTTATACCAAATGCAAATGCCAATATGAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGCAATACTGAAACTAAC
AGTGCTCCAACCACATTCAACTCTGGTAATCACAGTGCTCCTACAACATTCAACTCTGGTAATCACAGTGCTCCAAACACATTCAACACTGGTAATCCAGCAGGTAGCTC
TGCCTTCTCGAGTGGTATCATGATGAACACTGTTTCTAGTGGGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCCGAGGTCGCGGAGATGGAAGCTATATTGCAAGGACACAATAATACCATGCCATCTCGGGA
AGTTCTCGTTGCCCTTGCTGAGAAGTTTAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACACCCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCC
GCACCAGTAGGCTCTGCAAAGGGGGCTCCAGAAAATCCGTTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCGTCCTTTTTATCCCATAG
ATCTGTGGAAAGTGGTGATCCGGAAGTACTAGTTAGATTTTCTGGTTTTGGATCCGAGGAGGATGAGTGGGTTAATATCCGAAGAAACATTAGACCTCGTTCTCTACCTT
GTGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCACGTGCTTGATACACAGCGA
AGAAGACATGATGTACGGGGTTGTCGCTGCAGGTTTTTGGTTCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGA
TTACCGGTTGCAACAGCTTCACGCGGTAAATGAAGCAGCATCCATTGAGCCCTCAAAGTCTAGCATGGATTCTGTATTGCTTAGTGGCCAGAGGATAAATTTCGAGACAA
CACAAAAGCCGCTTAGCAAGGATGGAGCCTTGGTTATACCAAATGCAAATGCCAATATGAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGCAATACTGAAACTAAC
AGTGCTCCAACCACATTCAACTCTGGTAATCACAGTGCTCCTACAACATTCAACTCTGGTAATCACAGTGCTCCAAACACATTCAACACTGGTAATCCAGCAGGTAGCTC
TGCCTTCTCGAGTGGTATCATGATGAACACTGTTTCTAGTGGGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA
Protein sequenceShow/hide protein sequence
MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPSREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTPKAPGKLAVSPIVQIESTPVRNVPQTIVVPAP
APVGSAKGAPENPLSEFEAKSGRDGAWYDVASFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQR
RRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSVLLSGQRINFETTQKPLSKDGALVIPNANANMNVHAQTSTQEASNTETN
SAPTTFNSGNHSAPTTFNSGNHSAPNTFNTGNPAGSSAFSSGIMMNTVSSGSADNVSDGKLLS