; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg004241 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg004241
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2-like
Genome locationscaffold6:5905121..5911794
RNA-Seq ExpressionSpg004241
SyntenySpg004241
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]2.0e-19087.69Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAPAPVG+AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS
        PSKSGMDSVLLSGQRINFE +Q PL+KDA +VIPNAN +IN HAQTSTQEARN ETN+APTTFNS N AGSSAFSSGIVT++VS GSADNVSDGKLLS
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]1.9e-18584.35Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG AK APENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKSGMDSVLLSGQRINFE  Q PL+KDA +VIPNAN + N HAQTSTQEARN ETN           +APTTFNS N AGSSAFSSGIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]1.3e-18684.84Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG AK APENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKSGMDSVLLSGQRINFE  Q PL+KDA +VIPNAN +IN HAQTSTQEARN ETN           +APTTFNS N AGSSAFSSGIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

XP_022142790.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Momordica charantia]4.2e-18584.67Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA+EVAEMEAILQGHNNTMPAREVLVALAEKFSES+ERKGKIAVQMKQVWNWFQNRRYAIRAK+TKAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQ+IVVPAPAPVG+ KGAP+NPLSEFEAKS RDGAWYDVATFLSH+S+ESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAAS+E
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS
        P KSGMDSVLLSG R+NFE TQ+PL KDAT+V PNAN N+NV AQT TQE RNIET+S P +FNSGNPAGSSAF SGI T+SVSGG  DNVSDGKLLS
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]6.1e-19286.8Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQTIVVPAP PVG+AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKS MDSVLLSGQRINFE TQ+PLNKD T+VIPNANANINVHAQT+TQEARN ETN           SAPTTFNSGNPAG SAFS GIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein8.6e-18486.18Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMK      QNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAPAPVG+AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS
        PSKSGMDSVLLSGQRINFE +Q PL+KDA +VIPNAN +IN HAQTSTQEARN ETN+APTTFNS N AGSSAFSSGIVT++VS GSADNVSDGKLLS
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X16.4e-18784.84Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG AK APENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKSGMDSVLLSGQRINFE  Q PL+KDA +VIPNAN +IN HAQTSTQEARN ETN           +APTTFNS N AGSSAFSSGIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X16.4e-18784.84Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG AK APENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKSGMDSVLLSGQRINFE  Q PL+KDA +VIPNAN +IN HAQTSTQEARN ETN           +APTTFNS N AGSSAFSSGIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X19.2e-18684.35Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSPVVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQT+VVPAP PVG AK APENPLSEFEAKSGRDGAWYDVATFLSHRS+ESGDPEVLVRF+GFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSE                           EIVQLRKICRRPETDYRLQQLHAVNEAASIE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD
        PSKSGMDSVLLSGQRINFE  Q PL+KDA +VIPNAN + N HAQTSTQEARN ETN           +APTTFNS N AGSSAFSSGIVT++VSGGSAD
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETN-----------SAPTTFNSGNPAGSSAFSSGIVTSSVSGGSAD

Query:  NVSDGKLLS
        NVSDGKLLS
Subjt:  NVSDGKLLS

A0A6J1CLX5 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X12.0e-18584.67Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN
        MGRPPSNGGPAFRFTA+EVAEMEAILQGHNNTMPAREVLVALAEKFSES+ERKGKIAVQMKQVWNWFQNRRYAIRAK+TKAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRN

Query:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
        VPQ+IVVPAPAPVG+ KGAP+NPLSEFEAKS RDGAWYDVATFLSH+S+ESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC
Subjt:  VPQTIVVPAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE
        FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE                           EIVQLRKICRRPETDYRLQQLHAVNEAAS+E
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIE

Query:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS
        P KSGMDSVLLSG R+NFE TQ+PL KDAT+V PNAN N+NV AQT TQE RNIET+S P +FNSGNPAGSSAF SGI T+SVSGG  DNVSDGKLLS
Subjt:  PSKSGMDSVLLSGQRINFEATQRPLNKDATIVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 22.6e-10055.68Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP G+      +   +N   EFEAKS RDGAWYDV  FL+HR+LE GDPEV VRFAGF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE                           EIV LRKICRRPE
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE

Query:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA
        TDYRLQQLH AVN+ A+    +      +    + L G  +   A   P +KD       AT+V P++NA
Subjt:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 11.1e-3938.77Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V   KG A +     FEAKS RD AWYDV++FL++R L +G+ EV VRF+GF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE
        QALY D HVL+ +R  HD   C C FLVRY+ D +E                           E + L +ICRRPE
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors8.0e-4138.77Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V   KG A +     FEAKS RD AWYDV++FL++R L +G+ EV VRF+GF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE
        QALY D HVL+ +R  HD   C C FLVRY+ D +E                           E + L +ICRRPE
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.4e-4041.95Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPVVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE
             V   KG A +     FEAKS RD AWYDV++FL++R L +G+ EV VRF+GF +  DEWVN++ ++R RS+P E SEC  V  GDL+LCFQE ++
Subjt:  PAPAPVGAAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKE

Query:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
        QALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  QALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding1.8e-10155.68Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP G+      +   +N   EFEAKS RDGAWYDV  FL+HR+LE GDPEV VRFAGF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE                           EIV LRKICRRPE
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE

Query:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA
        TDYRLQQLH AVN+ A+    +      +    + L G  +   A   P +KD       AT+V P++NA
Subjt:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding1.8e-10155.68Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL
        +V Q + VP             PAP G+      +   +N   EFEAKS RDGAWYDV  FL+HR+LE GDPEV VRFAGF  EEDEW+N+++++R RSL
Subjt:  NVPQTIVVP------------APAPVGA-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSL

Query:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE
        PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE                           EIV LRKICRRPE
Subjt:  PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPE

Query:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA
        TDYRLQQLH AVN+ A+    +      +    + L G  +   A   P +KD       AT+V P++NA
Subjt:  TDYRLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding4.8e-10255.86Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIE-STPVR

Query:  NVPQTIVV--------------PAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCE
        +V Q + V              PAP+  G  +   +N   EFEAKS RDGAWYDV  FL+HR+LE GDPEV VRFAGF  EEDEW+N+++++R RSLPCE
Subjt:  NVPQTIVV--------------PAPAPVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCE

Query:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDY
        +SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE                           EIV LRKICRRPETDY
Subjt:  SSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDY

Query:  RLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA
        RLQQLH AVN+ A+    +      +    + L G  +   A   P +KD       AT+V P++NA
Subjt:  RLQQLH-AVNEAASIEPSK------SGMDSVLLSGQRINFEATQRPLNKD-------ATIVIPNANA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACTGCTTCCGAGGTTGCAGAGATGGAAGCTATATTGCAAGGACATAATAATACAATGCCAGCTCGGGA
AGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACCACGAAGGCCCCTGGAAAGTTAGCCGTCTCTCCAGTTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATTGTTGTTCCTGCTCCC
GCACCAGTAGGCGCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGATGTTGCTACCTTTTTATCCCATAG
ATCTCTGGAAAGTGGTGACCCGGAAGTACTAGTTAGATTTGCTGGTTTTGGATCAGAGGAGGATGAGTGGGTTAATATTCGAAGGAACATTAGACCTCGTTCTCTACCTT
GTGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGA
AGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAATCTGAGTTAATTTCACATGAGATGAAGACTTTAGCTCTTGATGTGGAGAA
AAGTTGGAAATTAAGAACTTTGTTAATGGTTGCTGCTGAAATTGTTCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAA
ATGAAGCAGCATCGATCGAGCCCTCAAAGTCTGGCATGGATTCTGTACTGCTCAGCGGTCAGAGGATAAATTTCGAGGCAACACAAAGACCACTCAACAAGGATGCAACC
ATCGTTATACCAAATGCAAATGCCAATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATATAGAAACTAACAGTGCTCCAACCACATTCAACTCCGGTAA
TCCCGCAGGTAGCTCTGCGTTCTCGAGTGGTATCGTGACGAGCTCTGTTTCTGGTGGGTCGGCAGACAATGTGTCTGATGGGAAATTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACTGCTTCCGAGGTTGCAGAGATGGAAGCTATATTGCAAGGACATAATAATACAATGCCAGCTCGGGA
AGTTCTTGTTGCCCTTGCTGAGAAGTTCAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACCACGAAGGCCCCTGGAAAGTTAGCCGTCTCTCCAGTTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATTGTTGTTCCTGCTCCC
GCACCAGTAGGCGCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGATGTTGCTACCTTTTTATCCCATAG
ATCTCTGGAAAGTGGTGACCCGGAAGTACTAGTTAGATTTGCTGGTTTTGGATCAGAGGAGGATGAGTGGGTTAATATTCGAAGGAACATTAGACCTCGTTCTCTACCTT
GTGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGA
AGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAATCTGAGTTAATTTCACATGAGATGAAGACTTTAGCTCTTGATGTGGAGAA
AAGTTGGAAATTAAGAACTTTGTTAATGGTTGCTGCTGAAATTGTTCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAA
ATGAAGCAGCATCGATCGAGCCCTCAAAGTCTGGCATGGATTCTGTACTGCTCAGCGGTCAGAGGATAAATTTCGAGGCAACACAAAGACCACTCAACAAGGATGCAACC
ATCGTTATACCAAATGCAAATGCCAATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATATAGAAACTAACAGTGCTCCAACCACATTCAACTCCGGTAA
TCCCGCAGGTAGCTCTGCGTTCTCGAGTGGTATCGTGACGAGCTCTGTTTCTGGTGGGTCGGCAGACAATGTGTCTGATGGGAAATTACTTAGTTGA
Protein sequenceShow/hide protein sequence
MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPVVQIESTPVRNVPQTIVVPAP
APVGAAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSLESGDPEVLVRFAGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQR
RRHDVRGCRCRFLVRYDHDQSELISHEMKTLALDVEKSWKLRTLLMVAAEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFEATQRPLNKDAT
IVIPNANANINVHAQTSTQEARNIETNSAPTTFNSGNPAGSSAFSSGIVTSSVSGGSADNVSDGKLLS