; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G009190 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G009190
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2-like
Genome locationCG_Chr05:9981509..9987996
RNA-Seq ExpressionClCG05G009190
SyntenyClCG05G009190
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]9.2e-18887.53Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET+Q PLSKDAALVIP  NAN +IN HAQTSTQEARNTETN           +APTTFNS N A SSAF+SGIVTNTVS G ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]6.1e-19288.03Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET Q PLSKDAALVIPNAN +   N HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]4.2e-19388.53Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET Q PLSKDAALVIP  NAN +IN HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]5.2e-19991.52Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQTIVVPAP PVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFETTQKPL+KD  LVIP  NANANINVHAQT+TQEARNTETNS PTTFNSG HSAPTTFNSGNPA  SAF+ GIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

XP_038878067.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X2 [Benincasa hispida]3.9e-18691.05Show/hide
Query:  MEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE
        MEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAP PVGSAKGAPE
Subjt:  MEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE

Query:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG
        NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG
Subjt:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG

Query:  KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQLCGQRINFETTQKPLSKDAAL
        KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDS  L GQRINFETTQKPL+KD  L
Subjt:  KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQLCGQRINFETTQKPLSKDAAL

Query:  VIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLLS
        VIP  NANANINVHAQT+TQEARNTETNS PTTFNSG HSAPTTFNSGNPA  SAF+ GIVTNTVSGG ADNVSDGKLLS
Subjt:  VIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein4.0e-18186.03Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMK      QNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET+Q PLSKDAALVIP  NAN +IN HAQTSTQEARNTETN           +APTTFNS N A SSAF+SGIVTNTVS G ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X12.1e-19388.53Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET Q PLSKDAALVIP  NAN +IN HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

A0A1S3C2X6 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X21.5e-18087.89Show/hide
Query:  MEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE
        ME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRNVPQT+VVPAP PVG+AK APE
Subjt:  MEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAPAPVGSAKGAPE

Query:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG
        NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG
Subjt:  NPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEG

Query:  KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQLCGQRINFETTQKPLSKDAAL
        KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  L GQRINFET Q PLSKDAAL
Subjt:  KEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQLCGQRINFETTQKPLSKDAAL

Query:  VIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLLS
        VIP  NAN +IN HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLLS
Subjt:  VIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X12.1e-19388.53Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET Q PLSKDAALVIP  NAN +IN HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X13.0e-19288.03Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN
        MGRPPSNGGPAFRFTASEVAEME ILQGHNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRN
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRN

Query:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP
        VPQT+VVPAP PVG+AK APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDP                 EVLVRFSGFGSEEDEWVNIRRNIRPRSLP
Subjt:  VPQTIVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLP

Query:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ
        CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKS MDS  
Subjt:  CESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQ

Query:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL
        L GQRINFET Q PLSKDAALVIPNAN +   N HAQTSTQEARNTETN+ P TF+SGNH+APTTFNS N A SSAF+SGIVTNTVSGG ADNVSDGKLL
Subjt:  LCGQRINFETTQKPLSKDAALVIPNANANANINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLL

Query:  S
        S
Subjt:  S

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 23.0e-10157.98Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV  FL+HR++E GDP                 EV VRF+GF  
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS

Query:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-
        EEDEW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH 
Subjt:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-

Query:  AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA
        AVN+ A S +    ++D+A      +   T     P SKD       A LV P++NA
Subjt:  AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 12.7e-4140.23Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC
             V + KG A +     FEAKS RD AWYDV++FL++R + +G+                  EV VRFSGF +  DEWVN++ ++R RS+P E SEC
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC

Query:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
          V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.9e-4240.23Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC
             V + KG A +     FEAKS RD AWYDV++FL++R + +G+                  EV VRFSGF +  DEWVN++ ++R RS+P E SEC
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC

Query:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
          V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.4e-3739.13Show/hide
Query:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV
        FT SE+ +ME + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +QI      S+   N      V
Subjt:  FTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTIVV

Query:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC
             V + KG A +     FEAKS RD AWYDV++FL++R + +G+                  EV VRFSGF +  DEWVN++ ++R RS+P E SEC
Subjt:  PAPAPVGSAKG-APENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSEC

Query:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
          V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  VAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding2.2e-10257.98Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV  FL+HR++E GDP                 EV VRF+GF  
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS

Query:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-
        EEDEW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH 
Subjt:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-

Query:  AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA
        AVN+ A S +    ++D+A      +   T     P SKD       A LV P++NA
Subjt:  AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding5.3e-10157.82Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS
        +V Q + VP             PAP GS      +   +N   EFEAKS RDGAWYDV  FL+HR++E GDP                 EV VRF+GF  
Subjt:  NVPQTIVVP------------APAPVGS-----AKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGS

Query:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH
        EEDEW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH
Subjt:  EEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH

Query:  -AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA
         AVN+ A S +    ++D+A      +   T     P SKD       A LV P++NA
Subjt:  -AVNEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding5.3e-10157.75Show/hide
Query:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR
        MGRPPSNGGPAFRF   EV EMEAIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R
Subjt:  MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVR

Query:  NVPQTIVV--------------PAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEED
        +V Q + V              PAP+  G  +   +N   EFEAKS RDGAWYDV  FL+HR++E GDP                 EV VRF+GF  EED
Subjt:  NVPQTIVV--------------PAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEED

Query:  EWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AV
        EW+N+++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AV
Subjt:  EWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AV

Query:  NEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA
        N+ A S +    ++D+A      +   T     P SKD       A LV P++NA
Subjt:  NEAA-SIEPSKSSMDSAQLCGQRINFETTQ--KPLSKD-------AALVIPNANA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGCTTCACGGCTTCGGAGGTTGCAGAGATGGAAGCTATATTGCAAGGACACAATAATACCATGCCAGCTCGGGA
AGTTCTTGTTGCCCTTGCTGAGAAGTTTAGTGAATCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATACGCTA
TCAGAGCAAAGACAACCAAGGCCCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCC
GCACCAGTAGGCTCTGCAAAGGGTGCTCCAGAAAATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCCCATAG
ATCTGTGGAAAGTGGTGACCCGGTAATTTTGCGCCTCTTGCATGGAAGAAGCCTAATAGTAGTTGTTTCTCAGGAAGTACTGGTTAGATTTTCTGGTTTTGGATCCGAGG
AGGATGAGTGGGTTAATATCCGAAGGAACATTAGACCTCGTTCTCTACCTTGTGAATCATCGGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAG
GGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAACGAAGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTCTGGTTCGTTATGATCACGATCA
ATCAGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGATTACCGGTTGCAACAGCTTCACGCTGTAAATGAAGCAGCATCCATTGAGCCTTCAAAGT
CTAGCATGGATTCTGCACAGCTCTGTGGTCAGAGGATAAATTTTGAGACGACACAAAAGCCGCTCAGCAAGGATGCAGCCTTGGTTATACCAAATGCCAATGCAAATGCC
AATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAACACTGAAACTAACAGTGTTCCAACCACATTCAACTCTGGCAATCACAGTGCCCCAACCACATTCAA
CTCTGGTAATCCCGCAAGTAGCTCTGCATTCACGAGTGGTATCGTGACGAACACTGTTTCTGGTGGGTTGGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
AAAGTTTGATATTTGGTACGGAATCACCTGGGAATTTTGTTTTTGCCCCCCAAAGAAACGATGAATTTTTGGGCTTAAAGCTTCTATCTTTCTCTCCATCGTTATATATA
TATATGTATGTATATGTAATACATTCATTTATATTCGCTCAAAGATTTCCCAAAATAACGAATTTCATTTCTGTAGACGGGATCTTCAATTCTGATTTCTTTTTCCCGGC
ACTTTTTCTTTGTTTCTCTCTTGCTTCTTCGATAGCGACGCCGAAATCAGAGAAATACACCGAAAAACTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCT
TCCGCTTCACGGCTTCGGAGGTTGCAGAGATGGAAGCTATATTGCAAGGACACAATAATACCATGCCAGCTCGGGAAGTTCTTGTTGCCCTTGCTGAGAAGTTTAGTGAA
TCAGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATACGCTATCAGAGCAAAGACAACCAAGGCCCCTGGAAAGTT
AGCTGTCTCTCCAATTGTCCAAATTGAGTCAACTCCCGTGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCCGCACCAGTAGGCTCTGCAAAGGGTGCTCCAGAAA
ATCCATTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGTAATTTTGCGC
CTCTTGCATGGAAGAAGCCTAATAGTAGTTGTTTCTCAGGAAGTACTGGTTAGATTTTCTGGTTTTGGATCCGAGGAGGATGAGTGGGTTAATATCCGAAGGAACATTAG
ACCTCGTTCTCTACCTTGTGAATCATCGGAATGTGTGGCAGTTCTTCCAGGCGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATG
TGCTTGATACACAACGAAGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTCTGGTTCGTTATGATCACGATCAATCAGAGGAAATTGTTCAGTTGAGAAAGATTTGC
CGTCGGCCTGAGACTGATTACCGGTTGCAACAGCTTCACGCTGTAAATGAAGCAGCATCCATTGAGCCTTCAAAGTCTAGCATGGATTCTGCACAGCTCTGTGGTCAGAG
GATAAATTTTGAGACGACACAAAAGCCGCTCAGCAAGGATGCAGCCTTGGTTATACCAAATGCCAATGCAAATGCCAATATAAATGTCCATGCCCAAACTAGTACTCAGG
AAGCAAGGAACACTGAAACTAACAGTGTTCCAACCACATTCAACTCTGGCAATCACAGTGCCCCAACCACATTCAACTCTGGTAATCCCGCAAGTAGCTCTGCATTCACG
AGTGGTATCGTGACGAACACTGTTTCTGGTGGGTTGGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGATTATGGGGAAAAATAAAATTCTCCATCAGTCTAATTTT
AACTGAACCTATCAATTTAAAATTTTGCCTGACTCGTTTATTTAGGATGAGTAAATAACTAGCGAAGTCTGTTCTTTTTGCCCCATGTTTCAAAGTTATAGGTTCCCATC
CTCTTGCTGTGATTGCTGAATGTTCTTGACGGATGAAAAATGGAGTCACACCTCGAGTGTAGTATCGAGCAGGTCAGAGGCATCATCTCTCTTGTTTTACCTTTTTCTTC
TCTGCTTAGCAAAACATGGCTCTACACTTGCAC
Protein sequenceShow/hide protein sequence
MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTIVVPAP
APVGSAKGAPENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPVILRLLHGRSLIVVVSQEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCFQE
GKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSSMDSAQLCGQRINFETTQKPLSKDAALVIPNANANA
NINVHAQTSTQEARNTETNSVPTTFNSGNHSAPTTFNSGNPASSSAFTSGIVTNTVSGGLADNVSDGKLLS