; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014113 (gene) of Chayote v1 genome

Gene IDSed0014113
OrganismSechium edule (Chayote v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 2-like
Genome locationLG04:30497036..30512029
RNA-Seq ExpressionSed0014113
SyntenySed0014113
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR009057 - Homeobox-like domain superfamily
IPR032001 - SAWADEE domain
IPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650545.1 hypothetical protein Csa_011086 [Cucumis sativus]1.6e-17384.29Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAP PVGSAKGA EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET++  L+KDA +VIPNAN +IN HAQT T EARNTETN AP  FNS+N AGSSAFSSGI+ ++VS GSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

TYK11257.1 protein SAWADEE HOMEODOMAIN-like protein 2 isoform X1 [Cucumis melo var. makuwa]4.8e-17081.42Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAPTPVG+AK A EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +  L+KDA +VIPNAN + N HAQT T EARNTETN           NAP  FNS+N AGSSAFSSGI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

XP_008456010.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo]3.3e-17181.93Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAPTPVG+AK A EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +  L+KDA +VIPNAN +IN HAQT T EARNTETN           NAP  FNS+N AGSSAFSSGI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

XP_022142790.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Momordica charantia]4.1e-16982.46Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+AILQ HNNTMPAREVLVALAEKFSES+ERKGKIAVQ+KQVWNWFQNRRYAIRAK+TKAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        IVQIESTPVRN   + V PAP PVGS KGA +N LSEFEAKSARDGAWYDVAT LSH+SVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EP KSGMDSVLLS  R+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFE-TARKLNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FE T + L KDATMV PNAN N+NV AQT T E RN ET++ P +FNS NPAGSSAF SGI  +SVSGG  DNVSDGKLLS
Subjt:  SFE-TARKLNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

XP_038878066.1 protein SAWADEE HOMEODOMAIN HOMOLOG 2-like isoform X1 [Benincasa hispida]3.2e-17483.46Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+AILQ HNNTMPAREVLVALAEKFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKTTKAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        IVQIESTPVRN   T V PAP PVGSAKGA EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKS MDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +K LNKD T+VIPNAN NINVHAQT T EARNTETN           +AP  FNS NPAG SAFS GI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

TrEMBL top hitse value%identityAlignment
A0A0A0LC67 SAWADEE domain-containing protein7.0e-16782.72Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+K      QNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAP PVGSAKGA EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET++  L+KDA +VIPNAN +IN HAQT T EARNTETN AP  FNS+N AGSSAFSSGI+ ++VS GSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

A0A1S3C274 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X11.6e-17181.93Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAPTPVG+AK A EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +  L+KDA +VIPNAN +IN HAQT T EARNTETN           NAP  FNS+N AGSSAFSSGI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

A0A5A7UUV7 SAWADEE HOMEODOMAIN-like protein 2 isoform X11.6e-17181.93Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAPTPVG+AK A EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +  L+KDA +VIPNAN +IN HAQT T EARNTETN           NAP  FNS+N AGSSAFSSGI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

A0A5D3CH38 Protein SAWADEE HOMEODOMAIN-like protein 2 isoform X12.3e-17081.42Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQ+KQVWNWFQNRRYAIRAKT+KAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        +VQIESTPVRN   T V PAPTPVG+AK A EN LSEFEAKS RDGAWYDVAT LSHRSVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDH+QSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSGMDSVLLS QR+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FET +  L+KDA +VIPNAN + N HAQT T EARNTETN           NAP  FNS+N AGSSAFSSGI+ ++VSGGSADNVSDGKLLS
Subjt:  SFETARK-LNKDATMVIPNANTNINVHAQTRTLEARNTETN-----------NAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

A0A6J1CLX5 protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X12.0e-16982.46Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRFTA EVAEM+AILQ HNNTMPAREVLVALAEKFSES+ERKGKIAVQ+KQVWNWFQNRRYAIRAK+TKAPGKLA           VSP
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC
        IVQIESTPVRN   + V PAP PVGS KGA +N LSEFEAKSARDGAWYDVAT LSH+SVESGDPEVLVRF GFG EEDEW NIRR+IRPRSLPCESSEC
Subjt:  IVQIESTPVRNQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSEC

Query:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM
        VAVLPGDL+LCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAAS EP KSGMDSVLLS  R+
Subjt:  VAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRM

Query:  SFE-TARKLNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS
        +FE T + L KDATMV PNAN N+NV AQT T E RN ET++ P +FNS NPAGSSAF SGI  +SVSGG  DNVSDGKLLS
Subjt:  SFE-TARKLNKDATMVIPNANTNINVHAQTRTLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS

SwissProt top hitse value%identityAlignment
Q8RWJ7 Protein SAWADEE HOMEODOMAIN HOMOLOG 29.0e-10359.71Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS ++    LP   
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS
           I+   V   +H T     + PAP+     G  +  ++N   EFEAKSARDGAWYDV   L+HR++E GDPEV VRF GF +EEDEW N+++ +R RS
Subjt:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS

Query:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSK
        LPCE+SECVAVL GDLVLCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A++     P+ 
Subjt:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSK

Query:  SGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN
               LS    +       +KD       AT+V P++N
Subjt:  SGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN

Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 12.7e-4341.57Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +Q   L         S+   N 
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ

Query:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC
        S+ T     T V + KG   +L    FEAKSARD AWYDV++ L++R + +G+ EV VRF GF    DEW N++ S+R RS+P E SEC  V  GDL+LC
Subjt:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        FQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

Arabidopsis top hitse value%identityAlignment
AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors2.0e-4441.57Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +Q   L         S+   N 
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ

Query:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC
        S+ T     T V + KG   +L    FEAKSARD AWYDV++ L++R + +G+ EV VRF GF    DEW N++ S+R RS+P E SEC  V  GDL+LC
Subjt:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE
        FQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + L +ICRRPE
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.1e-3940.5Show/hide
Query:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ
        FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++  + K+   P     SP +Q   L         S+   N 
Subjt:  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNR-RYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVRNQ

Query:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC
        S+ T     T V + KG   +L    FEAKSARD AWYDV++ L++R + +G+ EV VRF GF    DEW N++ S+R RS+P E SEC  V  GDL+LC
Subjt:  SHTTVAPAPTPVGSAKGATENLLS-EFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLC

Query:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE
        FQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +E
Subjt:  FQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE

AT3G18380.1 sequence-specific DNA binding transcription factors;sequence-specific DNA binding6.4e-10459.71Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS ++    LP   
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS
           I+   V   +H T     + PAP+     G  +  ++N   EFEAKSARDGAWYDV   L+HR++E GDPEV VRF GF +EEDEW N+++ +R RS
Subjt:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS

Query:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSK
        LPCE+SECVAVL GDLVLCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A++     P+ 
Subjt:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSK

Query:  SGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN
               LS    +       +KD       AT+V P++N
Subjt:  SGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN

AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding1.6e-10259.53Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS ++    LP   
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS
           I+   V   +H T     + PAP+     G  +  ++N   EFEAKSARDGAWYDV   L+HR++E GDPEV VRF GF +EEDEW N+++ +R RS
Subjt:  IVQIESTPVRNQSHTT-----VAPAPT---PVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRS

Query:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PS
        LPCE+SECVAVL GDLVLCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A++     P+
Subjt:  LPCESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PS

Query:  KSGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN
                LS    +       +KD       AT+V P++N
Subjt:  KSGMDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN

AT3G18380.3 sequence-specific DNA binding transcription factors;sequence-specific DNA binding3.7e-10460.06Show/hide
Query:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP
        MGRPPSNGGP FRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ KQ+WNWFQNRRYA+RA+  KAPGKL VS ++    LP   
Subjt:  MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSP

Query:  IVQIESTPVRNQSHTT-----VAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPC
           I+   V   +H T     + PAP+  G  +  ++N   EFEAKSARDGAWYDV   L+HR++E GDPEV VRF GF +EEDEW N+++ +R RSLPC
Subjt:  IVQIESTPVRNQSHTT-----VAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPC

Query:  ESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSKSG
        E+SECVAVL GDLVLCFQEGK+QALYFDA VLD QRRRHDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A++     P+   
Subjt:  ESSECVAVLPGDLVLCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAASAE----PSKSG

Query:  MDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN
             LS    +       +KD       AT+V P++N
Subjt:  MDSVLLSSQRMSFETARKLNKD-------ATMVIPNAN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGCCTCCCAGCAATGGAGGCCCCGTCTTCCGCTTCACCGCTCCCGAGGTTGCTGAGATGGATGCTATATTGCAAGCACACAATAATACCATGCCGGCTCGAGA
AGTTCTTGTTGCCCTTGCGGAGAAGTTCAGTGAATCTGTAGAACGGAAGGGGAAGATTGCTGTGCAAATTAAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTA
TCAGAGCAAAGACAACTAAGGCTCCTGGAAAGCTAGCTGTCTCTCCAACTGTTCAAAGTGGAAAGCTACCTGTTTCTCCAATCGTCCAAATCGAGTCAACTCCCGTGAGA
AATCAGTCTCATACAACAGTTGCTCCTGCTCCTACACCAGTAGGCTCTGCAAAGGGTGCTACAGAAAATCTATTGTCGGAATTTGAAGCTAAATCTGCGAGGGATGGTGC
ATGGTATGACGTTGCTACACTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTTCGATTTGTTGGCTTTGGATTGGAGGAGGATGAGTGGGCTAATA
TTCGAAGAAGCATTAGACCTCGTTCTCTACCTTGCGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGACCTTGTTTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTT
TACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTTCGTTATGATCATGATCAATCAGAGGAAATTGTTCA
GTTGAGAAAGATTTGCCGTCGGCCTGAGACTGACTACAGATTGCAACAGCTTCATGCTGTAAATGAAGCCGCATCGGCCGAGCCCTCAAAATCTGGCATGGATTCTGTAC
TGCTCAGCAGTCAAAGGATGAGTTTCGAAACAGCGCGAAAGCTGAACAAGGATGCAACTATGGTTATACCAAATGCAAATACCAATATAAATGTTCATGCCCAAACTAGA
ACTCTGGAAGCAAGGAATACTGAAACTAACAATGCCCCAAATGCATTCAACTCTAGTAATCCCGCGGGTAGCTCTGCATTCTCGAGCGGTATCATGATGAGCTCTGTTTC
TGGTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
GTTTTGCCCCCAAAAGCTATGAATTTGGGCTTAAAACTTTCCATATTTTCCTCTCTCATCATCATATTCTCATATATATATATATACACACACATATTCATTTATAATCT
AGCGCAAACATTACCAAAAATAACCAATTTCAATACACAGGATCTTCAATTCTGACTTATCCCGGTCTCTGTCTCGGTTTTCTCTCTTCGATTCTGCGACAGCGACGGCG
AAATCAGAGAAAAACACATTCACAGTCGAAGCTTTAGTTTATGGGTCGGCCTCCCAGCAATGGAGGCCCCGTCTTCCGCTTCACCGCTCCCGAGGTTGCTGAGATGGATG
CTATATTGCAAGCACACAATAATACCATGCCGGCTCGAGAAGTTCTTGTTGCCCTTGCGGAGAAGTTCAGTGAATCTGTAGAACGGAAGGGGAAGATTGCTGTGCAAATT
AAGCAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATCAGAGCAAAGACAACTAAGGCTCCTGGAAAGCTAGCTGTCTCTCCAACTGTTCAAAGTGGAAAGCTACC
TGTTTCTCCAATCGTCCAAATCGAGTCAACTCCCGTGAGAAATCAGTCTCATACAACAGTTGCTCCTGCTCCTACACCAGTAGGCTCTGCAAAGGGTGCTACAGAAAATC
TATTGTCGGAATTTGAAGCTAAATCTGCGAGGGATGGTGCATGGTATGACGTTGCTACACTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTTCGA
TTTGTTGGCTTTGGATTGGAGGAGGATGAGTGGGCTAATATTCGAAGAAGCATTAGACCTCGTTCTCTACCTTGCGAATCATCAGAATGTGTGGCAGTTCTTCCAGGCGA
CCTTGTTTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGATGTACGAGGTTGTCGCTGCAGGTTTT
TGGTTCGTTATGATCATGATCAATCAGAGGAAATTGTTCAGTTGAGAAAGATTTGCCGTCGGCCTGAGACTGACTACAGATTGCAACAGCTTCATGCTGTAAATGAAGCC
GCATCGGCCGAGCCCTCAAAATCTGGCATGGATTCTGTACTGCTCAGCAGTCAAAGGATGAGTTTCGAAACAGCGCGAAAGCTGAACAAGGATGCAACTATGGTTATACC
AAATGCAAATACCAATATAAATGTTCATGCCCAAACTAGAACTCTGGAAGCAAGGAATACTGAAACTAACAATGCCCCAAATGCATTCAACTCTAGTAATCCCGCGGGTA
GCTCTGCATTCTCGAGCGGTATCATGATGAGCTCTGTTTCTGGTGGGTCAGCTGACAATGTGTCTGATGGGAAGTTACTTAGTTGATTAAGGAAAAATTCTTCTTCATCA
GCTAATTTTAACTGAACCTATCAATTTACAATTTTGGCTGACTAGTTTATTTAGAATGAGTAAATACCTAGTGAAGTCTGTTTTTTGTTCCATGTTCCAAAGTTATAGGT
TCCCATCCACTTGCTCTGAATGCCGAATGTTCTTGACGGACGAAAAATGCAGGAGTTGAGCCTCGAATGTAGTATCGTCGAACAGGTCAGAGGCCTCATCTCTCTTGTTT
TACTTTTTTCTTCTCTGGTTAGCCTAACATGGGTCTACACTTCACATGCTGATTGATAATATATATAGTTTACTGCATAGTTTTTCCATAAAGCTTTGCTGACTACCAAA
TTTACTATGATACCC
Protein sequenceShow/hide protein sequence
MGRPPSNGGPVFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQIKQVWNWFQNRRYAIRAKTTKAPGKLAVSPTVQSGKLPVSPIVQIESTPVR
NQSHTTVAPAPTPVGSAKGATENLLSEFEAKSARDGAWYDVATLLSHRSVESGDPEVLVRFVGFGLEEDEWANIRRSIRPRSLPCESSECVAVLPGDLVLCFQEGKEQAL
YFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASAEPSKSGMDSVLLSSQRMSFETARKLNKDATMVIPNANTNINVHAQTR
TLEARNTETNNAPNAFNSSNPAGSSAFSSGIMMSSVSGGSADNVSDGKLLS