; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013880 (gene) of Chayote v1 genome

Gene IDSed0013880
OrganismSechium edule (Chayote v1)
Descriptionhomeobox protein HAT3.1
Genome locationLG13:3144849..3150754
RNA-Seq ExpressionSed0013880
SyntenySed0013880
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600300.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0074.26Show/hide
Query:  ILSLSFRSRHQGKFFMFFGDNMEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKEL
        ++ +S     QG   +FFGDNMEERDEYTESR  N S A+QEAKA+ EVE+ T L+N QMHS P+Y ELGT  D   +T   D EKPGV+QNMEE  KEL
Subjt:  ILSLSFRSRHQGKFFMFFGDNMEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKEL

Query:  GLGDAVRELSEKSKQTISNLADNDQDEARNLLSSDKDTENLILPIVVET-TILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDH
        GLG+A   L E+S QTIS LADNDQ EA NLLSSDKDTENLILPI VET  +LN+CS+ PTEDDNKN I+Q NP  ED   NTSI NL  VP +S +L  
Subjt:  GLGDAVRELSEKSKQTISNLADNDQDEARNLLSSDKDTENLILPIVVET-TILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDH

Query:  KDKKVLISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEG
        KDK+VL SKKKN  LRS VSSDRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK KK+KK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEG
Subjt:  KDKKVLISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEG

Query:  WKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQK
        WKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  
Subjt:  WKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQK

Query:  TDIPPDDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKS
        +DIPPDDEGWLCPGCDCKDDC+D+LN+FQGSNLSITDGWEKV+PE AA AAG++SDH + LPSDDS+DGDYDPDVPD ID+DG+     SSSD SSSDKS
Subjt:  TDIPPDDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKS

Query:  GY--VSASEELEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKN
        GY   SASEELEA PNDDQYLGLPSDDSEDDDYDP AP R E V QESS+S FTSDS+DLAAL D GS          LN T+PVRNSN +SS    NKN
Subjt:  GY--VSASEELEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKN

Query:  ALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSK
        A HN+LSSL  SGP++ GLELVSGRRHVERLDYKKLHDET+GN+PTDSSDDTYGS S+DSSDDRGRGRSTRK S KNPVP  S+NGTDDL +IKTK +SK
Subjt:  ALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSK

Query:  KTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKS
        +TRQKPAAENM+NSV KTPE T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKS
Subjt:  KTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKS

Query:  ATRMDIQSSQKSRKLPNPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSK
         +RM  QSS+ SRK P PE+ES  C R++ SNGA HQESP  ISVVAPCQSG TGDDKL  +K KRPEST  KSRKRKG    VASRSK  K+S+K P+K
Subjt:  ATRMDIQSSQKSRKLPNPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSK

Query:  SPKLNENQT
        S K++E QT
Subjt:  SPKLNENQT

KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0075.17Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  N S A+QEAKA+ EVE+ T L+N QMHS P+Y ELGT  D   +T   D EKPGV+QNMEE  KELGLG+A   L E+S QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVET-TILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        DNDQ EA NLLSSDKDTENLILPI VET  +LN+CS+ PTEDDNKN I+Q NP  ED   NTSI NL  VP +S +L  KDK+VL SKKKN  LRS VSS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVET-TILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK KK+KK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL
        +D+LN+FQGSNLSITDGWEKV+PE AA AAG++SDH + LPSDDS+DGDYDPDVPD ID+DG+     SSSD SSSDKSGY   SASEELEA PNDDQYL
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL

Query:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL
        GLPSDDSEDDDYDP AP R E V QESS+S FTSDS+DLAAL D GS          LN T+PVRNSN +SS    NKNA HN+LSSL  SGP++ GLEL
Subjt:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL

Query:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY
        VSGRRHVERLDYKKLHDET+GN+PTDSSDDTYGS S+DSSDDRGRGRSTRK S KNPVP  S+NGTDDL +IKTK +SK+TRQKPAAENM+NSV KTPE 
Subjt:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY

Query:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE
        T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKS +RM  QSSQ SRK P PE+E
Subjt:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE

Query:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK
        S  C R++ SNGA HQESP  ISVVAPCQSG TGDDKL  +K KRPEST  KSRKRKG    VASRSK  K+S+K P+KS K++E   QT DKVKKRRRK
Subjt:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK

Query:  SI
        S+
Subjt:  SI

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]0.0e+0074.97Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  N S A+QEAKAS EVE+ T L+N QMHS P+Y ELGT  D   +T   D EKPGV+QNMEE  KELGLG+A R L EKS QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        DNDQDEA NLLSSDKDTENLILPI VETT +LN+CS+ PTED+NKN I+Q NP  ED   NTSI NL  VP +S ++  KDK+VL SKKKN  LRS +SS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK K  KK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD----------SSSDDSSSDKSGY--VSASEELEAQPN
        +D+LN+FQGSNLSITDGWEKV+PE AA AAGQ+SDH + LPSDDS+DGDYDPDVPD ID+DG+          SSSD SSSDKSGY   SASEELEA PN
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD----------SSSDDSSSDKSGY--VSASEELEAQPN

Query:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK
        DDQYLGLPSDDSEDDDYDP AP R E VGQESS+S FTSDS+DLAAL D GS          LN T+PVRNS+ +SS    NKNA HN+LSSL  SGP++
Subjt:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK

Query:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI
         GLELVSGRRHVERLDYKKLHDET+GN+PTDSSDDTYGS S+DSSDDRGRGRSTRK S KNPVP  S+NGTDDL +IKTK +SK+TRQKPAAENM+NSV 
Subjt:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI

Query:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP
        KTPE T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKSA+RM  QSSQ SRK P
Subjt:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP

Query:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK
         PE+ES  C R+  SNGA HQESP  ISVVAPCQSG TGDDKL  +K KRPES   KSRKRKG    VASRSK  K+S+K P+KS K++E   QT DKVK
Subjt:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK

Query:  KRRRKSI
        KRRRKS+
Subjt:  KRRRKSI

XP_022979028.1 homeobox protein HAT3.1-like [Cucurbita maxima]0.0e+0074.72Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  + S A+QEAKAS EVE+ T L+N Q+ S P+Y ELGT  D   +T   D EKPGV+QNMEE +KEL LG+A  EL EKS QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        +NDQ EA NLLSSDKDTENLILPI VETT +LN+CS+ PTEDDNKN I+Q NP  E    NTSI NL  VP +S +L  KDK+VL SKKKN  LRS VSS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK KKRKK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL
        +D+LN+FQGSNLSITDGWEKV+PE AA AAG++SDH + LPSDDS+DGDYDPDVPD ID+DG+     SSSD SSSDKSGY   SASEELEA PNDDQYL
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL

Query:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL
        GLPSDDSEDDDYDP AP R E VGQESS+S FTSDS+DLAAL D GS          LN T PVRNSN +SS    NKNA HN+LSSL  SGP++ GLEL
Subjt:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL

Query:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY
        VSGRRHVERLDYKKLHDET+GN+P++SSDDTYGS S+DSSDDRGRGRSTRK S KN VP  S+NGTDD  +IKTK +S++TRQKPAAENM+NSV KTPE 
Subjt:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY

Query:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE
        T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKSA+RM  QSSQ SRK P PE+E
Subjt:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE

Query:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK
        S  C R+  SNGA HQESP  I+VVAPCQSG TGDDKL   KTKRPEST  KSRKRKG    VASRSK+ K+S+K P+KS K++E   QT DKVKKRRRK
Subjt:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK

Query:  SI
        S+
Subjt:  SI

XP_023531864.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0074.53Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  N S A+QEAKAS EVE+ T L+N QMHS P+Y ELGT  D   +T   D EKPGV+QNMEE  +ELGLG+A   L EKS QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        DNDQ EA NLLSSDKDTENLILPI VETT +LN+C + PTEDDNKN I+Q NP  ED   NT I NL  VP +S +L  KDK+VL SKKKN  LRS VSS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK KK+KK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDD----------SSSDKSGY--VSASEELEAQPN
        +D+LN+FQGSNLSITDGWEKV+PE AA AAG++SDH + LPSDDS+DGDYDPDVPD ID+DG+SSSD           SSSDKSGY   SASEELEA PN
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDD----------SSSDKSGY--VSASEELEAQPN

Query:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK
        DDQYLGLPSDDSEDDDYDP AP R E VGQESS+S FTSDS+DLAAL + GS          LN T+PVRNSN +SS    NK+A HN+LSSL  SGP++
Subjt:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK

Query:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI
         GLELVSGRRHVERLDYKKLHDET+GN+PTDSSDDTYGS S+DSSDDRGRGRSTRK S KNPVP  S+NGTDDL +IKTK +SK+TRQKPAAENM+NSV 
Subjt:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI

Query:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP
        KTPE T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKA+SA+RM  QSSQ SRK P
Subjt:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP

Query:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK
         PE+ES  C R+  SNGA HQESP  ISVVAPCQSG TGDDK   ++TKRPEST  KSRKRKG    VASRSK  K+S+K P+KS K++E   QT DKVK
Subjt:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK

Query:  KRRRKSI
        KRRRKS+
Subjt:  KRRRKSI

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0070.43Show/hide
Query:  DNMEERDEY--TESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDYELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTIS
        DNMEERDE   TESRP+  +EA+QEAKAS EVE+RTCLSN  M+S    ELGTTP+  R+T   D EK GVQQNM     ELG G  + ELSEK  QTIS
Subjt:  DNMEERDEY--TESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDYELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTIS

Query:  NLADNDQDEARNLLSSDKDTENLILPIVVE-TTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSR
        N ADNDQ EA N LS DKDT+NL L I  E TT+LN+CS+LP ED  KN I++MNP  EDLT  TSI +LE +P +SQQLDHKD++   SKKKN  LRS 
Subjt:  NLADNDQDEARNLLSSDKDTENLILPIVVE-TTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSR

Query:  VSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA
        VSSDRVLRSRTQEK+KAP+PSNDLNNFTA EEGKRKK KKKRNI+GKGARVDE+SSIRNHLRYL+NRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRA
Subjt:  VSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA

Query:  SNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCK
        SNEIMRRKLKIRDLFQRIDTLCAEGRL +SLFDSEGQIDSEDIFCAKCGSKELS  ENDIILCDG+C+RGFH FCL+PPL  TDIPPDDEGWLCPGCDCK
Subjt:  SNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCK

Query:  DDCMDVLNDFQGSNLSITDGWEKVYPEAAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDK-----SGYVSASEELEAQPNDDQYL
        DDC+D+LN+FQGSNLSITDGWEKVYPEAA AAG+NSD  LGLPSDDSEDGDYDPD+PD ID+D + SSD+SSSD+     SGY SASE LE  PNDDQYL
Subjt:  DDCMDVLNDFQGSNLSITDGWEKVYPEAAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDK-----SGYVSASEELEAQPNDDQYL

Query:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDD---------IGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELV
        GLPSDDSED+DYDPS PE  E   QESS+S FTSDS+DLAAL++         + SLN T+PV+N+N +SS    +K+ LHNELSSL +SG +KDGLE +
Subjt:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDD---------IGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELV

Query:  SGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSK-KTRQKPAAENMNNSVIKTPE
        SGRR VERLDYKKLHDETYGN+PT+SSDDTYGS ++DSSDDRG    TRK+  K  V   S NG+ DDL ++KTK + K +TRQKP A N+NNSV +TP 
Subjt:  SGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSK-KTRQKPAAENMNNSVIKTPE

Query:  YTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEK
         T+KS SSVR+ TSSS RRL+QPALERL ASF+EN+YP RATKESLAQELGL  KQVSKWF+NTRWSTRHP S   KAKS++RM I  SQ S +L   E+
Subjt:  YTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEK

Query:  ESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRR
        ES+ C R+ +SNGA HQ+ PM  SVVA CQSGDTGD KL  +KTKR ES+  KSRKRKG     AS SK  + S + P+KSPK+NE  TQT D+ K RRR
Subjt:  ESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRR

Query:  KSI
        +SI
Subjt:  KSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0074.27Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKASEVEMRTCLSNMQMHSVPD-YELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLAD
        MEER EYTE RP+N  EA+QEAKAS VE+ TC SN QMHS+PD  ELGTTP+   +T+  D+EK GVQQNMEE+ KELG GD + EL EK+ QTIS LA+
Subjt:  MEERDEYTESRPDNYSEAIQEAKASEVEMRTCLSNMQMHSVPD-YELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLAD

Query:  NDQDEARNLLSSDKDTENLILPIVVETTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYD----SQQLDHKDKKVLISKKKNCSLRSRV
         DQ EA NLLSSD +TENLILPI +ETT LN+CS+LP ED NKN IKQ+NP  EDLT NTSI  LE VP      SQQL HKDKK+L SKKKN  LRS V
Subjt:  NDQDEARNLLSSDKDTENLILPIVVETTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYD----SQQLDHKDKKVLISKKKNCSLRSRV

Query:  SSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
        SSDRVLRSRTQEK+KAP+PSN+LN  TAG EGKRK  KKKRNIKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  SSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS

Query:  NEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKD
        +EIMR KLKIRDLFQ +D+LCAEGRL +SLFDSEGQIDSEDIFCAKCGSKELS  ENDIILCDG+C+RGFH FCL+PPL  TDIPPDDEGWLCPGCDCKD
Subjt:  NEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKD

Query:  DCMDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEELEAQPNDDQYLGLPSD
        DC+D+LN+FQGSNLSITDGWEKVYPE AA AAGQNSDHALGLPSDDSEDGDYDPD PD I+++ +SSSD SSSD+SGY SASEELEA PNDDQYLGLPSD
Subjt:  DCMDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEELEAQPNDDQYLGLPSD

Query:  DSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKL
        DSEDDDY+P APE  E V QESS S FTSDS+DLAALDD     GT PVRNSN + S C    + LHNEL SL ESGP+KDGLE VSGRR VERLDYKKL
Subjt:  DSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKL

Query:  HDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSK-KTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTS
        HDETYGN+P+DSSDDT+GS+S+DSSDDRGRG  TRK+S KN VP  + NGT DDL + KTK + K +T QKP AENM NSV +TPE + KS SSVRRT S
Subjt:  HDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSK-KTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTS

Query:  SSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS-ETNKAKSATRMDIQSSQKSRKLPNPEKESSECLRNMESNG
        SS RRL+QPALERL+ASF+ENQYP RATKESLAQELGL  KQVSKWF+NTRWSTRHP S E+NKAKSA RM IQSS+ S KLP PE+ES  C R+ ++NG
Subjt:  SSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS-ETNKAKSATRMDIQSSQKSRKLPNPEKESSECLRNMESNG

Query:  AHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRKSI
        A HQ SP     VAPCQSGDT DDKL  +KT RPEST  KSRKRKG    VAS SK  KESQK P+KSPK+  NQ QT DKV+ RRR+SI
Subjt:  AHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRKSI

A0A6J1E4I6 homeobox protein HAT3.1-like3.9e-30769.97Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR +N +EA+QEAK S E EMRTCLSN Q HSVPDY EL  TP    +T  SD EKP VQQNMEE+N+ELG GD + ELSEK  QT SNLA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVE-TTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        DNDQ EA NLL  DKDTENLI+PI VE TT+L  CS+LP E  NKN I+QMNP  E LT NT   NLE VP +S+Q DHKDK++L S K N  LRS VSS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVE-TTILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DR LRS+TQEK K P+PSNDLNNFTA EEGK K  KK+RNI+GKGARVDEFSSIRNHLRYL+NRIKYEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRD+FQRID LC EG L +SLFDS+GQIDSEDIFCAKCGSKELS  ENDIILCDG+C+RGFH FCL+PPL  TDIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDS-EDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEELEAQPNDDQYLGLPSDD
        +++LN+FQGS LSITDGWEKVYPE AA AAG+N DHA GLPSDDS +D DYDPDVPD I +D     D+SS + SGY SASEELE+ PN DQYLGLPSDD
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDS-EDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEELEAQPNDDQYLGLPSDD

Query:  SEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDD----------IGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH
        SEDDDYDPSAPER E V QESS+S FTSDS+DLAALD             SLN T  ++N + +SS     K+AL+NELSSL ESGP+KDG E V GRR 
Subjt:  SEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDD----------IGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH

Query:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSKK-TRQKPAAENMNNSVIKTPEYTSKS
        VERLDYKKLHDETYGN+PTDSSDDTY SVSMDSSDD+G   +TRK+S K  V       T DDL +IKTKH+SK+ TRQK  A NMN SV KTPE T K+
Subjt:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGT-DDLNDIKTKHNSKK-TRQKPAAENMNNSVIKTPEYTSKS

Query:  GSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS-ETNKAKSATRMDIQSSQKSRKLPNPEKESSE
         SSVRRTT SS RRL++ ALERL+ASF+ENQYP RATKESLAQELGL  KQVSKWF NTRWSTRHP S E NKAKS++RM I SSQ S +L  PE+E   
Subjt:  GSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS-ETNKAKSATRMDIQSSQKSRKLPNPEKESSE

Query:  CLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRKSI
                GA HQE P   SVVAPCQSGDTGD KL  ++TKR E +  KSRKRKG     AS SK  KESQ+ P+KSPK+NE   QT   +K RRR S+
Subjt:  CLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRKSI

A0A6J1FNP3 homeobox protein HAT3.10.0e+0074.97Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  N S A+QEAKAS EVE+ T L+N QMHS P+Y ELGT  D   +T   D EKPGV+QNMEE  KELGLG+A R L EKS QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        DNDQDEA NLLSSDKDTENLILPI VETT +LN+CS+ PTED+NKN I+Q NP  ED   NTSI NL  VP +S ++  KDK+VL SKKKN  LRS +SS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK K  KK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD----------SSSDDSSSDKSGY--VSASEELEAQPN
        +D+LN+FQGSNLSITDGWEKV+PE AA AAGQ+SDH + LPSDDS+DGDYDPDVPD ID+DG+          SSSD SSSDKSGY   SASEELEA PN
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD----------SSSDDSSSDKSGY--VSASEELEAQPN

Query:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK
        DDQYLGLPSDDSEDDDYDP AP R E VGQESS+S FTSDS+DLAAL D GS          LN T+PVRNS+ +SS    NKNA HN+LSSL  SGP++
Subjt:  DDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEK

Query:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI
         GLELVSGRRHVERLDYKKLHDET+GN+PTDSSDDTYGS S+DSSDDRGRGRSTRK S KNPVP  S+NGTDDL +IKTK +SK+TRQKPAAENM+NSV 
Subjt:  DGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVI

Query:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP
        KTPE T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKSA+RM  QSSQ SRK P
Subjt:  KTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLP

Query:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK
         PE+ES  C R+  SNGA HQESP  ISVVAPCQSG TGDDKL  +K KRPES   KSRKRKG    VASRSK  K+S+K P+KS K++E   QT DKVK
Subjt:  NPEKESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVK

Query:  KRRRKSI
        KRRRKS+
Subjt:  KRRRKSI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0074.72Show/hide
Query:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA
        MEERDEYTESR  + S A+QEAKAS EVE+ T L+N Q+ S P+Y ELGT  D   +T   D EKPGV+QNMEE +KEL LG+A  EL EKS QTIS LA
Subjt:  MEERDEYTESRPDNYSEAIQEAKAS-EVEMRTCLSNMQMHSVPDY-ELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLA

Query:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS
        +NDQ EA NLLSSDKDTENLILPI VETT +LN+CS+ PTEDDNKN I+Q NP  E    NTSI NL  VP +S +L  KDK+VL SKKKN  LRS VSS
Subjt:  DNDQDEARNLLSSDKDTENLILPIVVETT-ILNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSS

Query:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+K+KAP+PSNDL+N TAGEEGK KKRKK R IKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC
        IMRRKLKIRDLFQRID LC+EGR  ++LFDSEGQIDSEDIFC KCGSKELS  ENDIILCDGVC+RGFH FCL+PPL  +DIPPDDEGWLCPGCDCKDDC
Subjt:  IMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDC

Query:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL
        +D+LN+FQGSNLSITDGWEKV+PE AA AAG++SDH + LPSDDS+DGDYDPDVPD ID+DG+     SSSD SSSDKSGY   SASEELEA PNDDQYL
Subjt:  MDVLNDFQGSNLSITDGWEKVYPE-AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGD-----SSSDDSSSDKSGY--VSASEELEAQPNDDQYL

Query:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL
        GLPSDDSEDDDYDP AP R E VGQESS+S FTSDS+DLAAL D GS          LN T PVRNSN +SS    NKNA HN+LSSL  SGP++ GLEL
Subjt:  GLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGS----------LNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL

Query:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY
        VSGRRHVERLDYKKLHDET+GN+P++SSDDTYGS S+DSSDDRGRGRSTRK S KN VP  S+NGTDD  +IKTK +S++TRQKPAAENM+NSV KTPE 
Subjt:  VSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEY

Query:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE
        T KS SSVRRTTSSS RRL+QP LERL+ASF+ENQYP RATKESLA+ELGL  KQVSKWF+NTRWSTRHP SE NKAKSA+RM  QSSQ SRK P PE+E
Subjt:  TSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKE

Query:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK
        S  C R+  SNGA HQESP  I+VVAPCQSG TGDDKL   KTKRPEST  KSRKRKG    VASRSK+ K+S+K P+KS K++E   QT DKVKKRRRK
Subjt:  SSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKG----VASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRK

Query:  SI
        S+
Subjt:  SI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A5.6e-9337.83Show/hide
Query:  SLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEK
        +L S  S  RVLRS +  K+ + +      +  A  +   K+RK  R      +  DEFS IR  +RY++NR+ YEQSLIEAY+SEGWK  S DK++PEK
Subjt:  SLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEK

Query:  ELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCP
        EL+RA +EI+R KL+IR++F+ ID+L ++G++ ++LFDSEG+I  EDIFC+ CGS + +   NDIILCDG C+RGFH  CL+PPL+  DIP  DEGWLCP
Subjt:  ELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCP

Query:  GCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAAPAAGQN-SDHALGLPSDDSEDGDYDPDVPD--IIDKDGDSSSDD----SSSDKSGYVSASEELEA-
         CDCK DC+D++N+  GSN+SI D WEKV+P+AA  A  +  D A  LPSDDS+D D+DP++P+  ++ KD +SS +D    S SD S +++ S++ E  
Subjt:  GCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAAPAAGQN-SDHALGLPSDDSEDGDYDPDVPD--IIDKDGDSSSDD----SSSDKSGYVSASEELEA-

Query:  --QPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSN--SGFTSDSDDL------AALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESG
          +  DD  L LPS+DSEDDDYDP+ P+  + V ++SS+  S FTSDSDD       +  D++ S           EK +  A   ++  + + +  + G
Subjt:  --QPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSN--SGFTSDSDDL------AALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESG

Query:  PEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIK----TKHNSKKTRQ--KPA
                 S RR  ERLDYKKL+DE YG   +DSSD           D+   G++T       P+ +S++ G  +    K      HN + T Q  K +
Subjt:  PEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIK----TKHNSKKTRQ--KPA

Query:  AENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPAL-ERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDI
          +++ SV + P   + +GS+     S++++    P + ++L   FK   YP+R+ KESLA+ELGL  +QV+KWF+  R S R   + + K  S  +   
Subjt:  AENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPAL-ERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDI

Query:  QSSQK----SRKLPNPE----KESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKGVASRSKHGK
        Q++      S +   PE    +ES+ CL    + G    +  +   V +     D G  K+   + + P     +  ++K +    +  K
Subjt:  QSSQK----SRKLPNPE----KESSECLRNMESNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKGVASRSKHGK

P48785 Pathogenesis-related homeodomain protein3.1e-5130.58Show/hide
Query:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS
        +S K+N     R + + +  +SRT++ S+      ++      +  K +KRK KR  K     VD+   ++   RYL+ ++K +Q+LI+AY++EGWKG S
Subjt:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + + +  S+G I  + IFCA+C S+E + P+NDIILCDG C R FH  CLDPPL+   IPP
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP

Query:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE
         D+GW C  CDCK + +D +N   G++  +   W+ ++ E A  P   + + ++    PSDDS+D DYDP++ +    +G  +S + S D  G       
Subjt:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE

Query:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE
                           D+D             +ES ++  +  SD +A     GS  G                      + LS++ E   E    E
Subjt:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE

Query:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT
         V G R    +DY +L+ E +G    D+     GS   D   +D R R R +   S    + ESSK   D                          V++T
Subjt:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT

Query:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW
         E + +   SV       +  RL + A+E+L   F E + P++A ++ LA+EL L P++V+KWF+NTR+
Subjt:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW

P48786 Pathogenesis-related homeodomain protein8.9e-11546.43Show/hide
Query:  LISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS
        ++ +K   S    V+S R LRSR+QEKS  P    D+NN  A E   R+K +KKR  + +  RVDEF  IR HLRYL++RIKYE++ ++AYS EGWKG S
Subjt:  LISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP
         DK+KPEKEL+RA  EI  RKLKIRDLFQR+D   +EGRLP+ LFDS G+IDSEDIFCAKCGSK+++   NDIILCDG C+RGFH FCLDPPL K  IPP
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP

Query:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPE--AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEEL
        DDEGWLCPGC+CK DC+ +LND Q +N+ + D WEKV+ E  AA A+G+N D   GLPSDDSED DYDP  PD+ +K      DDSS+D+S Y S S+++
Subjt:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPE--AAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEEL

Query:  EAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL
        +     +   GLPSDDSEDD+YDPS     + + ++SS S FTSDS+D   + D          +++ +     AS  + + N         PE+     
Subjt:  EAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLEL

Query:  VSGRRHVERLDYKKLHDETYGN--------------LPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTD--DLN-DIKTKHNSKKTR-
        +  RR VE LDYKKL+D  +                +    + + YG+ S DSSD+     S+  K+  +    + + G +  DL  D K + ++   R 
Subjt:  VSGRRHVERLDYKKLHDETYGN--------------LPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTD--DLN-DIKTKHNSKKTR-

Query:  -QKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRH
         +K A E  ++ + ++ E    S + V  + S+S+    + A +RL+ SFKENQYP RA KESLA EL L  +QVS WF N RWS RH
Subjt:  -QKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRH

Q04996 Homeobox protein HAT3.11.3e-11345.67Show/hide
Query:  RTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        R Q   +   PS+ + N T    G+ KK+ K  N KG+    DE++ I+  LRY +NRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKL
Subjt:  RTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDCMDVLND
        KIRDLFQ +DTLCAEG LP+SLFD++G+I SEDIFCAKCGSK+LS  +NDIILCDG C+RGFH +CL+PPL+K DIPPDDEGWLCPGCDCKDD +D+LND
Subjt:  KIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDCMDVLND

Query:  FQGSNLSITDGWEKVYPEAAPA---AGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSD-------DSSSDKSGYVSASEEL-----EAQPNDDQY
          G+  S++D WEK++PEAA A    GQN D    LPSDDS+D +YDPD  +  + D D S D       D SSD++ + SAS+E+     E +      
Subjt:  FQGSNLSITDGWEKVYPEAAPA---AGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSD-------DSSSDKSGYVSASEEL-----EAQPNDDQY

Query:  LGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAAL---DDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH
        + LPSDDSEDDDYDP AP   +   +ESSNS  TSD++DL      D+        P+ +   ++S+   +        + L+      DG   VS RR+
Subjt:  LGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAAL---DDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH

Query:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGS
        VERLDYKKL+DE Y N+PT SSDD       D +   G+   +  +   + VP    +  +D       H SKK  +K    +  +++    E   ++G 
Subjt:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGS

Query:  SVR-RTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWS--TRHPYSETN--KAKSATRMDIQSSQKSRKLPNPEKES
        S     +SSS  +   P  +RL  SF+ENQYP +ATKESLA+EL +  KQV+ WF++ RWS  ++   SE N  K K+    + ++S         E ES
Subjt:  SVR-RTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWS--TRHPYSETN--KAKSATRMDIQSSQKSRKLPNPEKES

Q8H991 Homeobox protein HAZ11.7e-9740.16Show/hide
Query:  ISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSS
        ++KK+  S   R +  RVLRS +++K+KA    N+L N  AG +   KKRK  R  KG G   D++  IR  +RY++NR+ YEQSLI+AY+SEGWKG S 
Subjt:  ISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSS

Query:  DKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPD
        +K++PEKEL+RA  EI+R K +IR+ F+ +D+L +EG+L +S+FDS G+I SEDIFCA CGSK+++  +NDIILCDG+C+RGFH +CL+PPL   DIP  
Subjt:  DKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPD

Query:  DEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAAP-AAGQNSDHALGLPSDDSEDGDYDP----------------DVPDIIDKDGDSSSDDS
        DEGWLCP CDCK DC+DVLN+ QG  LSI D WEKV+PEAA    G     A  LPSDDS D DYDP                D  + +D D  SS D  
Subjt:  DEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAAP-AAGQNSDHALGLPSDDSEDGDYDP----------------DVPDIIDKDGDSSSDDS

Query:  SSDKSGYVSASEELEAQPNDDQYLGLPSDDSEDDDYDPSAP-----ERAESVGQESSNSGFTSDSDDLAA-------LDDIGSLNGTMPVRNSNEKSSRC
        SS+K     +      +  DD  LGLPS+DSED D+DP+ P     +  ES   +S  S FTSDSDD  A        D+I   + +          S  
Subjt:  SSDKSGYVSASEELEAQPNDDQYLGLPSDDSEDDDYDPSAP-----ERAESVGQESSNSGFTSDSDDLAA-------LDDIGSLNGTMPVRNSNEKSSRC

Query:  ASNKNALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDT--YGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDI
            NA ++ L +  E+  E+D +  +S +R VERLDYKKL++E YG   +DSSDD   YG    +S+ ++G    +   S    + ES + G       
Subjt:  ASNKNALHNELSSLQESGPEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDT--YGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDI

Query:  KTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS
          ++++ +   +      + S  +T    S S  S     ++  R       ++L A FKE+ YP+RATKE+LAQELGL   QV+KWF +TR   R   +
Subjt:  KTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYS

Query:  ETNKAKSATRMDIQSSQKSRKLPNPEKESSECLRNMESNGAHHQE
         T K  +      +++  +  + + +   S  + +++ N    +E
Subjt:  ETNKAKSATRMDIQSSQKSRKLPNPEKESSECLRNMESNGAHHQE

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain9.2e-11545.67Show/hide
Query:  RTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        R Q   +   PS+ + N T    G+ KK+ K  N KG+    DE++ I+  LRY +NRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKL
Subjt:  RTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDCMDVLND
        KIRDLFQ +DTLCAEG LP+SLFD++G+I SEDIFCAKCGSK+LS  +NDIILCDG C+RGFH +CL+PPL+K DIPPDDEGWLCPGCDCKDD +D+LND
Subjt:  KIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDCMDVLND

Query:  FQGSNLSITDGWEKVYPEAAPA---AGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSD-------DSSSDKSGYVSASEEL-----EAQPNDDQY
          G+  S++D WEK++PEAA A    GQN D    LPSDDS+D +YDPD  +  + D D S D       D SSD++ + SAS+E+     E +      
Subjt:  FQGSNLSITDGWEKVYPEAAPA---AGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSD-------DSSSDKSGYVSASEEL-----EAQPNDDQY

Query:  LGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAAL---DDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH
        + LPSDDSEDDDYDP AP   +   +ESSNS  TSD++DL      D+        P+ +   ++S+   +        + L+      DG   VS RR+
Subjt:  LGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAAL---DDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLELVSGRRH

Query:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGS
        VERLDYKKL+DE Y N+PT SSDD       D +   G+   +  +   + VP    +  +D       H SKK  +K    +  +++    E   ++G 
Subjt:  VERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPEYTSKSGS

Query:  SVR-RTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWS--TRHPYSETN--KAKSATRMDIQSSQKSRKLPNPEKES
        S     +SSS  +   P  +RL  SF+ENQYP +ATKESLA+EL +  KQV+ WF++ RWS  ++   SE N  K K+    + ++S         E ES
Subjt:  SVR-RTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWS--TRHPYSETN--KAKSATRMDIQSSQKSRKLPNPEKES

AT4G29940.1 pathogenesis related homeodomain protein A2.2e-5230.58Show/hide
Query:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS
        +S K+N     R + + +  +SRT++ S+      ++      +  K +KRK KR  K     VD+   ++   RYL+ ++K +Q+LI+AY++EGWKG S
Subjt:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + + +  S+G I  + IFCA+C S+E + P+NDIILCDG C R FH  CLDPPL+   IPP
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP

Query:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE
         D+GW C  CDCK + +D +N   G++  +   W+ ++ E A  P   + + ++    PSDDS+D DYDP++ +    +G  +S + S D  G       
Subjt:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE

Query:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE
                           D+D             +ES ++  +  SD +A     GS  G                      + LS++ E   E    E
Subjt:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE

Query:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT
         V G R    +DY +L+ E +G    D+     GS   D   +D R R R +   S    + ESSK   D                          V++T
Subjt:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT

Query:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW
         E + +   SV       +  RL + A+E+L   F E + P++A ++ LA+EL L P++V+KWF+NTR+
Subjt:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW

AT4G29940.2 pathogenesis related homeodomain protein A2.2e-5230.58Show/hide
Query:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS
        +S K+N     R + + +  +SRT++ S+      ++      +  K +KRK KR  K     VD+   ++   RYL+ ++K +Q+LI+AY++EGWKG S
Subjt:  ISKKKNCSLRSRVSSDRVL-RSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNIKGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + + +  S+G I  + IFCA+C S+E + P+NDIILCDG C R FH  CLDPPL+   IPP
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELSCPENDIILCDGVCERGFHLFCLDPPLQKTDIPP

Query:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE
         D+GW C  CDCK + +D +N   G++  +   W+ ++ E A  P   + + ++    PSDDS+D DYDP++ +    +G  +S + S D  G       
Subjt:  DDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAA--PAAGQNS-DHALGLPSDDSEDGDYDPDVPDIIDKDGDSSSDDSSSDKSGYVSASEE

Query:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE
                           D+D             +ES ++  +  SD +A     GS  G                      + LS++ E   E    E
Subjt:  LEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQESGPEKDGLE

Query:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT
         V G R    +DY +L+ E +G    D+     GS   D   +D R R R +   S    + ESSK   D                          V++T
Subjt:  LVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMD--SSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKT

Query:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW
         E + +   SV       +  RL + A+E+L   F E + P++A ++ LA+EL L P++V+KWF+NTR+
Subjt:  PEYTSKSGSSVRRTTSSSQR-RLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAACGGAGCCCAGAACGAGAGGGGGGGCGGAGCGGCGGAGCGGAAGCCAGACGGTGGCCCAGATTTGGAAACAACCACCATCGCCAAAACCTCAACTCTTGTCTC
GCGAAACTCCATTAACGACCACCTTCAAGATCCAGAATCGCCTTCTTCCATGGACAATACCGTGCTCTTTCATCAACTCCAGCTTATGGTTAGGCTTTTTGTAATTCTCT
CCCTTTCCTTCCGAAGTCGTCATCAGGGAAAGTTTTTTATGTTCTTTGGGGACAATATGGAAGAAAGAGATGAATATACAGAATCAAGGCCTGATAATTACTCTGAAGCC
ATACAAGAAGCCAAGGCCAGCGAAGTTGAAATGCGAACTTGTCTTTCAAATATGCAAATGCATTCAGTTCCTGACTATGAATTGGGAACAACTCCAGATAATATCAGAAG
AACTAGCTGTTCAGATAATGAAAAACCAGGGGTCCAGCAGAATATGGAGGAAAAGAATAAGGAACTTGGTTTGGGAGATGCAGTCAGAGAGTTATCAGAAAAAAGTAAGC
AAACTATCTCTAACCTTGCTGATAATGATCAAGATGAAGCCAGAAATTTATTATCAAGTGATAAAGATACTGAAAATTTAATTTTACCCATTGTAGTTGAGACAACAATT
CTCAATAAGTGCTCCAAACTTCCAACTGAAGATGACAACAAAAATTGTATCAAACAGATGAACCCTCTTAGAGAAGATTTAACCCCAAATACTTCTATCCATAATTTAGA
ATTCGTCCCTTATGATTCTCAACAATTGGACCACAAGGATAAGAAAGTTTTGATATCAAAAAAGAAAAATTGTTCGTTAAGGTCTCGTGTAAGTAGTGACAGAGTTTTAC
GTTCAAGGACCCAAGAGAAATCTAAAGCACCCAAACCAAGTAATGACTTGAATAATTTTACTGCCGGAGAGGAGGGAAAAAGGAAGAAGAGGAAGAAGAAGAGAAATATT
AAGGGAAAGGGAGCAAGAGTTGATGAGTTTTCATCAATCAGAAATCATTTGAGATATTTAGTGAACCGCATTAAATATGAACAGAGCTTGATTGAAGCTTACTCTAGTGA
AGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCCGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTA
TTGATACACTTTGTGCTGAAGGAAGGCTTCCTCAATCTCTATTCGATTCTGAAGGGCAGATAGATAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAGGAACTGTCC
TGCCCTGAGAATGACATCATACTATGTGATGGTGTTTGTGAACGTGGGTTCCACCTATTCTGTTTAGATCCACCTTTGCAGAAAACAGACATTCCACCGGATGATGAGGG
ATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCATGGATGTGCTTAATGATTTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGG
CTGCACCAGCTGCTGGACAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATATTATTGACAAGGATGGT
GATTCGAGTTCCGATGATTCGAGTTCTGATAAATCTGGGTATGTTTCTGCTTCTGAGGAATTGGAGGCTCAACCTAATGATGACCAGTACTTAGGTCTCCCTTCAGATGA
CTCAGAGGATGATGATTATGATCCCAGTGCTCCAGAACGCGCGGAAAGTGTTGGACAGGAAAGTTCAAATTCTGGCTTCACATCTGATTCTGATGATCTAGCTGCACTTG
ATGATATCGGGTCTTTAAATGGTACTATGCCTGTGAGAAACTCTAATGAAAAAAGTTCTAGATGTGCTTCTAACAAGAATGCACTACATAACGAGTTATCAAGTCTACAA
GAGTCAGGTCCCGAGAAGGATGGTCTTGAACTAGTTTCGGGAAGAAGACATGTGGAACGGTTGGATTACAAGAAGCTCCATGACGAAACATATGGGAATCTTCCAACCGA
CTCAAGTGACGACACATATGGTAGTGTTTCCATGGATTCAAGTGATGATAGAGGCCGTGGCAGGAGTACAAGGAAGAAAAGTTTTAAAAACCCGGTTCCTGAATCGTCGA
AAAATGGAACTGATGATTTGAATGATATAAAAACTAAACACAATTCTAAGAAAACTCGTCAAAAGCCAGCTGCTGAAAATATGAATAATTCTGTGATTAAAACTCCTGAA
TACACTTCAAAATCTGGTTCTTCTGTTAGACGAACCACTTCATCATCCCAAAGAAGACTCAATCAACCTGCATTGGAGAGACTTATTGCATCATTCAAAGAAAATCAGTA
TCCTACACGAGCTACAAAGGAGAGTTTAGCCCAAGAACTAGGGCTTTGTCCGAAGCAGGTTAGTAAATGGTTTCAGAACACACGTTGGAGCACACGCCATCCCTACAGCG
AAACTAATAAAGCAAAGAGTGCTACAAGAATGGATATTCAATCATCTCAGAAAAGTAGAAAACTTCCCAATCCTGAGAAAGAATCTAGTGAATGTTTGAGAAATATGGAG
AGCAATGGTGCTCATCATCAAGAATCACCAATGCCAATTAGTGTTGTGGCCCCATGTCAGAGTGGAGATACAGGGGATGACAAGTTGGTGAAGAAGAAAACTAAAAGACC
AGAATCTACTACTGTGAAATCCAGAAAACGGAAGGGCGTGGCATCCCGTTCGAAGCACGGAAAGGAATCACAAAAGCGTCCATCTAAGTCACCAAAACTTAATGAAAATC
AAACTCAAACAGTAGATAAGGTTAAGAAAAGGAGGAGGAAATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAACGGAGCCCAGAACGAGAGGGGGGGCGGAGCGGCGGAGCGGAAGCCAGACGGTGGCCCAGATTTGGAAACAACCACCATCGCCAAAACCTCAACTCTTGTCTC
GCGAAACTCCATTAACGACCACCTTCAAGATCCAGAATCGCCTTCTTCCATGGACAATACCGTGCTCTTTCATCAACTCCAGCTTATGGTTAGGCTTTTTGTAATTCTCT
CCCTTTCCTTCCGAAGTCGTCATCAGGGAAAGTTTTTTATGTTCTTTGGGGACAATATGGAAGAAAGAGATGAATATACAGAATCAAGGCCTGATAATTACTCTGAAGCC
ATACAAGAAGCCAAGGCCAGCGAAGTTGAAATGCGAACTTGTCTTTCAAATATGCAAATGCATTCAGTTCCTGACTATGAATTGGGAACAACTCCAGATAATATCAGAAG
AACTAGCTGTTCAGATAATGAAAAACCAGGGGTCCAGCAGAATATGGAGGAAAAGAATAAGGAACTTGGTTTGGGAGATGCAGTCAGAGAGTTATCAGAAAAAAGTAAGC
AAACTATCTCTAACCTTGCTGATAATGATCAAGATGAAGCCAGAAATTTATTATCAAGTGATAAAGATACTGAAAATTTAATTTTACCCATTGTAGTTGAGACAACAATT
CTCAATAAGTGCTCCAAACTTCCAACTGAAGATGACAACAAAAATTGTATCAAACAGATGAACCCTCTTAGAGAAGATTTAACCCCAAATACTTCTATCCATAATTTAGA
ATTCGTCCCTTATGATTCTCAACAATTGGACCACAAGGATAAGAAAGTTTTGATATCAAAAAAGAAAAATTGTTCGTTAAGGTCTCGTGTAAGTAGTGACAGAGTTTTAC
GTTCAAGGACCCAAGAGAAATCTAAAGCACCCAAACCAAGTAATGACTTGAATAATTTTACTGCCGGAGAGGAGGGAAAAAGGAAGAAGAGGAAGAAGAAGAGAAATATT
AAGGGAAAGGGAGCAAGAGTTGATGAGTTTTCATCAATCAGAAATCATTTGAGATATTTAGTGAACCGCATTAAATATGAACAGAGCTTGATTGAAGCTTACTCTAGTGA
AGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCCGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTA
TTGATACACTTTGTGCTGAAGGAAGGCTTCCTCAATCTCTATTCGATTCTGAAGGGCAGATAGATAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAGGAACTGTCC
TGCCCTGAGAATGACATCATACTATGTGATGGTGTTTGTGAACGTGGGTTCCACCTATTCTGTTTAGATCCACCTTTGCAGAAAACAGACATTCCACCGGATGATGAGGG
ATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCATGGATGTGCTTAATGATTTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGG
CTGCACCAGCTGCTGGACAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATATTATTGACAAGGATGGT
GATTCGAGTTCCGATGATTCGAGTTCTGATAAATCTGGGTATGTTTCTGCTTCTGAGGAATTGGAGGCTCAACCTAATGATGACCAGTACTTAGGTCTCCCTTCAGATGA
CTCAGAGGATGATGATTATGATCCCAGTGCTCCAGAACGCGCGGAAAGTGTTGGACAGGAAAGTTCAAATTCTGGCTTCACATCTGATTCTGATGATCTAGCTGCACTTG
ATGATATCGGGTCTTTAAATGGTACTATGCCTGTGAGAAACTCTAATGAAAAAAGTTCTAGATGTGCTTCTAACAAGAATGCACTACATAACGAGTTATCAAGTCTACAA
GAGTCAGGTCCCGAGAAGGATGGTCTTGAACTAGTTTCGGGAAGAAGACATGTGGAACGGTTGGATTACAAGAAGCTCCATGACGAAACATATGGGAATCTTCCAACCGA
CTCAAGTGACGACACATATGGTAGTGTTTCCATGGATTCAAGTGATGATAGAGGCCGTGGCAGGAGTACAAGGAAGAAAAGTTTTAAAAACCCGGTTCCTGAATCGTCGA
AAAATGGAACTGATGATTTGAATGATATAAAAACTAAACACAATTCTAAGAAAACTCGTCAAAAGCCAGCTGCTGAAAATATGAATAATTCTGTGATTAAAACTCCTGAA
TACACTTCAAAATCTGGTTCTTCTGTTAGACGAACCACTTCATCATCCCAAAGAAGACTCAATCAACCTGCATTGGAGAGACTTATTGCATCATTCAAAGAAAATCAGTA
TCCTACACGAGCTACAAAGGAGAGTTTAGCCCAAGAACTAGGGCTTTGTCCGAAGCAGGTTAGTAAATGGTTTCAGAACACACGTTGGAGCACACGCCATCCCTACAGCG
AAACTAATAAAGCAAAGAGTGCTACAAGAATGGATATTCAATCATCTCAGAAAAGTAGAAAACTTCCCAATCCTGAGAAAGAATCTAGTGAATGTTTGAGAAATATGGAG
AGCAATGGTGCTCATCATCAAGAATCACCAATGCCAATTAGTGTTGTGGCCCCATGTCAGAGTGGAGATACAGGGGATGACAAGTTGGTGAAGAAGAAAACTAAAAGACC
AGAATCTACTACTGTGAAATCCAGAAAACGGAAGGGCGTGGCATCCCGTTCGAAGCACGGAAAGGAATCACAAAAGCGTCCATCTAAGTCACCAAAACTTAATGAAAATC
AAACTCAAACAGTAGATAAGGTTAAGAAAAGGAGGAGGAAATCCATTTAG
Protein sequenceShow/hide protein sequence
MENGAQNERGGGAAERKPDGGPDLETTTIAKTSTLVSRNSINDHLQDPESPSSMDNTVLFHQLQLMVRLFVILSLSFRSRHQGKFFMFFGDNMEERDEYTESRPDNYSEA
IQEAKASEVEMRTCLSNMQMHSVPDYELGTTPDNIRRTSCSDNEKPGVQQNMEEKNKELGLGDAVRELSEKSKQTISNLADNDQDEARNLLSSDKDTENLILPIVVETTI
LNKCSKLPTEDDNKNCIKQMNPLREDLTPNTSIHNLEFVPYDSQQLDHKDKKVLISKKKNCSLRSRVSSDRVLRSRTQEKSKAPKPSNDLNNFTAGEEGKRKKRKKKRNI
KGKGARVDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLPQSLFDSEGQIDSEDIFCAKCGSKELS
CPENDIILCDGVCERGFHLFCLDPPLQKTDIPPDDEGWLCPGCDCKDDCMDVLNDFQGSNLSITDGWEKVYPEAAPAAGQNSDHALGLPSDDSEDGDYDPDVPDIIDKDG
DSSSDDSSSDKSGYVSASEELEAQPNDDQYLGLPSDDSEDDDYDPSAPERAESVGQESSNSGFTSDSDDLAALDDIGSLNGTMPVRNSNEKSSRCASNKNALHNELSSLQ
ESGPEKDGLELVSGRRHVERLDYKKLHDETYGNLPTDSSDDTYGSVSMDSSDDRGRGRSTRKKSFKNPVPESSKNGTDDLNDIKTKHNSKKTRQKPAAENMNNSVIKTPE
YTSKSGSSVRRTTSSSQRRLNQPALERLIASFKENQYPTRATKESLAQELGLCPKQVSKWFQNTRWSTRHPYSETNKAKSATRMDIQSSQKSRKLPNPEKESSECLRNME
SNGAHHQESPMPISVVAPCQSGDTGDDKLVKKKTKRPESTTVKSRKRKGVASRSKHGKESQKRPSKSPKLNENQTQTVDKVKKRRRKSI