; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012074 (gene) of Chayote v1 genome

Gene IDSed0012074
OrganismSechium edule (Chayote v1)
Descriptionhomeobox protein HAT3.1
Genome locationLG07:9603352..9611606
RNA-Seq ExpressionSed0012074
SyntenySed0012074
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600300.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0065.98Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR IN S AVQEAKA VEVEV T L+NEQM S P Y E G   D+T +TG  DEEK  V                                
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                              KQNMEED KELGLG
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
        +A S                                               L E+S+Q IS  A+NDQ EAGNLLS+DKDTE+LILPIEVET  LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP EDDNK+YIEQ NP IED   NTSI+NL MVP +S +L  +D+R+LKS+KKN +L+S VS+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS
        AGEE K K+KK  R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR S
Subjt:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS

Query:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA
        E+LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAA
Subjt:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA

Query:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV
        AAAAG+SSD+ + LPSDDS+DGDYD   PDVPDAIDQ+     D SSS++SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R + V
Subjt:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV

Query:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV
         QESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T+PVRNSNG+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +GNV
Subjt:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV

Query:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA
         TDSSDDTYGSDS+DSSDDRG GRSTRK S KNPVPALS+NGT++LKN KTKRSSK T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLSQ  
Subjt:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA

Query:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS
        LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKS SR G QSS+TS K PKPEQESGACFRD  SNGAQHQESPK IS
Subjt:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS

Query:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKAR
        VV PCQSG  GDDKLAN K KRPEST+ KSRKRKGR D  ASRSKDRK+S+KPPAKS KV+E QTADK R
Subjt:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKAR

KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0065.92Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR IN S AVQEAKA VEVEV T L+NEQM S P Y E G   D+T +TG  DEEK  V                                
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                              KQNMEED KELGLG
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
        +A S                                               L E+S+Q IS  A+NDQ EAGNLLS+DKDTE+LILPIEVET  LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP EDDNK+YIEQ NP IED   NTSI+NL MVP +S +L  +D+R+L+S+KKN +L+S VS+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS
        AGEE K K+KK  R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR S
Subjt:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS

Query:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA
        E+LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAA
Subjt:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA

Query:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV
        AAAAG+SSD+ + LPSDDS+DGDYD   PDVPDAIDQ+     D SSS++SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R + V
Subjt:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV

Query:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV
         QESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T+PVRNSNG+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +GNV
Subjt:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV

Query:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA
         TDSSDDTYGSDS+DSSDDRG GRSTRK S KNPVPALS+NGT++LKN KTKRSSK T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLSQ  
Subjt:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA

Query:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS
        LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKS SR G QSSQTS K PKPEQESGACFRD  SNGAQHQESPK IS
Subjt:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS

Query:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
        VV PCQSG  GDDKLAN K KRPEST+ KSRKRKGR D  ASRSKDRK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]0.0e+0065.74Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR IN S AVQEAKA VEVEV T L+NEQM S P Y E G   D+T +TG  DEEK  V                                
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                              KQNMEED KELGLG
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
        +A              H+                                 L EKS+Q IS  A+NDQ EAGNLLS+DKDTE+LILPIEVETT LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP ED+NK+YIEQ NP IED   NTSI NL MVP +S ++  +D+R+LKS+KKN +L+S +S+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSES
        AGEE K K+K R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR SE+
Subjt:  AGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSES

Query:  LFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAA
        LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAAAA
Subjt:  LFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAA

Query:  AAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE----------DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERV
        AAGQSSD+ + LPSDDS+DGDYD   PDVPDAIDQ+          D+SSS+ SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R 
Subjt:  AAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE----------DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERV

Query:  KEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAY
        + VGQESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T+PVRNS+G+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +
Subjt:  KEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAY

Query:  GNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLS
        GNV TDSSDDTYGSDS+DSSDDRG GRSTRK S KNPVPALS+NGT++LKN KTKRSSK T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLS
Subjt:  GNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLS

Query:  QQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPK
        Q  LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKSASR G QSSQTS K PKPEQESGACFRD  SNGAQHQESPK
Subjt:  QQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPK

Query:  PISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
         ISVV PCQSG  GDDKLAN K KRPES + KSRKRKGR D  ASRSKDRK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  PISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

XP_022979028.1 homeobox protein HAT3.1-like [Cucurbita maxima]0.0e+0065.18Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR I+ S AVQEAKA VEVEV T L+NEQ+ S P Y E G   D+T +T                                          
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
         G  DEEKP V+QNMEE++KEL LG+A S                                                                       
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
                                                          EL EKS+Q IS  A NDQ EAGNLLS+DKDTE+LILPIEVETT LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP EDDNK+YIEQ NP IE    NTSI+NL MVP +S +L  +D+R+LKS+KKN +L+S VS+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS
        AGEE K K++K  R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR S
Subjt:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS

Query:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA
        E+LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAA
Subjt:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA

Query:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV
        AAAAG+SSD+ + LPSDDS+DGDYD   PDVPDAIDQ+     D SSS++SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R + V
Subjt:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV

Query:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV
        GQESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T PVRNSNG+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +GNV
Subjt:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV

Query:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA
         ++SSDDTYGSDS+DSSDDRG GRSTRK S KN VPALS+NGT++ KN KTK SS+ T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLSQ  
Subjt:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA

Query:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS
        LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKSASR G QSSQTS K PKPEQESGACFRD  SNGAQHQESPK I+
Subjt:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS

Query:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
        VV PCQSG  GDDKLA  KTKRPEST+ KSRKRKGR D  ASRSK+RK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

XP_023531864.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0065.62Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR IN S AVQEAKA VEVEV T L+NEQM S P Y E G   D+T +TG  DEEK  V                                
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                              KQNMEED +ELGLG
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
        +A S                                               L EKS+Q IS  A+NDQ EAGNLLS+DKDTE+LILPIEVETT LLNEC 
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP EDDNK+YIEQ NP IED   NT I+NL MVP +S +L  +D+R+LKS+KKN +L+S VS+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS
        AGEE K K+KK  R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR S
Subjt:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS

Query:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA
        E+LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAA
Subjt:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA

Query:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQ----------SGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPE
        AAAAG+SSD+ + LPSDDS+DGDYD   PDVPDAIDQ+ ESSS+ SSSDQ          SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP 
Subjt:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQ----------SGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPE

Query:  RVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDE
        R + VGQESSSSDFTSDSEDLAAL +NGSSKDDN+ SS +N+T+PVRNSNG+SSG GP+KSA HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE
Subjt:  RVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDE

Query:  AYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRR
         +GNV TDSSDDTYGSDS+DSSDDRG GRSTRK S KNPVPALS+NGT++LKN KTK SSK T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRR
Subjt:  AYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRR

Query:  LSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQES
        LSQ  LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKA+SASR G QSSQTS K PKPEQESGACFRD  SNGAQHQES
Subjt:  LSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQES

Query:  PKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
        PK ISVV PCQSG  GDDK AN +TKRPEST+ KSRKRKGR D  ASRSKDRK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  PKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0061.68Show/hide
Query:  MEER--NGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATST
        MEER  N  TESRP NN+EAVQEAKA  ++               E ++     +          +K +    +EE +++    +  ++ +E   +A ++
Subjt:  MEER--NGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATST

Query:  HQIGGSDEEKPV--VQQNMEEENKELDLGDALSELSEK-------NNQANSIHQTGGS-NEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQ
          +   DE      +  N  E  +E +  D + E  E        NN A ++ +   S N ++R    Y E         +N +  +E    +  E   +
Subjt:  HQIGGSDEEKPV--VQQNMEEENKELDLGDALSELSEK-------NNQANSIHQTGGS-NEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQ

Query:  NMEEDNKELGLGDAVSELSEKSNHAIST---------------------HQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANND
        N + +++   + +AV E        + T                      +T G DEEK  VQ         NMELG G  L ELSEK NQ ISNHA+ND
Subjt:  NMEEDNKELGLGDAVSELSEKSNHAIST---------------------HQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANND

Query:  QVEAGNLLSNDKDTEHLILPIEVETTDLLNECSEPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKNL-LKSPVSTDRV
        QVEAGN LS DKDT++L L IE ETT LLNECSE P+ED  K+YIE+MNP IEDLT  TSIQ+LE +PS+SQQLDH+D R  KS+KKN  L+S VS+DRV
Subjt:  QVEAGNLLSNDKDTEHLILPIEVETTDLLNECSEPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKNL-LKSPVSTDRV

Query:  LRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERKR-KRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKE
        LRSRTQE           K+K  EPSNDLNN TA EE KR K+KKR+I+G+GARVDE+SSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKE
Subjt:  LRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERKR-KRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKE

Query:  LQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGC
        LQRASNEIMRRKLKIRDLFQRID LC EGRLSESLFDSEGQIDSEDIFCAKCGS ELSLENDIILCDG+CDRGFHQFCLEPPLL TDIPPDDEGWLCPGC
Subjt:  LQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGC

Query:  DCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGYASASEQLEAL
        DCKDDC+DLLNEFQGSNLSITDGWEKVYPE AAAAAG++SD+ LGLPSDDSEDGDYD   PD+PD IDQ+     DESSS++S+SD SGYASASE LE  
Subjt:  DCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGYASASEQLEAL

Query:  PNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGL
        PNDDQYLGLPSDDSED D+DPS PE  +   QESSSSDFTSDSEDLAAL +N SSKDD+LV SS+N+T+PV+N+NG+SS  GPSKS LHNELSS+L+SGL
Subjt:  PNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGL

Query:  DMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSK-ITCQKPVAENTD
        D DGLEP+SGRR VERLDYKKLHDE YGNV T+SSDDTYGS ++DSSDDRG    TRKR  K  V ALS NG+N +L N KTKRS K  T QKP A N +
Subjt:  DMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSK-ITCQKPVAENTD

Query:  NSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTS
        NSV  TP +T KSSSSVR+ TSSS+RRLSQ ALERL ASFQEN+YP+RATKESLA+EL L+ KQVSKWFENTRWSTRHPSS   KAKS+SR  I  SQ S
Subjt:  NSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTS

Query:  GKLPKPEQESGACFRDNDSNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADK
        G+L K EQES  CFRD DSNGA+HQ+ P   SVV  CQSGD GD KL   KTKR ES++ KSRKRKGR D+TAS SKDR+ S +PPAKS KVNETQTAD+
Subjt:  GKLPKPEQESGACFRDNDSNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADK

Query:  ARTRRRKSI
         +TRRR+SI
Subjt:  ARTRRRKSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0063.91Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTE RP NN EAVQEAKA   VEV TC SNEQM S+P+ QE G TP+ T +T G D+EK  VQQ MEEE K                       
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                                            
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
                                                 ELG GD L EL EK+NQ IS  A  DQVEAGNLLS+D +TE+LILPIE+ETT  LNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVP----SSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDL
        E P ED NK+ I+Q+NP IEDLT NTSIQ LE VP    S SQQL H+D++ILKS+KKN +L+S VS+DRVLRSRTQE           K+K  EPSN+L
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVP----SSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDL

Query:  NNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGR
        N +TAG E KRK+KKR+IKG+GA  DEFSSIR  LRYL+NRI YEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+EIMR KLKIRDLFQ +D+LC EGR
Subjt:  NNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGR

Query:  LSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPE
        LSESLFDSEGQIDSEDIFCAKCGS ELSLENDIILCDG+CDRGFHQFCLEPPLL TDIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKVYPE
Subjt:  LSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPE

Query:  AAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESS
        AAAAAAGQ+SD+ALGLPSDDSEDGDYD   PD PD I+QEDESSS++SSSD+SGYASASE+LEA PNDDQYLGLPSDDSED+D++P APE  + V QESS
Subjt:  AAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESS

Query:  SSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSS
         SDFTSDSEDLAAL D                T PVRNSNG+ SGCGP  S LHNEL S+LESG D DGLEPVSGRR VERLDYKKLHDE YGNV +DSS
Subjt:  SSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSS

Query:  DDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSK-ITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALER
        DDT+GS S+DSSDDRG G  TRKRS KN VPAL  NGTN +LKNKKTKRS K  T QKP AEN  NSV RTP++++KSSSSVRRT SSS+RRLSQ ALER
Subjt:  DDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSK-ITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALER

Query:  LLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSS-EANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPISVV
        LLASFQENQYP+RATKESLA+EL LS KQVSKWFENTRWSTRHPSS E+NKAKSA R GIQSS+TSGKLPKPEQESGACFRD D+NGAQHQ SP     V
Subjt:  LLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSS-EANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPISVV

Query:  FPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
         PCQSGD  DDKLA  KT RPEST+ KSRKRKGR DH AS SKDRKESQKPPAKS KVN+ QTADK RTRRR+SI
Subjt:  FPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

A0A6J1FNP3 homeobox protein HAT3.10.0e+0065.74Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR IN S AVQEAKA VEVEV T L+NEQM S P Y E G   D+T +TG  DEEK  V                                
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
                                                                                              KQNMEED KELGLG
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
        +A              H+                                 L EKS+Q IS  A+NDQ EAGNLLS+DKDTE+LILPIEVETT LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP ED+NK+YIEQ NP IED   NTSI NL MVP +S ++  +D+R+LKS+KKN +L+S +S+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSES
        AGEE K K+K R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR SE+
Subjt:  AGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSES

Query:  LFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAA
        LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAAAA
Subjt:  LFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAA

Query:  AAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE----------DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERV
        AAGQSSD+ + LPSDDS+DGDYD   PDVPDAIDQ+          D+SSS+ SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R 
Subjt:  AAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE----------DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERV

Query:  KEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAY
        + VGQESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T+PVRNS+G+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +
Subjt:  KEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAY

Query:  GNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLS
        GNV TDSSDDTYGSDS+DSSDDRG GRSTRK S KNPVPALS+NGT++LKN KTKRSSK T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLS
Subjt:  GNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLS

Query:  QQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPK
        Q  LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKSASR G QSSQTS K PKPEQESGACFRD  SNGAQHQESPK
Subjt:  QQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPK

Query:  PISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
         ISVV PCQSG  GDDKLAN K KRPES + KSRKRKGR D  ASRSKDRK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  PISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0065.18Show/hide
Query:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ
        MEER+ YTESR I+ S AVQEAKA VEVEV T L+NEQ+ S P Y E G   D+T +T                                          
Subjt:  MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQ

Query:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG
         G  DEEKP V+QNMEE++KEL LG+A S                                                                       
Subjt:  IGGSDEEKPVVQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLG

Query:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS
                                                          EL EKS+Q IS  A NDQ EAGNLLS+DKDTE+LILPIEVETT LLNECS
Subjt:  DAVSELSEKSNHAISTHQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECS

Query:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT
        EPP EDDNK+YIEQ NP IE    NTSI+NL MVP +S +L  +D+R+LKS+KKN +L+S VS+DRVLRSRTQ+           K+K  EPSNDL+NVT
Subjt:  EPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVT

Query:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS
        AGEE K K++K  R IKG+GARVDEFSSIR HLRYL+NRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC EGR S
Subjt:  AGEERKRKRKK--RSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLS

Query:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA
        E+LFDSEGQIDSEDIFC KCGS ELSLENDIILCDGVCDRGFHQFCLEPPLL +DIPPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAA
Subjt:  ESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAA

Query:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV
        AAAAG+SSD+ + LPSDDS+DGDYD   PDVPDAIDQ+     D SSS++SSSD+SGY  ASASE+LEA PNDDQYLGLPSDDSED+D+DP AP R + V
Subjt:  AAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQE-----DESSSEKSSSDQSGY--ASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEV

Query:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV
        GQESSSSDFTSDSEDLAAL DNGSSKDDN+ SS +N+T PVRNSNG+SSG GP+K+A HN+LSS++ SG D  GLE VSGRRHVERLDYKKLHDE +GNV
Subjt:  GQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNV

Query:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA
         ++SSDDTYGSDS+DSSDDRG GRSTRK S KN VPALS+NGT++ KN KTK SS+ T QKP AEN DNSV +TP+ TLKSSSSVRRTTSSSHRRLSQ  
Subjt:  HTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQA

Query:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS
        LERLLASFQENQYPERATKESLA EL LS KQVSKWFENTRWSTRHPSSEANKAKSASR G QSSQTS K PKPEQESGACFRD  SNGAQHQESPK I+
Subjt:  LERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS

Query:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI
        VV PCQSG  GDDKLA  KTKRPEST+ KSRKRKGR D  ASRSK+RK+S+KPPAKS KV+E QTADK + RRRKS+
Subjt:  VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI

A0A6J1J9X9 homeobox protein HAT3.1-like1.8e-30569.42Show/hide
Query:  ESNKQCMEESNKQNIEEANKQNMEEDNKELGLGDAVSELSEKSNHAIS-----THQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISN
        E   +C E  +  N E        ++ K     +  + LS +  H +      T++TGG DEEKP VQ  MEE NK   ELG GD L ELSEK NQ  SN
Subjt:  ESNKQCMEESNKQNIEEANKQNMEEDNKELGLGDAVSELSEKSNHAIS-----THQTGGLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISN

Query:  HANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECSEPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPV
         A+NDQVEAGNLL  DKDTE+LI+PIEVETT LL +CSE P E  NK+YIEQMNP IE+LT NT  Q LE VPS+S+Q DH+D+RILKS K N +L+S V
Subjt:  HANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECSEPPMEDDNKSYIEQMNPSIEDLTPNTSIQNLEMVPSSSQQLDHEDRRILKSRKKN-LLKSPV

Query:  STDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLK
        S+DR +RS+TQE           K KD EPSNDLNN TA EE K K+K+R+I+G+GARVDEFSSIR HLRYL+NRINYEQNLIEAYSSEGWKGFSSDKLK
Subjt:  STDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLK

Query:  PEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWL
        PEKELQRASNEIMRRKLKIRD+FQRIDALC EG LS+SLFDS+GQI SEDIFCAKCGS ELSLENDIILCDG+CDRGFHQFCLEPPLL TDIP DDEGWL
Subjt:  PEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWL

Query:  CPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALP
        CPGCDCKDDC++LLNEFQGS LSITDGWEKVYPEAAA+AAG++ D+ALGLPSDDSED DYD   PDVPD I Q+D     KSSS+ SGYASASE+LE+ P
Subjt:  CPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALP

Query:  NDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLD
        N DQYLGLPSDDSED+D+DPSAPER ++V QESSSSDFTSDSEDLAAL  N SSK DNLVSSS+N+T  ++N +G+SSG GP KS+L+NELSS+LESG D
Subjt:  NDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLD

Query:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSKI-TCQKPVAENTDN
         DG EPV GRR VERLDYKKLHDE YGNV TDSSDDTY S SMDSSDD+G   +TRKRS K  V AL     N +L N KTK SSK  T QK  A N + 
Subjt:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTN-NLKNKKTKRSSKI-TCQKPVAENTDN

Query:  SVARTPDNTLKSSSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSS-EANKAKSASRTGIQSSQTS
        SV +TP++T K+SSSVRRTTSSS+RRLSQ ALERLLASFQENQYPERATKESLA+EL LS KQV+KWF NTRWSTRHPSS E NKAKS+SR GI SSQ S
Subjt:  SVARTPDNTLKSSSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSS-EANKAKSASRTGIQSSQTS

Query:  GKLPKPEQESGACFRDNDSNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADK
        G+L +PE+E           GAQHQE P   SVV PCQSGD GD KLA   TKR E ++ KSRKRKGR DH ASRSKD KESQ+PPAKS KVNE QTA  
Subjt:  GKLPKPEQESGACFRDNDSNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADK

Query:  ARTRRRKSI
         +TRRR S+
Subjt:  ARTRRRKSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A2.6e-9939.94Show/hide
Query:  LKSKDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDL
        L+S  S  +    +V A  +   KR+K S     +  DEFS IRK +RY++NR+NYEQ+LIEAY+SEGWK  S DK++PEKEL+RA +EI+R KL+IR++
Subjt:  LKSKDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDL

Query:  FQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNL
        F+ ID+L  +G++ E+LFDSEG+I  EDIFC+ CGS++ +L NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP CDCK DC+DL+NE  GSN+
Subjt:  FQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNL

Query:  SITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEK----SSSDQSGYASASEQLEALPN---DDQYLGLPSDDSED
        SI D WEKV+P+AAA A     D+A  LPSDDS+D D+DPN+P+    + +++ESS E     S SD S + + S+  E L +   DD  L LPS+DSED
Subjt:  SITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEK----SSSDQSGYASASEQLEALPN---DDQYLGLPSDDSED

Query:  EDFDPSAPERVKEVGQESSS--SDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHV
        +D+DP+ P+  K+V ++SSS  SDFTSDS+D        S    + VSS +     V +    ++    + SA        +E+ +D   + P S RR  
Subjt:  EDFDPSAPERVKEVGQESSS--SDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHV

Query:  ERLDYKKLHDEAYGNVHTDSSDDTYGSDS---MDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKS
        ERLDYKKL+DEAYG   +DSSDD   S     +  S++ G   S   + S+        +  + L  + TK+S           +   SV   P +   +
Subjt:  ERLDYKKLHDEAYGNVHTDSSDDTYGSDS---MDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKS

Query:  SSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGAC
         S+   T    H        ++L   F+   YP R+ KESLAEEL L+ +QV+KWFE  R S R  SS    +         +SQ +  + +P++  G  
Subjt:  SSSVRRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGAC

Query:  FRDND---SNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRK
          +++   + G    +      V       D G  K+ + + + P     +  ++K
Subjt:  FRDND---SNGAQHQESPKPISVVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRK

P48785 Pathogenesis-related homeodomain protein7.2e-4930.02Show/hide
Query:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP
        +  +++  + +E  +  KS+  + S         EE K    RKRK KR  K     VD+   +++  RYL+ ++  +QNLI+AY++EGWKG S +K++P
Subjt:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP

Query:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC
        +KEL+RA  EI+  KL +RD  +++D L   G + E +  S+G I  + IFCA+C S E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C
Subjt:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC

Query:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN
          CDCK + +D +N   G++  +   W+ ++ E A+   G                                     SE + ++++ +            
Subjt:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN

Query:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD
               PSDDS+D+D+DP     ++E G  +SS+            GD G   D+  +S+S++ S+  V  S G   G         + LS+++E   +
Subjt:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD

Query:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN
            E V G R    +DY +L+ E +G    D+     GS+  D   +DR      RKR                 K +    S+ +T C+   +   D 
Subjt:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN

Query:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT
         V  T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L P++V+KWF+NTR+ + R+  +E+ K    S+T
Subjt:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT

P48786 Pathogenesis-related homeodomain protein3.7e-11442.51Show/hide
Query:  EMVPSSSQQLDHEDRRILKSRKKNLLKSPVSTDRVLR------SRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTA--GEERKRKRKKRSIKGEGARVDEF
        E    SS QL    +R  + R + +  SP S    ++        +QE S +S  S+  +S++     D+NN+ A  G +R++ RKKR  + E  RVDEF
Subjt:  EMVPSSSQQLDHEDRRILKSRKKNLLKSPVSTDRVLR------SRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTA--GEERKRKRKKRSIKGEGARVDEF

Query:  SSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELS
          IR HLRYL++RI YE+N ++AYS EGWKG S DK+KPEKEL+RA  EI  RKLKIRDLFQR+D    EGRL E LFDS G+IDSEDIFCAKCGS +++
Subjt:  SSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELS

Query:  LENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQSSDNALGLPSDDSEDGDYD
        L NDIILCDG CDRGFHQFCL+PPLLK  IPPDDEGWLCPGC+CK DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G++ D+  GLPSDDSED DYD
Subjt:  LENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQSSDNALGLPSDDSEDGDYD

Query:  PNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVS
        P  PD+ + +  +D      SS+D+S Y S S+ ++ +   +   GLPSDDSED+++DPS      ++ ++SS SDFTSDSED   + D+   KD     
Subjt:  PNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVS

Query:  SSINSTIP-VRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHD--------------------------EAYGNVHTDSS
          + ST   VRN+     GCG      H E               P+  RR VE LDYKKL+D                          E YGN  +DSS
Subjt:  SSINSTIP-VRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHD--------------------------EAYGNVHTDSS

Query:  DDTY---GSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALE
        D+ Y    S   ++SD         + S    +   ++  T+N +  K          K   E TD+ ++R+ +++    +  + T+ + H    + A +
Subjt:  DDTY---GSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSHRRLSQQALE

Query:  RLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRH-----------PSSEANKAKSASRTG-----IQSSQTSGKLPKPEQESGA
        RLL SF+ENQYP+RA KESLA EL LS +QVS WF N RWS RH            S++  + KS   +G     +  S T  ++ K EQ++ +
Subjt:  RLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRH-----------PSSEANKAKSASRTG-----IQSSQTSGKLPKPEQESGA

Q04996 Homeobox protein HAT3.11.1e-11345.74Show/hide
Query:  KDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQR
        +D+ PS+ + N T    R +K+ K   KG+    DE++ I+K LRY +NRINYEQ+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQ 
Subjt:  KDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQR

Query:  IDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSIT
        +D LC EG L ESLFD++G+I SEDIFCAKCGS +LS++NDIILCDG CDRGFHQ+CLEPPL K DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++
Subjt:  IDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSIT

Query:  DGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNV----PDVPDAIDQEDESSSEKSSSDQSGYASASEQL-----EALPNDDQYLGLPSDDSEDE
        D WEK++PEAAAA  G   +    LPSDDS+D +YDP+      +  D  D  +ES +E  SSD++ + SAS+++     E        + LPSDDSED+
Subjt:  DGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNV----PDVPDAIDQEDESSSEKSSSDQSGYASASEQL-----EALPNDDQYLGLPSDDSEDE

Query:  DFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILES--GLDMDGLEPVSGRRHVE
        D+DP AP    +  +ESS+SD TSD+EDL       S K D     + ++ +             P +     +  +ILES  GLD DG   VS RR+VE
Subjt:  DFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILES--GLDMDGLEPVSGRRHVE

Query:  RLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSV
        RLDYKKL+DE Y NV T SSDD    D  D +   G    +      + VP    +   +  +KK  R SK   +K   E                S  +
Subjt:  RLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSV

Query:  RRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWS--TRHPSSEAN--KAKSASRTGIQSSQTSGKLPKPEQESGA
         +++SS+ ++   +  +RL  SFQENQYP++ATKESLA+EL+++ KQV+ WF++ RWS  ++   SE N  K K+      ++S         E ES A
Subjt:  RRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWS--TRHPSSEAN--KAKSASRTGIQSSQTSGKLPKPEQESGA

Q8H991 Homeobox protein HAZ11.3e-10341.87Show/hide
Query:  EMVPSSSQQL--DHEDRRILKSRKKNLLKSPV-------STDRVLRSRTQEKSKDSEPSIDLKS---KDSEPSNDLNNVTAGEERKRKRKK--RSIKGEG
        ++ PS +  L   H + R+L+S  K   ++P+       +  RV + R + K     PS  L+S   K ++  N+L N  AG +   K++K  R  KG G
Subjt:  EMVPSSSQQL--DHEDRRILKSRKKNLLKSPV-------STDRVLRSRTQEKSKDSEPSIDLKS---KDSEPSNDLNNVTAGEERKRKRKK--RSIKGEG

Query:  ARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKC
           D++  IRK +RY++NR+NYEQ+LI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L  EG+L ES+FDS G+I SEDIFCA C
Subjt:  ARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKC

Query:  GSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSE
        GS +++L+NDIILCDG+CDRGFHQ+CL PPLL  DIP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G    +A  LPSDDS 
Subjt:  GSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSE

Query:  DGDYDPNVPDVPDAIDQE-------------DESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSS-----SDF
        D DYDP +      +D+E             D+SSSE S S +   +  S+    +  DD  LGLPS+DSED DFDP+ P+  KE   ES+S     SDF
Subjt:  DGDYDPNVPDVPDAIDQE-------------DESSSEKSSSDQSGYASASEQLEALPNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSS-----SDF

Query:  TSDSEDLAALGDNGSSKDDNLVSSSINSTI-PVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDT
        TSDS+D  A       +D+  +S   +S I  V  ++G  SG     +A ++ L + +E+ L+ D + P+S +R VERLDYKKL++EAYG   +DSSDD 
Subjt:  TSDSEDLAALGDNGSSKDDNLVSSSINSTI-PVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDT

Query:  --YG-------------SDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSH
          YG             +DS+  S   G G S R        P    N  +  +N +   S      + +  N++ S A+                   +
Subjt:  --YG-------------SDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTSSSH

Query:  RRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTR
        R       ++L A F+E+ YP RATKE+LA+EL L+  QV+KWF +TR   R
Subjt:  RRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTR

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain7.7e-11545.74Show/hide
Query:  KDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQR
        +D+ PS+ + N T    R +K+ K   KG+    DE++ I+K LRY +NRINYEQ+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQ 
Subjt:  KDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQR

Query:  IDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSIT
        +D LC EG L ESLFD++G+I SEDIFCAKCGS +LS++NDIILCDG CDRGFHQ+CLEPPL K DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++
Subjt:  IDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSIT

Query:  DGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNV----PDVPDAIDQEDESSSEKSSSDQSGYASASEQL-----EALPNDDQYLGLPSDDSEDE
        D WEK++PEAAAA  G   +    LPSDDS+D +YDP+      +  D  D  +ES +E  SSD++ + SAS+++     E        + LPSDDSED+
Subjt:  DGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNV----PDVPDAIDQEDESSSEKSSSDQSGYASASEQL-----EALPNDDQYLGLPSDDSEDE

Query:  DFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILES--GLDMDGLEPVSGRRHVE
        D+DP AP    +  +ESS+SD TSD+EDL       S K D     + ++ +             P +     +  +ILES  GLD DG   VS RR+VE
Subjt:  DFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILES--GLDMDGLEPVSGRRHVE

Query:  RLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSV
        RLDYKKL+DE Y NV T SSDD    D  D +   G    +      + VP    +   +  +KK  R SK   +K   E                S  +
Subjt:  RLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSV

Query:  RRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWS--TRHPSSEAN--KAKSASRTGIQSSQTSGKLPKPEQESGA
         +++SS+ ++   +  +RL  SFQENQYP++ATKESLA+EL+++ KQV+ WF++ RWS  ++   SE N  K K+      ++S         E ES A
Subjt:  RRTTSSSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWS--TRHPSSEAN--KAKSASRTGIQSSQTSGKLPKPEQESGA

AT4G29940.1 pathogenesis related homeodomain protein A5.1e-5030.02Show/hide
Query:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP
        +  +++  + +E  +  KS+  + S         EE K    RKRK KR  K     VD+   +++  RYL+ ++  +QNLI+AY++EGWKG S +K++P
Subjt:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP

Query:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC
        +KEL+RA  EI+  KL +RD  +++D L   G + E +  S+G I  + IFCA+C S E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C
Subjt:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC

Query:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN
          CDCK + +D +N   G++  +   W+ ++ E A+   G                                     SE + ++++ +            
Subjt:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN

Query:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD
               PSDDS+D+D+DP     ++E G  +SS+            GD G   D+  +S+S++ S+  V  S G   G         + LS+++E   +
Subjt:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD

Query:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN
            E V G R    +DY +L+ E +G    D+     GS+  D   +DR      RKR                 K +    S+ +T C+   +   D 
Subjt:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN

Query:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT
         V  T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L P++V+KWF+NTR+ + R+  +E+ K    S+T
Subjt:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT

AT4G29940.2 pathogenesis related homeodomain protein A5.1e-5030.02Show/hide
Query:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP
        +  +++  + +E  +  KS+  + S         EE K    RKRK KR  K     VD+   +++  RYL+ ++  +QNLI+AY++EGWKG S +K++P
Subjt:  RSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERK----RKRK-KRSIKGEGARVDEFSSIRKHLRYLMNRINYEQNLIEAYSSEGWKGFSSDKLKP

Query:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC
        +KEL+RA  EI+  KL +RD  +++D L   G + E +  S+G I  + IFCA+C S E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C
Subjt:  EKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLC

Query:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN
          CDCK + +D +N   G++  +   W+ ++ E A+   G                                     SE + ++++ +            
Subjt:  PGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEALPN

Query:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD
               PSDDS+D+D+DP     ++E G  +SS+            GD G   D+  +S+S++ S+  V  S G   G         + LS+++E   +
Subjt:  DDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSIN-STIPVRNSNGKSSGCGPSKSALHNELSSILESGLD

Query:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN
            E V G R    +DY +L+ E +G    D+     GS+  D   +DR      RKR                 K +    S+ +T C+   +   D 
Subjt:  MDGLEPVSGRRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMD-SSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKIT-CQKPVAENTDN

Query:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT
         V  T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L P++V+KWF+NTR+ + R+  +E+ K    S+T
Subjt:  SVARTPDNTLKSSSSVRRTTSSSHR-RLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRW-STRHPSSEANKAKSASRT

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 56.5e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P +++  I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 56.5e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P +++  I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLKTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAAATGGATATACAGAATCAAGGCCTATTAATAACTCTGAAGCAGTACAAGAAGCTAAGGCATGTGTTGAAGTTGAAGTATTTACTTGTCTTTCAAATGA
GCAAATGCCTTCCGTTCCTGAATATCAGGAATCGGGAATAACCCCAGATTATACTTACCAAACTGGCGGTTCAGATGAAGAAAAACTTGAGGTCCAACAGAAAATGGAAG
AGGAGAATAAGGAGCTTGGTTTGCGAAATGCACTCCATGAGTTATCAGAAAAATGTAATCAAGCTACCTCTACCCACCAAATTGGCGGTTCAGATGAAGAAAAACCTGTG
GTCCAACAAAATATGGAGGAAGAGAATAAGGAGCTTGATTTGGGAGATGCGCTCAGTGAGTTATCAGAAAAAAATAATCAGGCTAACTCTATCCACCAAACTGGTGGTTC
AAATGAAGAAAAACGTGGGGTCCAGCTATATATGGAGGAATCGAATAAGCAGTGTATGGAGGAATCGAATAAGCAGTGTATGGAGGAATCAAATAAGCAGAATATAGAGG
AAGCAAATAAGCAAAATATGGAGGAAGATAATAAAGAGCTTGGTTTGGGAGATGCAGTTAGTGAGCTATCAGAAAAAAGTAATCATGCTATCTCTACCCACCAAACTGGC
GGTTTAGATGAAGAAAAACCTAGGGTCCAACACTATATGGAGGAAGCAAATAAGCATAATATGGAGCTTGGTTTGGGAGATGCGCTCGGTGAGTTATCAGAAAAAAGTAA
TCAAGCTATCTCTAACCATGCTAATAATGATCAAGTTGAAGCTGGAAATTTATTATCCAACGATAAAGATACTGAACATTTAATATTACCCATTGAAGTTGAGACAACGG
ATCTTCTTAATGAGTGCTCTGAACCTCCAATGGAAGATGACAACAAAAGTTACATCGAACAAATGAACCCTTCTATTGAAGATTTAACCCCAAACACTTCTATCCAAAAT
TTAGAAATGGTCCCCTCTAGTTCTCAACAGTTGGATCACGAGGATAGGAGAATTTTGAAATCAAGAAAGAAAAATTTGTTAAAGTCCCCTGTAAGTACCGACAGAGTTTT
GCGTTCAAGGACCCAAGAGAAATCTAAAGATTCTGAACCTAGTATTGACTTGAAATCTAAAGATTCTGAACCTAGTAATGACTTGAATAATGTTACTGCTGGTGAGGAGA
GGAAAAGGAAGAGGAAGAAGAGAAGTATAAAGGGAGAGGGAGCAAGAGTTGATGAGTTTTCATCAATCAGGAAACATTTGAGATATTTAATGAACCGCATCAATTATGAA
CAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAACTGAAGCCCGAAAAGGAACTTCAACGCGCATCAAATGAAATAATGCGACGCAA
ATTGAAAATAAGAGACCTATTTCAACGTATTGATGCACTTTGTCGTGAAGGAAGGCTTTCTGAATCTTTATTCGACTCTGAAGGGCAGATAGACAGTGAGGATATATTCT
GTGCAAAATGTGGATCCGATGAGCTATCCCTTGAGAATGACATCATACTCTGTGATGGTGTCTGTGATCGTGGGTTCCACCAGTTCTGTTTAGAACCACCTTTGCTAAAA
ACCGACATTCCACCAGATGATGAGGGATGGCTATGCCCAGGATGTGATTGCAAAGATGACTGCATGGATCTGCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGA
TGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCAGCTGGGCAGAGTTCTGATAACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTA
ATGTTCCAGATGTTCCAGATGCTATTGACCAGGAGGATGAATCAAGTTCAGAGAAATCGAGTTCTGATCAATCTGGGTATGCTTCTGCTTCTGAGCAATTGGAGGCTCTA
CCCAATGATGACCAGTACTTAGGTCTCCCTTCAGATGACTCGGAGGATGAGGATTTTGATCCCAGTGCACCAGAACGTGTCAAAGAAGTTGGGCAGGAAAGTTCAAGTTC
TGATTTTACATCTGATTCTGAGGATCTAGCTGCACTGGGCGATAATGGGTCTTCTAAAGATGATAACCTTGTGTCTTCTTCAATAAACAGTACTATTCCCGTGAGAAACT
CTAATGGGAAAAGTTCTGGTTGCGGTCCTAGCAAGAGTGCACTGCATAATGAGTTATCAAGTATACTAGAGTCGGGTCTCGATATGGATGGTCTTGAACCAGTTTCTGGA
AGAAGACATGTGGAACGGTTAGATTACAAGAAGCTCCATGATGAAGCATATGGGAATGTTCATACCGACTCAAGTGATGACACATATGGTAGTGATTCTATGGATTCAAG
TGATGACAGAGGTCATGGTAGGAGTACAAGGAAGAGAAGTTCTAAAAACCCAGTTCCTGCATTGTCAAAAAATGGAACTAATAATTTGAAAAACAAAAAAACTAAACGCA
GTTCTAAGATAACTTGTCAAAAGCCAGTTGCTGAAAATACGGATAATTCTGTGGCTAGAACTCCAGACAACACTTTGAAATCTAGTTCTTCTGTCAGACGAACTACTTCT
TCATCACATAGAAGGCTCAGTCAACAAGCATTGGAGAGACTTCTTGCATCATTCCAAGAAAATCAGTATCCTGAGCGTGCAACAAAGGAGAGTTTGGCAGAAGAACTGAG
GCTCAGTCCGAAGCAGGTTAGTAAATGGTTTGAGAACACACGTTGGAGCACTCGCCATCCCTCCAGTGAAGCTAATAAAGCAAAGAGTGCTTCAAGAACGGGGATTCAAT
CATCTCAGACAAGTGGAAAACTTCCTAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGATAACGATAGCAATGGTGCTCAACATCAAGAATCACCAAAGCCAATTAGT
GTTGTGTTCCCATGTCAGAGTGGAGATGCAGGGGATGACAAATTGGCAAATCCAAAAACTAAAAGACCAGAATCTACTTCTGGAAAATCCAGAAAACGGAAGGGCAGGCC
AGACCACACGGCATCTCGTTCGAAGGACAGAAAGGAATCACAGAAGCCTCCAGCGAAGTCACGAAAAGTTAATGAAACTCAAACAGCGGATAAGGCTAGGACAAGGAGGA
GGAAATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
GGGGGAGGGTGGCCCAGATTTAGAGGACTCCCAAAGTCTCTCACACAAACCCCCGCTTTTCTCTCTCTCCGACGAACAACCACCACCACCGTCGCCGGATTCATCGTACT
ATTCTCTCTCAAACCTCCATAATCAGCCGCCCACCCACCAAGATCCAACCCACCAGATAGTCGAGGCTGATGAATTTACGTGAAGAAGTTTTTTGAGGAGATTGCTTGAG
GTCGTCCAGGAAAGGTTTGTGGATGAATCTTCGGCACGCACAAAAAAATTGTCATCAGGAAAGTAAGGAAAAATTCATGAAGGGGACAATTGAAACAACCTTTGAAGTTG
TCATCAGGAAAGGTTTGTGGCTTCCTTTATTATAATACATTAGATGTGGTGTATTAATTTCATTCAAACATCTGTGGCCAAGCACTTCCAAAATGATTGATTATGAAGTG
GCTATTTAGTCTCTGGAAGCGATAGGCAAGATTCTACATTATTTTAGTGGTTGCTATTTTAGCCTAAAAGTCTGAGCAACCCCTAGTTGAAGTAAGCAGGAATAGGAGTA
GTTGATATTCTTAGGGGACAATATGGAAGAAAGAAATGGATATACAGAATCAAGGCCTATTAATAACTCTGAAGCAGTACAAGAAGCTAAGGCATGTGTTGAAGTTGAAG
TATTTACTTGTCTTTCAAATGAGCAAATGCCTTCCGTTCCTGAATATCAGGAATCGGGAATAACCCCAGATTATACTTACCAAACTGGCGGTTCAGATGAAGAAAAACTT
GAGGTCCAACAGAAAATGGAAGAGGAGAATAAGGAGCTTGGTTTGCGAAATGCACTCCATGAGTTATCAGAAAAATGTAATCAAGCTACCTCTACCCACCAAATTGGCGG
TTCAGATGAAGAAAAACCTGTGGTCCAACAAAATATGGAGGAAGAGAATAAGGAGCTTGATTTGGGAGATGCGCTCAGTGAGTTATCAGAAAAAAATAATCAGGCTAACT
CTATCCACCAAACTGGTGGTTCAAATGAAGAAAAACGTGGGGTCCAGCTATATATGGAGGAATCGAATAAGCAGTGTATGGAGGAATCGAATAAGCAGTGTATGGAGGAA
TCAAATAAGCAGAATATAGAGGAAGCAAATAAGCAAAATATGGAGGAAGATAATAAAGAGCTTGGTTTGGGAGATGCAGTTAGTGAGCTATCAGAAAAAAGTAATCATGC
TATCTCTACCCACCAAACTGGCGGTTTAGATGAAGAAAAACCTAGGGTCCAACACTATATGGAGGAAGCAAATAAGCATAATATGGAGCTTGGTTTGGGAGATGCGCTCG
GTGAGTTATCAGAAAAAAGTAATCAAGCTATCTCTAACCATGCTAATAATGATCAAGTTGAAGCTGGAAATTTATTATCCAACGATAAAGATACTGAACATTTAATATTA
CCCATTGAAGTTGAGACAACGGATCTTCTTAATGAGTGCTCTGAACCTCCAATGGAAGATGACAACAAAAGTTACATCGAACAAATGAACCCTTCTATTGAAGATTTAAC
CCCAAACACTTCTATCCAAAATTTAGAAATGGTCCCCTCTAGTTCTCAACAGTTGGATCACGAGGATAGGAGAATTTTGAAATCAAGAAAGAAAAATTTGTTAAAGTCCC
CTGTAAGTACCGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAATCTAAAGATTCTGAACCTAGTATTGACTTGAAATCTAAAGATTCTGAACCTAGTAATGACTTGAAT
AATGTTACTGCTGGTGAGGAGAGGAAAAGGAAGAGGAAGAAGAGAAGTATAAAGGGAGAGGGAGCAAGAGTTGATGAGTTTTCATCAATCAGGAAACATTTGAGATATTT
AATGAACCGCATCAATTATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAACTGAAGCCCGAAAAGGAACTTCAACGCGCAT
CAAATGAAATAATGCGACGCAAATTGAAAATAAGAGACCTATTTCAACGTATTGATGCACTTTGTCGTGAAGGAAGGCTTTCTGAATCTTTATTCGACTCTGAAGGGCAG
ATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCGATGAGCTATCCCTTGAGAATGACATCATACTCTGTGATGGTGTCTGTGATCGTGGGTTCCACCAGTTCTG
TTTAGAACCACCTTTGCTAAAAACCGACATTCCACCAGATGATGAGGGATGGCTATGCCCAGGATGTGATTGCAAAGATGACTGCATGGATCTGCTTAATGAATTTCAAG
GATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCAGCTGGGCAGAGTTCTGATAACGCCTTGGGTCTTCCTTCAGATGATTCT
GAAGATGGTGATTATGATCCTAATGTTCCAGATGTTCCAGATGCTATTGACCAGGAGGATGAATCAAGTTCAGAGAAATCGAGTTCTGATCAATCTGGGTATGCTTCTGC
TTCTGAGCAATTGGAGGCTCTACCCAATGATGACCAGTACTTAGGTCTCCCTTCAGATGACTCGGAGGATGAGGATTTTGATCCCAGTGCACCAGAACGTGTCAAAGAAG
TTGGGCAGGAAAGTTCAAGTTCTGATTTTACATCTGATTCTGAGGATCTAGCTGCACTGGGCGATAATGGGTCTTCTAAAGATGATAACCTTGTGTCTTCTTCAATAAAC
AGTACTATTCCCGTGAGAAACTCTAATGGGAAAAGTTCTGGTTGCGGTCCTAGCAAGAGTGCACTGCATAATGAGTTATCAAGTATACTAGAGTCGGGTCTCGATATGGA
TGGTCTTGAACCAGTTTCTGGAAGAAGACATGTGGAACGGTTAGATTACAAGAAGCTCCATGATGAAGCATATGGGAATGTTCATACCGACTCAAGTGATGACACATATG
GTAGTGATTCTATGGATTCAAGTGATGACAGAGGTCATGGTAGGAGTACAAGGAAGAGAAGTTCTAAAAACCCAGTTCCTGCATTGTCAAAAAATGGAACTAATAATTTG
AAAAACAAAAAAACTAAACGCAGTTCTAAGATAACTTGTCAAAAGCCAGTTGCTGAAAATACGGATAATTCTGTGGCTAGAACTCCAGACAACACTTTGAAATCTAGTTC
TTCTGTCAGACGAACTACTTCTTCATCACATAGAAGGCTCAGTCAACAAGCATTGGAGAGACTTCTTGCATCATTCCAAGAAAATCAGTATCCTGAGCGTGCAACAAAGG
AGAGTTTGGCAGAAGAACTGAGGCTCAGTCCGAAGCAGGTTAGTAAATGGTTTGAGAACACACGTTGGAGCACTCGCCATCCCTCCAGTGAAGCTAATAAAGCAAAGAGT
GCTTCAAGAACGGGGATTCAATCATCTCAGACAAGTGGAAAACTTCCTAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGATAACGATAGCAATGGTGCTCAACATCA
AGAATCACCAAAGCCAATTAGTGTTGTGTTCCCATGTCAGAGTGGAGATGCAGGGGATGACAAATTGGCAAATCCAAAAACTAAAAGACCAGAATCTACTTCTGGAAAAT
CCAGAAAACGGAAGGGCAGGCCAGACCACACGGCATCTCGTTCGAAGGACAGAAAGGAATCACAGAAGCCTCCAGCGAAGTCACGAAAAGTTAATGAAACTCAAACAGCG
GATAAGGCTAGGACAAGGAGGAGGAAATCCATTTAGATTCTGAAGTTTTGATCGTTCCTGGAATTTTATTCGGCTACTTATCCTCTCTTCCAAGTACCACATTAGATCAG
ATGCTGGTGATTGGAGGTAATTCCATGAACAAATCCAATTGTTTGTAGTTCATTGTGATCACATTAAAATTCCTCTTTCTAATCCTTTACACATAAATCATTAGTTTTAT
TAGCTTAAATCTAGGAGCTTCTTGTTAAATATTTATTTACATTTTCAAGTGAATGTTCTCGTCAGACCCTTGTCTTGATATGTGTTAAACAATCAATATCCAGCTATCCA
TTTAGTTGTTGACATCGTTCTGTTAAACAGCTTAGTTTTTTGCTGTTTGAGGTCACATGCGAATTCATTTCAGCTGTGGCAGTAGTCCATTTCAATGCTCTGCAGGGTCA
TCTAATTTGATTTTTGGAAGCTATGCGAACTTCAACACCCCGAAAAAGGAAAAAAAAAGATAAAGAAGCCAAACAAATAGAACATGAGAGCTACGTAGCACCCTTCAGCC
TCGAG
Protein sequenceShow/hide protein sequence
MEERNGYTESRPINNSEAVQEAKACVEVEVFTCLSNEQMPSVPEYQESGITPDYTYQTGGSDEEKLEVQQKMEEENKELGLRNALHELSEKCNQATSTHQIGGSDEEKPV
VQQNMEEENKELDLGDALSELSEKNNQANSIHQTGGSNEEKRGVQLYMEESNKQCMEESNKQCMEESNKQNIEEANKQNMEEDNKELGLGDAVSELSEKSNHAISTHQTG
GLDEEKPRVQHYMEEANKHNMELGLGDALGELSEKSNQAISNHANNDQVEAGNLLSNDKDTEHLILPIEVETTDLLNECSEPPMEDDNKSYIEQMNPSIEDLTPNTSIQN
LEMVPSSSQQLDHEDRRILKSRKKNLLKSPVSTDRVLRSRTQEKSKDSEPSIDLKSKDSEPSNDLNNVTAGEERKRKRKKRSIKGEGARVDEFSSIRKHLRYLMNRINYE
QNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCREGRLSESLFDSEGQIDSEDIFCAKCGSDELSLENDIILCDGVCDRGFHQFCLEPPLLK
TDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQSSDNALGLPSDDSEDGDYDPNVPDVPDAIDQEDESSSEKSSSDQSGYASASEQLEAL
PNDDQYLGLPSDDSEDEDFDPSAPERVKEVGQESSSSDFTSDSEDLAALGDNGSSKDDNLVSSSINSTIPVRNSNGKSSGCGPSKSALHNELSSILESGLDMDGLEPVSG
RRHVERLDYKKLHDEAYGNVHTDSSDDTYGSDSMDSSDDRGHGRSTRKRSSKNPVPALSKNGTNNLKNKKTKRSSKITCQKPVAENTDNSVARTPDNTLKSSSSVRRTTS
SSHRRLSQQALERLLASFQENQYPERATKESLAEELRLSPKQVSKWFENTRWSTRHPSSEANKAKSASRTGIQSSQTSGKLPKPEQESGACFRDNDSNGAQHQESPKPIS
VVFPCQSGDAGDDKLANPKTKRPESTSGKSRKRKGRPDHTASRSKDRKESQKPPAKSRKVNETQTADKARTRRRKSI