; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017210 (gene) of Snake gourd v1 genome

Gene IDTan0017210
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhomeobox protein HAT3.1
Genome locationLG01:6296594..6305950
RNA-Seq ExpressionTan0017210
SyntenyTan0017210
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600300.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0075.82Show/hide
Query:  MSGGSGRQTVRVKAGQKSALFSLRAERRRRRRRRRTTTTTTTTTTTIAKNFTVSQNSINDHQQDPKPPSFTDNTMLFRQHQDSMDVTISTLGIPSLDVEV
        MSGGSG +TVRV+AGQ+  + ++    R  R RR     +      +++N +          Q  K   F D                            
Subjt:  MSGGSGRQTVRVKAGQKSALFSLRAERRRRRRRRRTTTTTTTTTTTIAKNFTVSQNSINDHQQDPKPPSFTDNTMLFRQHQDSMDVTISTLGIPSLDVEV

Query:  RASWLQTLTNIKKNKNRNNMEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGL
                          NMEERDEYTESR+IN S  VQEAKA+VEVEVLT L+NEQ HS  +Y ELGT  D TSKTGSPDEEK GV+ N+EE+ KELGL
Subjt:  RASWLQTLTNIKKNKNRNNMEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGL

Query:  GDVLSELSEKSNQTISNLADNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKD
        G+  S L E+S+QTIS LADNDQ EAGNL SSDKDTENLILPIEVET   LNEC E PTEDD KNYIEQ NP IED  QN SI+NL MVP+NS +LG KD
Subjt:  GDVLSELSEKSNQTISNLADNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKD

Query:  KRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK
        KR+ KSK KN++LRS VSSDRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KKKK+N  IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK
Subjt:  KRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK

Query:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI
        GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDI
Subjt:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI

Query:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY-
        PPDDEGWLCPGCDCKDDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPDVPD ID   ESSSD SSSDQS SDKSGY 
Subjt:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY-

Query:  -ASASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALH
         ASASEELE   NDDQYLGLPSDDSEDDD+DP AP RDEGV QE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT+P RNS G+SSG GP+K+A H
Subjt:  -ASASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALH

Query:  NELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTR
        N+LSSL+ SG D+ GLE VSGRRHVERLDYKKLHDET+GNVPTDSSDDTYG+ S+DSSDDRG GRSTRK SPKN VPALS+NGTDDL NIKTKRSSKRTR
Subjt:  NELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTR

Query:  QKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASR
        QKPA  NM+NSVTKTP+ TLKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKS SR
Subjt:  QKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASR

Query:  MGIQSSQTSGKLPKPEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPK
        MG QSS+TS K PKPEQESGACFRD  SNGAQHQE  KAISVVAPCQSG TGDDKLANQK KRPESTA KSRKRKGRSD +ASRSKDRK+S+K PAKS K
Subjt:  MGIQSSQTSGKLPKPEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPK

Query:  VNEIQTADK
        V+EIQTADK
Subjt:  VNEIQTADK

KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0083.11Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+IN S  VQEAKA+VEVEVLT L+NEQ HS  +Y ELGT  D TSKTGSPDEEK GV+ N+EE+ KELGLG+  S L E+S+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        DNDQ EAGNL SSDKDTENLILPIEVET   LNEC E PTEDD KNYIEQ NP IED  QN SI+NL MVP+NS +LG KDKR+ +SK KN++LRS VSS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KKKK+N  IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM
        IMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+
Subjt:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPDVPD ID   ESSSD SSSDQS SDKSGY  ASASEELE   NDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG

Query:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV
        LPSDDSEDDD+DP AP RDEGV QE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT+P RNS G+SSG GP+K+A HN+LSSL+ SG D+ GLE V
Subjt:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV

Query:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT
        SGRRHVERLDYKKLHDET+GNVPTDSSDDTYG+ S+DSSDDRG GRSTRK SPKN VPALS+NGTDDL NIKTKRSSKRTRQKPA  NM+NSVTKTP+ T
Subjt:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT

Query:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES
        LKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKS SRMG QSSQTS K PKPEQES
Subjt:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES

Query:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI
        GACFRD  SNGAQHQE  KAISVVAPCQSG TGDDKLANQK KRPESTA KSRKRKGRSD +ASRSKDRK+S+K PAKS KV+EIQTAD K+K RRRKS+
Subjt:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]0.0e+0082.72Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+IN S  VQEAKASVEVEVLT L+NEQ HS  +Y ELGT  D TSKTGSPDEEK GV+ N+EE+ KELGLG+    L EKS+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        DNDQ EAGNL SSDKDTENLILPIEVETT  LNEC E PTED+ KNYIEQ NP IED  QN SI NL MVP+NS ++G KDKR+ KSK KN++LRS +SS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KKK R IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM

Query:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL
        RRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+DL
Subjt:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTID-----HGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQ
        LNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPDVPD ID       D SSSD SSSD S SDKSGY  ASASEELE   NDDQ
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTID-----HGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQ

Query:  YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL
        YLGLPSDDSEDDD+DP AP RDEGVGQE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT+P RNS G+SSG GP+K+A HN+LSSL+ SG D+ GL
Subjt:  YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL

Query:  EPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTP
        E VSGRRHVERLDYKKLHDET+GNVPTDSSDDTYG+ S+DSSDDRG GRSTRK SPKN VPALS+NGTDDL NIKTKRSSKRTRQKPA  NM+NSVTKTP
Subjt:  EPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTP

Query:  KDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPE
        + TLKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKSASRMG QSSQTS K PKPE
Subjt:  KDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR
        QESGACFRDT SNGAQHQE  KAISVVAPCQSG TGDDKLANQK KRPES A KSRKRKGRSD +ASRSKDRK+S+K PAKS KV+EIQTAD K+K RRR
Subjt:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR

Query:  KSI
        KS+
Subjt:  KSI

XP_022979028.1 homeobox protein HAT3.1-like [Cucurbita maxima]0.0e+0082.56Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+I+ S  VQEAKASVEVEVLT L+NEQ  S  +Y ELGT  D TSKTGSPDEEK GV+ N+EE++KEL LG+  SEL EKS+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        +NDQ EAGNL SSDKDTENLILPIEVETT  LNEC E PTEDD KNYIEQ NP IE   QN SI+NL MVP+NS +LG KDKR+ KSK KN++LRS VSS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KK+K+N  IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM
        IMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+
Subjt:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPDVPD ID   ESSSD SSSDQS SDKSGY  ASASEELE   NDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG

Query:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV
        LPSDDSEDDD+DP AP RDEGVGQE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT P RNS G+SSG GP+K+A HN+LSSL+ SG D+ GLE V
Subjt:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV

Query:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT
        SGRRHVERLDYKKLHDET+GNVP++SSDDTYG+ S+DSSDDRG GRSTRK SPKNLVPALS+NGTDD  NIKTK SS+RTRQKPA  NM+NSVTKTP+ T
Subjt:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT

Query:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES
        LKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKSASRMG QSSQTS K PKPEQES
Subjt:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES

Query:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI
        GACFRDT SNGAQHQE  KAI+VVAPCQSG TGDDKLA  KTKRPESTA KSRKRKGRSD +ASRSK+RK+S+K PAKS KV+EIQTAD K+K RRRKS+
Subjt:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI

XP_023531864.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0082.98Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+IN S  VQEAKASVEVEVLT L+NEQ HS  +Y ELGT  D TSKTGSPDEEK GV+ N+EE+ +ELGLG+  S L EKS+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        DNDQ EAGNL SSDKDTENLILPIEVETT  LNECLE PTEDD KNYIEQ NP IED  QN  I+NL MVP+NS +LG KDKR+ KSK KN++LRS VSS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KKKK+N  IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM
        IMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+
Subjt:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQ-----SHSDKSGY--ASASEELEVLAND
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPDVPD ID   ESSSD SSSDQ     S SDKSGY  ASASEELE   ND
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQ-----SHSDKSGY--ASASEELEVLAND

Query:  DQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKD
        DQYLGLPSDDSEDDD+DP AP RDEGVGQE+SSSDFTSDSEDLAAL +NGSSKDDN+ SS LNNT+P RNS G+SSG GP+KSA HN+LSSL+ SG D+ 
Subjt:  DQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKD

Query:  GLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTK
        GLE VSGRRHVERLDYKKLHDET+GNVPTDSSDDTYG+ S+DSSDDRG GRSTRK SPKN VPALS+NGTDDL NIKTK SSKRTRQKPA  NM+NSVTK
Subjt:  GLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTK

Query:  TPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPK
        TP+ TLKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKA+SASRMG QSSQTS K PK
Subjt:  TPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPK

Query:  PEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTR
        PEQESGACFRDT SNGAQHQE  KAISVVAPCQSG TGDDK ANQ+TKRPESTA KSRKRKGRSD +ASRSKDRK+S+K PAKS KV+EIQTAD K+K R
Subjt:  PEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTR

Query:  RRKSI
        RRKS+
Subjt:  RRKSI

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0078.44Show/hide
Query:  KKNKNRNNMEERDEYTESRSINN--SEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSE
        ++ K  +NMEERDE T++ S  N  +E VQEAKASVEVEV TCLSNE  +  S YQELGTTP+ + KT  PDEEK+GVQ     +N ELG G +LSELSE
Subjt:  KKNKNRNNMEERDEYTESRSINN--SEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSE

Query:  KSNQTISNLADNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNK
        K NQTISN ADNDQVEAGN  S DKDT+NL L IE ETTT LNEC ELP ED TKNYIE+ NP IEDLTQ  SIQ+LE +P+NSQQL HKD+R  KSK K
Subjt:  KSNQTISNLADNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNK

Query:  NFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKR-KKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPE
        N+ LRS VSSDRVLRSRTQEKAKAPEPSNDLNNFTA EEGKR KKKKRNI+GKG R+DE+SSIRNHLRYL+NRI+YEQSLIEAYSSEGWKGFSSDKLKPE
Subjt:  NFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKR-KKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPE

Query:  KELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCP
        KELQRASNEIMRRKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG+CDRGFH FCLEPPLL +DIPPDDEGWLCP
Subjt:  KELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCP

Query:  GCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLA
        GCDCKDDC+DLLNEFQGSNLSITDGWEKVYPE AAAAAGRNSD  LGLPSDDSEDGDYDPD+PDTID  +E SSD SSSDQS+SD SGYASASE LEV  
Subjt:  GCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLA

Query:  NDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLD
        NDDQYLGLPSDDSED+D+DPS PE DEG  QE+SSSDFTSDSEDLAAL++N SSKDD+L+ SSLNNT+P +N+ G+SS  GPSKS LHNELSSL++SGLD
Subjt:  NDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLD

Query:  KDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSK-RTRQKPAGGNMNN
        KDGLEP+SGRR VERLDYKKLHDETYGNVPT+SSDDTYG+ ++DSSDDRG    TRKR PK LV ALS NG+ DDL N+KTKRS K RTRQKP   N+NN
Subjt:  KDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSK-RTRQKPAGGNMNN

Query:  SVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSG
        SVT+TP DT KSSSSVR+ TSSS+RRLSQPALERL  SFQEN+YP+RATKESLAQELGL+LKQVSKWFENTRWSTRHPSS G KAKS+SRM I  SQ SG
Subjt:  SVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSG

Query:  KLPKPEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKK
        +L K EQES  CFRDTDSNGA+HQ+L  A SVVA CQSGDTGD KL  +KTKR ES+A KSRKRKGRSD+ AS SKDR+ S + PAKSPKVNE QTAD +
Subjt:  KLPKPEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKK

Query:  IKTRRRKSI
         KTRRR+SI
Subjt:  IKTRRRKSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0080.51Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEER EYTE R  NN E VQEAKAS  VEVLTC SNEQ HS+ D QELGTTP+ TSKT  PD+EKSGVQ N+EEE KELG GDVLSEL EK+NQTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVP----NNSQQLGHKDKRIRKSKNKNFMLRS
        + DQVEAGNL SSD +TENLILPIE+E TTTLNEC ELP ED  KN I+Q NP IEDLTQN SIQ LE VP    + SQQLGHKDK+I KSK KN+MLRS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVP----NNSQQLGHKDKRIRKSKNKNFMLRS

Query:  FVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
         VSSDRVLRSRTQEKAKAPEPSN+LN  TAG EGKRKKKKRNIKGKG   DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  FVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS

Query:  NEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDD
        +EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG+CDRGFH FCLEPPLL +DIPPDDEGWLCPGCDCKDD
Subjt:  NEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDD

Query:  CMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLG
        C+DLLNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDHALGLPSDDSEDGDYDPD PDTI+  DE     SSSDQS SD+SGYASASEELE   NDDQYLG
Subjt:  CMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLG

Query:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV
        LPSDDSEDDD++P APE DEGV QE+S SDFTSDSEDLAALDD                T P RNS G+ SGCGP  S LHNEL SL+ESG DKDGLEPV
Subjt:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV

Query:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSK-RTRQKPAGGNMNNSVTKTPK
        SGRR VERLDYKKLHDETYGNVP+DSSDDT+G++S+DSSDDRG G  TRKRSPKNLVPAL  NGT DDL N KTKRS K RT QKP   NM NSVT+TP+
Subjt:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSK-RTRQKPAGGNMNNSVTKTPK

Query:  DTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPKPE
        D++KSSSSVRRT SSS+RRLSQPALERLL SFQENQYP+RATKESLAQELGLSLKQVSKWFENTRWSTRHPSS E NKAKSA RMGIQSS+TSGKLPKPE
Subjt:  DTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR
        QESGACFRDTD+NGAQHQ        VAPCQSGDT DDKLA QKT RPESTA KSRKRKGRSDH+AS SKDRKESQK PAKSPKVN+IQTAD K++TRRR
Subjt:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR

Query:  KSI
        +SI
Subjt:  KSI

A0A6J1E4I6 homeobox protein HAT3.1-like0.0e+0077.33Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESRS NN+E VQEAK SVE E+ TCLSNEQ+HSV DY EL  TP +++KTG  DEEK  VQ N+EEEN+ELG GDVL ELSEK NQT SNLA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        DNDQVEAGNL   DKDTENLI+PIEVETTT L +C ELP E   KNYIEQ NP  E LTQN   QNLE VP+NS+Q  HKDKRI KS   N +LRS VSS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
        DR LRS+TQEK K PEPSNDLNNFTA EEGK KKK+RNI+GKG R+DEFSSIRNHLRYL+NRIKYEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM

Query:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL
        RRKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDG+CDRGFH FCLEPPLL +DIPPDDEGWLCPGCDCKDDC++L
Subjt:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDS-EDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLGLPS
        LNEFQGS LSITDGWEKVYPEAAA+AAGRN DHA GLPSDDS +D DYDPDVPDTI   DESS           + SGYASASEELE   N DQYLGLPS
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDS-EDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLGLPS

Query:  DDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGR
        DDSEDDD+DPSAPERDE V QE+SSSDFTSDSEDLAALD N SSK DNL+S SLNNT   +N  G+SSG GP KSAL+NELSSL+ESG DKDG EPV GR
Subjt:  DDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGR

Query:  RHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSKR-TRQKPAGGNMNNSVTKTPKDTL
        R VERLDYKKLHDETYGNVPTDSSDDTY +VSMDSSDD+G   +TRKRSPK LV AL    T DDL NIKTK SSKR TRQK    NMN SV+KTP+DT 
Subjt:  RHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGT-DDLNNIKTKRSSKR-TRQKPAGGNMNNSVTKTPKDTL

Query:  KSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPKPEQES
        K+SSSVRRTT SS+RRLS+ ALERLL SFQENQYPERATKESLAQELGLS+KQVSKWF NTRWSTRHPSS EGNKAKS+SRMGI SSQ SG+L +PEQE 
Subjt:  KSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPKPEQES

Query:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI
                  GAQHQEL  A SVVAPCQSGDTGD KLA Q+TKR E +A KSRKRKGRSDH AS SKD KESQ+ PAKSPKVNEIQTA   IKTRRR S+
Subjt:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI

A0A6J1FNP3 homeobox protein HAT3.10.0e+0082.72Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+IN S  VQEAKASVEVEVLT L+NEQ HS  +Y ELGT  D TSKTGSPDEEK GV+ N+EE+ KELGLG+    L EKS+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        DNDQ EAGNL SSDKDTENLILPIEVETT  LNEC E PTED+ KNYIEQ NP IED  QN SI NL MVP+NS ++G KDKR+ KSK KN++LRS +SS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KKK R IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM

Query:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL
        RRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+DL
Subjt:  RRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTID-----HGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQ
        LNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPDVPD ID       D SSSD SSSD S SDKSGY  ASASEELE   NDDQ
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTID-----HGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQ

Query:  YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL
        YLGLPSDDSEDDD+DP AP RDEGVGQE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT+P RNS G+SSG GP+K+A HN+LSSL+ SG D+ GL
Subjt:  YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL

Query:  EPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTP
        E VSGRRHVERLDYKKLHDET+GNVPTDSSDDTYG+ S+DSSDDRG GRSTRK SPKN VPALS+NGTDDL NIKTKRSSKRTRQKPA  NM+NSVTKTP
Subjt:  EPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTP

Query:  KDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPE
        + TLKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKSASRMG QSSQTS K PKPE
Subjt:  KDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR
        QESGACFRDT SNGAQHQE  KAISVVAPCQSG TGDDKLANQK KRPES A KSRKRKGRSD +ASRSKDRK+S+K PAKS KV+EIQTAD K+K RRR
Subjt:  QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRR

Query:  KSI
        KS+
Subjt:  KSI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0082.56Show/hide
Query:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA
        MEERDEYTESR+I+ S  VQEAKASVEVEVLT L+NEQ  S  +Y ELGT  D TSKTGSPDEEK GV+ N+EE++KEL LG+  SEL EKS+QTIS LA
Subjt:  MEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLA

Query:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS
        +NDQ EAGNL SSDKDTENLILPIEVETT  LNEC E PTEDD KNYIEQ NP IE   QN SI+NL MVP+NS +LG KDKR+ KSK KN++LRS VSS
Subjt:  DNDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
        DRVLRSRTQ+KAKAPEPSNDL+N TAGEEGK KK+K+N  IKGKG R+DEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRN--IKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNE

Query:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM
        IMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDGVCDRGFH FCLEPPLL SDIPPDDEGWLCPGCDCKDDC+
Subjt:  IMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCM

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPDVPD ID   ESSSD SSSDQS SDKSGY  ASASEELE   NDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGY--ASASEELEVLANDDQYLG

Query:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV
        LPSDDSEDDD+DP AP RDEGVGQE+SSSDFTSDSEDLAAL DNGSSKDDN+ SS LNNT P RNS G+SSG GP+K+A HN+LSSL+ SG D+ GLE V
Subjt:  LPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPV

Query:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT
        SGRRHVERLDYKKLHDET+GNVP++SSDDTYG+ S+DSSDDRG GRSTRK SPKNLVPALS+NGTDD  NIKTK SS+RTRQKPA  NM+NSVTKTP+ T
Subjt:  SGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDT

Query:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES
        LKSSSSVRRTTSSSHRRLSQP LERLL SFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSSE NKAKSASRMG QSSQTS K PKPEQES
Subjt:  LKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQES

Query:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI
        GACFRDT SNGAQHQE  KAI+VVAPCQSG TGDDKLA  KTKRPESTA KSRKRKGRSD +ASRSK+RK+S+K PAKS KV+EIQTAD K+K RRRKS+
Subjt:  GACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKRPAKSPKVNEIQTADKKIKTRRRKSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A3.5e-10137.99Show/hide
Query:  EDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKN-----------FMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGT
        E L  +  +   + +PNN   LG + K  +++ N+            + L S  S  RVLRS +  K  + E      +  A  +   K++K +     +
Subjt:  EDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKN-----------FMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGT

Query:  RIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCG
          DEFS IR  +RY++NR+ YEQSLIEAY+SEGWK  S DK++PEKEL+RA +EI+R KL+IR++F+ ID+L ++G++ E+LFDSEG+I  EDIFC+ CG
Subjt:  RIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCG

Query:  SKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSED
        S + +L NDIILCDG CDRGFH  CL PPL   DIP  DEGWLCP CDCK DC+DL+NE  GSN+SI D WEKV+P+AAA A     D A  LPSDDS+D
Subjt:  SKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSED

Query:  GDYDPDVPDTIDHG-DESSSDGSSSDQSHSDKSGYASASEELEVLAN---DDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSS--SDFTSDSEDLAALD
         D+DP++P+    G DE SS+      S SD S + + S++ E L +   DD  L LPS+DSEDDD+DP+ P+ D+ V +++SS  SDFTSDS+D     
Subjt:  GDYDPDVPDTIDHG-DESSSDGSSSDQSHSDKSGYASASEELEVLAN---DDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSS--SDFTSDSEDLAALD

Query:  DNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDD---TYGTVSMDSS
        +   S  D + S  L +         K +    + S+  + + + I+ G+    + P S RR  ERLDYKKL+DE YG   +DSSDD   +     +  S
Subjt:  DNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDD---TYGTVSMDSS

Query:  DDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPAL-ERLLTSFQENQYPER
        ++ G   S   +  + +      +  D+L    TK+S           +++ SV + P D   + S+     S++ +    P + ++L   F+   YP R
Subjt:  DDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPAL-ERLLTSFQENQYPER

Query:  ATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPK-PE----QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDT
        + KESLA+ELGL+ +QV+KWFE  R S R  SS +G      S     S  T+   PK PE    +ES  C     + G    +   +  V +     D 
Subjt:  ATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-EGNKAKSASRMGIQSSQTSGKLPK-PE----QESGACFRDTDSNGAQHQELLKAISVVAPCQSGDT

Query:  GDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDR
        G  K+ + + + P    A+  ++K     +  +   R
Subjt:  GDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDR

P48785 Pathogenesis-related homeodomain protein3.7e-4727.67Show/hide
Query:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK
        K K +   +N     R     +   +SRT++ ++      ++      +  KRK K++  K     +D+   ++   RYL+ ++K +Q+LI+AY++EGWK
Subjt:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK

Query:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI
        G S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FH  CL+PPL    I
Subjt:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI

Query:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA
        PP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G                                                  A
Subjt:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA

Query:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE
        + + E +           PSDDS+DDD+DP    R+ G G  ++ S             D G   D+  +S+SL     + +S G +   G  +    + 
Subjt:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE

Query:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK
        LS+++E   +    E V G R    +DY +L+ E +G             +    S+D   G + R++          K  +D  + + T   S +  Q 
Subjt:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK

Query:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR
                 V +T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+ + R+  +E  K    S+
Subjt:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR

P48786 Pathogenesis-related homeodomain protein1.7e-11644.2Show/hide
Query:  VSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKK--KKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA
        V+S R LRSR+QEK+  P    D+NN  A E   R+K  KKR  + +  R+DEF  IR HLRYL++RIKYE++ ++AYS EGWKG S DK+KPEKEL+RA
Subjt:  VSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKK--KKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA

Query:  SNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD
          EI  RKLKIRDLFQR+D   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDIILCDG CDRGFH FCL+PPLLK  IPPDDEGWLCPGC+CK 
Subjt:  SNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD

Query:  DCMDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQY
        DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD          +    D S +D+S Y S S++++V+   +  
Subjt:  DCMDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQY

Query:  LGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMP-ARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL
         GLPSDDSEDD++DPS    D+ + +++S SDFTSDSED   + D+   KD       L +T    RN+     GCG      H E          +   
Subjt:  LGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMP-ARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGL

Query:  EPVSGRRHVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNI
         P+  RR VE LDYKKL+D                          E YGN  +DSSD+ Y   S    ++     +  +R  ++    L +   +  +N 
Subjt:  EPVSGRRHVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNI

Query:  KTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS
              +R  +K A    ++ ++++ +D+    +  + T+ + H    + A +RLL SF+ENQYP+RA KESLA EL LS++QVS WF N RWS RH S 
Subjt:  KTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS

Query:  EGN-----------KAKSASRMG-----IQSSQTSGKLPKPEQESGA
         G+           + KS    G     +  S T  ++ K EQ++ +
Subjt:  EGN-----------KAKSASRMG-----IQSSQTSGKLPKPEQESGA

Q04996 Homeobox protein HAT3.13.4e-11745.03Show/hide
Query:  NNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIE
        +N  +     KR+     +N +  S          R Q   +   PS+ + N T     K+K K  N KG+    DE++ I+  LRY +NRI YEQSLI+
Subjt:  NNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIE

Query:  AYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLE
        AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFH +CLE
Subjt:  AYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLE

Query:  PPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDG--SSSD
        PPL K DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  ++ ++ S D   S ++
Subjt:  PPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDG--SSSD

Query:  QSHSDKSGYASASEELEVLANDDQ-----YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYG
           SD++ + SAS+E+     + +      + LPSDDSEDDD+DP AP  D+   +E+S+SD TSD+EDL       S K D     + +  +       
Subjt:  QSHSDKSGYASASEELEVLANDDQ-----YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYG

Query:  KSSGCGPSKSALHNELSSLIES--GLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTD
              P +     +  +++ES  GLD DG   VS RR+VERLDYKKL+DE Y NVPT SSDD       D +   G    +      + VP    +  +
Subjt:  KSSGCGPSKSALHNELSSLIES--GLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTD

Query:  DLNNIKTKRSSKR-----TRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFEN
        D  + K  R SKR     T + P  G   N            S  + +++SS+ ++ + P  +RL  SFQENQYP++ATKESLA+EL +++KQV+ WF++
Subjt:  DLNNIKTKRSSKR-----TRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFEN

Query:  TRWS
         RWS
Subjt:  TRWS

Q8H991 Homeobox protein HAZ11.3e-10342.28Show/hide
Query:  TKNYIEQTNPSIEDLTQNASIQNLE--MVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKK--RNI
        +K Y  +++ S   + ++AS +  E  +VP N        +R+ K + ++  LR   +  RVLRS +++K KA    N+L N  AG +   KK+K  R  
Subjt:  TKNYIEQTNPSIEDLTQNASIQNLE--MVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKK--RNI

Query:  KGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIF
        KG GT  D++  IR  +RY++NR+ YEQSLI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L +EG+L ES+FDS G+I SEDIF
Subjt:  KGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIF

Query:  CAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPS
        CA CGSK+++L+NDIILCDG+CDRGFH +CL PPLL  DIP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G     A  LPS
Subjt:  CAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPS

Query:  DDSEDGDYDP----------DVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSS-----
        DDS D DYDP          +     D G+   SD SSS+ S S +   +  S+    +  DD  LGLPS+DSED DFDP+ P+ D+    E++S     
Subjt:  DDSEDGDYDP----------DVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSS-----

Query:  SDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSD
        SDFTSDS+D  A       +D+    SS       R      SG     +A ++ L + +E+ L++D + P+S +R VERLDYKKL++E YG   +DSSD
Subjt:  SDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSD

Query:  DT--YGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERL
        D   YG  + +  +       +   SP+       +      NN  T ++      +P G     SV+    + L S+S+    +++ +R       ++L
Subjt:  DT--YGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERL

Query:  LTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTR
           F+E+ YP RATKE+LAQELGL+  QV+KWF +TR   R
Subjt:  LTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTR

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain2.4e-11845.03Show/hide
Query:  NNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIE
        +N  +     KR+     +N +  S          R Q   +   PS+ + N T     K+K K  N KG+    DE++ I+  LRY +NRI YEQSLI+
Subjt:  NNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIE

Query:  AYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLE
        AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFH +CLE
Subjt:  AYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLE

Query:  PPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDG--SSSD
        PPL K DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  ++ ++ S D   S ++
Subjt:  PPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDG--SSSD

Query:  QSHSDKSGYASASEELEVLANDDQ-----YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYG
           SD++ + SAS+E+     + +      + LPSDDSEDDD+DP AP  D+   +E+S+SD TSD+EDL       S K D     + +  +       
Subjt:  QSHSDKSGYASASEELEVLANDDQ-----YLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYG

Query:  KSSGCGPSKSALHNELSSLIES--GLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTD
              P +     +  +++ES  GLD DG   VS RR+VERLDYKKL+DE Y NVPT SSDD       D +   G    +      + VP    +  +
Subjt:  KSSGCGPSKSALHNELSSLIES--GLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTD

Query:  DLNNIKTKRSSKR-----TRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFEN
        D  + K  R SKR     T + P  G   N            S  + +++SS+ ++ + P  +RL  SFQENQYP++ATKESLA+EL +++KQV+ WF++
Subjt:  DLNNIKTKRSSKR-----TRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFEN

Query:  TRWS
         RWS
Subjt:  TRWS

AT4G29940.1 pathogenesis related homeodomain protein A2.7e-4827.67Show/hide
Query:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK
        K K +   +N     R     +   +SRT++ ++      ++      +  KRK K++  K     +D+   ++   RYL+ ++K +Q+LI+AY++EGWK
Subjt:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK

Query:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI
        G S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FH  CL+PPL    I
Subjt:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI

Query:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA
        PP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G                                                  A
Subjt:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA

Query:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE
        + + E +           PSDDS+DDD+DP    R+ G G  ++ S             D G   D+  +S+SL     + +S G +   G  +    + 
Subjt:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE

Query:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK
        LS+++E   +    E V G R    +DY +L+ E +G             +    S+D   G + R++          K  +D  + + T   S +  Q 
Subjt:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK

Query:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR
                 V +T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+ + R+  +E  K    S+
Subjt:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR

AT4G29940.2 pathogenesis related homeodomain protein A2.7e-4827.67Show/hide
Query:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK
        K K +   +N     R     +   +SRT++ ++      ++      +  KRK K++  K     +D+   ++   RYL+ ++K +Q+LI+AY++EGWK
Subjt:  KDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEKAKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWK

Query:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI
        G S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FH  CL+PPL    I
Subjt:  GFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDI

Query:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA
        PP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G                                                  A
Subjt:  PPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYA

Query:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE
        + + E +           PSDDS+DDD+DP    R+ G G  ++ S             D G   D+  +S+SL     + +S G +   G  +    + 
Subjt:  SASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNGSSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNE

Query:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK
        LS+++E   +    E V G R    +DY +L+ E +G             +    S+D   G + R++          K  +D  + + T   S +  Q 
Subjt:  LSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKNLVPALSKNGTDDLNNIKTKRSSKRTRQK

Query:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR
                 V +T + + + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+ + R+  +E  K    S+
Subjt:  PAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHR-RLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRW-STRHPSSEGNKAKSASR

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 56.2e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH+ CL P +++  I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 56.2e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH+ CL P +++  I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGCGGTAGCGGAAGGCAAACGGTGCGCGTGAAGGCTGGGCAAAAATCCGCTTTATTCTCTCTTCGCGCTGAGAGAAGAAGAAGAAGAAGAAGAAGAAGAACGAC
GACGACAACGACAACGACAACAACCACCATCGCCAAAAATTTTACTGTCTCACAAAACTCCATAAACGACCACCAACAAGATCCAAAACCGCCTTCTTTCACGGACAATA
CTATGCTCTTTCGCCAACACCAGGATTCAATGGATGTGACGATTAGTACCTTAGGCATTCCTTCTTTAGATGTTGAGGTGCGTGCAAGCTGGCTCCAGACACTCACGAAT
ATCAAAAAAAATAAGAATCGGAATAATATGGAAGAAAGAGATGAATATACAGAATCAAGATCTATTAATAACTCTGAAGGCGTACAAGAAGCCAAGGCCAGTGTTGAAGT
TGAAGTGCTAACTTGTCTTTCAAATGAGCAAAGGCATTCAGTTTCCGACTATCAGGAATTGGGAACAACTCCAGATCATACCAGCAAAACTGGAAGTCCAGATGAAGAAA
AATCAGGGGTCCAGCATAATATTGAGGAAGAGAATAAGGAACTTGGTTTGGGAGATGTGCTCAGTGAGTTATCAGAAAAAAGTAATCAGACTATCTCAAACCTTGCTGAT
AATGATCAAGTTGAAGCTGGCAATTTATCATCCAGTGATAAAGATACTGAAAATTTAATATTACCTATTGAAGTTGAGACGACGACGACTCTTAATGAGTGCTTGGAACT
TCCAACTGAAGATGACACCAAAAATTATATCGAACAGACAAACCCTTCCATCGAAGATTTAACCCAAAATGCTTCTATCCAAAATTTAGAAATGGTCCCCAATAATTCCC
AACAATTGGGTCACAAGGATAAGAGAATTCGGAAATCAAAAAACAAGAATTTTATGTTAAGGTCCTTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAA
GCTAAAGCTCCTGAACCAAGTAATGACTTGAATAATTTTACTGCTGGAGAGGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAGGGAAAGGGAACAAGAATTGATGA
GTTTTCATCAATCAGGAATCATTTAAGATATTTAGTGAACCGCATCAAATATGAACAGAGCTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATA
AATTGAAGCCTGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTATTGATGCACTTTGTGCTGAAGGAAGG
CTTTCTGAATCTTTATTCGATTCTGAAGGGCAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAGGAACTGTCCCTTGAGAATGACATCATACTATGTGA
TGGTGTTTGTGATCGTGGGTTCCACCTGTTCTGTTTAGAACCACCTTTGCTAAAATCAGACATTCCACCGGATGATGAGGGATGGCTATGCCCCGGATGTGATTGCAAGG
ATGACTGCATGGATCTCCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTACCCAGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCT
GATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATACTATTGACCATGGTGATGAATCAAGTTCTGATGGATCGAGTTC
TGATCAATCACATTCTGATAAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGTTCTAGCCAATGATGACCAGTACTTAGGTCTCCCTTCAGATGACTCGGAGGATG
ATGATTTTGATCCCAGTGCTCCAGAACGTGATGAAGGTGTTGGACAGGAAAATTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGACGATAACGGG
TCTTCTAAAGATGATAACCTCATGTCTTCTTCTTTAAATAATACTATGCCTGCGAGAAACTCTTATGGGAAAAGTTCTGGATGCGGCCCTAGCAAGAGTGCACTACATAA
CGAGTTATCAAGTCTAATAGAGTCAGGTCTCGATAAGGATGGTCTTGAACCAGTTTCAGGAAGACGACATGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACAT
ACGGGAATGTTCCTACCGACTCAAGCGATGACACATATGGTACTGTTTCTATGGATTCAAGTGATGACAGAGGTCATGGTAGGAGTACAAGGAAGAGAAGTCCTAAAAAC
CTGGTTCCTGCATTGTCAAAAAATGGAACTGATGATTTGAATAATATAAAAACTAAACGCAGTTCTAAGAGAACTCGTCAAAAGCCAGCTGGTGGAAATATGAATAATTC
TGTGACTAAAACTCCTAAAGACACTTTGAAATCTAGTTCTTCTGTTAGACGAACCACTTCATCATCACATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTGACAT
CATTCCAAGAAAATCAGTATCCTGAACGAGCTACGAAGGAGAGTTTGGCACAAGAACTAGGGCTCAGTCTGAAGCAGGTTAGTAAATGGTTTGAGAACACACGTTGGAGC
ACACGCCATCCCTCCAGTGAAGGTAATAAAGCAAAGAGTGCTTCAAGAATGGGCATTCAATCATCTCAGACAAGTGGAAAACTTCCCAAGCCTGAGCAAGAATCCGGTGC
ATGTTTCAGAGACACTGATAGCAATGGTGCTCAACATCAAGAATTACTAAAGGCAATTAGTGTTGTGGCTCCATGTCAGAGTGGAGATACTGGGGATGACAAATTGGCGA
ATCAGAAAACTAAAAGACCAGAATCTACGGCTGCAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACATGGCATCCCGTTCAAAGGACAGAAAGGAATCACAAAAGCGT
CCAGCTAAGTCACCAAAAGTTAATGAAATTCAAACTGCGGATAAGAAGATTAAGACAAGGAGGAGGAAATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGGCGGTAGCGGAAGGCAAACGGTGCGCGTGAAGGCTGGGCAAAAATCCGCTTTATTCTCTCTTCGCGCTGAGAGAAGAAGAAGAAGAAGAAGAAGAAGAACGAC
GACGACAACGACAACGACAACAACCACCATCGCCAAAAATTTTACTGTCTCACAAAACTCCATAAACGACCACCAACAAGATCCAAAACCGCCTTCTTTCACGGACAATA
CTATGCTCTTTCGCCAACACCAGGATTCAATGGATGTGACGATTAGTACCTTAGGCATTCCTTCTTTAGATGTTGAGGTGCGTGCAAGCTGGCTCCAGACACTCACGAAT
ATCAAAAAAAATAAGAATCGGAATAATATGGAAGAAAGAGATGAATATACAGAATCAAGATCTATTAATAACTCTGAAGGCGTACAAGAAGCCAAGGCCAGTGTTGAAGT
TGAAGTGCTAACTTGTCTTTCAAATGAGCAAAGGCATTCAGTTTCCGACTATCAGGAATTGGGAACAACTCCAGATCATACCAGCAAAACTGGAAGTCCAGATGAAGAAA
AATCAGGGGTCCAGCATAATATTGAGGAAGAGAATAAGGAACTTGGTTTGGGAGATGTGCTCAGTGAGTTATCAGAAAAAAGTAATCAGACTATCTCAAACCTTGCTGAT
AATGATCAAGTTGAAGCTGGCAATTTATCATCCAGTGATAAAGATACTGAAAATTTAATATTACCTATTGAAGTTGAGACGACGACGACTCTTAATGAGTGCTTGGAACT
TCCAACTGAAGATGACACCAAAAATTATATCGAACAGACAAACCCTTCCATCGAAGATTTAACCCAAAATGCTTCTATCCAAAATTTAGAAATGGTCCCCAATAATTCCC
AACAATTGGGTCACAAGGATAAGAGAATTCGGAAATCAAAAAACAAGAATTTTATGTTAAGGTCCTTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAA
GCTAAAGCTCCTGAACCAAGTAATGACTTGAATAATTTTACTGCTGGAGAGGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAGGGAAAGGGAACAAGAATTGATGA
GTTTTCATCAATCAGGAATCATTTAAGATATTTAGTGAACCGCATCAAATATGAACAGAGCTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATA
AATTGAAGCCTGAAAAGGAACTTCAACGGGCATCAAATGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTATTGATGCACTTTGTGCTGAAGGAAGG
CTTTCTGAATCTTTATTCGATTCTGAAGGGCAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAGGAACTGTCCCTTGAGAATGACATCATACTATGTGA
TGGTGTTTGTGATCGTGGGTTCCACCTGTTCTGTTTAGAACCACCTTTGCTAAAATCAGACATTCCACCGGATGATGAGGGATGGCTATGCCCCGGATGTGATTGCAAGG
ATGACTGCATGGATCTCCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTACCCAGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCT
GATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATACTATTGACCATGGTGATGAATCAAGTTCTGATGGATCGAGTTC
TGATCAATCACATTCTGATAAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGTTCTAGCCAATGATGACCAGTACTTAGGTCTCCCTTCAGATGACTCGGAGGATG
ATGATTTTGATCCCAGTGCTCCAGAACGTGATGAAGGTGTTGGACAGGAAAATTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGACGATAACGGG
TCTTCTAAAGATGATAACCTCATGTCTTCTTCTTTAAATAATACTATGCCTGCGAGAAACTCTTATGGGAAAAGTTCTGGATGCGGCCCTAGCAAGAGTGCACTACATAA
CGAGTTATCAAGTCTAATAGAGTCAGGTCTCGATAAGGATGGTCTTGAACCAGTTTCAGGAAGACGACATGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACAT
ACGGGAATGTTCCTACCGACTCAAGCGATGACACATATGGTACTGTTTCTATGGATTCAAGTGATGACAGAGGTCATGGTAGGAGTACAAGGAAGAGAAGTCCTAAAAAC
CTGGTTCCTGCATTGTCAAAAAATGGAACTGATGATTTGAATAATATAAAAACTAAACGCAGTTCTAAGAGAACTCGTCAAAAGCCAGCTGGTGGAAATATGAATAATTC
TGTGACTAAAACTCCTAAAGACACTTTGAAATCTAGTTCTTCTGTTAGACGAACCACTTCATCATCACATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTGACAT
CATTCCAAGAAAATCAGTATCCTGAACGAGCTACGAAGGAGAGTTTGGCACAAGAACTAGGGCTCAGTCTGAAGCAGGTTAGTAAATGGTTTGAGAACACACGTTGGAGC
ACACGCCATCCCTCCAGTGAAGGTAATAAAGCAAAGAGTGCTTCAAGAATGGGCATTCAATCATCTCAGACAAGTGGAAAACTTCCCAAGCCTGAGCAAGAATCCGGTGC
ATGTTTCAGAGACACTGATAGCAATGGTGCTCAACATCAAGAATTACTAAAGGCAATTAGTGTTGTGGCTCCATGTCAGAGTGGAGATACTGGGGATGACAAATTGGCGA
ATCAGAAAACTAAAAGACCAGAATCTACGGCTGCAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACATGGCATCCCGTTCAAAGGACAGAAAGGAATCACAAAAGCGT
CCAGCTAAGTCACCAAAAGTTAATGAAATTCAAACTGCGGATAAGAAGATTAAGACAAGGAGGAGGAAATCCATTTAG
Protein sequenceShow/hide protein sequence
MSGGSGRQTVRVKAGQKSALFSLRAERRRRRRRRRTTTTTTTTTTTIAKNFTVSQNSINDHQQDPKPPSFTDNTMLFRQHQDSMDVTISTLGIPSLDVEVRASWLQTLTN
IKKNKNRNNMEERDEYTESRSINNSEGVQEAKASVEVEVLTCLSNEQRHSVSDYQELGTTPDHTSKTGSPDEEKSGVQHNIEEENKELGLGDVLSELSEKSNQTISNLAD
NDQVEAGNLSSSDKDTENLILPIEVETTTTLNECLELPTEDDTKNYIEQTNPSIEDLTQNASIQNLEMVPNNSQQLGHKDKRIRKSKNKNFMLRSFVSSDRVLRSRTQEK
AKAPEPSNDLNNFTAGEEGKRKKKKRNIKGKGTRIDEFSSIRNHLRYLVNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGR
LSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGVCDRGFHLFCLEPPLLKSDIPPDDEGWLCPGCDCKDDCMDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNS
DHALGLPSDDSEDGDYDPDVPDTIDHGDESSSDGSSSDQSHSDKSGYASASEELEVLANDDQYLGLPSDDSEDDDFDPSAPERDEGVGQENSSSDFTSDSEDLAALDDNG
SSKDDNLMSSSLNNTMPARNSYGKSSGCGPSKSALHNELSSLIESGLDKDGLEPVSGRRHVERLDYKKLHDETYGNVPTDSSDDTYGTVSMDSSDDRGHGRSTRKRSPKN
LVPALSKNGTDDLNNIKTKRSSKRTRQKPAGGNMNNSVTKTPKDTLKSSSSVRRTTSSSHRRLSQPALERLLTSFQENQYPERATKESLAQELGLSLKQVSKWFENTRWS
TRHPSSEGNKAKSASRMGIQSSQTSGKLPKPEQESGACFRDTDSNGAQHQELLKAISVVAPCQSGDTGDDKLANQKTKRPESTAAKSRKRKGRSDHMASRSKDRKESQKR
PAKSPKVNEIQTADKKIKTRRRKSI