; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G003550 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G003550
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionhomeobox protein HAT3.1-like
Genome locationCmo_Chr16:1640025..1645344
RNA-Seq ExpressionCmoCh16G003550
SyntenyCmoCh16G003550
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576929.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0098.75Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDEYTESRSNNNAEAVQEAKISVEAEM TCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEE RELGSGDVL ELSEKHNQTFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRN+RSKTQEKDKDPEPSNDLNNFTAEEGKGKK ERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDG+CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHA GLPSDDS DDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDS+PSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPK LVLALPNYRTNDDLTNIKTKHSSKRGTRQKA AANMNKSVSKTPEDTGKASSSVRRTTP
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

XP_022922818.1 homeobox protein HAT3.1-like [Cucurbita moschata]0.0e+00100Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

XP_022984249.1 homeobox protein HAT3.1-like [Cucurbita maxima]0.0e+0094.99Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDE TESRSNNNAEAVQEAK  VEAEM TCLSNEQK      HELEATPGY+NKTGG DEEKPEVQQNMEEEN+ELGSGDVL ELSEKHNQTFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPP E+LTQNTPFQ LETVPSNSEQSDHKDKRILKS+KINSILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRN+RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRI YEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQI SEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIP DDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHA GLPSDDS +DDDYDPDVPDTIVQDD+SS ETSGYASASEELES PNVDQYLGLPSDDSEDDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVS SLNNTTSMKNPDGRSSGGGPRKS+LYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYAS+SMDSSDDQGWDSNTRKRSPKTLVLALPNYR NDDLTN+KTKHSSKRGTRQKA A NMNKSV+KTPEDTGKASSSVRRTT 
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLS+LALERLLASFQENQYPERATKESLAQELGLSVKQV+KWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPE+EFGAQHQELPTAD
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQ+TKRSEFSA KSRKRKGRSDHAAS SKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

XP_023551733.1 homeobox protein HAT3.1-like [Cucurbita pepo subsp. pepo]0.0e+0095.56Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDEYTESRSNNNAEAVQEAKISVEAEM TCLSNEQKHSVPDYHELEATP Y+NKTGG DEEKPEVQQNMEEEN+ELGSGDVL ELSEKHN+TFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPP E+LTQNTPFQNLETVPSNSEQSDHKDKRILKS+KI SILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRN+RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHA GLPSDDS +DDDYDPDVPDTIVQDDESS ETSGYASASEELESP NVDQYLGLPSDDS+DDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDS+PSSKADNLVS SLNNTTS KNPDGRSSGGGPRKSALYNELSSLLES      PEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYAS+S DSSDDQGWDSNTRKRSPKTLVLALPNYRTNDD+TN+KTKHSSKRGTRQKA A NMNKSV+KTPEDTGKASSSVRRTTP
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLS+LALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGI SSQASGELHQPEQEFGAQHQELPT D
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQETKRSEFSA KSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.0e+0075.66Show/hide
Query:  MEERDEY--TESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSN
        MEERDE   TESR NN+AE VQEAK SVE E+ TCLSNE  HS   Y EL  TP YS+KT G DEEKP VQQNM     ELGSG +L EL EK NQT SN
Subjt:  MEERDEY--TESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSN

Query:  LADNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLV
         ADNDQVEAGNLL  DKDTENL +PIEVETTTLL +CSELP E VNKN+IEQMNPP E LTQN   QNLE +PSNS+Q   KDK ILKS K N  LRSLV
Subjt:  LADNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLV

Query:  SSDRNLRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN
        SSDR LRS+TQEK K PEPSN LNNFTAEEG  K KKK+RNIQGK ARVDE+SSIR  LRYLLNRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Subjt:  SSDRNLRSKTQEKDKDPEPSNDLNNFTAEEG--KGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIM+RKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESS------------PETSGYASASEELESPPNVDQY
        L+LLNEFQGS LSITD WEKVYPEAAA+AAG+N DH  GLPSDDS +D DYDPDVPDTI QD+ESS             +TSGYASASE LE PPN DQY
Subjt:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESS------------PETSGYASASEELESPPNVDQY

Query:  LGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPE
        LGLPSDDSEDDDYDPS PE DE VR+ESSSSDFTSDSEDLAALD+N  SK D+ VS SLNNT S+KN +G+SSG GP KSAL+NELSSL      KDG E
Subjt:  LGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPE

Query:  PVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKT
        PV GRRQVERLDYKKLHDETYGNVPTDSSDDTY S SMDSS D+GWDS+TRKR P+ LVLAL N  TNDDLTN+KTK S KR TRQKA A N+N SV++T
Subjt:  PVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKT

Query:  PEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQ
        P DT K+SSS R+TT SS RRLS+ ALERL ASFQEN+YP+RATKESLAQELGLS+KQVS+WF NTRWSTRHPSS  GN+AKSSSRM   SS+ASGEL +
Subjt:  PEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQ

Query:  PEQEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRR
         EQE           GAQHQ+LPTA+S   PCQSGDTGD KL T++TKR+E SATKSRKRK  SDH AS +KD + SQRPPAKSPKVNEIQTA   KTRR
Subjt:  PEQEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRR

Query:  RNSL
        R S+
Subjt:  RNSL

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0075.28Show/hide
Query:  MEERDEY--TESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSN
        MEERDE   TESR N  AEAVQEAK SVE E+RTCLSNE  +S   Y EL  TP +S KT G DEEK  VQQNM     ELGSG +L ELSEK NQT SN
Subjt:  MEERDEY--TESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSN

Query:  LADNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLV
         ADNDQVEAGN L  DKDT+NL + IE ETTTLL +CSELP E V KNYIE+MNPP E LTQ T  Q+LET+PSNS+Q DHKD+R  KS K N  LRSLV
Subjt:  LADNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLV

Query:  SSDRNLRSKTQEKDKDPEPSNDLNNFTAEE--GKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN
        SSDR LRS+TQEK K PEPSNDLNNFTAEE   + KKK+RNIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Subjt:  SSDRNLRSKTQEKDKDPEPSNDLNNFTAEE--GKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMRRKLKIRD+FQRID LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQD----------DESSPETSGYASASEELESPPNVDQYLG
        L+LLNEFQGS LSITDGWEKVYPEAAA AAGRN D   GLPSDDS +D DYDPD+PDTI QD          D+S+ +TSGYASASE LE PPN DQYLG
Subjt:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQD----------DESSPETSGYASASEELESPPNVDQYLG

Query:  LPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPV
        LPSDDSED+DYDPS PE DE  RQESSSSDFTSDSEDLAAL++N SSK D+LVS SLNNT  +KN +GRSS  GP KS L+NELSSLL+SG DKDG EP+
Subjt:  LPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPV

Query:  LGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPE
         GRRQVERLDYKKLHDETYGNVPT+SSDDTY S ++DSSDD+G DS TRKR PKTLVLAL N  +NDDLTN+KTK S KR TRQK  A N+N SV++TP 
Subjt:  LGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPE

Query:  DTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPE
        DT K+SSSVR+ T SS RRLS+ ALERL ASFQEN+YP+RATKESLAQELGL++KQVSKWF NTRWSTRHPSS  G KAKSSSRM IH SQASGEL + E
Subjt:  DTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPE

Query:  QEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRN
        QE           GA+HQ+LP A+SVVA CQSGDTGD KL T++TKR E SATKSRKRKGRSD+ AS SKD + S RPPAKSPKVNE QTA   KTRRR 
Subjt:  QEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRN

Query:  SL
        S+
Subjt:  SL

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0073.47Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEER EYTE R NNN EAVQEAK SV  E+ TC SNEQ HS+PD  EL  TP  ++KT G D+EK  VQQNMEEE +ELGSGDVL EL EK+NQT S LA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVP----SNSEQSDHKDKRILKSIKINSILRS
        + DQVEAGNLL  D +TENLI+PIE+ETTT L +CSELPPE  NKN I+Q+NPP E LTQNT  Q LETVP    S S+Q  HKDK+ILKS K N +LRS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVP----SNSEQSDHKDKRILKSIKINSILRS

Query:  LVSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN
        LVSSDR LRS+TQEK K PEPSN+LN  TA EGK KKK+RNI+GKGA  DEFSSIRN LRYL+NRIKYEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+
Subjt:  LVSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMR KLKIRD+FQ +D+LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDE-----SSPETSGYASASEELESPPNVDQYLGLPSDD
        L+LLNEFQGS LSITDGWEKVYPEAAA+AAG+N DHA GLPSDDS +D DYDPD PDTI Q+DE     SS + SGYASASEELE+ PN DQYLGLPSDD
Subjt:  LNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDE-----SSPETSGYASASEELESPPNVDQYLGLPSDD

Query:  SEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQ
        SEDDDY+P APE DE V+QESS SDFTSDSEDLAALD               + TT ++N +G+ SG GPR S L+NEL SLLESGPDKDG EPV GRRQ
Subjt:  SEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQ

Query:  VERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKA
        VERLDYKKLHDETYGNVP+DSSDDT+ S+S+DSSDD+G  S TRKRSPK LV AL    TNDDL N KTK S KR T QK  A NM  SV++TPED+ K+
Subjt:  VERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKA

Query:  SSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEF--
        SSSVRRT  SS RRLS+ ALERLLASFQENQYP+RATKESLAQELGLS+KQVSKWF NTRWSTRHPSS+E NKAKS+ RMGI SS+ SG+L +PEQE   
Subjt:  SSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEF--

Query:  --------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
                GAQHQ  P  D  VAPCQSGDT D KLATQ+T R E +ATKSRKRKGRSDH AS SKD KESQ+PPAKSPKVN+IQTA  ++TRRR S+
Subjt:  --------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

A0A6J1E4I6 homeobox protein HAT3.1-like0.0e+00100Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

A0A6J1FNP3 homeobox protein HAT3.10.0e+0072.08Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDEYTESR+ N + AVQEAK SVE E+ T L+NEQ HS P+Y EL     +++KTG  DEEKP V+QNMEE+ +ELG G+    L EK +QT S LA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQ EAGNLL  DKDTENLI+PIEVETT LL +CSE P E  NKNYIEQ NPP E   QNT   NL  VP NS +   KDKR+LKS K N ILRSL+SS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTA-EEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
        DR LRS+TQ+K K PEPSNDL+N TA EEGKGKKK R I+GKGARVDEFSSIRNHLRYL+NRIKYEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEIM
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTA-EEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM

Query:  RRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNL
        RRKLKIRD+FQRIDALC EG  S++LFDS+GQIDSEDIFC KCGSKELS ENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC++L
Subjt:  RRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNL

Query:  LNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDE---------------SSPETSGY--ASASEELESPPNVD
        LNEFQGS LSITDGWEKV+PEAAA+AAG++ DH   LPSDDS DD DYDPDVPD I QD E               SS + SGY  ASASEELE+PPN D
Subjt:  LNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDE---------------SSPETSGY--ASASEELESPPNVD

Query:  QYLGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDG
        QYLGLPSDDSEDDDYDP AP RDE V QESSSSDFTSDSEDLAAL  N SSK DN+ S  LNNT  ++N DG+SSG GP K+A +N+LSSL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDG

Query:  PEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVS
         E V GRR VERLDYKKLHDET+GNVPTDSSDDTY S S+DSSDD+G   +TRK SPK  V AL    T DDL NIKTK SSKR TRQK  A NM+ SV+
Subjt:  PEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVS

Query:  KTPEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGEL
        KTPE T K+SSSVRRTT SS+RRLS+  LERLLASFQENQYPERATKESLA+ELGLS+KQVSKWF NTRWSTRHPSS E NKAKS+SRMG  SSQ S + 
Subjt:  KTPEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGEL

Query:  HQPEQEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKT
         +PEQE           GAQHQE P A SVVAPCQSG TGD KLA Q+ KR E +ATKSRKRKGRSD  AS SKD K+S++PPAKS KV+EIQTA  +K 
Subjt:  HQPEQEF----------GAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKT

Query:  RRRNSL
        RRR S+
Subjt:  RRRNSL

A0A6J1J9X9 homeobox protein HAT3.1-like0.0e+0094.99Show/hide
Query:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA
        MEERDE TESRSNNNAEAVQEAK  VEAEM TCLSNEQK      HELEATPGY+NKTGG DEEKPEVQQNMEEEN+ELGSGDVL ELSEKHNQTFSNLA
Subjt:  MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLA

Query:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS
        DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPP E+LTQNTPFQ LETVPSNSEQSDHKDKRILKS+KINSILRSLVSS
Subjt:  DNDQVEAGNLLCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSS

Query:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
        DRN+RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRI YEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR
Subjt:  DRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMR

Query:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL
        RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQI SEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIP DDEGWLCPGCDCKDDCLNLL
Subjt:  RKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLL

Query:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS
        NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHA GLPSDDS +DDDYDPDVPDTIVQDD+SS ETSGYASASEELES PNVDQYLGLPSDDSEDDDYDPS
Subjt:  NEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPS

Query:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
        APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVS SLNNTTSMKNPDGRSSGGGPRKS+LYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL
Subjt:  APERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKL

Query:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP
        HDETYGNVPTDSSDDTYAS+SMDSSDDQGWDSNTRKRSPKTLVLALPNYR NDDLTN+KTKHSSKRGTRQKA A NMNKSV+KTPEDTGKASSSVRRTT 
Subjt:  HDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTP

Query:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD
        SSYRRLS+LALERLLASFQENQYPERATKESLAQELGLSVKQV+KWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPE+EFGAQHQELPTAD
Subjt:  SSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSRMGIHSSQASGELHQPEQEFGAQHQELPTAD

Query:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
        SVVAPCQSGDTGDVKLATQ+TKRSEFSA KSRKRKGRSDHAAS SKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL
Subjt:  SVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A1.9e-10041.75Show/hide
Query:  NSILRSLVSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKE
        NS +R L S+  +  + T+      +P+             K+++ +     +  DEFS IR  +RY+LNR+ YEQ+LIEAY+SEGWK  S DK++PEKE
Subjt:  NSILRSLVSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKE

Query:  LQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGC
        L+RA +EI+R KL+IR+VF+ ID+L  +G + ++LFDS+G+I  EDIFC+ CGS + +  NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP C
Subjt:  LQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGC

Query:  DCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPD--TIVQDDESSPETSGYASASEEL-------ESPPNV
        DCK DC++L+NE  GS +SI D WEKV+P+AAA A     D A  LPSDDS DD+D+DP++P+   + +D+ESS E     S S++        +S P +
Subjt:  DCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPD--TIVQDDESSPETSGYASASEEL-------ESPPNV

Query:  DQY---LGLPSDDSEDDDYDPSAPERDEDVRQESSS--SDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSL---
        D+    L LPS+DSEDDDYDP+ P+ D+DV ++SSS  SDFTSDS+D     S   S  D + SP L        PD +  G   + +A     SS    
Subjt:  DQY---LGLPSDDSEDDDYDPSAPERDEDVRQESSS--SDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSL---

Query:  LESGPDKDGPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDD---TYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQ
        +E+  D+    P   RRQ ERLDYKKL+DE YG   +DSSDD   +  +  +  S+++G  ++   +  + +         ND+LT   TK S       
Subjt:  LESGPDKDGPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDD---TYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQ

Query:  KAVAANMNKSVSKTPED-TGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSS
             +++ SV + P D T   S+S  R           +  ++L   F+   YP R+ KESLA+ELGL+ +QV+KWF   R S R  SS +G      S
Subjt:  KAVAANMNKSVSKTPED-TGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSS

Query:  RMGIHSS-QASGELHQPE
            +S   AS E  +PE
Subjt:  RMGIHSS-QASGELHQPE

P48785 Pathogenesis-related homeodomain protein2.4e-5028.3Show/hide
Query:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        +S+T++  +      ++     ++ + +K +R  +     VD+   ++   RYLL ++K +QNLI+AY++EGWKG S +K++P+KEL+RA  EI+  KL 
Subjt:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ
        +RD  +++D L   G + + +  S G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + ++ +N   
Subjt:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ

Query:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER
        G+   +   W+ ++ E A+   G                                                 S   V+     PSDDS+DDDYDP   E 
Subjt:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER

Query:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET
               + S D   D+++                  S++ + S+ + DG +   G  +    + LS+++E   +    E V G RQ   +DY +L+ E 
Subjt:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET

Query:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR
        +G           A +    S+D+ W  N R++  +          ++   T +    SSK+           ++ V +T E + + S SV        R
Subjt:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR

Query:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR
        R+ RL   A+E+L   F E + P +A ++ LA+EL L  ++V+KWF NTR+        E  K    S+
Subjt:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR

P48786 Pathogenesis-related homeodomain protein7.8e-11845.39Show/hide
Query:  VSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGA---RVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRA
        V+S R+LRS++QEK  +P    D+NN  A+EG  ++K R  + K     RVDEF  IR HLRYLL+RIKYE+N ++AYS EGWKG S DK+KPEKEL+RA
Subjt:  VSSDRNLRSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGA---RVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRA

Query:  SNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
          EI  RKLKIRD+FQR+D    EG L + LFDS+G+IDSEDIFCAKCGSK+++  NDIILCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK 
Subjt:  SNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

Query:  DCLNLLNEFQGSRLSITDGWEKVY-PEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVP--DTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDD
        DC+ LLN+ Q + + + D WEKV+  EAAA+A+G+N D  SGLPSDDS +DDDYDP  P  D  VQ D+SS + S Y S S++++     +   GLPSDD
Subjt:  DCLNLLNEFQGSRLSITDGWEKVY-PEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVP--DTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDD

Query:  SEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGR
        SEDD+YDPS    D+ + ++SS SDFTSDSED   +  D   + KA   ++ + ++  + +   G                       P++    P+  R
Subjt:  SEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAAL--DSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGR

Query:  RQVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHS
        RQVE LDYKKL+D                          E YGN  +DSSD+ Y   S    ++   ++   +R            R + DL   +    
Subjt:  RQVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHS

Query:  SKRGTR--QKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVE
        S    R  +K      +  +S++ ED+    +  + T+ + +      A +RLL SF+ENQYP+RA KESLA EL LSV+QVS WF N RWS RH S + 
Subjt:  SKRGTR--QKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVE

Query:  GNKAKSSS
         + AK  S
Subjt:  GNKAKSSS

Q04996 Homeobox protein HAT3.14.0e-11446.37Show/hide
Query:  KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKI
        + Q   +D  PS+ + N T   G+ KKK + + +G+    DE++ I+  LRY LNRI YEQ+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKI
Subjt:  KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKI

Query:  RDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQG
        RD+FQ +D LC EG L +SLFD+ G+I SEDIFCAKCGSK+LS +NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD L+LLN+  G
Subjt:  RDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQG

Query:  SRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPET------------SGYASASEEL-----ESPPNVDQYLGLP
        ++ S++D WEK++PEAAA+  G   +    LPSDDS DD++YDPD  +    D++ S +             + + SAS+E+     E    +   + LP
Subjt:  SRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPET------------SGYASASEEL-----ESPPNVDQYLGLP

Query:  SDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLG
        SDDSEDDDYDP AP  D+D  +ESS+SD TSD+EDL       S K D          T+ +  D      G + S L  +     + G D DGP  V  
Subjt:  SDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLG

Query:  RRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDT
        RR VERLDYKKL+DE Y NVPT SSDD       D +   G + +  +    T  + L      +D T+ K    SKR  ++  +     +   + P + 
Subjt:  RRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDT

Query:  GKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWS
        G  S  + +++ S+ ++      +RL  SFQENQYP++ATKESLA+EL ++VKQV+ WF + RWS
Subjt:  GKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWS

Q8H991 Homeobox protein HAZ11.0e-10139.97Show/hide
Query:  NKNYIEQMNPPNEKLTQNTPFQNLET--VPSNSEQSDHKDKRILKSIKINSILRSLVSSDRNLRSKTQEKDKDPEPSNDLNNFTA--EEGKGKKKERNIQ
        +K Y  + +  N ++ ++   +  ET  VP+N   +    +R+ K  K +  LR   S  R LRS +++K+K     N+L N  A  +  + K+K     
Subjt:  NKNYIEQMNPPNEKLTQNTPFQNLET--VPSNSEQSDHKDKRILKSIKINSILRSLVSSDRNLRSKTQEKDKDPEPSNDLNNFTA--EEGKGKKKERNIQ

Query:  GKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFC
          G   D++  IR  +RY+LNR+ YEQ+LI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L  EG L +S+FDS G+I SEDIFC
Subjt:  GKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFC

Query:  AKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSD
        A CGSK+++ +NDIILCDGICDRGFHQ+CL PPLL  DIP  DEGWLCP CDCK DC+++LNE QG +LSI D WEKV+PEAA+   G     AS LPSD
Subjt:  AKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSD

Query:  DSVDDDDYDPDVPD-TIVQDDESSPETSGYA-----SASEELESPP-----------NVDQYLGLPSDDSEDDDYDPSAPERDEDVRQESSS-----SDF
        DS  D+DYDP +     V +++SS E  G       S+SE+ ES              VD  LGLPS+DSED D+DP+ P+ D++   ES+S     SDF
Subjt:  DSVDDDDYDPDVPD-TIVQDDESSPETSGYA-----SASEELESPP-----------NVDQYLGLPSDDSEDDDYDPSAPERDEDVRQESSS-----SDF

Query:  TSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDT-
        TSDS+D  A +   S   D +  PS +   ++   DG    G P      N   + +E+  ++D   P+  +RQVERLDYKKL++E YG   +DSSDD  
Subjt:  TSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDT-

Query:  -YASVSMDSSDDQGWDSNTRKRSP---KTLVLALP-NYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLAL
         Y + + +  + +  ++++   SP   K      P  Y  N+          S    + + + +N N S +K                    R       
Subjt:  -YASVSMDSSDDQGWDSNTRKRSP---KTLVLALP-NYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLAL

Query:  ERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRW-----STRHPSSVEGNKAKSSS
        ++L A F+E+ YP RATKE+LAQELGL+  QV+KWF++TR      +T+  +++E + A++++
Subjt:  ERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRW-----STRHPSSVEGNKAKSSS

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain2.9e-11546.37Show/hide
Query:  KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKI
        + Q   +D  PS+ + N T   G+ KKK + + +G+    DE++ I+  LRY LNRI YEQ+LI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLKI
Subjt:  KTQEKDKDPEPSNDLNNFTAEEGKGKKKERNI-QGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKI

Query:  RDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQG
        RD+FQ +D LC EG L +SLFD+ G+I SEDIFCAKCGSK+LS +NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD L+LLN+  G
Subjt:  RDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQG

Query:  SRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPET------------SGYASASEEL-----ESPPNVDQYLGLP
        ++ S++D WEK++PEAAA+  G   +    LPSDDS DD++YDPD  +    D++ S +             + + SAS+E+     E    +   + LP
Subjt:  SRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPET------------SGYASASEEL-----ESPPNVDQYLGLP

Query:  SDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLG
        SDDSEDDDYDP AP  D+D  +ESS+SD TSD+EDL       S K D          T+ +  D      G + S L  +     + G D DGP  V  
Subjt:  SDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLG

Query:  RRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDT
        RR VERLDYKKL+DE Y NVPT SSDD       D +   G + +  +    T  + L      +D T+ K    SKR  ++  +     +   + P + 
Subjt:  RRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDT

Query:  GKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWS
        G  S  + +++ S+ ++      +RL  SFQENQYP++ATKESLA+EL ++VKQV+ WF + RWS
Subjt:  GKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWS

AT4G29940.1 pathogenesis related homeodomain protein A1.7e-5128.3Show/hide
Query:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        +S+T++  +      ++     ++ + +K +R  +     VD+   ++   RYLL ++K +QNLI+AY++EGWKG S +K++P+KEL+RA  EI+  KL 
Subjt:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ
        +RD  +++D L   G + + +  S G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + ++ +N   
Subjt:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ

Query:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER
        G+   +   W+ ++ E A+   G                                                 S   V+     PSDDS+DDDYDP   E 
Subjt:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER

Query:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET
               + S D   D+++                  S++ + S+ + DG +   G  +    + LS+++E   +    E V G RQ   +DY +L+ E 
Subjt:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET

Query:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR
        +G           A +    S+D+ W  N R++  +          ++   T +    SSK+           ++ V +T E + + S SV        R
Subjt:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR

Query:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR
        R+ RL   A+E+L   F E + P +A ++ LA+EL L  ++V+KWF NTR+        E  K    S+
Subjt:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR

AT4G29940.2 pathogenesis related homeodomain protein A1.7e-5128.3Show/hide
Query:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        +S+T++  +      ++     ++ + +K +R  +     VD+   ++   RYLL ++K +QNLI+AY++EGWKG S +K++P+KEL+RA  EI+  KL 
Subjt:  RSKTQEKDKDPEPSNDLNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ
        +RD  +++D L   G + + +  S G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + ++ +N   
Subjt:  IRDVFQRIDALCGEGGLSKSLFDSQGQIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQ

Query:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER
        G+   +   W+ ++ E A+   G                                                 S   V+     PSDDS+DDDYDP   E 
Subjt:  GSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDDSVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPER

Query:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET
               + S D   D+++                  S++ + S+ + DG +   G  +    + LS+++E   +    E V G RQ   +DY +L+ E 
Subjt:  DEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMKNPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDET

Query:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR
        +G           A +    S+D+ W  N R++  +          ++   T +    SSK+           ++ V +T E + + S SV        R
Subjt:  YGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKTKHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYR

Query:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR
        R+ RL   A+E+L   F E + P +A ++ LA+EL L  ++V+KWF NTR+        E  K    S+
Subjt:  RLSRL---ALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0542.37Show/hide
Query:  DIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0542.37Show/hide
Query:  DIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAGATGAATATACAGAATCGAGAAGTAATAATAATGCTGAAGCCGTACAAGAAGCCAAGATCAGTGTTGAAGCTGAAATGCGAACTTGTCTTTCAAATGA
GCAAAAGCATTCAGTTCCTGATTATCATGAATTGGAAGCAACTCCAGGATATTCCAACAAAACTGGCGGTTCAGATGAAGAAAAGCCAGAGGTCCAGCAGAATATGGAGG
AAGAGAATAGGGAACTTGGTTCAGGAGATGTGCTTATTGAATTATCAGAAAAACACAATCAGACTTTCTCTAACCTTGCTGATAATGATCAAGTTGAAGCTGGTAATTTA
TTATGCTGTGATAAAGATACCGAAAATTTGATAGTACCTATTGAAGTTGAGACAACGACTCTTCTTGTTGACTGCTCTGAACTTCCACCTGAAGTTGTCAACAAAAACTA
TATTGAACAGATGAACCCTCCTAATGAAAAATTAACCCAAAATACTCCTTTCCAAAATTTAGAAACAGTCCCCAGTAATTCAGAACAATCGGATCACAAGGATAAGAGAA
TTTTGAAATCAATTAAGATAAATTCTATTTTAAGGTCCCTTGTAAGTAGTGACAGAAATCTGCGTTCAAAGACCCAAGAGAAAGATAAAGATCCTGAACCAAGTAATGAC
TTAAATAATTTTACTGCTGAAGAGGGAAAAGGGAAGAAGAAGGAGAGAAATATACAAGGAAAGGGAGCAAGAGTCGATGAATTCTCATCAATCAGGAATCATTTGAGATA
TTTACTGAACCGCATCAAATATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGG
CATCAAATGAAATAATGCGGCGCAAATTAAAAATAAGAGATGTATTTCAACGTATTGATGCACTTTGTGGCGAAGGAGGCCTTTCTAAATCTTTATTCGATTCTCAAGGA
CAGATAGACAGTGAAGATATATTCTGTGCAAAATGTGGATCCAAAGAATTGTCCTTTGAGAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTT
CTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCAGATGATGAAGGATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCTTGAATCTGCTTAATGAATTTC
AAGGATCAAGACTTTCCATCACTGATGGTTGGGAGAAAGTCTATCCCGAGGCCGCAGCATCAGCTGCTGGACGAAATTTTGATCATGCCTCAGGTCTTCCTTCAGATGAT
TCTGTAGATGATGATGATTATGATCCTGATGTTCCAGATACTATTGTCCAGGACGATGAATCAAGTCCTGAAACATCTGGGTATGCTTCTGCTTCTGAGGAATTAGAGTC
TCCACCCAATGTTGACCAGTACTTAGGTCTCCCTTCCGATGATTCGGAGGATGATGACTATGATCCCAGTGCTCCAGAACGTGATGAAGATGTTAGACAGGAAAGTTCAA
GTTCTGACTTTACATCTGATTCTGAGGATTTAGCTGCACTTGACAGTAACCCCTCTTCCAAAGCTGATAACCTTGTGTCTCCATCACTGAATAATACTACGTCTATGAAA
AACCCTGATGGGCGAAGTTCTGGAGGTGGTCCTAGAAAGAGTGCACTGTATAATGAGCTATCAAGTCTACTAGAGTCCGGTCCTGATAAGGACGGTCCTGAACCTGTTTT
GGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATATGGGAATGTTCCTACCGACTCAAGCGATGACACGTACGCAAGTGTTTCTATGGATT
CAAGTGATGACCAAGGCTGGGATAGTAATACAAGGAAGAGAAGTCCTAAAACCCTGGTTCTTGCTCTGCCAAATTATCGAACTAATGATGATTTGACTAACATAAAAACT
AAACACAGTTCTAAGAGGGGTACTCGTCAAAAGGCAGTTGCTGCAAATATGAATAAATCTGTGAGTAAAACTCCTGAAGACACTGGAAAAGCTAGTTCTTCTGTTAGGAG
AACCACACCATCATCGTATAGAAGACTCAGTCGACTTGCATTGGAGAGACTTTTAGCATCATTCCAAGAAAACCAGTATCCTGAACGAGCTACAAAGGAGAGTCTGGCAC
AAGAACTAGGGCTCAGTGTGAAGCAGGTTAGCAAATGGTTTACGAACACACGTTGGAGCACACGCCATCCCTCAAGCGTTGAGGGTAATAAAGCGAAGAGTTCCTCAAGA
ATGGGCATTCATTCATCTCAGGCAAGTGGAGAGCTGCACCAGCCCGAGCAAGAATTTGGTGCCCAACATCAAGAATTACCAACAGCAGATAGTGTTGTGGCCCCATGTCA
GAGTGGGGATACAGGGGATGTCAAATTGGCAACTCAGGAAACTAAAAGATCAGAATTTTCTGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACGCTGCATCAT
GTTCAAAGGACAGTAAGGAATCACAAAGGCCTCCTGCCAAGTCACCAAAAGTAAATGAAATCCAAACAGCACATAGCATTAAGACGAGGAGGAGAAATTCCTTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGAGATGAATATACAGAATCGAGAAGTAATAATAATGCTGAAGCCGTACAAGAAGCCAAGATCAGTGTTGAAGCTGAAATGCGAACTTGTCTTTCAAATGA
GCAAAAGCATTCAGTTCCTGATTATCATGAATTGGAAGCAACTCCAGGATATTCCAACAAAACTGGCGGTTCAGATGAAGAAAAGCCAGAGGTCCAGCAGAATATGGAGG
AAGAGAATAGGGAACTTGGTTCAGGAGATGTGCTTATTGAATTATCAGAAAAACACAATCAGACTTTCTCTAACCTTGCTGATAATGATCAAGTTGAAGCTGGTAATTTA
TTATGCTGTGATAAAGATACCGAAAATTTGATAGTACCTATTGAAGTTGAGACAACGACTCTTCTTGTTGACTGCTCTGAACTTCCACCTGAAGTTGTCAACAAAAACTA
TATTGAACAGATGAACCCTCCTAATGAAAAATTAACCCAAAATACTCCTTTCCAAAATTTAGAAACAGTCCCCAGTAATTCAGAACAATCGGATCACAAGGATAAGAGAA
TTTTGAAATCAATTAAGATAAATTCTATTTTAAGGTCCCTTGTAAGTAGTGACAGAAATCTGCGTTCAAAGACCCAAGAGAAAGATAAAGATCCTGAACCAAGTAATGAC
TTAAATAATTTTACTGCTGAAGAGGGAAAAGGGAAGAAGAAGGAGAGAAATATACAAGGAAAGGGAGCAAGAGTCGATGAATTCTCATCAATCAGGAATCATTTGAGATA
TTTACTGAACCGCATCAAATATGAACAGAACTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGG
CATCAAATGAAATAATGCGGCGCAAATTAAAAATAAGAGATGTATTTCAACGTATTGATGCACTTTGTGGCGAAGGAGGCCTTTCTAAATCTTTATTCGATTCTCAAGGA
CAGATAGACAGTGAAGATATATTCTGTGCAAAATGTGGATCCAAAGAATTGTCCTTTGAGAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTT
CTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCAGATGATGAAGGATGGCTATGCCCCGGATGTGATTGCAAAGATGACTGCTTGAATCTGCTTAATGAATTTC
AAGGATCAAGACTTTCCATCACTGATGGTTGGGAGAAAGTCTATCCCGAGGCCGCAGCATCAGCTGCTGGACGAAATTTTGATCATGCCTCAGGTCTTCCTTCAGATGAT
TCTGTAGATGATGATGATTATGATCCTGATGTTCCAGATACTATTGTCCAGGACGATGAATCAAGTCCTGAAACATCTGGGTATGCTTCTGCTTCTGAGGAATTAGAGTC
TCCACCCAATGTTGACCAGTACTTAGGTCTCCCTTCCGATGATTCGGAGGATGATGACTATGATCCCAGTGCTCCAGAACGTGATGAAGATGTTAGACAGGAAAGTTCAA
GTTCTGACTTTACATCTGATTCTGAGGATTTAGCTGCACTTGACAGTAACCCCTCTTCCAAAGCTGATAACCTTGTGTCTCCATCACTGAATAATACTACGTCTATGAAA
AACCCTGATGGGCGAAGTTCTGGAGGTGGTCCTAGAAAGAGTGCACTGTATAATGAGCTATCAAGTCTACTAGAGTCCGGTCCTGATAAGGACGGTCCTGAACCTGTTTT
GGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATATGGGAATGTTCCTACCGACTCAAGCGATGACACGTACGCAAGTGTTTCTATGGATT
CAAGTGATGACCAAGGCTGGGATAGTAATACAAGGAAGAGAAGTCCTAAAACCCTGGTTCTTGCTCTGCCAAATTATCGAACTAATGATGATTTGACTAACATAAAAACT
AAACACAGTTCTAAGAGGGGTACTCGTCAAAAGGCAGTTGCTGCAAATATGAATAAATCTGTGAGTAAAACTCCTGAAGACACTGGAAAAGCTAGTTCTTCTGTTAGGAG
AACCACACCATCATCGTATAGAAGACTCAGTCGACTTGCATTGGAGAGACTTTTAGCATCATTCCAAGAAAACCAGTATCCTGAACGAGCTACAAAGGAGAGTCTGGCAC
AAGAACTAGGGCTCAGTGTGAAGCAGGTTAGCAAATGGTTTACGAACACACGTTGGAGCACACGCCATCCCTCAAGCGTTGAGGGTAATAAAGCGAAGAGTTCCTCAAGA
ATGGGCATTCATTCATCTCAGGCAAGTGGAGAGCTGCACCAGCCCGAGCAAGAATTTGGTGCCCAACATCAAGAATTACCAACAGCAGATAGTGTTGTGGCCCCATGTCA
GAGTGGGGATACAGGGGATGTCAAATTGGCAACTCAGGAAACTAAAAGATCAGAATTTTCTGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCACGCTGCATCAT
GTTCAAAGGACAGTAAGGAATCACAAAGGCCTCCTGCCAAGTCACCAAAAGTAAATGAAATCCAAACAGCACATAGCATTAAGACGAGGAGGAGAAATTCCTTATAG
Protein sequenceShow/hide protein sequence
MEERDEYTESRSNNNAEAVQEAKISVEAEMRTCLSNEQKHSVPDYHELEATPGYSNKTGGSDEEKPEVQQNMEEENRELGSGDVLIELSEKHNQTFSNLADNDQVEAGNL
LCCDKDTENLIVPIEVETTTLLVDCSELPPEVVNKNYIEQMNPPNEKLTQNTPFQNLETVPSNSEQSDHKDKRILKSIKINSILRSLVSSDRNLRSKTQEKDKDPEPSND
LNNFTAEEGKGKKKERNIQGKGARVDEFSSIRNHLRYLLNRIKYEQNLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDVFQRIDALCGEGGLSKSLFDSQG
QIDSEDIFCAKCGSKELSFENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLNLLNEFQGSRLSITDGWEKVYPEAAASAAGRNFDHASGLPSDD
SVDDDDYDPDVPDTIVQDDESSPETSGYASASEELESPPNVDQYLGLPSDDSEDDDYDPSAPERDEDVRQESSSSDFTSDSEDLAALDSNPSSKADNLVSPSLNNTTSMK
NPDGRSSGGGPRKSALYNELSSLLESGPDKDGPEPVLGRRQVERLDYKKLHDETYGNVPTDSSDDTYASVSMDSSDDQGWDSNTRKRSPKTLVLALPNYRTNDDLTNIKT
KHSSKRGTRQKAVAANMNKSVSKTPEDTGKASSSVRRTTPSSYRRLSRLALERLLASFQENQYPERATKESLAQELGLSVKQVSKWFTNTRWSTRHPSSVEGNKAKSSSR
MGIHSSQASGELHQPEQEFGAQHQELPTADSVVAPCQSGDTGDVKLATQETKRSEFSATKSRKRKGRSDHAASCSKDSKESQRPPAKSPKVNEIQTAHSIKTRRRNSL