; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G009070 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G009070
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionhomeobox-leucine zipper protein HAT22-like
Genome locationCmo_Chr04:4562273..4563990
RNA-Seq ExpressionCmoCh04G009070
SyntenyCmoCh04G009070
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600675.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]8.2e-14898.17Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHP-HHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDL
        MGLDDVSQTSLVLGLGISEAASPIINNLKN KK NNNKNPTSLEFEPCALTLGFSGDVDPP HHP HHHHLYRQASPHSSAVCSSFSGGGG GIKRERDL
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHP-HHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDL

Query:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE
        SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE
Subjt:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE

Query:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-14998.9Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHP-HHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDL
        MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKK NNKNPTSLEFEPCALTLGFSGDVDPP HHP HHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDL
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHP-HHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDL

Query:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE
        SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE
Subjt:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE

Query:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]3.2e-152100Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
        MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS

Query:  SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN
        SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN
Subjt:  SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN

Query:  RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]2.8e-14094.22Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPP-----HHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKR
        MGLDDVSQTSLVLGLGISEAASPIINN KNN   NNN +PTSL+FEPCALTLGFSGDVDPPP     HH  HHHLYRQASPHSSAVCSSFSGGGGGGIKR
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPP-----HHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKR

Query:  ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
        ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET

Query:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVG  GVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

XP_023547039.1 homeobox-leucine zipper protein HAT22-like [Cucurbita pepo subsp. pepo]7.0e-14797.45Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNK-NPTSLEFEPCALTLGFSGDVDPPPHH--PHHHHLYRQASPHSSAVCSSFSGGGGGGIKRER
        MGLDDVSQTSLVLGLGISEAASPI+NN KNNKKK NNK NPTSLEFEPCALTLGFSGDVDPPPHH   HHHHLYRQASPHSSAVCSSFSGGGGGGIKRER
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNK-NPTSLEFEPCALTLGFSGDVDPPPHH--PHHHHLYRQASPHSSAVCSSFSGGGGGGIKRER

Query:  DLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT
        DLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT
Subjt:  DLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT

Query:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like5.5e-9773.21Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPH----HPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRE
        MG DD S+T LVLGLG+SE A      L   KKK    + +SL+FEPC LTLGFSG    P      H    HLYRQ SPHSSAVCSSFS    G +KRE
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPH----HPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRE

Query:  RDLSSEEVELERVCCRVSDEDEDGC-NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
        RDLSSEEVELERVC RVSDED+DGC NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQ LAR LNL PRQVEVWFQNRRARTK+KQTEVDCE LKKCCET
Subjt:  RDLSSEEVELERVCCRVSDEDEDGC-NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET

Query:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKE+QELKA+KLA P+YM M  ATLT+CPSCERV   G  GV  G+  SKPKFSM P P FYNPFSNPSAAC
Subjt:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

A0A5D3DEZ1 Homeobox-leucine zipper protein HAT22-like1.2e-9672.86Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPH----HPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRE
        MG DD S+T LVLGLG+SE A      L   KKK    + +SL+FEPC LTLGFSG    P      H    HLYRQ SPHSSAVCSSFS    G +KRE
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPH----HPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRE

Query:  RDLSSEEVELERVCCRVSDEDEDGC-NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
        RDLSSEEVELERVC RV+DED+DGC NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQ LAR LNL PRQVEVWFQNRRARTK+KQTEVDCE LKKCCET
Subjt:  RDLSSEEVELERVCCRVSDEDEDGC-NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET

Query:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKE+QELKA+KLA P+YM M  ATLT+CPSCERV   G  GV  G+  SKPKFSM P P FYNPFSNPSAAC
Subjt:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like3.3e-10278.39Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGG-GIKRERDL
        MG DD+S T LVLGLG+SE ASP     ++N  K      +SL+F+PC LTLGFSG  +        HHLYRQASPHSSAV SSFSGGGGG  +KRERDL
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGG-GIKRERDL

Query:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE
        SSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+E
Subjt:  SSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNE

Query:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        NRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG    G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  NRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like1.6e-152100Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
        MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS

Query:  SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN
        SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN
Subjt:  SEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNEN

Query:  RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  RRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like1.4e-14094.22Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPP-----HHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKR
        MGLDDVSQTSLVLGLGISEAASPIINN KNN   NNN +PTSL+FEPCALTLGFSGDVDPPP     HH  HHHLYRQASPHSSAVCSSFSGGGGGGIKR
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPP-----HHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKR

Query:  ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
        ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCET

Query:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
        LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVG  GVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC
Subjt:  LTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.1e-5156.62Show/hide
Query:  PHSSAVCSSFSGGGGGGIKRERDLSSEEVELERVCCRVS--DEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNR
        P  S    S        +KRER   +EE + ERV    +  D+D+DG +TRKKLRL+K+QSALLE+ F+++STLNPKQK  LA+ LNLRPRQVEVWFQNR
Subjt:  PHSSAVCSSFSGGGGGGIKRERDLSSEEVELERVCCRVS--DEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNR

Query:  RARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLA----------------HPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKPKF
        RARTK+KQTEVDCEFLK+CCETLT ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG      V   DG +K   
Subjt:  RARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLA----------------HPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKPKF

Query:  SMAPKPHFYNPFSNPSAAC
              HF+NPF++ SAAC
Subjt:  SMAPKPHFYNPFSNPSAAC

A2Z1U1 Homeobox-leucine zipper protein HOX111.8e-5268.26Show/hide
Query:  ASPHSSA---VCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWF
        +SP++SA       FSG G GG     D +      +R C R SDED DG + RKKLRLSK+QSA LEESFK++STLNPKQK  LA+ LNLRPRQVEVWF
Subjt:  ASPHSSA---VCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWF

Query:  QNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV
        QNRRARTK+KQTEVDCE+LK+CCETLT ENRRLQKEL EL+ALK  HP YMH+PA TL+MCPSCERV
Subjt:  QNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV

P46603 Homeobox-leucine zipper protein HAT92.2e-7158.82Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
        MG DD   T LVLGLG     SPI NN  +  +++     +  + EP +LTL  SGD            L RQ S HS    SSFS   G  +KRERD  
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS

Query:  SEEVELERVCCRV-SD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT
         E  E E +  RV SD  EDE+G + RKKLRL+KQQSALLEESFK +STLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETL 
Subjt:  SEEVELERVCCRV-SD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT

Query:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKPKFSMAPKPHFYNPFSNPSAAC
        +EN RLQKE+QELK LKL  P YMHMPA+TLT CPSCER+GG G G G G               +K  FS++ KPHF+NPF+NPSAAC
Subjt:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKPKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT229.6e-7559.72Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNN-KKKNNNKNPTSLEFEPCALTLGFSGD-VDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERD
        MGLDD   T LVLGLG+    SP  NN  +  KK ++  +   +  +P +LTL  SG+             + RQ S HS    SSFS    G +KRER+
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNN-KKKNNNKNPTSLEFEPCALTLGFSGD-VDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERD

Query:  LS-------SEEVELERVCCRVSD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFL
        +S       +EE     VC RVSD  +DE+G + RKKLRL+KQQSALLE++FK +STLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFL
Subjt:  LS-------SEEVELERVCCRVSD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFL

Query:  KKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKPKFSMAPKPHFYNPFSNPSAAC
        KKCCETLT+ENRRLQKELQ+LKALKL+ P YMHMPAATLTMCPSCER+GG GVG     V +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  KKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKPKFSMAPKPHFYNPFSNPSAAC

Q67UE2 Homeobox-leucine zipper protein HOX111.8e-5268.26Show/hide
Query:  ASPHSSA---VCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWF
        +SP++SA       FSG G GG     D +      +R C R SDED DG + RKKLRLSK+QSA LEESFK++STLNPKQK  LA+ LNLRPRQVEVWF
Subjt:  ASPHSSA---VCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWF

Query:  QNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV
        QNRRARTK+KQTEVDCE+LK+CCETLT ENRRLQKEL EL+ALK  HP YMH+PA TL+MCPSCERV
Subjt:  QNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family1.6e-7258.82Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS
        MG DD   T LVLGLG     SPI NN  +  +++     +  + EP +LTL  SGD            L RQ S HS    SSFS   G  +KRERD  
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLS

Query:  SEEVELERVCCRV-SD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT
         E  E E +  RV SD  EDE+G + RKKLRL+KQQSALLEESFK +STLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETL 
Subjt:  SEEVELERVCCRV-SD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLT

Query:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKPKFSMAPKPHFYNPFSNPSAAC
        +EN RLQKE+QELK LKL  P YMHMPA+TLT CPSCER+GG G G G G               +K  FS++ KPHF+NPF+NPSAAC
Subjt:  NENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKPKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 31.5e-4663.75Show/hide
Query:  SGGGGGGIKRERDLSSEEVELERVCCRV--SDEDEDGC-----NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTK
        +G  GGG         E+ E+ER  C +    +DEDG      ++RKKLRLSK+Q+ +LEE+FK++STLNPKQK  LA+ LNLR RQVEVWFQNRRARTK
Subjt:  SGGGGGGIKRERDLSSEEVELERVCCRV--SDEDEDGC-----NTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTK

Query:  MKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHM-PAATLTMCPSCERV
        +KQTEVDCE+LK+CCE LT+ENRRLQKE+ EL+ALKL+  LYMHM P  TLTMCPSCERV
Subjt:  MKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHM-PAATLTMCPSCERV

AT4G16780.1 homeobox protein 24.3e-4658.79Show/hide
Query:  DVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLA
        DV+ PP        Y       S+  S+ S   G   +RE D   +         R   +DEDG N+RKKLRLSK QSA+LEE+FK +STLNPKQKQ LA
Subjt:  DVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLA

Query:  RHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHM-PAATLTMCPSCERV
        + L LR RQVEVWFQNRRARTK+KQTEVDCEFL++CCE LT ENRRLQKE+ EL+ALKL+   YMHM P  TLTMCPSCE V
Subjt:  RHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family6.8e-7659.72Show/hide
Query:  MGLDDVSQTSLVLGLGISEAASPIINNLKNN-KKKNNNKNPTSLEFEPCALTLGFSGD-VDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERD
        MGLDD   T LVLGLG+    SP  NN  +  KK ++  +   +  +P +LTL  SG+             + RQ S HS    SSFS    G +KRER+
Subjt:  MGLDDVSQTSLVLGLGISEAASPIINNLKNN-KKKNNNKNPTSLEFEPCALTLGFSGD-VDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERD

Query:  LS-------SEEVELERVCCRVSD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFL
        +S       +EE     VC RVSD  +DE+G + RKKLRL+KQQSALLE++FK +STLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFL
Subjt:  LS-------SEEVELERVCCRVSD--EDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFL

Query:  KKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKPKFSMAPKPHFYNPFSNPSAAC
        KKCCETLT+ENRRLQKELQ+LKALKL+ P YMHMPAATLTMCPSCER+GG GVG     V +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  KKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKPKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.4e-4965.03Show/hide
Query:  AVCSSFS---GGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCN--TRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRR
        +V SSF    G    G +R  +    + E+ER   R S+ED D  N  TRKKLRLSK QSA LE+SFK++STLNPKQK  LA+ LNLRPRQVEVWFQNRR
Subjt:  AVCSSFS---GGGGGGIKRERDLSSEEVELERVCCRVSDEDEDGCN--TRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRR

Query:  ARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV
        ARTK+KQTEVDCE+LK+CCE+LT ENRRLQKE++EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  ARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGGATGATGTTTCTCAAACAAGCTTGGTTTTGGGGTTGGGGATCTCAGAAGCTGCTTCTCCAATTATTAACAACTTGAAGAACAACAAGAAGAAGAACAACAA
CAAGAATCCTACTTCACTTGAGTTTGAGCCTTGTGCTTTGACTTTGGGATTTTCAGGCGATGTTGATCCTCCTCCTCATCATCCTCATCATCATCATTTGTATCGCCAAG
CTTCCCCTCATAGCAGCGCTGTTTGTTCTTCCTTCTCCGGGGGCGGTGGTGGTGGGATTAAAAGGGAGAGAGACCTCAGCAGTGAAGAAGTTGAATTGGAGAGAGTTTGT
TGTAGAGTTAGTGATGAAGATGAAGATGGTTGTAATACTAGGAAGAAGCTTAGGCTTTCTAAACAACAATCCGCGCTATTGGAAGAGAGTTTCAAACAAAATAGCACTCT
CAACCCCAAACAAAAGCAAACCTTAGCCAGACACCTAAACCTACGCCCAAGACAAGTGGAAGTATGGTTTCAAAATAGGAGAGCTCGAACAAAAATGAAACAAACAGAAG
TAGACTGTGAGTTCTTGAAGAAATGTTGTGAGACGTTGACAAACGAAAATAGAAGACTACAGAAGGAGCTTCAAGAATTGAAAGCGCTAAAGCTCGCACACCCGCTGTAC
ATGCACATGCCAGCAGCAACGTTAACCATGTGCCCATCCTGCGAAAGGGTGGGTGGCGTTGGCGTTGGCGTTGGCGATGGCGCTTCCAAACCCAAATTTTCAATGGCTCC
CAAGCCCCACTTTTACAACCCTTTCTCCAATCCTTCAGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATACAAACACAAACACAAACACAAACACAAACACACCCATCTCTCTCCTTTTTCCTCTTCTTAAACCCTTCATTTTGGTTTGTTGTTCTTCACAACTTCCATCTTTTTCC
ACACAAATCCCTCAACTCAACTCAACTCCAGATGGGTTTGGATGATGTTTCTCAAACAAGCTTGGTTTTGGGGTTGGGGATCTCAGAAGCTGCTTCTCCAATTATTAACA
ACTTGAAGAACAACAAGAAGAAGAACAACAACAAGAATCCTACTTCACTTGAGTTTGAGCCTTGTGCTTTGACTTTGGGATTTTCAGGCGATGTTGATCCTCCTCCTCAT
CATCCTCATCATCATCATTTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTGTTCTTCCTTCTCCGGGGGCGGTGGTGGTGGGATTAAAAGGGAGAGAGACCTCAG
CAGTGAAGAAGTTGAATTGGAGAGAGTTTGTTGTAGAGTTAGTGATGAAGATGAAGATGGTTGTAATACTAGGAAGAAGCTTAGGCTTTCTAAACAACAATCCGCGCTAT
TGGAAGAGAGTTTCAAACAAAATAGCACTCTCAACCCCAAACAAAAGCAAACCTTAGCCAGACACCTAAACCTACGCCCAAGACAAGTGGAAGTATGGTTTCAAAATAGG
AGAGCTCGAACAAAAATGAAACAAACAGAAGTAGACTGTGAGTTCTTGAAGAAATGTTGTGAGACGTTGACAAACGAAAATAGAAGACTACAGAAGGAGCTTCAAGAATT
GAAAGCGCTAAAGCTCGCACACCCGCTGTACATGCACATGCCAGCAGCAACGTTAACCATGTGCCCATCCTGCGAAAGGGTGGGTGGCGTTGGCGTTGGCGTTGGCGATG
GCGCTTCCAAACCCAAATTTTCAATGGCTCCCAAGCCCCACTTTTACAACCCTTTCTCCAATCCTTCAGCCGCATGTTAGAAAATTAATGTACACCATTTACAAGCTACA
CAAAATAACACTATACTCATATATTTTTCTCCTTCTTCTTCAAAACGCCATAGCTAGCTTAGAAATTGGAGAAATTTAGTGTAACTATTAGCTACCCTTGTACAAGCTTA
CCCTTAAGATCAAATTATCATCAGTTGTAAAGATTTTAAACTCTATCTAGAAATTAATCAAGGATTGGTGGGTATATAATGTAATTTTTTTTTATCGATTTTTAGAATGA
TAATTGATGTTTTATTTACAGAATTATGTATGATCGACCGACCATCCGACCGACGATCCAATGACGCAAGATGGATGGTGTGTTGGATTGGGAAAAATTATTGGGTGGTG
TTGGTGTAAGGGGCTATCTATCTATCTCATTGGCTGCCTACGTGTTACGCAAGGGATGACATTGATATGACTTTTGCTATTTTATTAGTGCGTTTTGGCTTCATTGGTCA
ATATGATACAATGATAATAATAATACATTTCTTTTTTCTTTTTTTCTTGTTTTATATCATGGAATCTACTTTCTTTTTCTCATTTTCTTCCTTTTTGTTTCTAC
Protein sequenceShow/hide protein sequence
MGLDDVSQTSLVLGLGISEAASPIINNLKNNKKKNNNKNPTSLEFEPCALTLGFSGDVDPPPHHPHHHHLYRQASPHSSAVCSSFSGGGGGGIKRERDLSSEEVELERVC
CRVSDEDEDGCNTRKKLRLSKQQSALLEESFKQNSTLNPKQKQTLARHLNLRPRQVEVWFQNRRARTKMKQTEVDCEFLKKCCETLTNENRRLQKELQELKALKLAHPLY
MHMPAATLTMCPSCERVGGVGVGVGDGASKPKFSMAPKPHFYNPFSNPSAAC