; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G006240 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G006240
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHomeobox-leucine zipper protein
Genome locationCmo_Chr07:2812539..2813907
RNA-Seq ExpressionCmoCh07G006240
SyntenyCmoCh07G006240
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594945.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]1.5e-13599.22Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

KAG7026906.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]7.5e-13598.83Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_022963334.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]4.7e-137100Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_023003674.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]1.8e-13397.66Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPN PV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGG+VKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGN K AFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_023517048.1 homeobox-leucine zipper protein HAT22-like [Cucurbita pepo subsp. pepo]4.4e-13598.83Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKA FSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3B144 homeobox-leucine zipper protein HAT22-like1.2e-10982.58Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVNLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA
        MGFDDL NT LLLGLGLTL S P   +SQKP   ++L  FP  ESEPSLTLGLST       T DL RQPSPHS +SS SG RVKRERDVSGE+IEEEKA
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVNLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA

Query:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQE
        SSRVSDE+EDGSNARKKLRLTKEQSALLEESFKLH TLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQE
Subjt:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQE

Query:  LKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        LKALKL +PLFMQMPAATLTMCPSCERIGG  ATVNG+GNSK  FSMA KP+FYK FT PSAAC
Subjt:  LKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A5A7T2R8 Homeobox-leucine zipper protein HAT22-like1.2e-10982.58Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVNLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA
        MGFDDL NT LLLGLGLTL S P   +SQKP   ++L  FP  ESEPSLTLGLST       T DL RQPSPHS +SS SG RVKRERDVSGE+IEEEKA
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVNLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA

Query:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQE
        SSRVSDE+EDGSNARKKLRLTKEQSALLEESFKLH TLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQE
Subjt:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQE

Query:  LKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        LKALKL +PLFMQMPAATLTMCPSCERIGG  ATVNG+GNSK  FSMA KP+FYK FT PSAAC
Subjt:  LKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1GGZ5 homeobox-leucine zipper protein HAT225.3e-11081.95Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLST----------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEE
        MGFDDL NTGLLLGLGL L S PA LS KP  PV+LFSFPA ESEPSLTLGLST          TADLCRQPSPHS +SS SGGRVKRERDVSGEDIEEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLST----------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEE

Query:  KASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKEL
        KA SRVSDE+EDGS ARKKLRLTKEQSALLE+SFKLH TLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENR+L KEL
Subjt:  KASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKEL

Query:  QELKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        QELKALKL +PL MQMPAATLTMCPSCER GG    VN +GNSK+ FSMA  P+F K FT PSAAC
Subjt:  QELKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1HJS8 homeobox-leucine zipper protein HAT22-like2.3e-137100Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1KU00 homeobox-leucine zipper protein HAT22-like9.0e-13497.66Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPN PV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGG+VKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGGATATVNGNGN K AFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX199.8e-5355.47Show/hide
Query:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH
        EPSLTL L          + TA       P  +VSSLS G      VKRER    E+ + E+ SS  +  D+++DGS  RKKLRLTKEQSALLE+ F+ H
Subjt:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH

Query:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLT----------------KPLFMQMPAATL
         TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT+ENRRL +ELQEL+ALK                   P +MQ+PAATL
Subjt:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLT----------------KPLFMQMPAATL

Query:  TMCPSCERIGG--ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        T+CPSCER+GG  + A V     +KA         F+ PFT+ SAAC
Subjt:  TMCPSCERIGG--ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX272.4e-5170.78Show/hide
Query:  EKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKE
        E++SSR SD++E G++ARKKLRL+KEQSA LEESFK H TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCE+LKRCCETLT+ENRRLHKE
Subjt:  EKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKE

Query:  LQELKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKP
        L EL+ALK  +P +M +PA TL+MCPSCER+    AT + +  + AA S A  P
Subjt:  LQELKALKLTKPLFMQMPAATLTMCPSCERIGGATATVNGNGNSKAAFSMAPKP

P46603 Homeobox-leucine zipper protein HAT93.7e-7662.41Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE
        MGFDD  NTGL+LGLG      P+ +    N+ +   S    + EPSLTL LS        T AD LCRQ S HS VSS S GR VKRERD   E  EEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE

Query:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLH
        + + RV SD  E+E+G +ARKKLRLTK+QSALLEESFK H TLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN RL 
Subjt:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLH

Query:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        KE+QELK LKLT+P +M MPA+TLT CPSCERIG             GATA +     +K AFS++ KP F+ PFTNPSAAC
Subjt:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

P46604 Homeobox-leucine zipper protein HAT224.6e-7961.07Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE
        MG DD  NTGL+LGLGL+    P   +       +      +  +PSLTL LS  +            +CRQ S HS +SS S GRVKRER++SG D EE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE

Query:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
        E          SRVSD  ++E+G +ARKKLRLTK+QSALLE++FKLH TLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT
Subjt:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        DENRRL KELQ+LKALKL++P +M MPAATLTMCPSCER+G    G   T      +K AFS+  KP+FY PFTNPSAAC
Subjt:  DENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX199.8e-5355.47Show/hide
Query:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH
        EPSLTL L          + TA       P  +VSSLS G      VKRER    E+ + E+ SS  +  D+++DGS  RKKLRLTKEQSALLE+ F+ H
Subjt:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH

Query:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLT----------------KPLFMQMPAATL
         TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT+ENRRL +ELQEL+ALK                   P +MQ+PAATL
Subjt:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLT----------------KPLFMQMPAATL

Query:  TMCPSCERIGG--ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        T+CPSCER+GG  + A V     +KA         F+ PFT+ SAAC
Subjt:  TMCPSCERIGG--ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.6e-7762.41Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE
        MGFDD  NTGL+LGLG      P+ +    N+ +   S    + EPSLTL LS        T AD LCRQ S HS VSS S GR VKRERD   E  EEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE

Query:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLH
        + + RV SD  E+E+G +ARKKLRLTK+QSALLEESFK H TLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN RL 
Subjt:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLH

Query:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        KE+QELK LKLT+P +M MPA+TLT CPSCERIG             GATA +     +K AFS++ KP F+ PFTNPSAAC
Subjt:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

AT2G44910.1 homeobox-leucine zipper protein 42.6e-4856.78Show/hide
Query:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKAS------SRVSDEEE--DGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQ
        SP+S VSSLSG +        G++ E E+AS      S  SD+E+  +G  +RKKLRL+K+Q+ +LEE+FK H TLNPKQK ALA++LNLR RQVEVWFQ
Subjt:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKAS------SRVSDEEE--DGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQ

Query:  NRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQM-PAATLTMCPSCERIGGATATVNGNGNSKAAFSMA--PKPQFYKPFT
        NRRARTKLKQTEVDCE+LKRCC+ LT+ENRRL KE+ EL+ALKL+  L+M M P  TLTMCPSCER+  + ATV    ++    ++   P PQ   P+T
Subjt:  NRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQM-PAATLTMCPSCERIGGATATVNGNGNSKAAFSMA--PKPQFYKPFT

AT4G16780.1 homeobox protein 23.0e-4966.04Show/hide
Query:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKL
        SP+STVSS +G R +RE D        +   SR   ++EDG N+RKKLRL+K+QSA+LEE+FK H TLNPKQKQALA++L LR RQVEVWFQNRRARTKL
Subjt:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKL

Query:  KQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQM-PAATLTMCPSCERI
        KQTEVDCEFL+RCCE LT+ENRRL KE+ EL+ALKL+   +M M P  TLTMCPSCE +
Subjt:  KQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQM-PAATLTMCPSCERI

AT4G37790.1 Homeobox-leucine zipper protein family3.3e-8061.07Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE
        MG DD  NTGL+LGLGL+    P   +       +      +  +PSLTL LS  +            +CRQ S HS +SS S GRVKRER++SG D EE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE

Query:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
        E          SRVSD  ++E+G +ARKKLRLTK+QSALLE++FKLH TLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT
Subjt:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        DENRRL KELQ+LKALKL++P +M MPAATLTMCPSCER+G    G   T      +K AFS+  KP+FY PFTNPSAAC
Subjt:  DENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----GATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana3.9e-4964.38Show/hide
Query:  VSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSN--ARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT
        + S    R   +RD+   D E E+++SR S+E+ D  N   RKKLRL+K+QSA LE+SFK H TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQT
Subjt:  VSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSN--ARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT

Query:  EVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIGGATA
        EVDCE+LKRCCE+LT+ENRRL KE++EL+ LK + P +MQ+PA TLTMCPSCER+  + A
Subjt:  EVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERIGGATA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGACGATCTTCCTAATACGGGTCTTCTATTGGGATTGGGGTTAACGCTTCAATCCAAGCCCGCCTGCCTTTCTCAAAAACCCAACAACCCTGTCAATTTGTT
CAGTTTCCCCGCCGTCGAATCCGAGCCGTCCTTAACTTTGGGGCTTTCTACTACGGCTGACTTGTGTCGTCAACCGTCCCCTCACAGCACGGTTTCTTCCCTCTCTGGCG
GAAGGGTCAAGCGTGAAAGAGACGTTTCCGGCGAGGATATTGAAGAAGAGAAAGCTTCTTCTCGGGTTAGTGATGAAGAGGAAGATGGGTCTAATGCTAGGAAGAAACTT
AGGCTAACTAAAGAACAATCCGCCCTCTTGGAGGAGAGCTTCAAACTTCACTGCACTCTCAACCCTAAGCAAAAACAAGCCTTAGCCAGAGAGTTAAATCTTCGGCCACG
ACAAGTTGAAGTTTGGTTCCAGAACAGAAGAGCCAGGACAAAGCTAAAGCAAACAGAGGTAGATTGCGAGTTTCTAAAGAGATGCTGCGAAACGCTAACAGACGAAAACA
GAAGACTGCACAAAGAGCTGCAAGAATTGAAAGCCCTGAAACTAACCAAGCCTCTGTTCATGCAAATGCCGGCGGCAACACTCACCATGTGCCCCTCCTGCGAGCGGATC
GGCGGCGCCACCGCCACTGTTAACGGCAACGGGAATTCCAAGGCCGCATTTTCAATGGCTCCAAAGCCCCAGTTTTACAAACCCTTCACCAATCCCTCTGCTGCTTGCTA
A
mRNA sequenceShow/hide mRNA sequence
CTCATTCTTTCGTTTGATTATTAATCGCCCACCACCAACCCATTCCCTCCATTCCCATTTCTCTCTTCTTAAACGCCATCCCTCTCCTCAAAATCCCCACGCCATCGCTT
CAAAAAACGCCTCCAAAATCTTCAACCTCAAATCTTCATTCATATACCCAGATGGGTTTTGACGATCTTCCTAATACGGGTCTTCTATTGGGATTGGGGTTAACGCTTCA
ATCCAAGCCCGCCTGCCTTTCTCAAAAACCCAACAACCCTGTCAATTTGTTCAGTTTCCCCGCCGTCGAATCCGAGCCGTCCTTAACTTTGGGGCTTTCTACTACGGCTG
ACTTGTGTCGTCAACCGTCCCCTCACAGCACGGTTTCTTCCCTCTCTGGCGGAAGGGTCAAGCGTGAAAGAGACGTTTCCGGCGAGGATATTGAAGAAGAGAAAGCTTCT
TCTCGGGTTAGTGATGAAGAGGAAGATGGGTCTAATGCTAGGAAGAAACTTAGGCTAACTAAAGAACAATCCGCCCTCTTGGAGGAGAGCTTCAAACTTCACTGCACTCT
CAACCCTAAGCAAAAACAAGCCTTAGCCAGAGAGTTAAATCTTCGGCCACGACAAGTTGAAGTTTGGTTCCAGAACAGAAGAGCCAGGACAAAGCTAAAGCAAACAGAGG
TAGATTGCGAGTTTCTAAAGAGATGCTGCGAAACGCTAACAGACGAAAACAGAAGACTGCACAAAGAGCTGCAAGAATTGAAAGCCCTGAAACTAACCAAGCCTCTGTTC
ATGCAAATGCCGGCGGCAACACTCACCATGTGCCCCTCCTGCGAGCGGATCGGCGGCGCCACCGCCACTGTTAACGGCAACGGGAATTCCAAGGCCGCATTTTCAATGGC
TCCAAAGCCCCAGTTTTACAAACCCTTCACCAATCCCTCTGCTGCTTGCTAATCGAATTTGCCTAGCTAGAAGCAAAAAAAAAAAAAAGAATTAGGTAATTAATCAAAGA
GGAAGATCCCCAGAAACCCAGGATTTTTTGGTTGGGGCCTCGTTTAGTTCTAGTTGGTTCTATTAATTAAAAAAAAAAAAACATAGTAACTACAAATCCAATTATTATGT
CTCCTTATTTGATTTCATATGTTTGTACATAAACTAAATTAATGTTGTTATTGAAGAAAATATTTGTAATATGACGGCTTCTCTAAACTCCC
Protein sequenceShow/hide protein sequence
MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVNLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKL
RLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLHKELQELKALKLTKPLFMQMPAATLTMCPSCERI
GGATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC