; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18519 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18519
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionHomeobox-leucine zipper protein
Genome locationCarg_Chr07:2754898..2755956
RNA-Seq ExpressionCarg18519
SyntenyCarg18519
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594945.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]8.9e-13699.61Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

KAG7026906.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-136100Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_022963334.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]1.3e-13498.83Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_023003674.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]1.1e-13398.05Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPN PVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGG+VKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGN K AFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

XP_023517048.1 homeobox-leucine zipper protein HAT22-like [Cucurbita pepo subsp. pepo]2.6e-13599.22Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGNSKA FSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3B144 homeobox-leucine zipper protein HAT22-like6.9e-11082.95Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVDLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA
        MGFDDL NT LLLGLGLTL S P   +SQKP   +DL  FP  ESEPSLTLGLST       T DL RQPSPHS +SS SG RVKRERDVSGE+IEEEKA
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVDLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA

Query:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDE+EDGSNARKKLRLTKEQSALLEESFKLH TLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        LKALKL +PLFMQMPAATLTMCPSCERIG   ATVNG+GNSK  FSMA KP+FYK FT PSAAC
Subjt:  LKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A5A7T2R8 Homeobox-leucine zipper protein HAT22-like6.9e-11082.95Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVDLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA
        MGFDDL NT LLLGLGLTL S P   +SQKP   +DL  FP  ESEPSLTLGLST       T DL RQPSPHS +SS SG RVKRERDVSGE+IEEEKA
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPA-CLSQKPNNPVDLFSFPAVESEPSLTLGLST-------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKA

Query:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDE+EDGSNARKKLRLTKEQSALLEESFKLH TLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        LKALKL +PLFMQMPAATLTMCPSCERIG   ATVNG+GNSK  FSMA KP+FYK FT PSAAC
Subjt:  LKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1GGZ5 homeobox-leucine zipper protein HAT223.1e-11082.33Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLST----------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEE
        MGFDDL NTGLLLGLGL L S PA LS KP  PVDLFSFPA ESEPSLTLGLST          TADLCRQPSPHS +SS SGGRVKRERDVSGEDIEEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLST----------TADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEE

Query:  KASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKEL
        KA SRVSDE+EDGS ARKKLRLTKEQSALLE+SFKLH TLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENR+LQKEL
Subjt:  KASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKEL

Query:  QELKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        QELKALKL +PL MQMPAATLTMCPSCER G     VN +GNSK+ FSMA  P+F K FT PSAAC
Subjt:  QELKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1HJS8 homeobox-leucine zipper protein HAT22-like6.2e-13598.83Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPV+LFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRL KELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A0A6J1KU00 homeobox-leucine zipper protein HAT22-like5.2e-13498.05Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE
        MGFDDLPNTGLLLGLGLTLQSKPACLSQKPN PVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGG+VKRERDVSGEDIEEEKASSRVSDEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEE

Query:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
        EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK
Subjt:  EDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTK

Query:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        PLFMQMPAATLTMCPSCERIG ATATVNGNGN K AFSMAPKPQFYKPFTNPSAAC
Subjt:  PLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX192.2e-5255.47Show/hide
Query:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH
        EPSLTL L          + TA       P  +VSSLS G      VKRER    E+ + E+ SS  +  D+++DGS  RKKLRLTKEQSALLE+ F+ H
Subjt:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH

Query:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLT----------------KPLFMQMPAATL
         TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT+ENRRLQ+ELQEL+ALK                   P +MQ+PAATL
Subjt:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLT----------------KPLFMQMPAATL

Query:  TMCPSCERIG--SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        T+CPSCER+G  ++ A V     +KA         F+ PFT+ SAAC
Subjt:  TMCPSCERIG--SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX271.6e-5070.78Show/hide
Query:  EKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE
        E++SSR SD++E G++ARKKLRL+KEQSA LEESFK H TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCE+LKRCCETLT+ENRRL KE
Subjt:  EKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE

Query:  LQELKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKP
        L EL+ALK  +P +M +PA TL+MCPSCER+ S  AT + +  + AA S A  P
Subjt:  LQELKALKLTKPLFMQMPAATLTMCPSCERIGSATATVNGNGNSKAAFSMAPKP

P46603 Homeobox-leucine zipper protein HAT91.4e-7562.41Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE
        MGFDD  NTGL+LGLG      P+ +    N+ +   S    + EPSLTL LS        T AD LCRQ S HS VSS S GR VKRERD   E  EEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE

Query:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQ
        + + RV SD  E+E+G +ARKKLRLTK+QSALLEESFK H TLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN RLQ
Subjt:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQ

Query:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        KE+QELK LKLT+P +M MPA+TLT CPSCERIG              ATA +     +K AFS++ KP F+ PFTNPSAAC
Subjt:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

P46604 Homeobox-leucine zipper protein HAT221.8e-7861.07Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE
        MG DD  NTGL+LGLGL+    P   +              +  +PSLTL LS  +            +CRQ S HS +SS S GRVKRER++SG D EE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE

Query:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
        E          SRVSD  ++E+G +ARKKLRLTK+QSALLE++FKLH TLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT
Subjt:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        DENRRLQKELQ+LKALKL++P +M MPAATLTMCPSCER+G        T      +K AFS+  KP+FY PFTNPSAAC
Subjt:  DENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX192.2e-5255.47Show/hide
Query:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH
        EPSLTL L          + TA       P  +VSSLS G      VKRER    E+ + E+ SS  +  D+++DGS  RKKLRLTKEQSALLE+ F+ H
Subjt:  EPSLTLGL----------STTADLCRQPSPHSTVSSLSGG-----RVKRERDVSGEDIEEEKASSRVS--DEEEDGSNARKKLRLTKEQSALLEESFKLH

Query:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLT----------------KPLFMQMPAATL
         TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT+ENRRLQ+ELQEL+ALK                   P +MQ+PAATL
Subjt:  CTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLT----------------KPLFMQMPAATL

Query:  TMCPSCERIG--SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        T+CPSCER+G  ++ A V     +KA         F+ PFT+ SAAC
Subjt:  TMCPSCERIG--SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family9.9e-7762.41Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE
        MGFDD  NTGL+LGLG      P+ +    N+ +   S    + EPSLTL LS        T AD LCRQ S HS VSS S GR VKRERD   E  EEE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLS--------TTAD-LCRQPSPHSTVSSLSGGR-VKRERDVSGEDIEEE

Query:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQ
        + + RV SD  E+E+G +ARKKLRLTK+QSALLEESFK H TLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN RLQ
Subjt:  KASSRV-SD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQ

Query:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        KE+QELK LKLT+P +M MPA+TLT CPSCERIG              ATA +     +K AFS++ KP F+ PFTNPSAAC
Subjt:  KELQELKALKLTKPLFMQMPAATLTMCPSCERIG-------------SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

AT2G44910.1 homeobox-leucine zipper protein 45.1e-4957.79Show/hide
Query:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKAS------SRVSDEEE--DGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQ
        SP+S VSSLSG +        G++ E E+AS      S  SD+E+  +G  +RKKLRL+K+Q+ +LEE+FK H TLNPKQK ALA++LNLR RQVEVWFQ
Subjt:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKAS------SRVSDEEE--DGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQ

Query:  NRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQM-PAATLTMCPSCERIGSATATVNGNGNSKAAFSMA--PKPQFYKPFT
        NRRARTKLKQTEVDCE+LKRCC+ LT+ENRRLQKE+ EL+ALKL+  L+M M P  TLTMCPSCER+ S+ ATV    ++    ++   P PQ   P+T
Subjt:  NRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQM-PAATLTMCPSCERIGSATATVNGNGNSKAAFSMA--PKPQFYKPFT

AT4G16780.1 homeobox protein 21.8e-4966.67Show/hide
Query:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKL
        SP+STVSS +G R +RE D        +   SR   ++EDG N+RKKLRL+K+QSA+LEE+FK H TLNPKQKQALA++L LR RQVEVWFQNRRARTKL
Subjt:  SPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKL

Query:  KQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQM-PAATLTMCPSCERI
        KQTEVDCEFL+RCCE LT+ENRRLQKE+ EL+ALKL+   +M M P  TLTMCPSCE +
Subjt:  KQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQM-PAATLTMCPSCERI

AT4G37790.1 Homeobox-leucine zipper protein family1.3e-7961.07Show/hide
Query:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE
        MG DD  NTGL+LGLGL+    P   +              +  +PSLTL LS  +            +CRQ S HS +SS S GRVKRER++SG D EE
Subjt:  MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTA-----------DLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEE

Query:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
        E          SRVSD  ++E+G +ARKKLRLTK+QSALLE++FKLH TLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT
Subjt:  EK-------ASSRVSD--EEEDGSNARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC
        DENRRLQKELQ+LKALKL++P +M MPAATLTMCPSCER+G        T      +K AFS+  KP+FY PFTNPSAAC
Subjt:  DENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIG----SATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.0e-4965Show/hide
Query:  VSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSN--ARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT
        + S    R   +RD+   D E E+++SR S+E+ D  N   RKKLRL+K+QSA LE+SFK H TLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQT
Subjt:  VSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSN--ARKKLRLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT

Query:  EVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIGSATA
        EVDCE+LKRCCE+LT+ENRRLQKE++EL+ LK + P +MQ+PA TLTMCPSCER+ ++ A
Subjt:  EVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERIGSATA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGACGATCTTCCTAATACGGGTCTTCTATTGGGATTGGGGTTAACGCTTCAATCCAAGCCCGCCTGCCTTTCTCAAAAACCCAACAACCCTGTCGATTTGTT
CAGTTTCCCCGCCGTCGAATCCGAGCCGTCCTTAACTTTGGGGCTTTCTACTACGGCTGACTTGTGTCGTCAACCGTCCCCTCACAGCACGGTTTCTTCCCTCTCTGGCG
GAAGGGTCAAGCGTGAAAGAGACGTTTCCGGCGAGGATATTGAAGAAGAGAAAGCTTCTTCTCGGGTTAGTGATGAGGAGGAAGATGGGTCTAATGCTAGGAAGAAACTT
AGGCTAACTAAAGAACAATCCGCCCTCTTGGAGGAGAGCTTCAAACTTCACTGCACTCTCAACCCTAAGCAAAAACAAGCCTTGGCCAGAGAGTTGAATCTTCGGCCTCG
ACAAGTTGAAGTTTGGTTCCAGAACAGAAGAGCCAGGACAAAGCTAAAGCAAACAGAGGTAGATTGCGAGTTTCTAAAGAGATGCTGCGAAACGCTAACAGACGAAAACA
GAAGACTGCAAAAAGAGCTGCAAGAATTGAAAGCCCTGAAACTAACCAAGCCTCTGTTCATGCAAATGCCGGCGGCAACACTCACCATGTGCCCCTCCTGCGAGCGGATC
GGCAGCGCCACCGCCACTGTTAACGGCAACGGGAATTCCAAGGCCGCATTTTCAATGGCTCCAAAGCCCCAGTTTTACAAACCCTTCACCAATCCCTCTGCTGCTTGCTA
A
mRNA sequenceShow/hide mRNA sequence
CATTTCTCTCTTCTTAAACGCCATCCCTCGCCTCAAAATCCCCACGCCATCGCTTCAAAAAACGCCTCCAAAATCTTCAACCTCAAATCTTCATTCATATACCCAGATGG
GTTTTGACGATCTTCCTAATACGGGTCTTCTATTGGGATTGGGGTTAACGCTTCAATCCAAGCCCGCCTGCCTTTCTCAAAAACCCAACAACCCTGTCGATTTGTTCAGT
TTCCCCGCCGTCGAATCCGAGCCGTCCTTAACTTTGGGGCTTTCTACTACGGCTGACTTGTGTCGTCAACCGTCCCCTCACAGCACGGTTTCTTCCCTCTCTGGCGGAAG
GGTCAAGCGTGAAAGAGACGTTTCCGGCGAGGATATTGAAGAAGAGAAAGCTTCTTCTCGGGTTAGTGATGAGGAGGAAGATGGGTCTAATGCTAGGAAGAAACTTAGGC
TAACTAAAGAACAATCCGCCCTCTTGGAGGAGAGCTTCAAACTTCACTGCACTCTCAACCCTAAGCAAAAACAAGCCTTGGCCAGAGAGTTGAATCTTCGGCCTCGACAA
GTTGAAGTTTGGTTCCAGAACAGAAGAGCCAGGACAAAGCTAAAGCAAACAGAGGTAGATTGCGAGTTTCTAAAGAGATGCTGCGAAACGCTAACAGACGAAAACAGAAG
ACTGCAAAAAGAGCTGCAAGAATTGAAAGCCCTGAAACTAACCAAGCCTCTGTTCATGCAAATGCCGGCGGCAACACTCACCATGTGCCCCTCCTGCGAGCGGATCGGCA
GCGCCACCGCCACTGTTAACGGCAACGGGAATTCCAAGGCCGCATTTTCAATGGCTCCAAAGCCCCAGTTTTACAAACCCTTCACCAATCCCTCTGCTGCTTGCTAA
Protein sequenceShow/hide protein sequence
MGFDDLPNTGLLLGLGLTLQSKPACLSQKPNNPVDLFSFPAVESEPSLTLGLSTTADLCRQPSPHSTVSSLSGGRVKRERDVSGEDIEEEKASSRVSDEEEDGSNARKKL
RLTKEQSALLEESFKLHCTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLTKPLFMQMPAATLTMCPSCERI
GSATATVNGNGNSKAAFSMAPKPQFYKPFTNPSAAC