; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0182881 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0182881
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionHomeobox-leucine zipper protein family
Genome locationCMiso1.1chr07:1616756..1618758
RNA-Seq ExpressionCmc07g0182881
SyntenyCmc07g0182881
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649900.1 hypothetical protein Csa_011929 [Cucumis sativus]6.9e-12893.13Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLGRVSG +IN SVRS +  +NKKKL QVLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRER AGE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGG SSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]3.1e-10479.65Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLGR  G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VVE        TA EE    G SGSPVSSFSNSSGFKRER 
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+  E E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGGG  GG   DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_004141416.1 homeobox-leucine zipper protein HAT9 [Cucumis sativus]6.9e-12893.13Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLGRVSG +IN SVRS +  +NKKKL QVLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRER AGE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGG SSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]1.8e-10479.37Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLGR  G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VV+        TA +E    G SGSPVSSFSNSSGFKRER 
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+ AETE       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGGG  GG    DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_038898886.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]8.5e-11083.81Show/hide
Query:  MDSDCNTGLLLGLGRVSGDD-INDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE----TATEE----GCSGSPVSSFSNSSGFKRER----AAG
        M+ DCNTGLLLGLGRVS DD +   V   VAGV KKKLQVLKF DDILPSLTLGLS VVE     ATEE    GCSGSPVSSFSNSSGFKRER      G
Subjt:  MDSDCNTGLLLGLGRVSGDD-INDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE----TATEE----GCSGSPVSSFSNSSGFKRER----AAG

Query:  EDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
         + AE EE     CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLA+QLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
Subjt:  EDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK

Query:  EENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        EENTRLQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGGG  GGDASPAN FSI SKPQFLKFPFNHPSAAC
Subjt:  EENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein3.4e-12893.13Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLGRVSG +IN SVRS +  +NKKKL QVLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRER AGE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGG SSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

A0A6A1W3F2 Homeobox-leucine zipper protein HAT225.3e-6556.79Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQV-LKFDDDILPSLTLGLS---FVVETAT---------------EEGCSGSPVSSFSNSSGFK
        +D  CNTGL L LG  S D   + +R    G NK+K +  LK+D  + PSL+LG S   F  + AT               ++  S S VSSFSNSS  K
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQV-LKFDDDILPSLTLGLS---FVVETAT---------------EEGCSGSPVSSFSNSSGFK

Query:  RERAAGEDVAETEE-CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK
        RE+ +  +  E E+ C +V +EDE+  PRKKLRLTK+QSA+LED FKEH++L PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE+DCELLKKCCE 
Subjt:  RERAAGEDVAETEE-CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK

Query:  LKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        L EEN RL+KELQE+KSLKL   PF MQL AATLT+CPSCE       S+ GD S  + FS+ +    L  PF HPSAAC
Subjt:  LKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1C954 homeobox-leucine zipper protein HAT9-like2.4e-8969.86Show/hide
Query:  MDSDCN--TGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLS-----------------FVVETATEEGCSGSPVSSFSNSSGFK
        MD DC+  TGLLLGLGR       + +RS V  V+ KK  VLKF DDILP LTLGLS                    E   ++G S SPVSSFSNSSG K
Subjt:  MDSDCN--TGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLS-----------------FVVETATEEGCSGSPVSSFSNSSGFK

Query:  RERAAG------EDVAETEE------CMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE
        R+R+ G      E  AE  E        KVG +EDEDGSPRKKLRLTK+QSAILED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQTE
Subjt:  RERAAG------EDVAETEE------CMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        MDCELLKKCCEKLKEENTRLQKELQE+KSLKLT PPFCMQLQAATLTVCPSCE SICGGG  GGDASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  MDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like1.7e-10378.4Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLGR  G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VV+        TA +E    G SGSPVSSFS+SSGFKRER 
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+ AE E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGGG  GG     DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like1.3e-10379Show/hide
Query:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETA---------TEEGCSGSPVSSFSNSSGFKRER--AA
        MD+DCN GLLLGLGR  G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VVE +           +G SGSP SSFSNSSGFKRER   A
Subjt:  MDSDCNTGLLLGLGRVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETA---------TEEGCSGSPVSSFSNSSGFKRER--AA

Query:  GEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
        GE+ AE E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
Subjt:  GEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL

Query:  KEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC
        KEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGGG  GG  DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  KEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX196.2e-4751.02Show/hide
Query:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA   D           ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQE+++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGGSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGGSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

P46603 Homeobox-leucine zipper protein HAT91.6e-5854.74Show/hide
Query:  DSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE-
        D  CNTGL+LGLG      I ++  S +   +  KL+         PSLTL L    S  V T  ++ C    S S VSSFS+    KRER  GE+  E 
Subjt:  DSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE-

Query:  ---TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRL
           TE  +    EDE+G S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN RL
Subjt:  ---TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRL

Query:  QKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        QKE+QE+K+LKLT  PF M + A+TLT CPSCE    GGG +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  QKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

P46604 Homeobox-leucine zipper protein HAT221.8e-5752.78Show/hide
Query:  MDSDCNTGLLLGLG-RVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG
        +D  CNTGL+LGLG   + ++ N +++   + V+ + ++       + PSLTL L   S+ ++T    G        S S +SSFS S   KRER  + G
Subjt:  MDSDCNTGLLLGLG-RVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG

Query:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
        +   E EE      C +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ++K+LKL+  PF M + AATLT+CPSCE    GGG  GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

P46665 Homeobox-leucine zipper protein HAT141.8e-4156.84Show/hide
Query:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS
        + LG + VVE   EE       S SP     SSF    G K      R+   D+ +  E           +DE+GS RKKLRL+K QSA LED+FKEHS+
Subjt:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS

Query:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKE++E+++LK T  PF MQL A TLT+CPSCE
Subjt:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE

Q8GRL4 Homeobox-leucine zipper protein HOX196.2e-4751.02Show/hide
Query:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA   D           ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQE+++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGGSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGGSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family1.1e-5954.74Show/hide
Query:  DSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE-
        D  CNTGL+LGLG      I ++  S +   +  KL+         PSLTL L    S  V T  ++ C    S S VSSFS+    KRER  GE+  E 
Subjt:  DSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE-

Query:  ---TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRL
           TE  +    EDE+G S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN RL
Subjt:  ---TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRL

Query:  QKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        QKE+QE+K+LKLT  PF M + A+TLT CPSCE    GGG +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  QKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

AT2G44910.1 homeobox-leucine zipper protein 41.2e-4054.8Show/hide
Query:  VVETATEEGCSGSPVSSFSNSSGFKRE----RAAGEDVAETEECMK----VGEEDEDG----SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQL
        VV+   E     SP S+ S+ SG KR+    R   E+ AE   C +     G +DEDG      RKKLRL+K Q+ +LE+ FKEHS+L+PKQK  LA+QL
Subjt:  VVETATEEGCSGSPVSSFSNSSGFKRE----RAAGEDVAETEECMK----VGEEDEDG----SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        NLR RQVEVWFQNRRARTKLKQTE+DCE LK+CC+ L EEN RLQKE+ E+++LKL+P  +       TLT+CPSCE
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G16780.1 homeobox protein 25.8e-4054.6Show/hide
Query:  ETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNR
        E   E+    SP S+ S+S+G + ER   E+  + +    + ++++  + RKKLRL+K QSAILE+ FK+HS+L+PKQKQ LA+QL LR RQVEVWFQNR
Subjt:  ETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        RARTKLKQTE+DCE L++CCE L EEN RLQKE+ E+++LKL+P  +       TLT+CPSCE
Subjt:  RARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G37790.1 Homeobox-leucine zipper protein family1.2e-5852.78Show/hide
Query:  MDSDCNTGLLLGLG-RVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG
        +D  CNTGL+LGLG   + ++ N +++   + V+ + ++       + PSLTL L   S+ ++T    G        S S +SSFS S   KRER  + G
Subjt:  MDSDCNTGLLLGLG-RVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG

Query:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
        +   E EE      C +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ++K+LKL+  PF M + AATLT+CPSCE    GGG  GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGGSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.3e-4256.84Show/hide
Query:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS
        + LG + VVE   EE       S SP     SSF    G K      R+   D+ +  E           +DE+GS RKKLRL+K QSA LED+FKEHS+
Subjt:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS

Query:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKE++E+++LK T  PF MQL A TLT+CPSCE
Subjt:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAGATTGCAACACGGGGCTTCTTCTTGGCCTAGGGAGGGTTTCCGGAGACGATATTAATGATTCTGTGCGGTCCGGTGTAGCCGGTGTTAATAAGAAGAAGCT
GCAGGTTTTGAAGTTCGATGATGATATTTTGCCTTCTTTGACGCTTGGATTGTCGTTTGTTGTCGAGACTGCGACGGAGGAGGGATGTTCGGGTAGTCCGGTTTCGTCGT
TTTCCAACTCGTCGGGGTTTAAGAGGGAACGCGCTGCTGGCGAGGATGTGGCGGAGACGGAGGAGTGTATGAAAGTGGGTGAGGAAGATGAAGATGGAAGTCCGAGGAAG
AAACTTAGATTAACAAAACAACAGTCCGCCATTTTAGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAAAAGCAGGATTTGGCTAGGCAATTAAACCTAAG
GCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATTACTGAAGAAATGCTGTGAAAAGCTAAAAGAAG
AAAACACAAGGCTTCAAAAGGAACTTCAAGAAATTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTACAAGCCGCCACTCTCACCGTTTGCCCTTCATGT
GAGAGCTCCATTTGTGGCGGCGGCAGCAGCGGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCAAAGCCTCAATTTCTCAAGTTCCCATTTAACCATCCATC
GGCGGCTTGTAATTAG
mRNA sequenceShow/hide mRNA sequence
TCCCCACAATCCCATATTCCCAACCCTAAGAAACTTCCCCTCTATTTCTCCCCTTTATATTCCCCCCCTCTCAAGCTTACTTTCTTTTCATTTTTCATTTTTCATTTTGT
CAAACAATGGATTCAGATTGCAACACGGGGCTTCTTCTTGGCCTAGGGAGGGTTTCCGGAGACGATATTAATGATTCTGTGCGGTCCGGTGTAGCCGGTGTTAATAAGAA
GAAGCTGCAGGTTTTGAAGTTCGATGATGATATTTTGCCTTCTTTGACGCTTGGATTGTCGTTTGTTGTCGAGACTGCGACGGAGGAGGGATGTTCGGGTAGTCCGGTTT
CGTCGTTTTCCAACTCGTCGGGGTTTAAGAGGGAACGCGCTGCTGGCGAGGATGTGGCGGAGACGGAGGAGTGTATGAAAGTGGGTGAGGAAGATGAAGATGGAAGTCCG
AGGAAGAAACTTAGATTAACAAAACAACAGTCCGCCATTTTAGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAAAAGCAGGATTTGGCTAGGCAATTAAA
CCTAAGGCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATTACTGAAGAAATGCTGTGAAAAGCTAA
AAGAAGAAAACACAAGGCTTCAAAAGGAACTTCAAGAAATTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTACAAGCCGCCACTCTCACCGTTTGCCCT
TCATGTGAGAGCTCCATTTGTGGCGGCGGCAGCAGCGGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCAAAGCCTCAATTTCTCAAGTTCCCATTTAACCA
TCCATCGGCGGCTTGTAATTAGGTGGCAGCCTAATTTTAATTATAATATCATTAATTAAAATTGACCTGAAGACTGCTATTTTTCCTTGGGGCTTCTTGGTCAATATATA
TATATATAACACTGATATTTATGTATTTAATATGTTATTATATGTACAAATATATTGTTGTGTTTTGAACACATTGGAAAAGATTAGGCTTTGTAGAAATCAAAACTATG
AGTTATGACGACATACTTTGTATTTAAGTTTTCATTTTGGTGATATTAATAGTAAATTTTGTTATTGGTGGTGAAGTCCTTGTCCTTCATTAATCAATTTAGCTCGTCTA
TATAG
Protein sequenceShow/hide protein sequence
MDSDCNTGLLLGLGRVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRK
KLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSC
ESSICGGGSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN