; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0002429 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0002429
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationchr07:1594452..1596402
RNA-Seq ExpressionIVF0002429
SyntenyIVF0002429
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649900.1 hypothetical protein Csa_011929 [Cucumis sativus]4.02e-16293.13Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQ-VLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLG VSG +IN SVRS +  +NKKKLQ VLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRERA GE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQ-VLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]8.96e-13078.95Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRS---GVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLG   G D ++S+R     VAGV KKKLQVLKFDD ILPSLTLGLS VVE        TA EE    G SGSPVSSFSNSSGFKRER 
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRS---GVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+  E E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG   DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_004141416.1 homeobox-leucine zipper protein HAT9 [Cucumis sativus]1.13e-16293.13Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQ-VLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLG VSG +IN SVRS +  +NKKKLQ VLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRERA GE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQ-VLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]4.61e-13078.67Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRS---GVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLG   G D ++S+R     VAGV KKKLQVLKFDD ILPSLTLGLS VV+        TA +E    G SGSPVSSFSNSSGFKRER 
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRS---GVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+ AETE       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG    DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_038898886.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]3.59e-13783.09Show/hide
Query:  MDSDCNTGLLLGLGSVSGDD-INDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE----TATEE----GCSGSPVSSFSNSSGFKRER----AAG
        M+ DCNTGLLLGLG VS DD +   V   VAGV KKKLQVLKFDD ILPSLTLGLS VVE     ATEE    GCSGSPVSSFSNSSGFKRER      G
Subjt:  MDSDCNTGLLLGLGSVSGDD-INDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE----TATEE----GCSGSPVSSFSNSSGFKRER----AAG

Query:  EDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
         + AE EE     CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLA+QLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
Subjt:  EDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK

Query:  EENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        EENTRLQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGG   GGDASPAN FSI SKPQFLKFPFNHPSAAC
Subjt:  EENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein1.3e-12793.13Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV
        MD+DCNTGLLLGLG VSG +IN SVRS +  +NKKKL QVLKFDDDILPSLTLGLSFVV+TATE+GCSGSPVSSFSNSSGFKRER AGE+VAETEECMKV
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKL-QVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKV

Query:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK
        GEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE+KSLK
Subjt:  GEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLK

Query:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  LTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

A0A5A7VA64 Homeobox-leucine zipper protein HAT22-like6.9e-65100Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVG
        MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVG
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVG

Query:  EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSP
        EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSP
Subjt:  EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSP

A0A6J1C954 homeobox-leucine zipper protein HAT9-like1.3e-8769.18Show/hide
Query:  MDSDCN--TGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLS-----------------FVVETATEEGCSGSPVSSFSNSSGFK
        MD DC+  TGLLLGLG        + +RS V  V+ KK  VLKF DDILP LTLGLS                    E   ++G S SPVSSFSNSSG K
Subjt:  MDSDCN--TGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLS-----------------FVVETATEEGCSGSPVSSFSNSSGFK

Query:  RERAAG------EDVAETEE------CMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE
        R+R+ G      E  AE  E        KVG +EDEDGSPRKKLRLTK+QSAILED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQTE
Subjt:  RERAAG------EDVAETEE------CMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        MDCELLKKCCEKLKEENTRLQKELQE+KSLKLT PPFCMQLQAATLTVCPSCE SICGG   GGDASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  MDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like9.2e-10277.7Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-
        MD+DCNTGLLLGLG   G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VV+        TA +E    G SGSPVSSFS+SSGFKRER 
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVE--------TATEE----GCSGSPVSSFSNSSGFKRER-

Query:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          AGE+ AE E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
Subjt:  -AAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC
        EKLKEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG     DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like7.1e-10278.29Show/hide
Query:  MDSDCNTGLLLGLGSVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETA---------TEEGCSGSPVSSFSNSSGFKRER--AA
        MD+DCN GLLLGLG   G D ++S+R     VAGV KKKLQVLKF DDILPSLTLGLS VVE +           +G SGSP SSFSNSSGFKRER   A
Subjt:  MDSDCNTGLLLGLGSVSGDDINDSVR---SGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETA---------TEEGCSGSPVSSFSNSSGFKRER--AA

Query:  GEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
        GE+ AE E       MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
Subjt:  GEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL

Query:  KEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC
        KEENT+LQKELQE+KSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG  DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  KEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX196.2e-4751.02Show/hide
Query:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA   D           ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQE+++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

A2Z1U1 Homeobox-leucine zipper protein HOX118.7e-4153.06Show/hide
Query:  EEGCSGSPVSSFSNSSG-------FKRERAAGEDVAE-----TEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQ
        ++  +G+ +SS  N+S        F      G D A         C +  +ED+ GS RKKLRL+K+QSA LE++FKEHS+L+PKQK  LA+QLNLRPRQ
Subjt:  EEGCSGSPVSSFSNSSG-------FKRERAAGEDVAE-----TEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQ

Query:  VEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKP
        VEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKEL E+++LK T  PF M L A TL++CPSCE      +S+   AS A   S  + P
Subjt:  VEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKP

P46603 Homeobox-leucine zipper protein HAT97.9e-5854.55Show/hide
Query:  DSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE
        D  CNTGL+LGLG S   ++ N ++R            V K +    PSLTL L    S  V T  ++ C    S S VSSFS+    KRER  GE+  E
Subjt:  DSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE

Query:  ----TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTR
            TE  +    EDE+G S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN R
Subjt:  ----TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTR

Query:  LQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        LQKE+QE+K+LKLT  PF M + A+TLT CPSCE    GG  +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  LQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

P46604 Homeobox-leucine zipper protein HAT225.1e-5752.78Show/hide
Query:  MDSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG
        +D  CNTGL+LGLG S + ++ N +++   + V+ + ++       + PSLTL L   S+ ++T    G        S S +SSFS S   KRER  + G
Subjt:  MDSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG

Query:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
        +   E EE      C +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ++K+LKL+  PF M + AATLT+CPSCE    GG   GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX196.2e-4751.02Show/hide
Query:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA   D           ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVETATEEGCSGSPVSSFSN-------SSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQE+++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family5.6e-5954.55Show/hide
Query:  DSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE
        D  CNTGL+LGLG S   ++ N ++R            V K +    PSLTL L    S  V T  ++ C    S S VSSFS+    KRER  GE+  E
Subjt:  DSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL----SFVVETATEEGC----SGSPVSSFSNSSGFKRERAAGEDVAE

Query:  ----TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTR
            TE  +    EDE+G S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN R
Subjt:  ----TEECMKVGEEDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTR

Query:  LQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        LQKE+QE+K+LKLT  PF M + A+TLT CPSCE    GG  +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  LQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

AT2G44910.1 homeobox-leucine zipper protein 41.2e-4054.8Show/hide
Query:  VVETATEEGCSGSPVSSFSNSSGFKRE----RAAGEDVAETEECMK----VGEEDEDG----SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQL
        VV+   E     SP S+ S+ SG KR+    R   E+ AE   C +     G +DEDG      RKKLRL+K Q+ +LE+ FKEHS+L+PKQK  LA+QL
Subjt:  VVETATEEGCSGSPVSSFSNSSGFKRE----RAAGEDVAETEECMK----VGEEDEDG----SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        NLR RQVEVWFQNRRARTKLKQTE+DCE LK+CC+ L EEN RLQKE+ E+++LKL+P  +       TLT+CPSCE
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G16780.1 homeobox protein 25.8e-4054.6Show/hide
Query:  ETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNR
        E   E+    SP S+ S+S+G + ER   E+  + +    + ++++  + RKKLRL+K QSAILE+ FK+HS+L+PKQKQ LA+QL LR RQVEVWFQNR
Subjt:  ETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE
        RARTKLKQTE+DCE L++CCE L EEN RLQKE+ E+++LKL+P  +       TLT+CPSCE
Subjt:  RARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G37790.1 Homeobox-leucine zipper protein family3.6e-5852.78Show/hide
Query:  MDSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG
        +D  CNTGL+LGLG S + ++ N +++   + V+ + ++       + PSLTL L   S+ ++T    G        S S +SSFS S   KRER  + G
Subjt:  MDSDCNTGLLLGLG-SVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGL---SFVVETATEEG-------CSGSPVSSFSNSSGFKRER--AAG

Query:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
        +   E EE      C +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  EDVAETEE------CMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ++K+LKL+  PF M + AATLT+CPSCE    GG   GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.4e-4152.58Show/hide
Query:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS
        + LG + VVE   EE       S SP     SSF    G K      R+   D+ +  E           +DE+GS RKKLRL+K QSA LED+FKEHS+
Subjt:  LTLGLSFVVETATEE-----GCSGSP----VSSFSNSSGFK----RERAAGEDVAETEE-----CMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSS

Query:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSG
        L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKE++E+++LK T  PF MQL A TLT+CPSCE      S++ 
Subjt:  LSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSG

Query:  GDASPANNFSIGS
           S A+N  + +
Subjt:  GDASPANNFSIGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCAGATTGCAACACGGGGCTTCTTCTTGGCCTAGGGAGCGTTTCCGGAGACGATATTAATGATTCTGTGCGGTCCGGTGTAGCCGGTGTTAATAAGAAGAAGCT
GCAGGTTTTGAAGTTCGATGATGATATTTTGCCTTCTTTGACGCTTGGATTGTCGTTTGTTGTCGAGACTGCGACGGAGGAGGGATGTTCGGGTAGTCCGGTTTCGTCGT
TTTCCAACTCGTCGGGGTTTAAGAGGGAACGCGCTGCTGGCGAGGATGTGGCGGAGACGGAGGAGTGTATGAAAGTGGGTGAGGAAGATGAAGATGGAAGTCCGAGGAAG
AAACTTAGATTAACAAAACAACAATCCGCCATTTTAGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAAAAGCAGGATTTGGCTAGGCAATTAAACCTAAG
GCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATTACTGAAGAAATGCTGTGAAAAGCTAAAAGAAG
AAAACACAAGGCTTCAAAAGGAACTTCAAGAAATTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTACAAGCCGCCACTCTCACCGTTTGCCCTTCATGT
GAGAGCTCCATTTGTGGCGGCAGCAGCAGCGGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCAAAGCCTCAATTTCTCAAGTTCCCATTTAACCATCCATC
GGCGGCTTGTAATTAG
mRNA sequenceShow/hide mRNA sequence
GAAACTTCCCCTCTATTTCTCCCCTTTATATTCCCCCCCTCTCAAGCTTACTTTCTTTTCATTTTTCATTTTGTCAAACAATGGATTCAGATTGCAACACGGGGCTTCTT
CTTGGCCTAGGGAGCGTTTCCGGAGACGATATTAATGATTCTGTGCGGTCCGGTGTAGCCGGTGTTAATAAGAAGAAGCTGCAGGTTTTGAAGTTCGATGATGATATTTT
GCCTTCTTTGACGCTTGGATTGTCGTTTGTTGTCGAGACTGCGACGGAGGAGGGATGTTCGGGTAGTCCGGTTTCGTCGTTTTCCAACTCGTCGGGGTTTAAGAGGGAAC
GCGCTGCTGGCGAGGATGTGGCGGAGACGGAGGAGTGTATGAAAGTGGGTGAGGAAGATGAAGATGGAAGTCCGAGGAAGAAACTTAGATTAACAAAACAACAATCCGCC
ATTTTAGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAAAAGCAGGATTTGGCTAGGCAATTAAACCTAAGGCCAAGACAAGTGGAAGTATGGTTTCAAAA
CAGAAGAGCCAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATTACTGAAGAAATGCTGTGAAAAGCTAAAAGAAGAAAACACAAGGCTTCAAAAGGAACTTCAAG
AAATTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTACAAGCCGCCACTCTCACCGTTTGCCCTTCATGTGAGAGCTCCATTTGTGGCGGCAGCAGCAGC
GGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCAAAGCCTCAATTTCTCAAGTTCCCATTTAACCATCCATCGGCGGCTTGTAATTAGGCGGCAGCCTAATT
TTAATTATAATATCATTAATTAAAATTGACCTGAAGACTGCTATTTTTCCTTGGGGCTTCTTGGTCTATATATATATATATATATATATATATATATATATAACACTGAT
ATTTATGTATTTAATATGTTATTATATGTACAAATATATTGTTGTGTTTTGAACACATTGGAAAAGATTAGGCTTTGTAGAAATCAAAACTATGAGTTATGACGACATAC
TTTGTATTTAAGTTTTCATTTTGGTGATATTAATAGTAAATTTTGTTATTGGTGGTGAA
Protein sequenceShow/hide protein sequence
MDSDCNTGLLLGLGSVSGDDINDSVRSGVAGVNKKKLQVLKFDDDILPSLTLGLSFVVETATEEGCSGSPVSSFSNSSGFKRERAAGEDVAETEECMKVGEEDEDGSPRK
KLRLTKQQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQEIKSLKLTPPPFCMQLQAATLTVCPSC
ESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN