; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G25100 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G25100
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationChr4:22640114..22641861
RNA-Seq ExpressionCSPI04G25100
SyntenyCSPI04G25100
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649900.1 hypothetical protein Csa_011929 [Cucumis sativus]1.0e-13999.62Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
        MDTDCNTGLLLGLGRVSGHNINASVRSELP LNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG

Query:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
        EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
Subjt:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL

Query:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]1.2e-10077.11Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--
        MD DCNTGLLLGLGR   H  + S+R  +P +   KK  QVLKF DDILPSLTLGLS VV+        TA E+    G SGSPVSSFSNSSGFKRER  
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--

Query:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
         AGEE  E E       MKV EE+E+GSPRKKLRLTK QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC
        KLKEENT+LQKELQELKSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG   DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG---DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_004141416.1 homeobox-leucine zipper protein HAT9 [Cucumis sativus]1.0e-13999.62Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
        MDTDCNTGLLLGLGRVSGHNINASVRSELP LNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG

Query:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
        EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
Subjt:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL

Query:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]1.1e-10177.54Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--
        MD DCNTGLLLGLGR   H  + S+R  +P +   KK  QVLKF DDILPSLTLGLS VVD        TA ++    G SGSPVSSFSNSSGFKRER  
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--

Query:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
         AGEE AETE       MKV EE+E+GSPRKKLRLTK QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC
        KLKEENT+LQKELQELKSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG    DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG----DASPANNFSIGSKPQFLKFPFNHPSAAC

XP_038898886.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]1.1e-10480.29Show/hide
Query:  MDTDCNTGLLLGLGRVSG-HNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVD----TATED----GCSGSPVSSFSNSSGFKRER-----A
        M+ DCNTGLLLGLGRVS   ++ + V  E+ G+ KK   QVLKF DDILPSLTLGLS VV+     ATE+    GCSGSPVSSFSNSSGFKRER      
Subjt:  MDTDCNTGLLLGLGRVSG-HNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVD----TATED----GCSGSPVSSFSNSSGFKRER-----A

Query:  GEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
        G E AE EE     CMKVGEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLA+QLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
Subjt:  GEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL

Query:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        KEENTRLQKELQELKSLKLT PPFCMQLQAATLTVCPSCESSICGG   GGDASPAN FSI SKPQFLKFPFNHPSAAC
Subjt:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein5.0e-14099.62Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
        MDTDCNTGLLLGLGRVSGHNINASVRSELP LNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVG

Query:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
        EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL
Subjt:  EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKL

Query:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
        TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN
Subjt:  TPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN

A0A6A1W3F2 Homeobox-leucine zipper protein HAT221.5e-6457.5Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLS---FVVDTAT---------------EDGCSGSPVSSFSNSSGFK
        +D  CNTGL L LG  S       +R +  G NK+K +  LK+D  + PSL+LG S   F    AT               +   S S VSSFSNSS  K
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLS---FVVDTAT---------------EDGCSGSPVSSFSNSSGFK

Query:  RER--AGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK
        RE+   GEE    + C +V +EDEE  PRKKLRLTK QSA+LED FKEH++L PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTE+DCELLKKCCE 
Subjt:  RER--AGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK

Query:  LKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        L EEN RL+KELQELKSLKL   PF MQL AATLT+CPSCE       S+ GD S  + FS+ +    L  PF HPSAAC
Subjt:  LKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1C954 homeobox-leucine zipper protein HAT9-like4.2e-8670.31Show/hide
Query:  MDTDCN--TGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLS-----------FVVDTAT------EDGCSGSPVSSFSNSSGF
        MD DC+  TGLLLGLGR S  N    +RS +P ++ KK + VLKF DDILP LTLGLS            +V   T      + G S SPVSSFSNSSG 
Subjt:  MDTDCN--TGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLS-----------FVVDTAT------EDGCSGSPVSSFSNSSGF

Query:  KRERA------GEE-VAETEE------CMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQT
        KR+R+      GEE  AE  E        KVG +EDE+GSPRKKLRLTK QSAILED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQT
Subjt:  KRERA------GEE-VAETEE------CMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC
        EMDCELLKKCCEKLKEENTRLQKELQELKSLKLT PPFCMQLQAATLTVCPSCE SICGG   GGDASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like1.0e-10076.57Show/hide
Query:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--
        MD DCNTGLLLGLGR   H  + S+R  +P +   KK  QVLKF DDILPSLTLGLS VVD        TA ++    G SGSPVSSFS+SSGFKRER  
Subjt:  MDTDCNTGLLLGLGRVSGHNINASVRSELPGL-NKKKLQQVLKFDDDILPSLTLGLSFVVD--------TATED----GCSGSPVSSFSNSSGFKRER--

Query:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
         AGEE AE E       MKV EE+E+GSPRKKLRLTK QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  -AGEEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC
        KLKEENT+LQKELQELKSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG     DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG-----DASPANNFSIGSKPQFLKFPFNHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like2.3e-10077.14Show/hide
Query:  MDTDCNTGLLLGLGR-VSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVV-----DTATED----GCSGSPVSSFSNSSGFKRER---AG
        MD DCN GLLLGLGR +  HN    +  ++ G+ KK   QVLKF DDILPSLTLGLS VV     ++A +D    G SGSP SSFSNSSGFKRER   AG
Subjt:  MDTDCNTGLLLGLGR-VSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVV-----DTATED----GCSGSPVSSFSNSSGFKRER---AG

Query:  EEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
        EE AE E       MKV EE+E+GSPRKKLRLTK QSA+LEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
Subjt:  EEVAETEE-----CMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK

Query:  EENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC
        EENT+LQKELQELKSLKLT PPFCMQLQAATLTVCPSCESSICGG   GG  DASPAN FSIGSKP FLKFPFNHPSAAC
Subjt:  EENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG--DASPANNFSIGSKPQFLKFPFNHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.8e-4651.84Show/hide
Query:  PSLTLGL-------SFVVDTATEDGCSGSPVSSFSN-------SSGFKRERAGEEVAETEECMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA E   E       G ++D++GS RKKLRLTK QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVDTATEDGCSGSPVSSFSN-------SSGFKRERAGEEVAETEECMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQEL++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

P46603 Homeobox-leucine zipper protein HAT93.0e-5754.7Show/hide
Query:  DTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL----SFVVDTATEDGC----SGSPVSSFSNSSGFKRERAG-----
        D  CNTGL+LGLG     +N N+++R             V K +    PSLTL L    S  V T  +  C    S S VSSFS+    KRER G     
Subjt:  DTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL----SFVVDTATEDGC----SGSPVSSFSNSSGFKRERAG-----

Query:  EEVAETEECMKVGEEDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENT
        EE   TE  +    EDEEG S RKKLRLTK QSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN 
Subjt:  EEVAETEECMKVGEEDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENT

Query:  RLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        RLQKE+QELK+LKLT  PF M + A+TLT CPSCE    GG  +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  RLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

P46604 Homeobox-leucine zipper protein HAT223.3e-5651.74Show/hide
Query:  MDTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL---SFVVDTATEDG-------CSGSPVSSFSNS--------SGF
        +D  CNTGL+LGLG   + +N N +++     ++ + ++        + PSLTL L   S+ + T    G        S S +SSFS+         SG 
Subjt:  MDTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL---SFVVDTATEDG-------CSGSPVSSFSNS--------SGF

Query:  KRERAGEEVAETEECMKVGE--EDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          E   EE  E   C +V +  +DEEG S RKKLRLTK QSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  KRERAGEEVAETEECMKVGE--EDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ+LK+LKL+  PF M + AATLT+CPSCE    GG   GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

P46665 Homeobox-leucine zipper protein HAT142.6e-4060.12Show/hide
Query:  KRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
        KR+   E             +DE GS RKKLRL+K QSA LED+FKEHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L
Subjt:  KRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL

Query:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGS
         EEN RLQKE++EL++LK T  PF MQL A TLT+CPSCE      S++    S A+N  + +
Subjt:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGS

Q8GRL4 Homeobox-leucine zipper protein HOX191.8e-4651.84Show/hide
Query:  PSLTLGL-------SFVVDTATEDGCSGSPVSSFSN-------SSGFKRERAGEEVAETEECMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLS
        PSLTL L       +    TAT  G  G P  S S+       ++  KRERA E   E       G ++D++GS RKKLRLTK QSA+LED F+EHS+L+
Subjt:  PSLTLGL-------SFVVDTATEDGCSGSPVSSFSN-------SSGFKRERAGEEVAETEECMKVG-EEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLS

Query:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPP---------------PFCMQLQAATLTVCP
        PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQEL++LK  PP               PF MQL AATLT+CP
Subjt:  PKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPP---------------PFCMQLQAATLTVCP

Query:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC
        SCE    GG +S      A+    G   +       PF H SAAC
Subjt:  SCESSICGGSSSGGDASPANNFSIG---SKPQFLKFPFNHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.1e-5854.7Show/hide
Query:  DTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL----SFVVDTATEDGC----SGSPVSSFSNSSGFKRERAG-----
        D  CNTGL+LGLG     +N N+++R             V K +    PSLTL L    S  V T  +  C    S S VSSFS+    KRER G     
Subjt:  DTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL----SFVVDTATEDGC----SGSPVSSFSNSSGFKRERAG-----

Query:  EEVAETEECMKVGEEDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENT
        EE   TE  +    EDEEG S RKKLRLTK QSA+LE++FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN 
Subjt:  EEVAETEECMKVGEEDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENT

Query:  RLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC
        RLQKE+QELK+LKLT  PF M + A+TLT CPSCE    GG  +GG            D S A   FSI SKP F   PF +PSAAC
Subjt:  RLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGG------------DASPANN-FSIGSKPQFLKFPFNHPSAAC

AT2G44910.1 homeobox-leucine zipper protein 45.8e-4055.93Show/hide
Query:  VVDTATEDGCSGSPVSSFSNSSGFKRE----RAGEE-VAETEECMK----VGEEDEEG----SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQL
        VVD   E     SP S+ S+ SG KR+    R G+E  AE   C +     G +DE+G      RKKLRL+K Q+ +LE+ FKEHS+L+PKQK  LA+QL
Subjt:  VVDTATEDGCSGSPVSSFSNSSGFKRE----RAGEE-VAETEECMK----VGEEDEEG----SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCE
        NLR RQVEVWFQNRRARTKLKQTE+DCE LK+CC+ L EEN RLQKE+ EL++LKL+P  +       TLT+CPSCE
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G16780.1 homeobox protein 22.0e-4057.59Show/hide
Query:  EDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTK
        ED    SP S+ S+S+G + ER  EE  + +    + ++++  + RKKLRL+K QSAILE+ FK+HS+L+PKQKQ LA+QL LR RQVEVWFQNRRARTK
Subjt:  EDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTK

Query:  LKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCE
        LKQTE+DCE L++CCE L EEN RLQKE+ EL++LKL+P  +       TLT+CPSCE
Subjt:  LKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCE

AT4G37790.1 Homeobox-leucine zipper protein family2.4e-5751.74Show/hide
Query:  MDTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL---SFVVDTATEDG-------CSGSPVSSFSNS--------SGF
        +D  CNTGL+LGLG   + +N N +++     ++ + ++        + PSLTL L   S+ + T    G        S S +SSFS+         SG 
Subjt:  MDTDCNTGLLLGLG-RVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGL---SFVVDTATEDG-------CSGSPVSSFSNS--------SGF

Query:  KRERAGEEVAETEECMKVGE--EDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC
          E   EE  E   C +V +  +DEEG S RKKLRLTK QSA+LEDNFK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKKCC
Subjt:  KRERAGEEVAETEECMKVGE--EDEEG-SPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCC

Query:  EKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC
        E L +EN RLQKELQ+LK+LKL+  PF M + AATLT+CPSCE    GG   GGD +  +       FSI +KP+F   PF +PSAAC
Subjt:  EKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANN------FSIGSKPQFLKFPFNHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.8e-4160.12Show/hide
Query:  KRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL
        KR+   E             +DE GS RKKLRL+K QSA LED+FKEHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L
Subjt:  KRERAGEEVAETEECMKVGEEDEEGSPRKKLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKL

Query:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGS
         EEN RLQKE++EL++LK T  PF MQL A TLT+CPSCE      S++    S A+N  + +
Subjt:  KEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSCESSICGGSSSGGDASPANNFSIGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACAGATTGCAATACGGGGCTTCTTCTTGGCCTAGGCAGGGTTTCCGGACACAATATTAATGCTTCTGTGCGCTCCGAACTACCCGGTCTTAATAAGAAGAAGCT
GCAGCAGGTTTTGAAGTTCGATGATGATATTCTGCCTTCTTTGACTCTTGGGTTGTCGTTTGTTGTCGACACCGCGACGGAGGACGGCTGTTCGGGTAGTCCGGTTTCGT
CGTTTTCCAACTCGTCGGGGTTTAAGAGGGAACGTGCTGGGGAAGAGGTGGCGGAGACGGAGGAGTGTATGAAAGTGGGTGAGGAAGATGAAGAGGGAAGTCCGAGGAAG
AAACTTAGATTAACAAAACACCAATCCGCCATTTTGGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAAAAGCAGGATTTGGCTAGGCAATTAAACCTAAG
GCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCAAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATTACTGAAGAAATGCTGTGAAAAGCTAAAAGAAG
AAAACACAAGGCTTCAAAAGGAACTTCAAGAACTTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTACAAGCCGCCACTCTCACCGTTTGCCCTTCATGT
GAGAGCTCCATCTGCGGCGGCAGCAGCAGCGGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCGAAGCCTCAATTTCTCAAATTCCCATTTAACCATCCATC
GGCGGCTTGTAATTAG
mRNA sequenceShow/hide mRNA sequence
CCATTTTGTGTTTTTTTTCACTTTGTCAAACCATGGATACAGATTGCAATACGGGGCTTCTTCTTGGCCTAGGCAGGGTTTCCGGACACAATATTAATGCTTCTGTGCGC
TCCGAACTACCCGGTCTTAATAAGAAGAAGCTGCAGCAGGTTTTGAAGTTCGATGATGATATTCTGCCTTCTTTGACTCTTGGGTTGTCGTTTGTTGTCGACACCGCGAC
GGAGGACGGCTGTTCGGGTAGTCCGGTTTCGTCGTTTTCCAACTCGTCGGGGTTTAAGAGGGAACGTGCTGGGGAAGAGGTGGCGGAGACGGAGGAGTGTATGAAAGTGG
GTGAGGAAGATGAAGAGGGAAGTCCGAGGAAGAAACTTAGATTAACAAAACACCAATCCGCCATTTTGGAGGACAATTTCAAAGAACACTCGAGTCTTAGTCCTAAGCAA
AAGCAGGATTTGGCTAGGCAATTAAACCTAAGGCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCAAGAACCAAGCTGAAGCAAACAGAAATGGATTGTGAATT
ACTGAAGAAATGCTGTGAAAAGCTAAAAGAAGAAAACACAAGGCTTCAAAAGGAACTTCAAGAACTTAAATCACTCAAACTAACGCCTCCACCGTTCTGCATGCAACTAC
AAGCCGCCACTCTCACCGTTTGCCCTTCATGTGAGAGCTCCATCTGCGGCGGCAGCAGCAGCGGCGGTGATGCATCTCCGGCCAATAACTTCTCAATTGGGTCGAAGCCT
CAATTTCTCAAATTCCCATTTAACCATCCATCGGCGGCTTGTAATTAGGCGGCAGCCTAATTTTAATTATAATATCATTAATTAAAATTGACCTGAAGACTGCTATTTTT
CCTTGGGGCTTCT
Protein sequenceShow/hide protein sequence
MDTDCNTGLLLGLGRVSGHNINASVRSELPGLNKKKLQQVLKFDDDILPSLTLGLSFVVDTATEDGCSGSPVSSFSNSSGFKRERAGEEVAETEECMKVGEEDEEGSPRK
KLRLTKHQSAILEDNFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTPPPFCMQLQAATLTVCPSC
ESSICGGSSSGGDASPANNFSIGSKPQFLKFPFNHPSAACN