; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G191810 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G191810
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionHomeobox-leucine zipper protein family
Genome locationCla97Chr10:15937459..15938526
RNA-Seq ExpressionCla97C10G191810
SyntenyCla97C10G191810
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060358.1 homeobox-leucine zipper protein HAT22-like [Cucumis melo var. makuwa]5.5e-9972.95Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG + H+KVID               GV HL+RQ  S H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R+  RV+DED+DG +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG          SKPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC

XP_004149840.2 homeobox-leucine zipper protein HAT22 [Cucumis sativus]6.5e-10073.63Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG +TH+KVID              VG HHL+RQ AS H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDED-GSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R   RVSDED+D  +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDED-GSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGAS--------KPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG           KPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGAS--------KPKFSMAPTPHFYNPFSSPSAAC

XP_008450165.1 PREDICTED: homeobox-leucine zipper protein HAT22-like [Cucumis melo]2.5e-9973.29Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG + H+KVID               GV HL+RQ  S H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R+  RVSDED+DG +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG          SKPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]4.4e-9672.41Show/hide
Query:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS
        MG DD   TGLVLGLGLS        +   L KKP PC SSSLDF+PC LTLGFS    G   ++K+ D                 HHL+RQ AS HSS+
Subjt:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS

Query:  AVCSSFSGGG---RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
           SSFSGGG   RVKRERDLSSEEV L+R+SSR+SDEDEDGSN RKKLRLS++Q+ALLE+SFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
Subjt:  AVCSSFSGGG---RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK

Query:  LKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG-GGASKPKFSMAPTPHFYNPFSSPSAAC
        LKQTEVDCE LKKCCETLTDENRRLQKELQELK+LKLAQPLYM MPAATLT+CPSCER+G    GASK KFSMAP PHFYNPFS+PSAAC
Subjt:  LKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG-GGASKPKFSMAPTPHFYNPFSSPSAAC

XP_038903319.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]3.3e-13692.5Show/hide
Query:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG
        MGLDDF QTGLVLGLGLSNDQRLKKKP PCSSSSLDFEPCALTLGFSG+GG   THQKVIDVNNKMMMI NNNSDVGVHHLHRQEASTHSSSAVCSSFSG
Subjt:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG

Query:  GGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL
        GGRVKRERDLSS+EV+LDR+SSRVSDEDEDGSN RKKLRLSKQQ+ALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL
Subjt:  GGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL

Query:  KKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG--GGASKPKFSMAPTPHFYNPFSSPSAAC
        KKCCETLTDENRRLQKELQELK+LKLAQP+YMQ+PAATLTICPSCER+G G  GGASKPKFSMAP PHFYNPFS+PSAAC
Subjt:  KKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG--GGASKPKFSMAPTPHFYNPFSSPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0LBR7 Homeobox domain-containing protein3.2e-10073.63Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG +TH+KVID              VG HHL+RQ AS H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDED-GSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R   RVSDED+D  +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDED-GSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGAS--------KPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG           KPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGAS--------KPKFSMAPTPHFYNPFSSPSAAC

A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like1.2e-9973.29Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG + H+KVID               GV HL+RQ  S H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R+  RVSDED+DG +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG          SKPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC

A0A5D3DEZ1 Homeobox-leucine zipper protein HAT22-like2.7e-9972.95Show/hide
Query:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MG DDF +TGLVLGLGLS   +DQR  LKKKP PCSSSSLDFEPC LTLGFS  GGG + H+KVID               GV HL+RQ  S H SSAVC
Subjt:  MGLDDFCQTGLVLGLGLS---NDQR--LKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SSFS  G+VKRERDLSSEEV+L+R+  RV+DED+DG +N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTE
Subjt:  SSFSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDG-SNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC
        VDCELLKKCCETLTDENRRLQKE+QELK++KLA+P+YMQM  ATLTICPSCER+G+GG          SKPKFSM P P FYNPFS+PSAAC
Subjt:  VDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG--------GASKPKFSMAPTPHFYNPFSSPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like2.1e-9672.41Show/hide
Query:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS
        MG DD   TGLVLGLGLS        +   L KKP PC SSSLDF+PC LTLGFS    G   ++K+ D                 HHL+RQ AS HSS+
Subjt:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS

Query:  AVCSSFSGGG---RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
           SSFSGGG   RVKRERDLSSEEV L+R+SSR+SDEDEDGSN RKKLRLS++Q+ALLE+SFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
Subjt:  AVCSSFSGGG---RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK

Query:  LKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG-GGASKPKFSMAPTPHFYNPFSSPSAAC
        LKQTEVDCE LKKCCETLTDENRRLQKELQELK+LKLAQPLYM MPAATLT+CPSCER+G    GASK KFSMAP PHFYNPFS+PSAAC
Subjt:  LKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSG-GGASKPKFSMAPTPHFYNPFSSPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like7.5e-9469.62Show/hide
Query:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS
        MGLDD  QT LVLGLG+S        N +  KKK    + +SL+FEPCALTLGFSG       H                      HHL+RQ AS H SS
Subjt:  MGLDDFCQTGLVLGLGLS--------NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSS

Query:  AVCSSFS--GGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL
        AVCSSFS  GGG +KRERDLSSEEV+L+R+  RVSDEDEDG N RKKLRLSKQQ+ALLE+SFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+
Subjt:  AVCSSFS--GGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL

Query:  KQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGS-----GGGASKPKFSMAPTPHFYNPFSSPSAAC
        KQTEVDCE LKKCCETLT+ENRRLQKELQELK+LKLA PLYM MPAATLT+CPSCER+G      G GASKPKFSMAP PHFYNPFS+PSAAC
Subjt:  KQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGS-----GGGASKPKFSMAPTPHFYNPFSSPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX194.5e-5157.41Show/hide
Query:  HSSSAVCSSFSGGGRVKRERDLSSEEVQLDRLSSRVS--DEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA
        HS S++    +    VKRER   +EE   +R+SS  +  D+D+DGS  RKKLRL+K+Q+ALLE  F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRA
Subjt:  HSSSAVCSSFSGGGRVKRERDLSSEEVQLDRLSSRVS--DEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA

Query:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLA----------------QPLYMQMPAATLTICPSCERLGSGGGASKPKFS--------MA
        RTKLKQTEVDCE LK+CCETLT+ENRRLQ+ELQEL++LK A                 P YMQ+PAATLTICPSCER+G    A+K   +          
Subjt:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLA----------------QPLYMQMPAATLTICPSCERLGSGGGASKPKFS--------MA

Query:  PTPHFYNPFSSPSAAC
         T HF+NPF+  SAAC
Subjt:  PTPHFYNPFSSPSAAC

P46603 Homeobox-leucine zipper protein HAT95.2e-6856Show/hide
Query:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG
        MG DD C TGLVLGLG S         T   SS    EP +LTL  SG     +    V+                G   L RQ  S+HS     SSFS 
Subjt:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG

Query:  GGRVKRERDLSSEEVQLDRLSSRV-SD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        G  VKRERD   E  + + ++ RV SD  EDE+G +ARKKLRL+KQQ+ALLE+SFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  GGRVKRERDLSSEEVQLDRLSSRV-SD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGG-------------------ASKPKFSMAPTPHFYNPFSSPSAAC
        E LKKCCETL DEN RLQKE+QELK+LKL QP YM MPA+TLT CPSCER+G GGG                    +K  FS++  PHF+NPF++PSAAC
Subjt:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGG-------------------ASKPKFSMAPTPHFYNPFSSPSAAC

P46604 Homeobox-leucine zipper protein HAT221.5e-7056.29Show/hide
Query:  MGLDDFCQTGLVLGLGLS-----NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MGLDD C TGLVLGLGLS      +  +KK  +      +  +P +LTL  SG     +T                     G      ++ S+HS     
Subjt:  MGLDDFCQTGLVLGLGLS-----NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLS-------SEEVQLDRLSSRVSD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA
        SSFS  GRVKRER++S       +EE     + SRVSD  +DE+G +ARKKLRL+KQQ+ALLE +FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRA
Subjt:  SSFSGGGRVKRERDLS-------SEEVQLDRLSSRVSD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA

Query:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG----------GASKPKFSMAPTPHFYNPFSSPSA
        RTKLKQTEVDCE LKKCCETLTDENRRLQKELQ+LK+LKL+QP YM MPAATLT+CPSCERLG GG            +K  FS+   P FYNPF++PSA
Subjt:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG----------GASKPKFSMAPTPHFYNPFSSPSA

Query:  AC
        AC
Subjt:  AC

Q67UE2 Homeobox-leucine zipper protein HOX114.9e-5050Show/hide
Query:  LVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGF--------SGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSGGG
        +V GLGL         P P SS S   E  A T GF         G GGG    ++  DV    +  + NNS          + S H      ++  GGG
Subjt:  LVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGF--------SGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSGGG

Query:  RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKK
                       DR  SR SDED DG +ARKKLRLSK+Q+A LE+SFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE LK+
Subjt:  RVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKK

Query:  CCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGASKPKFSMAPTPHFYNPFSSPSA
        CCETLT+ENRRLQKEL EL++LK   P YM +PA TL++CPSCER+ S    +    S A T     P ++PS+
Subjt:  CCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGASKPKFSMAPTPHFYNPFSSPSA

Q8GRL4 Homeobox-leucine zipper protein HOX194.5e-5157.41Show/hide
Query:  HSSSAVCSSFSGGGRVKRERDLSSEEVQLDRLSSRVS--DEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA
        HS S++    +    VKRER   +EE   +R+SS  +  D+D+DGS  RKKLRL+K+Q+ALLE  F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRA
Subjt:  HSSSAVCSSFSGGGRVKRERDLSSEEVQLDRLSSRVS--DEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA

Query:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLA----------------QPLYMQMPAATLTICPSCERLGSGGGASKPKFS--------MA
        RTKLKQTEVDCE LK+CCETLT+ENRRLQ+ELQEL++LK A                 P YMQ+PAATLTICPSCER+G    A+K   +          
Subjt:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLA----------------QPLYMQMPAATLTICPSCERLGSGGGASKPKFS--------MA

Query:  PTPHFYNPFSSPSAAC
         T HF+NPF+  SAAC
Subjt:  PTPHFYNPFSSPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family3.7e-6956Show/hide
Query:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG
        MG DD C TGLVLGLG S         T   SS    EP +LTL  SG     +    V+                G   L RQ  S+HS     SSFS 
Subjt:  MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSG

Query:  GGRVKRERDLSSEEVQLDRLSSRV-SD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        G  VKRERD   E  + + ++ RV SD  EDE+G +ARKKLRL+KQQ+ALLE+SFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  GGRVKRERDLSSEEVQLDRLSSRV-SD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGG-------------------ASKPKFSMAPTPHFYNPFSSPSAAC
        E LKKCCETL DEN RLQKE+QELK+LKL QP YM MPA+TLT CPSCER+G GGG                    +K  FS++  PHF+NPF++PSAAC
Subjt:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGG-------------------ASKPKFSMAPTPHFYNPFSSPSAAC

AT2G44910.1 homeobox-leucine zipper protein 45.2e-4746.34Show/hide
Query:  GLVLGLGLSNDQ------RLKKKPTPCSSSSLDFE---------------PCALTLGFSGTGGGSETHQKVIDVNN--KMMMINNNNSDVGVHHLHRQEA
        GL L L L N Q      RL   P   SSSS  F+                 + T  F  +G    T ++  D  +  +   +N   S V V  L  + A
Subjt:  GLVLGLGLSNDQ------RLKKKPTPCSSSSLDFE---------------PCALTLGFSGTGGGSETHQKVIDVNN--KMMMINNNNSDVGVHHLHRQEA

Query:  STHSSSAVCSSFSGGGR---VKRERDLS-SEEVQLDRLSSRVSDEDEDGSN---ARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEV
           S ++  SS SG  R   V R  D + +E     R       +DEDG N   +RKKLRLSK QA +LE++FK++STLNPKQK ALA+QLNLR RQVEV
Subjt:  STHSSSAVCSSFSGGGR---VKRERDLS-SEEVQLDRLSSRVSDEDEDGSN---ARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEV

Query:  WFQNRRARTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQM-PAATLTICPSCERLGSGGGASKPKFSMAPTP
        WFQNRRARTKLKQTEVDCE LK+CC+ LT+ENRRLQKE+ EL++LKL+  LYM M P  TLT+CPSCER+ S         S   TP
Subjt:  WFQNRRARTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQM-PAATLTICPSCERLGSGGGASKPKFSMAPTP

AT4G16780.1 homeobox protein 22.1e-4850.79Show/hide
Query:  GLVLGLGLSNDQ-RLKKKP----TPCSSSSLDFEPCALTLGFSGTGGGSETHQKV-------IDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSS
        GL LGL     Q  LK  P    TP SSS   F   +    F+ +   S++ QK        IDVN         + D GV           SS     S
Subjt:  GLVLGLGLSNDQ-RLKKKP----TPCSSSSLDFEPCALTLGFSGTGGGSETHQKV-------IDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSS

Query:  FSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
         S G R +RE D        D   SR   +DEDG N+RKKLRLSK Q+A+LE++FK +STLNPKQKQALA+QL LR RQVEVWFQNRRARTKLKQTEVDC
Subjt:  FSGGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQM-PAATLTICPSCERL
        E L++CCE LT+ENRRLQKE+ EL++LKL+   YM M P  TLT+CPSCE +
Subjt:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQM-PAATLTICPSCERL

AT4G37790.1 Homeobox-leucine zipper protein family1.0e-7156.29Show/hide
Query:  MGLDDFCQTGLVLGLGLS-----NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC
        MGLDD C TGLVLGLGLS      +  +KK  +      +  +P +LTL  SG     +T                     G      ++ S+HS     
Subjt:  MGLDDFCQTGLVLGLGLS-----NDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVC

Query:  SSFSGGGRVKRERDLS-------SEEVQLDRLSSRVSD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA
        SSFS  GRVKRER++S       +EE     + SRVSD  +DE+G +ARKKLRL+KQQ+ALLE +FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRA
Subjt:  SSFSGGGRVKRERDLS-------SEEVQLDRLSSRVSD--EDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA

Query:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG----------GASKPKFSMAPTPHFYNPFSSPSA
        RTKLKQTEVDCE LKKCCETLTDENRRLQKELQ+LK+LKL+QP YM MPAATLT+CPSCERLG GG            +K  FS+   P FYNPF++PSA
Subjt:  RTKLKQTEVDCELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGG----------GASKPKFSMAPTPHFYNPFSSPSA

Query:  AC
        AC
Subjt:  AC

AT5G06710.1 homeobox from Arabidopsis thaliana3.3e-4964.85Show/hide
Query:  GGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSN--ARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        G  R   +RD+  E   ++R +SR S+ED D  N   RKKLRLSK Q+A LE SFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  GGGRVKRERDLSSEEVQLDRLSSRVSDEDEDGSN--ARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGASKPKFSMA
        E LK+CCE+LT+ENRRLQKE++EL++LK + P YMQ+PA TLT+CPSCER+ +   A++P  S A
Subjt:  ELLKKCCETLTDENRRLQKELQELKSLKLAQPLYMQMPAATLTICPSCERLGSGGGASKPKFSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGGATGATTTTTGTCAAACAGGCTTGGTGTTGGGCTTAGGGCTCTCTAATGATCAAAGGTTGAAGAAGAAGCCTACACCTTGCTCCTCTAGTTCCCTTGATTT
TGAGCCTTGTGCTTTGACTTTGGGATTTTCCGGCACTGGAGGCGGCAGCGAAACTCACCAGAAAGTTATTGATGTCAACAACAAGATGATGATGATTAATAACAACAACT
CTGATGTGGGTGTTCATCATTTGCATCGACAAGAAGCTTCCACTCATAGCAGCAGCGCTGTTTGTTCTTCCTTCTCCGGCGGCGGTAGGGTTAAAAGGGAGAGAGATCTT
AGCAGCGAAGAAGTTCAATTGGACAGACTTTCTTCTAGAGTTAGTGATGAAGATGAAGATGGTTCTAATGCTAGAAAGAAACTTAGGCTCTCTAAACAACAAGCCGCTCT
CTTGGAACAAAGTTTCAAGCAAAATAGCACCCTCAATCCTAAGCAAAAACAAGCCTTGGCGAGACAGCTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAAAATA
GGAGAGCTCGAACAAAACTGAAACAAACAGAAGTAGACTGTGAGTTATTGAAGAAGTGTTGTGAGACGTTGACAGATGAAAATAGAAGGCTACAAAAGGAGCTTCAAGAA
TTGAAGTCGTTAAAGCTGGCTCAACCACTATACATGCAGATGCCGGCGGCGACATTAACTATATGCCCCTCCTGCGAAAGGCTCGGCAGCGGCGGCGGCGCTTCCAAACC
CAAATTTTCAATGGCTCCTACGCCTCACTTTTACAACCCCTTCTCCAGTCCTTCCGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTGGATGATTTTTGTCAAACAGGCTTGGTGTTGGGCTTAGGGCTCTCTAATGATCAAAGGTTGAAGAAGAAGCCTACACCTTGCTCCTCTAGTTCCCTTGATTT
TGAGCCTTGTGCTTTGACTTTGGGATTTTCCGGCACTGGAGGCGGCAGCGAAACTCACCAGAAAGTTATTGATGTCAACAACAAGATGATGATGATTAATAACAACAACT
CTGATGTGGGTGTTCATCATTTGCATCGACAAGAAGCTTCCACTCATAGCAGCAGCGCTGTTTGTTCTTCCTTCTCCGGCGGCGGTAGGGTTAAAAGGGAGAGAGATCTT
AGCAGCGAAGAAGTTCAATTGGACAGACTTTCTTCTAGAGTTAGTGATGAAGATGAAGATGGTTCTAATGCTAGAAAGAAACTTAGGCTCTCTAAACAACAAGCCGCTCT
CTTGGAACAAAGTTTCAAGCAAAATAGCACCCTCAATCCTAAGCAAAAACAAGCCTTGGCGAGACAGCTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAAAATA
GGAGAGCTCGAACAAAACTGAAACAAACAGAAGTAGACTGTGAGTTATTGAAGAAGTGTTGTGAGACGTTGACAGATGAAAATAGAAGGCTACAAAAGGAGCTTCAAGAA
TTGAAGTCGTTAAAGCTGGCTCAACCACTATACATGCAGATGCCGGCGGCGACATTAACTATATGCCCCTCCTGCGAAAGGCTCGGCAGCGGCGGCGGCGCTTCCAAACC
CAAATTTTCAATGGCTCCTACGCCTCACTTTTACAACCCCTTCTCCAGTCCTTCCGCCGCATGTTAG
Protein sequenceShow/hide protein sequence
MGLDDFCQTGLVLGLGLSNDQRLKKKPTPCSSSSLDFEPCALTLGFSGTGGGSETHQKVIDVNNKMMMINNNNSDVGVHHLHRQEASTHSSSAVCSSFSGGGRVKRERDL
SSEEVQLDRLSSRVSDEDEDGSNARKKLRLSKQQAALLEQSFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTDENRRLQKELQE
LKSLKLAQPLYMQMPAATLTICPSCERLGSGGGASKPKFSMAPTPHFYNPFSSPSAAC