; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008668 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008668
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhomeobox-leucine zipper protein HAT9-like
Genome locationscaffold4:2951627..2952998
RNA-Seq ExpressionMS008668
SyntenyMS008668
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]1.6e-10174.15Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT--TAAEELLQQGSSASPVSSFSNSSGSKR
        MD DC+TG  L LG G    N +R  VP+V  VKKKL VLKFDDILP LTLGLS        V E   G++  TAAEEL+ QGSS SPVSSFSNSSG KR
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT--TAAEELLQQGSSASPVSSFSNSSGSKR

Query:  DRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEM
        +R      GG   E   AE+++ER+S KV  +E+EDGSPRKKLRLTKEQSA+LED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQTEM
Subjt:  DRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEM

Query:  DCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG----DASPTTAFSIGSKPTFLKFPFNHPSAAC
        DCELLKKCCEKLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGG    DASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  DCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG----DASPTTAFSIGSKPTFLKFPFNHPSAAC

XP_022137752.1 homeobox-leucine zipper protein HAT9-like [Momordica charantia]1.1e-153100Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
        MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD

Query:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL
        GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL
Subjt:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL

Query:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
        LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
Subjt:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC

XP_022935802.1 homeobox-leucine zipper protein HAT9-like [Cucurbita moschata]4.6e-10172.67Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS
        MD DC+TG  L LG G    N +R  VP+V  VKKKL VLKFDDILP LTLGLS           ++V K+      TAA+EL+QQGSS SPVSSFS+SS
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS

Query:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK
        G KR+R     GG  E  AE AE+++ER+S KV  +E+EDGSPRKKLRLTKEQSA+LED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLK
Subjt:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK

Query:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG------DASPTTAFSIGSKPTFLKFPFNHPSAAC
        QTEMDCELLKKCCEKLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGG      DASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG------DASPTTAFSIGSKPTFLKFPFNHPSAAC

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]2.7e-10172.91Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS
        MD DC+TG  L LG G    N +R  VP+V  VKKKL VLKFDDILP LTLGLS           ++V K+      TAA+EL+QQGSS SPVSSFSNSS
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS

Query:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK
        G KR+R     GG  E  AE  E+++ER+S KV  +E+EDGSPRKKLRLTKEQSA+LED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLK
Subjt:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK

Query:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG-----DASPTTAFSIGSKPTFLKFPFNHPSAAC
        QTEMDCELLKKCCEKLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGG     DASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG-----DASPTTAFSIGSKPTFLKFPFNHPSAAC

XP_038898886.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]5.0e-10376.37Show/hide
Query:  MDGDCSTGTGLLLGLGR-SERNPLRSLVPE--VDVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT--TAAEELLQQGSSASPVSSFSNSSGS
        M+ DC+  TGLLLGLGR S  + +RS+VP     VKKKL VLKFDDILP LTLGLS           I+V K+   A EEL Q+G S SPVSSFSNSSG 
Subjt:  MDGDCSTGTGLLLGLGR-SERNPLRSLVPE--VDVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT--TAAEELLQQGSSASPVSSFSNSSGS

Query:  KRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQT
        KR+R  G  GGGEE EAE    Y+ERV  KVG +EDEDGSPRKKLRLTK+QSAILED+FKEHSSL+PKQK DLA+QL+LRPRQVEVWFQNRRARTKLKQT
Subjt:  KRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
        EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGGDASP   FSI SKP FLKFPFNHPSAAC
Subjt:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein1.6e-8670.31Show/hide
Query:  MDGDCSTGTGLLLGLGR-SERN---PLRSLVPEVDVKK-KLVLKF-DDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS
        MD DC+  TGLLLGLGR S  N    +RS +P ++ KK + VLKF DDILP LTLGLS            +V   T      + G S SPVSSFSNSSG 
Subjt:  MDGDCSTGTGLLLGLGR-SERN---PLRSLVPEVDVKK-KLVLKF-DDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS

Query:  KRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQT
        KR+R+      GEE  AE  E        KVG +EDE+GSPRKKLRLTK QSAILED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQT
Subjt:  KRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQT

Query:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG-GGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
        EMDCELLKKCCEKLKEENTRLQKELQELKSLKLT PPFCMQLQAATLTVCPSCE SICGG   GGDASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  EMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG-GGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC

A0A6J1C954 homeobox-leucine zipper protein HAT9-like5.1e-154100Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
        MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD

Query:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL
        GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL
Subjt:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL

Query:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
        LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
Subjt:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like2.2e-10172.67Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS
        MD DC+TG  L LG G    N +R  VP+V  VKKKL VLKFDDILP LTLGLS           ++V K+      TAA+EL+QQGSS SPVSSFS+SS
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKT------TAAEELLQQGSSASPVSSFSNSS

Query:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK
        G KR+R     GG  E  AE AE+++ER+S KV  +E+EDGSPRKKLRLTKEQSA+LED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLK
Subjt:  GSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLK

Query:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG------DASPTTAFSIGSKPTFLKFPFNHPSAAC
        QTEMDCELLKKCCEKLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGG      DASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  QTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG------DASPTTAFSIGSKPTFLKFPFNHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like6.5e-10173.88Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDR
        MD DC+ G  L LG G    N +R LVP+V  VKKKL VLKFDDILP LTLGL        SV     G  +AA++L+ QGSS SP SSFSNSSG KR+R
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEV-DVKKKL-VLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDR

Query:  SDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC
             GG  E  AE AE+++ER+S KV  +E+EDGSPRKKLRLTKEQSA+LED+FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQTEMDC
Subjt:  SDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG---DASPTTAFSIGSKPTFLKFPFNHPSAAC
        ELLKKCCEKLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SICGGGGGG   DASP   FSIGSKP FLKFPFNHPSAAC
Subjt:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGG---DASPTTAFSIGSKPTFLKFPFNHPSAAC

B9R6T5 Homeobox protein, putative1.6e-7061.54Show/hide
Query:  GDCSTGTGLLLGLGRSERNPL--RSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
        G C+TG GL L     E N    +S       KKKL LK+D + P LTLGL  P  A  SV ++         +L  Q SS S VSSFSNSS  K++R  
Subjt:  GDCSTGTGLLLGLGRSERNPL--RSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD

Query:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL
        G  GGGEE +        ERVSS+V D EDE+GSPRKKLRLTK+QSAILED+FKEHS+L PKQK  LA QL+LRPRQVEVWFQNRRARTKLKQTE+DCE+
Subjt:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCEL

Query:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC
        LKKCCE L EEN RLQKELQELKSLKL A PF MQL AATLT+CPSCER I GGG G  ++ T    +GSKP F   PF HPSAAC
Subjt:  LKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX198.4e-4550.38Show/hide
Query:  PCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS----KRDRSDGWIGGGEEREAEAAEIYLERVSS-KVGDQEDEDGSPRKKL
        P LTL L     AG           TA       G  A  VSS S  + +    KR+R++         EA+      ERVSS   G  +D+DGS RKKL
Subjt:  PCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS----KRDRSDGWIGGGEEREAEAAEIYLERVSS-KVGDQEDEDGSPRKKL

Query:  RLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAP---------
        RLTKEQSA+LED F+EHS+L PKQK+ LA+QL+LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQEL++LK   P         
Subjt:  RLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAP---------

Query:  ------PFCMQLQAATLTVCPSCERSICGGGGGGD---ASPTTAFSIGSKPTFLKF-PFNHPSAAC
              PF MQL AATLT+CPSCER   GG        A+  T    G   T   F PF H SAAC
Subjt:  ------PFCMQLQAATLTVCPSCERSICGGGGGGD---ASPTTAFSIGSKPTFLKF-PFNHPSAAC

P46603 Homeobox-leucine zipper protein HAT93.9e-5854.3Show/hide
Query:  MDGDCSTGTGLLLGLGRSE-RNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRS
        M  D +  TGL+LGLG S   N   S + +  V K        + P LTL LSG         +  V   T A++L +Q SS S VSSFS+    KR+R 
Subjt:  MDGDCSTGTGLLLGLGRSE-RNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRS

Query:  DGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC
             GGEE   E  E   ERV S     EDE+G S RKKLRLTK+QSA+LE+SFK+HS+L PKQK  LARQL+LRPRQVEVWFQNRRARTKLKQTE+DC
Subjt:  DGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG---GGGGDASPTT-----------AFSIGSKPTFLKFPFNHPSA
        E LKKCCE L +EN RLQKE+QELK+LKLT  PF M + A+TLT CPSCER   GG   GGGG  S  T           AFSI SKP F   PF +PSA
Subjt:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG---GGGGDASPTT-----------AFSIGSKPTFLKFPFNHPSA

Query:  AC
        AC
Subjt:  AC

P46604 Homeobox-leucine zipper protein HAT228.6e-5852.22Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
        +D  C+TG  L LGL  +  N   ++           ++ D   P LTL LSG +       +I  G   A +++ +Q SS S +SSFS S   KR+R  
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD

Query:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCE
            G EE E     +   RVS    D +DE+G S RKKLRLTK+QSA+LED+FK HS+L PKQK  LARQL+LRPRQVEVWFQNRRARTKLKQTE+DCE
Subjt:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCE

Query:  LLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTT------AFSIGSKPTFLKFPFNHPSAAC
         LKKCCE L +EN RLQKELQ+LK+LKL + PF M + AATLT+CPSCER + GGG GGD +         AFSI +KP F   PF +PSAAC
Subjt:  LLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTT------AFSIGSKPTFLKFPFNHPSAAC

P46665 Homeobox-leucine zipper protein HAT146.4e-4553.51Show/hide
Query:  VDVKKKLVLKFDDILPCLTLG-LSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASP---VSSFSNSSGSKRDRSDGWIGGGEEREAEAAEI--YLERVSS
        VD   +L L F + LP  + G   G  P G   A  +V +    EE +   S + P    SSF    G K          G ER +   +I   +ER +S
Subjt:  VDVKKKLVLKFDDILPCLTLG-LSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASP---VSSFSNSSGSKRDRSDGWIGGGEEREAEAAEI--YLERVSS

Query:  KVG--DQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE
        +    D +DE+GS RKKLRL+K+QSA LEDSFKEHS+L PKQK+ LA+QL+LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKE++E
Subjt:  KVG--DQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE

Query:  LKSLKLTAPPFCMQLQAATLTVCPSCER
        L++LK T+ PF MQL A TLT+CPSCER
Subjt:  LKSLKLTAPPFCMQLQAATLTVCPSCER

Q8GRL4 Homeobox-leucine zipper protein HOX198.4e-4550.38Show/hide
Query:  PCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS----KRDRSDGWIGGGEEREAEAAEIYLERVSS-KVGDQEDEDGSPRKKL
        P LTL L     AG           TA       G  A  VSS S  + +    KR+R++         EA+      ERVSS   G  +D+DGS RKKL
Subjt:  PCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGS----KRDRSDGWIGGGEEREAEAAEIYLERVSS-KVGDQEDEDGSPRKKL

Query:  RLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAP---------
        RLTKEQSA+LED F+EHS+L PKQK+ LA+QL+LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+ELQEL++LK   P         
Subjt:  RLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAP---------

Query:  ------PFCMQLQAATLTVCPSCERSICGGGGGGD---ASPTTAFSIGSKPTFLKF-PFNHPSAAC
              PF MQL AATLT+CPSCER   GG        A+  T    G   T   F PF H SAAC
Subjt:  ------PFCMQLQAATLTVCPSCERSICGGGGGGD---ASPTTAFSIGSKPTFLKF-PFNHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.7e-5954.3Show/hide
Query:  MDGDCSTGTGLLLGLGRSE-RNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRS
        M  D +  TGL+LGLG S   N   S + +  V K        + P LTL LSG         +  V   T A++L +Q SS S VSSFS+    KR+R 
Subjt:  MDGDCSTGTGLLLGLGRSE-RNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRS

Query:  DGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC
             GGEE   E  E   ERV S     EDE+G S RKKLRLTK+QSA+LE+SFK+HS+L PKQK  LARQL+LRPRQVEVWFQNRRARTKLKQTE+DC
Subjt:  DGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG---GGGGDASPTT-----------AFSIGSKPTFLKFPFNHPSA
        E LKKCCE L +EN RLQKE+QELK+LKLT  PF M + A+TLT CPSCER   GG   GGGG  S  T           AFSI SKP F   PF +PSA
Subjt:  ELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGG---GGGGDASPTT-----------AFSIGSKPTFLKFPFNHPSA

Query:  AC
        AC
Subjt:  AC

AT2G44910.1 homeobox-leucine zipper protein 43.4e-4152.74Show/hide
Query:  ASPVSSFSNSSGSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG----SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQ
        +SP S+ S+ SG+KRD +     GG+E EAE A       S   G  +DEDG      RKKLRL+K+Q+ +LE++FKEHS+L PKQKL LA+QL+LR RQ
Subjt:  ASPVSSFSNSSGSKRDRSDGWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG----SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQ

Query:  VEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKF
        VEVWFQNRRARTKLKQTE+DCE LK+CC+ L EEN RLQKE+ EL++LKL+   +       TLT+CPSCER          A+ T A S  + PT +  
Subjt:  VEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKF

Query:  P
        P
Subjt:  P

AT3G60390.1 homeobox-leucine zipper protein 33.2e-3949.28Show/hide
Query:  TTAAEELLQQGSSASPVSSFSNS--SGSKRDR----SDGWIGGGEEREAEAAEIYLERVSSKV-GDQEDEDG------SPRKKLRLTKEQSAILEDSFKE
        +T   ++  +G+  S  +S  +S  SG K +R    + G +GGG   + E     +ER S  + G  +DEDG      S RKKLRL+KEQ+ +LE++FKE
Subjt:  TTAAEELLQQGSSASPVSSFSNS--SGSKRDR----SDGWIGGGEEREAEAAEIYLERVSSKV-GDQEDEDG------SPRKKLRLTKEQSAILEDSFKE

Query:  HSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGG
        HS+L PKQK+ LA+QL+LR RQVEVWFQNRRARTKLKQTE+DCE LK+CCE L +EN RLQKE+ EL++LKL+   +       TLT+CPSCER      
Subjt:  HSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGG

Query:  GGGDASP
            A P
Subjt:  GGGDASP

AT4G37790.1 Homeobox-leucine zipper protein family6.1e-5952.22Show/hide
Query:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD
        +D  C+TG  L LGL  +  N   ++           ++ D   P LTL LSG +       +I  G   A +++ +Q SS S +SSFS S   KR+R  
Subjt:  MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSD

Query:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCE
            G EE E     +   RVS    D +DE+G S RKKLRLTK+QSA+LED+FK HS+L PKQK  LARQL+LRPRQVEVWFQNRRARTKLKQTE+DCE
Subjt:  GWIGGGEEREAEAAEIYLERVSSKVGDQEDEDG-SPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCE

Query:  LLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTT------AFSIGSKPTFLKFPFNHPSAAC
         LKKCCE L +EN RLQKELQ+LK+LKL + PF M + AATLT+CPSCER + GGG GGD +         AFSI +KP F   PF +PSAAC
Subjt:  LLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTT------AFSIGSKPTFLKFPFNHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana4.6e-4653.51Show/hide
Query:  VDVKKKLVLKFDDILPCLTLG-LSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASP---VSSFSNSSGSKRDRSDGWIGGGEEREAEAAEI--YLERVSS
        VD   +L L F + LP  + G   G  P G   A  +V +    EE +   S + P    SSF    G K          G ER +   +I   +ER +S
Subjt:  VDVKKKLVLKFDDILPCLTLG-LSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASP---VSSFSNSSGSKRDRSDGWIGGGEEREAEAAEI--YLERVSS

Query:  KVG--DQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE
        +    D +DE+GS RKKLRL+K+QSA LEDSFKEHS+L PKQK+ LA+QL+LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQKE++E
Subjt:  KVG--DQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQE

Query:  LKSLKLTAPPFCMQLQAATLTVCPSCER
        L++LK T+ PF MQL A TLT+CPSCER
Subjt:  LKSLKLTAPPFCMQLQAATLTVCPSCER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCGATTGTAGTACCGGCACCGGCCTTCTTCTTGGCCTTGGCCGGAGTGAGCGTAATCCTCTGCGGTCGCTCGTACCCGAAGTGGACGTGAAGAAGAAATTGGT
GTTGAAGTTCGATGACATTTTGCCTTGTTTGACGCTTGGGTTGTCGGGCCCCACCCCGGCTGGATTCTCGGTGGCGGAGATCATCGTCGGGAAGACTACCGCCGCCGAGG
AGTTGCTGCAGCAGGGCTCTTCAGCAAGTCCGGTGTCGTCGTTTTCCAACTCCTCCGGGTCTAAGAGGGACAGATCAGACGGATGGATCGGCGGCGGCGAAGAGAGGGAG
GCGGAGGCGGCGGAGATTTATTTGGAGAGGGTGTCTTCGAAAGTTGGTGATCAGGAAGATGAAGATGGAAGTCCGAGGAAGAAACTGAGACTCACTAAAGAACAATCCGC
CATTTTAGAAGATAGCTTCAAAGAACACTCAAGTCTTACTCCGAAACAAAAGCTGGATTTGGCTAGGCAATTAAGCCTAAGGCCACGACAAGTGGAAGTATGGTTCCAAA
ACAGACGAGCCAGAACCAAGCTGAAGCAAACGGAAATGGACTGCGAACTACTGAAAAAATGCTGCGAAAAGCTCAAAGAAGAGAACACGAGGCTTCAGAAAGAGCTTCAA
GAGCTTAAATCGTTAAAGTTAACGGCTCCGCCATTTTGTATGCAGCTACAAGCGGCCACCCTCACCGTCTGCCCCTCCTGCGAGAGATCCATTTGCGGCGGCGGAGGCGG
CGGCGATGCGTCTCCGACCACCGCCTTCTCCATTGGGTCGAAGCCTACTTTTCTCAAATTCCCCTTTAACCACCCATCCGCGGCTTGT
mRNA sequenceShow/hide mRNA sequence
ATGGACGGCGATTGTAGTACCGGCACCGGCCTTCTTCTTGGCCTTGGCCGGAGTGAGCGTAATCCTCTGCGGTCGCTCGTACCCGAAGTGGACGTGAAGAAGAAATTGGT
GTTGAAGTTCGATGACATTTTGCCTTGTTTGACGCTTGGGTTGTCGGGCCCCACCCCGGCTGGATTCTCGGTGGCGGAGATCATCGTCGGGAAGACTACCGCCGCCGAGG
AGTTGCTGCAGCAGGGCTCTTCAGCAAGTCCGGTGTCGTCGTTTTCCAACTCCTCCGGGTCTAAGAGGGACAGATCAGACGGATGGATCGGCGGCGGCGAAGAGAGGGAG
GCGGAGGCGGCGGAGATTTATTTGGAGAGGGTGTCTTCGAAAGTTGGTGATCAGGAAGATGAAGATGGAAGTCCGAGGAAGAAACTGAGACTCACTAAAGAACAATCCGC
CATTTTAGAAGATAGCTTCAAAGAACACTCAAGTCTTACTCCGAAACAAAAGCTGGATTTGGCTAGGCAATTAAGCCTAAGGCCACGACAAGTGGAAGTATGGTTCCAAA
ACAGACGAGCCAGAACCAAGCTGAAGCAAACGGAAATGGACTGCGAACTACTGAAAAAATGCTGCGAAAAGCTCAAAGAAGAGAACACGAGGCTTCAGAAAGAGCTTCAA
GAGCTTAAATCGTTAAAGTTAACGGCTCCGCCATTTTGTATGCAGCTACAAGCGGCCACCCTCACCGTCTGCCCCTCCTGCGAGAGATCCATTTGCGGCGGCGGAGGCGG
CGGCGATGCGTCTCCGACCACCGCCTTCTCCATTGGGTCGAAGCCTACTTTTCTCAAATTCCCCTTTAACCACCCATCCGCGGCTTGT
Protein sequenceShow/hide protein sequence
MDGDCSTGTGLLLGLGRSERNPLRSLVPEVDVKKKLVLKFDDILPCLTLGLSGPTPAGFSVAEIIVGKTTAAEELLQQGSSASPVSSFSNSSGSKRDRSDGWIGGGEERE
AEAAEIYLERVSSKVGDQEDEDGSPRKKLRLTKEQSAILEDSFKEHSSLTPKQKLDLARQLSLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQ
ELKSLKLTAPPFCMQLQAATLTVCPSCERSICGGGGGGDASPTTAFSIGSKPTFLKFPFNHPSAAC