; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0525 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0525
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhomeobox-leucine zipper protein HAT22-like
Genome locationMC10:4355527..4357170
RNA-Seq ExpressionMC10g0525
SyntenyMC10g0525
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600675.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]3.78e-13076.09Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSS----SLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRE
        MG DD+S T LVLGLG+SEA+   +P  +NL  K    ++    SL+F+PC LTLGFSG+     +     HHLYRQASPHSSAV S   GGGG  +KRE
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSS----SLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRE

Query:  RDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL
        RDLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETL
Subjt:  RDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL

Query:  TDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC
        T+ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG      DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  TDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC

KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]1.61e-13178.47Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD
        MG DD+S T LVLGLG+SEA SP     ++N  KK     +SL+F+PC LTLGFSG+     +     HHLYRQASPHSSAV SSFSGGGGG  +KRERD
Subjt:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD

Query:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
        LSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+
Subjt:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD

Query:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC
        ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG      DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]3.24e-184100Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
        MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV

Query:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
        DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
Subjt:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ

Query:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]3.65e-13078.39Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE--SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERDL
        MG DD+S T LVLGLG+SEA SP     ++N  K      +SL+F+PC LTLGFSG+          HHLYRQASPHSSAV SSFSGGGGG  +KRERDL
Subjt:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE--SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERDL

Query:  SSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDE
        SSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+E
Subjt:  SSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDE

Query:  NRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC
        NRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG      DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  NRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]1.81e-13077.45Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR
        MG DD+S T LVLGLG+SEA+     +  N      P  +SLDF+PC LTLGFSG+         +     HHLYRQASPHSSAV SSFSGGGGG  +KR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR

Query:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET
        ERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET

Query:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR--DGASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG    DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR--DGASKAKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A2N9J7A3 Homeobox domain-containing protein8.06e-12573.33Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA------SPKPAPDQHNLIKKPAPCS------SSLDFDPCLTLGFSGESAY--RKIDDHH-------LYRQASPHSSAVSS
        M F+D+  TGLVLGLGL+ A      +P  AP +        PCS      SS  F+P LTLG  GES    +KID +        LYRQASP +SAVSS
Subjt:  MGFDDLSPTGLVLGLGLSEA------SPKPAPDQHNLIKKPAPCS------SSLDFDPCLTLGFSGESAY--RKIDDHH-------LYRQASPHSSAVSS

Query:  FSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        FS G    RVKRERDLSSEEV+ ERVSSR+SDEDEDG N RKKLRL++EQSALLEESFKQ+STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  FSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        VDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GG  + ASK+ FSMAPKPHFYNPF+NPSAAC
Subjt:  VDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6A1UND8 Homeobox-leucine zipper protein HAT224.00e-12575.27Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA--SPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAY--RKIDDHH-------LYRQASPHSSAVSSFSGGGGGRRV
        M FDD+  TGLVLGLG++ A  +P  AP      KK     ++ +F+P LTLG SGE+    +KID +        LYRQ SPHS AVSSFS G    RV
Subjt:  MGFDDLSPTGLVLGLGLSEA--SPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAY--RKIDDHH-------LYRQASPHSSAVSSFSGGGGGRRV

Query:  KRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
        KRERDLSSEEV+ ERVSSRISDEDEDG N RKKLRL++EQSALLEESFKQ+STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
Subjt:  KRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC

Query:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GG  + ASK+ FSMAPKPHFYNPF+NPSAAC
Subjt:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like1.57e-184100Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
        MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV

Query:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
        DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
Subjt:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ

Query:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like1.77e-13078.39Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE--SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERDL
        MG DD+S T LVLGLG+SEA SP     ++N  K      +SL+F+PC LTLGFSG+          HHLYRQASPHSSAV SSFSGGGGG  +KRERDL
Subjt:  MGFDDLSPTGLVLGLGLSEA-SPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE--SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERDL

Query:  SSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDE
        SSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+E
Subjt:  SSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDE

Query:  NRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC
        NRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG      DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  NRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR----DGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like8.76e-13177.45Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR
        MG DD+S T LVLGLG+SEA+     +  N      P  +SLDF+PC LTLGFSG+         +     HHLYRQASPHSSAV SSFSGGGGG  +KR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR

Query:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET
        ERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET

Query:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR--DGASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVGG    DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGAR--DGASKAKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.6e-5850.99Show/hide
Query:  LSPTGLVLGLGL---SEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR
        LS  GL LGL L      +   A       ++P+P S     +P LTL    ++A            A   ++A ++ SGGGG                 
Subjt:  LSPTGLVLGLGL---SEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR

Query:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL++EQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA
        +CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG        A DG +KA        HF+NPF++ SA
Subjt:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA

Query:  AC
        AC
Subjt:  AC

A2YW03 Homeobox-leucine zipper protein HOX274.5e-5365.36Show/hide
Query:  HSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        H+ A +   GGGGG                ER SSR SD+DE G++ RKKLRLS+EQSA LEESFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRAR
Subjt:  HSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKP
        TKLKQTEVDCE+LK+CCETLT+ENRRL KEL EL+ALK A+P YMH+PA TL+MCPSCERV      AS +  + A  P
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKP

P46603 Homeobox-leucine zipper protein HAT93.8e-7660.14Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE
        MGFDD   TGLVLGLG     P P P+ +N   +    SS    +P LTL  SG+ +   +     L RQ S H S VSSFS    GR VKRERD   E 
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE

Query:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
         + E ++ R ISD  EDE+G + RKKLRL+++QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN
Subjt:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG   G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT223.7e-7962.54Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR
        MG DD   TGLVLGLGLS     P P+ +N  IKK    SS++D      DP LTL  SGES   K        + RQ S H S +SSFS G    RVKR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR

Query:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SR+SD  +DE+G + RKKLRL+++QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        EFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG   G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX191.6e-5850.99Show/hide
Query:  LSPTGLVLGLGL---SEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR
        LS  GL LGL L      +   A       ++P+P S     +P LTL    ++A            A   ++A ++ SGGGG                 
Subjt:  LSPTGLVLGLGL---SEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR

Query:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL++EQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA
        +CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG        A DG +KA        HF+NPF++ SA
Subjt:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA

Query:  AC
        AC
Subjt:  AC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.7e-7760.14Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE
        MGFDD   TGLVLGLG     P P P+ +N   +    SS    +P LTL  SG+ +   +     L RQ S H S VSSFS    GR VKRERD   E 
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE

Query:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
         + E ++ R ISD  EDE+G + RKKLRL+++QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN
Subjt:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG   G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 32.6e-4859.38Show/hide
Query:  ASPHSSAVSSFSGGGGGRRVKRERDLSS----------EEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSREQSALLEESFKQNSTLNPKQKQALA
        +SP+S+  S  SG       K ER+L +          E+ ++ER S  +    +DEDGS     ++RKKLRLS+EQ+ +LEE+FK++STLNPKQK ALA
Subjt:  ASPHSSAVSSFSGGGGGRRVKRERDLSS----------EEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSREQSALLEESFKQNSTLNPKQKQALA

Query:  RQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERVGGARDGASKA
        +QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKE+ EL+ALKL+  LYMHM P  TLTMCPSCERV      +S A
Subjt:  RQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERVGGARDGASKA

AT4G16780.1 homeobox protein 25.3e-4965.45Show/hide
Query:  ASPHSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR
        +SP+S+  SS      G+R +RE D   +        SR   +DEDG N+RKKLRLS++QSA+LEE+FK +STLNPKQKQALA+QL LR RQVEVWFQNR
Subjt:  ASPHSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERV
        RARTKLKQTEVDCEFL++CCE LT+ENRRLQKE+ EL+ALKL+   YMHM P  TLTMCPSCE V
Subjt:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family2.6e-8062.54Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR
        MG DD   TGLVLGLGLS     P P+ +N  IKK    SS++D      DP LTL  SGES   K        + RQ S H S +SSFS G    RVKR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDQHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR

Query:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SR+SD  +DE+G + RKKLRL+++QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        EFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG   G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana6.7e-5266.47Show/hide
Query:  SPHSSAVSSFSGGGGGRRVKRERDLSSEEVD--LERVSSRISDEDEDGSN--TRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWF
        SP  S  SSF    G +    ER  +  ++D  +ER +SR S+ED D  N  TRKKLRLS++QSA LE+SFK++STLNPKQK ALA+QLNLRPRQVEVWF
Subjt:  SPHSSAVSSFSGGGGGRRVKRERDLSSEEVD--LERVSSRISDEDEDGSN--TRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWF

Query:  QNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV
        QNRRARTKLKQTEVDCE+LK+CCE+LT+ENRRLQKE++EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  QNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATCTTTCTCCCACCGGCTTGGTCTTGGGGTTGGGGCTCTCCGAGGCCTCTCCCAAGCCAGCGCCTGATCAACACAACCTCATCAAAAAGCCAGCACC
TTGCTCCAGCTCCCTTGATTTTGACCCATGTTTGACTTTGGGGTTCTCCGGCGAGAGTGCTTACCGGAAGATTGACGATCATCATTTGTATCGCCAAGCTTCCCCTCATA
GCAGCGCTGTTTCTTCCTTCTCCGGCGGCGGCGGCGGCCGTAGGGTCAAGAGGGAGAGGGATCTTAGCAGTGAAGAAGTTGACTTGGAGAGAGTTTCATCGAGAATCAGC
GATGAAGATGAAGATGGTTCCAATACTAGGAAGAAACTTAGGCTCTCTAGAGAACAATCCGCTCTCTTGGAAGAAAGCTTCAAACAAAACAGCACTCTCAACCCTAAACA
AAAACAAGCCTTGGCCAGACAGCTAAATCTGCGGCCACGACAAGTCGAAGTATGGTTTCAAAACAGGAGAGCCCGAACGAAATTGAAACAAACGGAAGTAGATTGTGAAT
TCTTGAAGAAGTGCTGCGAGACACTGACGGATGAAAATAGAAGACTGCAAAAGGAGCTTCAAGAACTCAAGGCGCTAAAGCTCGCCCAGCCTCTTTACATGCACATGCCC
GCGGCGACGTTGACGATGTGCCCGTCGTGCGAAAGGGTCGGTGGCGCTCGCGACGGGGCTTCCAAAGCCAAATTTTCTATGGCTCCCAAGCCTCACTTTTACAACCCCTT
CTCTAATCCTTCCGCGGCTTGTTAG
mRNA sequenceShow/hide mRNA sequence
CCCACCCACCCACACCCACACCCACACCCATATCTCTCTCTCTTCTTCTCTTCTTAAACCCCTCATTTTGGTTTTCATATTCTTCACAAGTTCCTCTTTTTCCACTCCCC
ATCCACAATTTTCCTCTCGCCCCATATGGGTTTTGATGATCTTTCTCCCACCGGCTTGGTCTTGGGGTTGGGGCTCTCCGAGGCCTCTCCCAAGCCAGCGCCTGATCAAC
ACAACCTCATCAAAAAGCCAGCACCTTGCTCCAGCTCCCTTGATTTTGACCCATGTTTGACTTTGGGGTTCTCCGGCGAGAGTGCTTACCGGAAGATTGACGATCATCAT
TTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTCTTCCTTCTCCGGCGGCGGCGGCGGCCGTAGGGTCAAGAGGGAGAGGGATCTTAGCAGTGAAGAAGTTGACTT
GGAGAGAGTTTCATCGAGAATCAGCGATGAAGATGAAGATGGTTCCAATACTAGGAAGAAACTTAGGCTCTCTAGAGAACAATCCGCTCTCTTGGAAGAAAGCTTCAAAC
AAAACAGCACTCTCAACCCTAAACAAAAACAAGCCTTGGCCAGACAGCTAAATCTGCGGCCACGACAAGTCGAAGTATGGTTTCAAAACAGGAGAGCCCGAACGAAATTG
AAACAAACGGAAGTAGATTGTGAATTCTTGAAGAAGTGCTGCGAGACACTGACGGATGAAAATAGAAGACTGCAAAAGGAGCTTCAAGAACTCAAGGCGCTAAAGCTCGC
CCAGCCTCTTTACATGCACATGCCCGCGGCGACGTTGACGATGTGCCCGTCGTGCGAAAGGGTCGGTGGCGCTCGCGACGGGGCTTCCAAAGCCAAATTTTCTATGGCTC
CCAAGCCTCACTTTTACAACCCCTTCTCTAATCCTTCCGCGGCTTGTTAGATTAATAGTCTAGGCCAATACAAGCTAGACAAATACACCCACATAAAAACTACAAGAATA
TATAGTTTTTTTTTTCTTCTTCTTCAAAAGGCCGTAGGTAGAGTAGAAATTAGAGAAATTTAGTGTAATTAACATGTAGAAAATTTCTATTGATCACTGAAGATCAAATT
ATCAGTTCTTTTTTTTTGGGGATGATGAGATATGTGTAAAGGTTTAAAATTCTATCTAGAAATGAAGAAATTGATGGGTACGTATATTATTTTAAGAGAGTATGATGATG
ATGAGATTTGTAGAGAAGAAATTTGGTTTTATGGAATTTTTGGGGTGTAAATTTTTTGATGTGACGCAATGTGGATGGTGTGTTGGATTGGAAAATTATTGGGTGGTGTT
GGTGTAAGGCTTTCTATCAAGCCCATTGGCTGCCTACGTGTTACGCAAGGGATGACATTGATATGACTTTTGCTATTTTGTAGTGCGTTTTGCCCTCATTGGTCAATACA
ATTTTTAATTTATCTATCTCCCTTTAGTTTTTAATATCATG
Protein sequenceShow/hide protein sequence
MGFDDLSPTGLVLGLGLSEASPKPAPDQHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRIS
DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMP
AATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC