; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012599 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012599
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhomeobox-leucine zipper protein HAT22-like
Genome locationscaffold63:1938337..1939378
RNA-Seq ExpressionMS012599
SyntenyMS012599
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]3.9e-10378.18Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRER
        MG DD+S T LVLGLG+SEA+  P  +N  +N  KK     +SL+F+PC LTLGFSG+     +     HHLYRQASPHSSAV SSFSGGGGG  +KRER
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSGE---SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRER

Query:  DLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT
        DLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT
Subjt:  DLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT

Query:  DENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC
        +ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG    G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  DENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]1.9e-14299.62Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
        MGFDDLSPTGLVLGLGLSEASPKPAPD HNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV

Query:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
        DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
Subjt:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ

Query:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]4.3e-10278.1Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSG--ESAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD
        MG DD+S T LVLGLG+SEA+  P  +N  +N  K      +SL+F+PC LTLGFSG  +        HHLYRQASPHSSAV SSFSGGGGG  +KRERD
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSG--ESAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD

Query:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
        LSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+
Subjt:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD

Query:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC
        ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG    G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]3.3e-10277.45Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR
        MG DD+S T LVLGLG+SEA+     +  N      P  +SLDF+PC LTLGFSG+         +     HHLYRQASPHSSAV SSFSGGGGG  +KR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR

Query:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET
        ERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET

Query:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG--GARDGASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG  G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG--GARDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_038903319.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]5.6e-10275.69Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPC-SSSLDFDPC-LTLGFSGES--AYRKIDD----------------HHLYRQ-ASPHSSAV--
        MG DD S TGLVLGLGLS        ++  L KKPAPC SSSLDF+PC LTLGFSG     ++K+ D                HHL+RQ AS HSS+   
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPC-SSSLDFDPC-LTLGFSGES--AYRKIDD----------------HHLYRQ-ASPHSSAV--

Query:  SSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQ
        SSFSGGG   RVKRERDLSS+EV+L+RVSSR+SDEDEDGSNTRKKLRLS++QSALLE+SFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQ
Subjt:  SSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARD-GASKAKFSMAPKPHFYNPFSNPSAAC
        TEVDCE LKKCCETLTDENRRLQKELQELKALKLAQP+YM +PAATLT+CPSCERVGG  D GASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  TEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARD-GASKAKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0LBR7 Homeobox domain-containing protein5.3e-9870.87Show/hide
Query:  TPHFGLHILHKFLFFHSPSTIF--LSPHMGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPC-SSSLDFDPC-LTLGFS--GESAYRKIDD----H
        T H    +   F F   PSTI   LSP MGFDD S TGLVLGLGLSE +         L KKPAPC SSSLDF+PC LTLGFS  G   +RK+ D    H
Subjt:  TPHFGLHILHKFLFFHSPSTIF--LSPHMGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPC-SSSLDFDPC-LTLGFS--GESAYRKIDD----H

Query:  HLYRQASPHSSAV-SSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDED-GSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQV
        HLYRQASPHSSAV SSFSG     +VKRERDLSSEEV+LER   R+SDED+D  +NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LARQLNL PRQV
Subjt:  HLYRQASPHSSAV-SSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDED-GSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQV

Query:  EVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV-----GGARDGAS--KAKFSMAPKPHFYN
        EVWFQNRRARTK+KQTEVDCE LKKCCETLTDENRRLQKE+QELKA+KLA+P+YM M  ATLT+CPSCERV     GG  DG S  K KFSM P P FYN
Subjt:  EVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV-----GGARDGAS--KAKFSMAPKPHFYN

Query:  PFSNPSAAC
        PFSNPSAAC
Subjt:  PFSNPSAAC

A0A6A1UND8 Homeobox-leucine zipper protein HAT224.0e-9875.27Show/hide
Query:  MGFDDLSPTGLVLGLGLSEA--SPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGES--AYRKID-------DHHLYRQASPHSSAVSSFSGGGGGRRV
        M FDD+  TGLVLGLG++ A  +P  AP      KK     ++ +F+P LTLG SGE+    +KID          LYRQ SPH SAVSSFS G    RV
Subjt:  MGFDDLSPTGLVLGLGLSEA--SPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGES--AYRKID-------DHHLYRQASPHSSAVSSFSGGGGGRRV

Query:  KRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
        KRERDLSSEEV+ ERVSSRISDEDEDG N RKKLRL++EQSALLEESFKQ+STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
Subjt:  KRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC

Query:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GG  + ASK+ FSMAPKPHFYNPF+NPSAAC
Subjt:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like9.1e-14399.62Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
        MGFDDLSPTGLVLGLGLSEASPKPAPD HNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEEV

Query:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
        DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ
Subjt:  DLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQ

Query:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
        KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like2.1e-10278.1Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSG--ESAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD
        MG DD+S T LVLGLG+SEA+  P  +N  +N  K      +SL+F+PC LTLGFSG  +        HHLYRQASPHSSAV SSFSGGGGG  +KRERD
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDN--HNLIKKPAPCSSSLDFDPC-LTLGFSG--ESAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKRERD

Query:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
        LSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+
Subjt:  LSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD

Query:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC
        ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG    G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG----GARDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like1.6e-10277.45Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR
        MG DD+S T LVLGLG+SEA+     +  N      P  +SLDF+PC LTLGFSG+         +     HHLYRQASPHSSAV SSFSGGGGG  +KR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPC-LTLGFSGE-------SAYRKIDDHHLYRQASPHSSAV-SSFSGGGGGRRVKR

Query:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET
        ERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLS++QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCET
Subjt:  ERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCET

Query:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG--GARDGASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKELQELKALKLA PLYMHMPAATLTMCPSCERVG  G  DGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG--GARDGASKAKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.3e-5850.99Show/hide
Query:  LSPTGLVLGLGL---SEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR
        LS  GL LGL L      +   A  +    ++P+P S     +P LTL    ++A            A   ++A ++ SGGGG                 
Subjt:  LSPTGLVLGLGL---SEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR

Query:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL++EQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA
        +CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG        A DG +KA        HF+NPF++ SA
Subjt:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA

Query:  AC
        AC
Subjt:  AC

A2YW03 Homeobox-leucine zipper protein HOX275.0e-5365.36Show/hide
Query:  HSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        H+ A +   GGGGG                ER SSR SD+DE G++ RKKLRLS+EQSA LEESFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRAR
Subjt:  HSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKP
        TKLKQTEVDCE+LK+CCETLT+ENRRL KEL EL+ALK A+P YMH+PA TL+MCPSCERV      AS +  + A  P
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKP

P46603 Homeobox-leucine zipper protein HAT91.1e-7660.49Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE
        MGFDD   TGLVLGLG     P P P+N+N   +    SS    +P LTL  SG+ +   +     L RQ S H S VSSFS    GR VKRERD   E 
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE

Query:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
         + E ++ R ISD  EDE+G + RKKLRL+++QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN
Subjt:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG   G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT228.1e-8062.89Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR
        MG DD   TGLVLGLGLS     P P+N+N  IKK    SS++D      DP LTL  SGES   K        + RQ S H S +SSFS G    RVKR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR

Query:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SR+SD  +DE+G + RKKLRL+++QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        EFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG   G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX191.3e-5850.99Show/hide
Query:  LSPTGLVLGLGL---SEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR
        LS  GL LGL L      +   A  +    ++P+P S     +P LTL    ++A            A   ++A ++ SGGGG                 
Subjt:  LSPTGLVLGLGL---SEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGGG--------------GRR

Query:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL++EQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  VKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA
        +CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG        A DG +KA        HF+NPF++ SA
Subjt:  KCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG--------ARDGASKAKFSMAPKPHFYNPFSNPSA

Query:  AC
        AC
Subjt:  AC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family7.8e-7860.49Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE
        MGFDD   TGLVLGLG     P P P+N+N   +    SS    +P LTL  SG+ +   +     L RQ S H S VSSFS    GR VKRERD   E 
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDD-HHLYRQASPHSSAVSSFSGGGGGRRVKRERDLSSEE

Query:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
         + E ++ R ISD  EDE+G + RKKLRL+++QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN
Subjt:  VDLERVSSR-ISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG   G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 32.9e-4859.38Show/hide
Query:  ASPHSSAVSSFSGGGGGRRVKRERDLSS----------EEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSREQSALLEESFKQNSTLNPKQKQALA
        +SP+S+  S  SG       K ER+L +          E+ ++ER S  +    +DEDGS     ++RKKLRLS+EQ+ +LEE+FK++STLNPKQK ALA
Subjt:  ASPHSSAVSSFSGGGGGRRVKRERDLSS----------EEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSREQSALLEESFKQNSTLNPKQKQALA

Query:  RQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERVGGARDGASKA
        +QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKE+ EL+ALKL+  LYMHM P  TLTMCPSCERV      +S A
Subjt:  RQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERVGGARDGASKA

AT4G16780.1 homeobox protein 25.8e-4965.45Show/hide
Query:  ASPHSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR
        +SP+S+  SS      G+R +RE D   +        SR   +DEDG N+RKKLRLS++QSA+LEE+FK +STLNPKQKQALA+QL LR RQVEVWFQNR
Subjt:  ASPHSSAVSSFSGGGGGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERV
        RARTKLKQTEVDCEFL++CCE LT+ENRRLQKE+ EL+ALKL+   YMHM P  TLTMCPSCE V
Subjt:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family5.8e-8162.89Show/hide
Query:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR
        MG DD   TGLVLGLGLS     P P+N+N  IKK    SS++D      DP LTL  SGES   K        + RQ S H S +SSFS G    RVKR
Subjt:  MGFDDLSPTGLVLGLGLSEASPKPAPDNHN-LIKKPAPCSSSLD-----FDPCLTLGFSGESAYRKID---DHHLYRQASPHSSAVSSFSGGGGGRRVKR

Query:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SR+SD  +DE+G + RKKLRL+++QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        EFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG   G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana7.3e-5266.47Show/hide
Query:  SPHSSAVSSFSGGGGGRRVKRERDLSSEEVD--LERVSSRISDEDEDGSN--TRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWF
        SP  S  SSF    G +    ER  +  ++D  +ER +SR S+ED D  N  TRKKLRLS++QSA LE+SFK++STLNPKQK ALA+QLNLRPRQVEVWF
Subjt:  SPHSSAVSSFSGGGGGRRVKRERDLSSEEVD--LERVSSRISDEDEDGSN--TRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWF

Query:  QNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV
        QNRRARTKLKQTEVDCE+LK+CCE+LT+ENRRLQKE++EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  QNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACCCCTCATTTTGGTTTACATATTCTTCACAAGTTCCTCTTTTTCCACTCCCCATCCACAATTTTCCTCTCGCCCCATATGGGTTTTGATGATCTTTCTCCCACCGGCTT
GGTCTTGGGGTTGGGGCTCTCCGAGGCCTCTCCCAAGCCAGCGCCTGATAATCACAACCTCATCAAAAAGCCAGCACCTTGCTCCAGCTCCCTTGATTTTGACCCATGTT
TGACTTTGGGGTTCTCCGGCGAGAGTGCTTACCGGAAGATTGACGATCATCATTTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTCTTCCTTCTCCGGCGGCGGC
GGCGGCCGTAGGGTCAAGAGGGAGAGGGATCTTAGCAGTGAAGAAGTTGACTTGGAGAGAGTTTCATCGAGAATCAGCGATGAAGATGAAGATGGTTCCAATACTAGGAA
GAAACTTAGGCTCTCTAGAGAACAATCCGCTCTCTTGGAAGAAAGCTTCAAACAAAACAGCACTCTCAACCCTAAACAAAAACAAGCCTTGGCCAGACAGCTAAATCTGC
GGCCACGACAAGTCGAAGTATGGTTTCAAAACAGGAGAGCCCGAACGAAATTGAAACAAACGGAAGTAGATTGTGAATTCTTGAAGAAGTGCTGCGAGACACTGACGGAT
GAAAATAGAAGACTGCAAAAGGAGCTTCAAGAACTCAAGGCGCTAAAGCTCGCCCAGCCTCTTTACATGCACATGCCCGCGGCGACGTTGACGATGTGCCCGTCGTGCGA
AAGGGTCGGTGGCGCTCGCGACGGGGCTTCCAAAGCCAAATTTTCTATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCTAATCCTTCCGCGGCTTGT
mRNA sequenceShow/hide mRNA sequence
ACCCCTCATTTTGGTTTACATATTCTTCACAAGTTCCTCTTTTTCCACTCCCCATCCACAATTTTCCTCTCGCCCCATATGGGTTTTGATGATCTTTCTCCCACCGGCTT
GGTCTTGGGGTTGGGGCTCTCCGAGGCCTCTCCCAAGCCAGCGCCTGATAATCACAACCTCATCAAAAAGCCAGCACCTTGCTCCAGCTCCCTTGATTTTGACCCATGTT
TGACTTTGGGGTTCTCCGGCGAGAGTGCTTACCGGAAGATTGACGATCATCATTTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTCTTCCTTCTCCGGCGGCGGC
GGCGGCCGTAGGGTCAAGAGGGAGAGGGATCTTAGCAGTGAAGAAGTTGACTTGGAGAGAGTTTCATCGAGAATCAGCGATGAAGATGAAGATGGTTCCAATACTAGGAA
GAAACTTAGGCTCTCTAGAGAACAATCCGCTCTCTTGGAAGAAAGCTTCAAACAAAACAGCACTCTCAACCCTAAACAAAAACAAGCCTTGGCCAGACAGCTAAATCTGC
GGCCACGACAAGTCGAAGTATGGTTTCAAAACAGGAGAGCCCGAACGAAATTGAAACAAACGGAAGTAGATTGTGAATTCTTGAAGAAGTGCTGCGAGACACTGACGGAT
GAAAATAGAAGACTGCAAAAGGAGCTTCAAGAACTCAAGGCGCTAAAGCTCGCCCAGCCTCTTTACATGCACATGCCCGCGGCGACGTTGACGATGTGCCCGTCGTGCGA
AAGGGTCGGTGGCGCTCGCGACGGGGCTTCCAAAGCCAAATTTTCTATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCTAATCCTTCCGCGGCTTGT
Protein sequenceShow/hide protein sequence
TPHFGLHILHKFLFFHSPSTIFLSPHMGFDDLSPTGLVLGLGLSEASPKPAPDNHNLIKKPAPCSSSLDFDPCLTLGFSGESAYRKIDDHHLYRQASPHSSAVSSFSGGG
GGRRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSREQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
ENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGARDGASKAKFSMAPKPHFYNPFSNPSAAC