; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000956 (gene) of Snake gourd v1 genome

Gene IDTan0000956
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationLG01:14509664..14511700
RNA-Seq ExpressionTan0000956
SyntenyTan0000956
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600675.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]4.8e-10881.14Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSS---TSLDHFEPCALTLGFSGDSHRKVVDV---GGVDHHHLYRQASPHSSAVCSSFS-GGGSR
        MG DD+SQT LVLGLG+SEAA SPIIN LK K    ++   TSL+ FEPCALTLGFSGD     VD        HHHLYRQASPHSSAVCSSFS GGGS 
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSS---TSLDHFEPCALTLGFSGDSHRKVVDV---GGVDHHHLYRQASPHSSAVCSSFS-GGGSR

Query:  VKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKC
        +KRERDLSSEEVELERV  RVSDEDEDG N RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCE LKKC
Subjt:  VKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKC

Query:  CETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        CETLT ENRRLQKE+QELKALKLA  P+YMHMPAATLTMCPSCERVG    GVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  CETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]1.0e-11084.36Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINK--LKKKHAPCSSTSLDHFEPCALTLGFSGDS-HRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGS--RVK
        MGFDDLS TGLVLGLGLSEA+  P  ++  L KK APCSS SLD F+PC LTLGFSG+S +RK      +D HHLYRQASPHSSAV SSFSGGG   RVK
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINK--LKKKHAPCSSTSLDHFEPCALTLGFSGDS-HRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGS--RVK

Query:  RERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCE
        RERDLSSEEV+LERVSSR+SDEDEDGSN RKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE LKKCCE
Subjt:  RERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCE

Query:  TLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        TLT+ENRRLQKE+QELKALKLA+ P+YMHMPAATLTMCPSCERVGG  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  TLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]2.8e-10881.43Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKL---KKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVD--VGGVDHHHLYRQASPHSSAVCSSFS-GGGSRV
        MG DD+SQT LVLGLG+SEAA SPIIN L   KKK+   + TSL+ FEPCALTLGFSGD     VD       HHHLYRQASPHSSAVCSSFS GGG  +
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKL---KKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVD--VGGVDHHHLYRQASPHSSAVCSSFS-GGGSRV

Query:  KRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCC
        KRERDLSSEEVELERV  RVSDEDEDG N RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCE LKKCC
Subjt:  KRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCC

Query:  ETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        ETLT ENRRLQKE+QELKALKLA  P+YMHMPAATLTMCPSCERVG    GVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  ETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]4.8e-10879.79Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---------SHRKVVDVGGVDHHHLYRQASPHSSAVCSSFS-GG
        MG DD+SQT LVLGLG+SEAA SPIIN  K  +     TSLD FEPCALTLGFSGD          H          HHHLYRQASPHSSAVCSSFS GG
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---------SHRKVVDVGGVDHHHLYRQASPHSSAVCSSFS-GG

Query:  GSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL
        G  +KRERDLSSEEVELERV  RVSDEDEDG N RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCE L
Subjt:  GSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL

Query:  KKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG--GVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KKCCETLT ENRRLQKE+QELKALKLA  P+YMHMPAATLTMCPSCERVG  GVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG--GVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_038903319.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]4.8e-10880.07Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---SHRKVVDVGG----------VDHHHLYRQ-ASPH-SSAVCS
        MG DD SQTGLVLGLGLS         +LKKK APCSS+SLD FEPCALTLGFSG    +H+KV+DV            V  HHL+RQ AS H SSAVCS
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---SHRKVVDVGG----------VDHHHLYRQ-ASPH-SSAVCS

Query:  SFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEV
        SFSGGG RVKRERDLSS+EVEL+RVSSRVSDEDEDGSN RKKLRLSK+QSALLE+SFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEV
Subjt:  SFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEV

Query:  DCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGD-GASKAKFSMAPKPHFYNPFSNPSAAC
        DCELLKKCCETLT+ENRRLQKE+QELKALKLA+ P+YM +PAATLT+CPSCERVGG GD GASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  DCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGD-GASKAKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like1.3e-10379.36Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFS---GDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRE
        MGFDD S+TGLVLGLGLSE A+      LKKK APCSS+SLD FEPC LTLGFS   GD HRKV+D  GV   HLYRQ SPHSSAVCSSFSG   +VKRE
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFS---GDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRE

Query:  RDLSSEEVELERVSSRVSDEDEDGSNN-RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCET
        RDLSSEEVELERV  RVSDED+DG NN RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCELLKKCCET
Subjt:  RDLSSEEVELERVSSRVSDEDEDGSNN-RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCET

Query:  LTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV-----GGVGDG--ASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKEVQELKA+KLAK PVYM M  ATLT+CPSCERV     GGV DG   SK KFSM P P FYNPFSNPSAAC
Subjt:  LTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV-----GGVGDG--ASKAKFSMAPKPHFYNPFSNPSAAC

A0A5D3DEZ1 Homeobox-leucine zipper protein HAT22-like3.0e-10379Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFS---GDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRE
        MGFDD S+TGLVLGLGLSE A+      LKKK APCSS+SLD FEPC LTLGFS   GD HRKV+D  GV   HLYRQ SPHSSAVCSSFSG   +VKRE
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFS---GDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRE

Query:  RDLSSEEVELERVSSRVSDEDEDGSNN-RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCET
        RDLSSEEVELERV  RV+DED+DG NN RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCELLKKCCET
Subjt:  RDLSSEEVELERVSSRVSDEDEDGSNN-RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCET

Query:  LTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV-----GGVGDG--ASKAKFSMAPKPHFYNPFSNPSAAC
        LT+ENRRLQKEVQELKA+KLAK PVYM M  ATLT+CPSCERV     GGV DG   SK KFSM P P FYNPFSNPSAAC
Subjt:  LTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV-----GGVGDG--ASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like5.0e-11184.36Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINK--LKKKHAPCSSTSLDHFEPCALTLGFSGDS-HRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGS--RVK
        MGFDDLS TGLVLGLGLSEA+  P  ++  L KK APCSS SLD F+PC LTLGFSG+S +RK      +D HHLYRQASPHSSAV SSFSGGG   RVK
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINK--LKKKHAPCSSTSLDHFEPCALTLGFSGDS-HRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGS--RVK

Query:  RERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCE
        RERDLSSEEV+LERVSSR+SDEDEDGSN RKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE LKKCCE
Subjt:  RERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCE

Query:  TLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        TLT+ENRRLQKE+QELKALKLA+ P+YMHMPAATLTMCPSCERVGG  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  TLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like1.4e-10881.43Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKL---KKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVD--VGGVDHHHLYRQASPHSSAVCSSFS-GGGSRV
        MG DD+SQT LVLGLG+SEAA SPIIN L   KKK+   + TSL+ FEPCALTLGFSGD     VD       HHHLYRQASPHSSAVCSSFS GGG  +
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKL---KKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVD--VGGVDHHHLYRQASPHSSAVCSSFS-GGGSRV

Query:  KRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCC
        KRERDLSSEEVELERV  RVSDEDEDG N RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCE LKKCC
Subjt:  KRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCC

Query:  ETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        ETLT ENRRLQKE+QELKALKLA  P+YMHMPAATLTMCPSCERVG    GVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  ETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG----GVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like2.3e-10879.79Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---------SHRKVVDVGGVDHHHLYRQASPHSSAVCSSFS-GG
        MG DD+SQT LVLGLG+SEAA SPIIN  K  +     TSLD FEPCALTLGFSGD          H          HHHLYRQASPHSSAVCSSFS GG
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGD---------SHRKVVDVGGVDHHHLYRQASPHSSAVCSSFS-GG

Query:  GSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL
        G  +KRERDLSSEEVELERV  RVSDEDEDG N RKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCE L
Subjt:  GSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELL

Query:  KKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG--GVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KKCCETLT ENRRLQKE+QELKALKLA  P+YMHMPAATLTMCPSCERVG  GVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVG--GVGDGASKAKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.9e-5450Show/hide
Query:  LSQTGLVLGLGLSEAANSPIINKLKKK---HAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDLSS
        LS  GL LGL L              +     P  S+     EP +LTL    D+                     HS +  S  +   + VKRER   +
Subjt:  LSQTGLVLGLGLSEAANSPIINKLKKK---HAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDLSS

Query:  EEVELERVSSRVS--DEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEE
        EE + ERVSS  +  D+D+DGS  RKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE LK+CCETLTEE
Subjt:  EEVELERVSSRVS--DEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEE

Query:  NRRLQKEVQELKALKLAKP---------------PVYMHMPAATLTMCPSCERVGGVGDGA-------SKAKFSMAPKPHFYNPFSNPSAAC
        NRRLQ+E+QEL+ALK A P               P YM +PAATLT+CPSCERVGG    A       +KA        HF+NPF++ SAAC
Subjt:  NRRLQKEVQELKALKLAKP---------------PVYMHMPAATLTMCPSCERVGGVGDGA-------SKAKFSMAPKPHFYNPFSNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX278.2e-5064.8Show/hide
Query:  HSSAVCSSFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRART
        H+ A      GGG                ER SSR SD+DE G++ RKKLRLSKEQSA LEESFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRART
Subjt:  HSSAVCSSFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRART

Query:  KLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKP
        KLKQTEVDCE LK+CCETLTEENRRL KE+ EL+ALK A+ P YMH+PA TL+MCPSCERV      AS +  + A  P
Subjt:  KLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKP

P46603 Homeobox-leucine zipper protein HAT97.6e-7260.14Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDL
        MGFDD   TGLVLGLG      SPI N     ++    +S+   EP +LTL  SGD    V  V G D   L RQ S HS    SSFS  G  VKRERD 
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDL

Query:  SSEEVELERVSSRV-SD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETL
          E  E E ++ RV SD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE LKKCCETL
Subjt:  SSEEVELERVSSRV-SD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETL

Query:  TEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         +EN RLQKE+QELK LKL + P YMHMPA+TLT CPSCER+GG G G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  TEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT223.0e-7661.3Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDH----FEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKR
        MG DD   TGLVLGLGLS   N+   N   KK    SS+++DH     +P +LTL  SG+S++     G  D   + RQ S HS    SSFS G  RVKR
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDH----FEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKR

Query:  ERDLS-------SEEVELERVSSRVSD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SRVSD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVELERVSSRVSD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        E LKKCCETLT+ENRRLQKE+Q+LKALKL++ P YMHMPAATLTMCPSCER+GG G G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  ELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX191.9e-5450Show/hide
Query:  LSQTGLVLGLGLSEAANSPIINKLKKK---HAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDLSS
        LS  GL LGL L              +     P  S+     EP +LTL    D+                     HS +  S  +   + VKRER   +
Subjt:  LSQTGLVLGLGLSEAANSPIINKLKKK---HAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDLSS

Query:  EEVELERVSSRVS--DEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEE
        EE + ERVSS  +  D+D+DGS  RKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE LK+CCETLTEE
Subjt:  EEVELERVSSRVS--DEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEE

Query:  NRRLQKEVQELKALKLAKP---------------PVYMHMPAATLTMCPSCERVGGVGDGA-------SKAKFSMAPKPHFYNPFSNPSAAC
        NRRLQ+E+QEL+ALK A P               P YM +PAATLT+CPSCERVGG    A       +KA        HF+NPF++ SAAC
Subjt:  NRRLQKEVQELKALKLAKP---------------PVYMHMPAATLTMCPSCERVGGVGDGA-------SKAKFSMAPKPHFYNPFSNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family5.4e-7360.14Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDL
        MGFDD   TGLVLGLG      SPI N     ++    +S+   EP +LTL  SGD    V  V G D   L RQ S HS    SSFS  G  VKRERD 
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDL

Query:  SSEEVELERVSSRV-SD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETL
          E  E E ++ RV SD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE LKKCCETL
Subjt:  SSEEVELERVSSRV-SD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETL

Query:  TEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC
         +EN RLQKE+QELK LKL + P YMHMPA+TLT CPSCER+GG G G                   +K  FS++ KPHF+NPF+NPSAAC
Subjt:  TEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG------------------ASKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 33.9e-4761.26Show/hide
Query:  SPHSSAVCSSFSGGGSRVKRERDLSS----------EEVELERVSSRV--SDEDEDGSNN-----RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQ
        S  +S V S  SG     K ER+L +          E+ E+ER S  +    +DEDGS N     RKKLRLSKEQ+ +LEE+FK++STLNPKQK ALA+Q
Subjt:  SPHSSAVCSSFSGGGSRVKRERDLSS----------EEVELERVSSRV--SDEDEDGSNN-----RKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQ

Query:  LNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHM-PAATLTMCPSCERVGGVGDGASKA
        LNLR RQVEVWFQNRRARTKLKQTEVDCE LK+CCE LT+ENRRLQKEV EL+ALKL+ P +YMHM P  TLTMCPSCERV      +S A
Subjt:  LNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHM-PAATLTMCPSCERVGGVGDGASKA

AT4G16780.1 homeobox protein 23.5e-4866.67Show/hide
Query:  ASPHSSAVCSSFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRR
        +SP+S+   S+    G R +RE D   +        SR   +DEDG N+RKKLRLSK+QSA+LEE+FK +STLNPKQKQALA+QL LR RQVEVWFQNRR
Subjt:  ASPHSSAVCSSFSGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRR

Query:  ARTKLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHM-PAATLTMCPSCERV
        ARTKLKQTEVDCE L++CCE LTEENRRLQKEV EL+ALKL+ P  YMHM P  TLTMCPSCE V
Subjt:  ARTKLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family2.1e-7761.3Show/hide
Query:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDH----FEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKR
        MG DD   TGLVLGLGLS   N+   N   KK    SS+++DH     +P +LTL  SG+S++     G  D   + RQ S HS    SSFS G  RVKR
Subjt:  MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDH----FEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKR

Query:  ERDLS-------SEEVELERVSSRVSD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
        ER++S       +EE     V SRVSD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
Subjt:  ERDLS-------SEEVELERVSSRVSD--EDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC

Query:  ELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG---------ASKAKFSMAPKPHFYNPFSNPSAAC
        E LKKCCETLT+ENRRLQKE+Q+LKALKL++ P YMHMPAATLTMCPSCER+GG G G          +K  FS+  KP FYNPF+NPSAAC
Subjt:  ELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERVGGVGDG---------ASKAKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana4.2e-4970.78Show/hide
Query:  SGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSN--NRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEV
        S G  R   +RD+     E+ER +SR S+ED D  N   RKKLRLSK+QSA LE+SFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEV
Subjt:  SGGGSRVKRERDLSSEEVELERVSSRVSDEDEDGSN--NRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEV

Query:  DCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV
        DCE LK+CCE+LTEENRRLQKEV+EL+ LK    P YM +PA TLTMCPSCERV
Subjt:  DCELLKKCCETLTEENRRLQKEVQELKALKLAKPPVYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATCTTTCTCAAACAGGCTTGGTTTTGGGCTTAGGGCTCTCAGAAGCAGCTAACTCTCCAATTATTAACAAGTTGAAGAAGAAGCATGCCCCTTGCTC
CTCTACTTCACTTGATCATTTTGAGCCATGTGCTTTGACTTTGGGATTTTCCGGTGACTCTCACCGGAAAGTTGTCGATGTCGGAGGTGTTGATCATCATCATTTGTATC
GTCAAGCTTCCCCTCATAGCAGCGCTGTTTGTTCTTCCTTCTCCGGTGGCGGCAGTAGGGTTAAAAGGGAGAGAGATCTCAGCAGTGAAGAGGTTGAATTGGAGAGGGTT
TCTTCCAGAGTTAGTGATGAAGATGAAGATGGTTCTAATAATAGAAAGAAGCTTAGGCTTTCTAAAGAACAATCCGCTCTCTTGGAAGAGAGTTTCAAACAAAATAGCAC
TCTCAACCCCAAGCAAAAACAAGCCTTGGCGAGACAACTAAATCTACGGCCACGACAAGTCGAAGTATGGTTTCAAAATAGGAGAGCCCGAACAAAACTGAAACAAACAG
AAGTAGACTGTGAGTTGTTGAAGAAATGTTGCGAGACGTTGACAGAAGAAAATAGAAGACTGCAAAAGGAAGTTCAAGAATTGAAGGCGCTGAAGCTGGCAAAGCCGCCG
GTGTACATGCACATGCCAGCGGCAACGTTAACGATGTGCCCGTCGTGCGAAAGGGTGGGTGGCGTCGGCGACGGCGCTTCCAAAGCCAAATTTTCAATGGCTCCCAAGCC
CCACTTTTACAACCCCTTCTCCAATCCTTCAGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
GATTTTTATTTTATTTTTTTATTTTATTTTTGTCAATAATGTGAAGATAATAAGTAAGTGGGGTTTCAAAGGTCAACCTATGTATAGAGATAGAAGGGACTTTTAATTTG
ATAACTTTATTTTTTTTCCAAATGCTAAGATTCTTTTGTCTTCTCAATTCTTTTCTTCTGGATTAATAATCATTCCCACCCACTCACACACACAAACACAAACACACAAA
CACACCCATATCTCTCTTTTTTTTCCTCTTCTTAAACCCTCCATTTTGGTTTGTTGTTCTTCACAAGTTCTTCTTTTTCCACTCCCTATCCACATAATTCCTTTTAACTC
CAGATGGGTTTTGATGATCTTTCTCAAACAGGCTTGGTTTTGGGCTTAGGGCTCTCAGAAGCAGCTAACTCTCCAATTATTAACAAGTTGAAGAAGAAGCATGCCCCTTG
CTCCTCTACTTCACTTGATCATTTTGAGCCATGTGCTTTGACTTTGGGATTTTCCGGTGACTCTCACCGGAAAGTTGTCGATGTCGGAGGTGTTGATCATCATCATTTGT
ATCGTCAAGCTTCCCCTCATAGCAGCGCTGTTTGTTCTTCCTTCTCCGGTGGCGGCAGTAGGGTTAAAAGGGAGAGAGATCTCAGCAGTGAAGAGGTTGAATTGGAGAGG
GTTTCTTCCAGAGTTAGTGATGAAGATGAAGATGGTTCTAATAATAGAAAGAAGCTTAGGCTTTCTAAAGAACAATCCGCTCTCTTGGAAGAGAGTTTCAAACAAAATAG
CACTCTCAACCCCAAGCAAAAACAAGCCTTGGCGAGACAACTAAATCTACGGCCACGACAAGTCGAAGTATGGTTTCAAAATAGGAGAGCCCGAACAAAACTGAAACAAA
CAGAAGTAGACTGTGAGTTGTTGAAGAAATGTTGCGAGACGTTGACAGAAGAAAATAGAAGACTGCAAAAGGAAGTTCAAGAATTGAAGGCGCTGAAGCTGGCAAAGCCG
CCGGTGTACATGCACATGCCAGCGGCAACGTTAACGATGTGCCCGTCGTGCGAAAGGGTGGGTGGCGTCGGCGACGGCGCTTCCAAAGCCAAATTTTCAATGGCTCCCAA
GCCCCACTTTTACAACCCCTTCTCCAATCCTTCAGCCGCATGTTAGACAAAATTAATATAATACAACCTAAGACAAAATTCAATGAGATTTGTTTTTCTTTTTCTTTTTC
TTCAAACAAAGCCCGTAGCTATTTTAGAAATTCGAGAAATTTAGTGCAAGTATTAGCTACCCTGGTAGAAAATTTCCCTTAATAATCAAATTATCATTTTTTTTTTTTTT
TGGGTTAGATGAGATGTTGTAAAGTTTTAAAGTCTATCTAGAAATGAAGGATTGATGGGTATATATTATTTTAAGTGAGGGTGATCACAATTTTTTTTTAGTTCAACCAT
TATTAGATGAAAGATTGAATCATCGATCTTTAAAATAATTATGTTCGAGAGATGTGGGTGGTGTGTTGGATTGGAAAATTATTGGGTGGTGTTGGTGGAAGGGGCTATCT
ATCTATCTCATTGGCTGCCTACGTGTTACGCAAGGGATGACATTGATATGACTTTTGCTATTTTATTAGTGCGTTTTGCCTTCATTGGTCAATACAATACAATAATACAA
TTATCTTTTCATCA
Protein sequenceShow/hide protein sequence
MGFDDLSQTGLVLGLGLSEAANSPIINKLKKKHAPCSSTSLDHFEPCALTLGFSGDSHRKVVDVGGVDHHHLYRQASPHSSAVCSSFSGGGSRVKRERDLSSEEVELERV
SSRVSDEDEDGSNNRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKKCCETLTEENRRLQKEVQELKALKLAKPP
VYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC