; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012900 (gene) of Snake gourd v1 genome

Gene IDTan0012900
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHomeobox-leucine zipper protein
Genome locationLG03:73128639..73130304
RNA-Seq ExpressionTan0012900
SyntenyTan0012900
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034261.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]2.3e-11585.77Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLP-----DLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEE
        MGFDDLSNTGLLLGLG   PS P   LS KPKK VDLFSF A +SEPSLTL LST +TYPLP     DLCRQPSPHSA+SSFSGGRVKRERDVS EDIEE
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLP-----DLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEE

Query:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE
        EKA SRVSDEDEDGS ARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENR+LQKE
Subjt:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE

Query:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LQELKALKLAQPL MQMPAATLTMCPSCER GGG   VN DGN K PFSMA  PRF K FT PSAAC
Subjt:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

XP_004143421.1 homeobox-leucine zipper protein HAT22 [Cucumis sativus]1.7e-11886.74Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNT LLLGLG T PS P   +SQKPKK +D   F   +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSG RVKRERDVS E+IEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTKEQSALLE+SFKLHSTLNPKQKQALA ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGG ATVNGDGN KGPFS+A KPRFYK FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

XP_008440442.1 PREDICTED: homeobox-leucine zipper protein HAT22-like [Cucumis melo]5.3e-12087.5Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNT LLLGLG T PS P   +SQKPKK++DL  F   +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSG RVKRERDVS E+IEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTKEQSALLE+SFKLHSTLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGG ATVNGDGN KGPFSMA KPRFYK FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

XP_023003674.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]3.6e-11687.02Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASS
        MGFDDL NTGLLLGLG T  SKP   LSQKP K VDLFSF A +SEPSLTL LST       DLCRQPSPHS VSS SGG+VKRERDVS EDIEEEKASS
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASS

Query:  RVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK
        RVSDE+EDGSNARKKLRLTKEQSALLE+SFKLH TLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK
Subjt:  RVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK

Query:  ALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        ALKL +PLFMQMPAATLTMCPSCERIGG  ATVNG+GNPKG FSMAPKP+FYKPFTNPSAAC
Subjt:  ALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

XP_038883701.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]1.7e-12189.02Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNTGLLLGLG T PS P   LSQKPKK VD   F A +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSGGRVKRERDVS EDIEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTK+QSALLE+SFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGGGA VNGDGN KGPFSMAP PRF+K FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0KJS8 Homeobox-leucine zipper protein8.3e-11986.74Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNT LLLGLG T PS P   +SQKPKK +D   F   +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSG RVKRERDVS E+IEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTKEQSALLE+SFKLHSTLNPKQKQALA ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGG ATVNGDGN KGPFS+A KPRFYK FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

A0A1S3B144 homeobox-leucine zipper protein HAT22-like2.6e-12087.5Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNT LLLGLG T PS P   +SQKPKK++DL  F   +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSG RVKRERDVS E+IEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTKEQSALLE+SFKLHSTLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGG ATVNGDGN KGPFSMA KPRFYK FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

A0A5A7T2R8 Homeobox-leucine zipper protein HAT22-like2.6e-12087.5Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA
        MGFDDLSNT LLLGLG T PS P   +SQKPKK++DL  F   +SEPSLTL LST DTYP   PDL RQPSPHSA+SSFSG RVKRERDVS E+IEEEKA
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYP--LPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKA

Query:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
        SSRVSDEDEDGSNARKKLRLTKEQSALLE+SFKLHSTLNPKQKQALA+ELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE
Subjt:  SSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQE

Query:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LKALKLAQPLFMQMPAATLTMCPSCERIGGG ATVNGDGN KGPFSMA KPRFYK FT PSAAC
Subjt:  LKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

A0A6J1GGZ5 homeobox-leucine zipper protein HAT222.5e-11585.39Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLP-----DLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEE
        MGFDDLSNTGLLLGLG   PS P   LS KPKK VDLFSF A +SEPSLTL LST +TYP+P     DLCRQPSPHSA+SSFSGGRVKRERDVS EDIEE
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLP-----DLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEE

Query:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE
        EKA SRVSDEDEDGS ARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENR+LQKE
Subjt:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE

Query:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LQELKALKLAQPL MQMPAATLTMCPSCER GGG   VN DGN K PFSMA  PRF K FT PSAAC
Subjt:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

A0A6J1KU00 homeobox-leucine zipper protein HAT22-like1.7e-11687.02Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASS
        MGFDDL NTGLLLGLG T  SKP   LSQKP K VDLFSF A +SEPSLTL LST       DLCRQPSPHS VSS SGG+VKRERDVS EDIEEEKASS
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASS

Query:  RVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK
        RVSDE+EDGSNARKKLRLTKEQSALLE+SFKLH TLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK
Subjt:  RVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELK

Query:  ALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        ALKL +PLFMQMPAATLTMCPSCERIGG  ATVNG+GNPKG FSMAPKP+FYKPFTNPSAAC
Subjt:  ALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX192.8e-5551.86Show/hide
Query:  LSNTGLLLGL-------GWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLAL-----STADTYPLPDLCRQPSPHSAVSSFSGG-----RVKRERDV
        LS+ GL LGL       G T  +  +    ++P       S      EPSLTL+L     + A             P  +VSS S G      VKRER  
Subjt:  LSNTGLLLGL-------GWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLAL-----STADTYPLPDLCRQPSPHSAVSSFSGG-----RVKRERDV

Query:  SAEDIEEEKASSRVS--DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
         AE+ + E+ SS  +  D+D+DGS  RKKLRLTKEQSALLED F+ HSTLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
Subjt:  SAEDIEEEKASSRVS--DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLQKELQELKALKLA----------------QPLFMQMPAATLTMCPSCERIGG---GGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        +ENRRLQ+ELQEL+ALK A                 P +MQ+PAATLT+CPSCER+GG       V  DG   GP        F+ PFT+ SAAC
Subjt:  DENRRLQKELQELKALKLA----------------QPLFMQMPAATLTMCPSCERIGG---GGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX271.1e-5175.54Show/hide
Query:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE
        E++SSR SD+DE G++ARKKLRL+KEQSA LE+SFK HSTLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCE+LKRCCETLT+ENRRL KE
Subjt:  EKASSRVSDEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKE

Query:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVN
        L EL+ALK A+P +M +PA TL+MCPSCER+    AT +
Subjt:  LQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVN

P46603 Homeobox-leucine zipper protein HAT92.8e-7962.32Show/hide
Query:  MGFDDLSNTGLLLGLGWT-FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL----PDLCRQPSPHSAVSSFSGGR-VKRERDVSAEDIE
        MGFDD  NTGL+LGLG +  P+  N  + Q           +  + EPSLTL LS   +  +      LCRQ S HS VSSFS GR VKRERD   E  E
Subjt:  MGFDDLSNTGLLLGLGWT-FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL----PDLCRQPSPHSAVSSFSGGR-VKRERDVSAEDIE

Query:  EEKASSRV-SD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRR
        EE+ + RV SD  EDE+G +ARKKLRLTK+QSALLE+SFK HSTLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN R
Subjt:  EEKASSRV-SD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRR

Query:  LQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDG-------------NPKGPFSMAPKPRFYKPFTNPSAAC
        LQKE+QELK LKL QP +M MPA+TLT CPSCERIGGGG    G G               KG FS++ KP F+ PFTNPSAAC
Subjt:  LQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDG-------------NPKGPFSMAPKPRFYKPFTNPSAAC

P46604 Homeobox-leucine zipper protein HAT222.6e-8564.18Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL-------PDLCRQPSPHSAVSSFSGGRVKRERDVSAEDI
        MG DD  NTGL+LGLG    S   ++ +   KK+         + +PSLTL+LS  ++Y +         +CRQ S HS +SSFS GRVKRER++S  D 
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL-------PDLCRQPSPHSAVSSFSGGRVKRERDVSAEDI

Query:  EEEK-------ASSRVSD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCET
        EEE          SRVSD  +DE+G +ARKKLRLTK+QSALLED+FKLHSTLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCET
Subjt:  EEEK-------ASSRVSD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCET

Query:  LTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGG----ATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LTDENRRLQKELQ+LKALKL+QP +M MPAATLTMCPSCER+GGGG     T   +   KG FS+  KPRFY PFTNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGG----ATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX192.8e-5551.86Show/hide
Query:  LSNTGLLLGL-------GWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLAL-----STADTYPLPDLCRQPSPHSAVSSFSGG-----RVKRERDV
        LS+ GL LGL       G T  +  +    ++P       S      EPSLTL+L     + A             P  +VSS S G      VKRER  
Subjt:  LSNTGLLLGL-------GWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLAL-----STADTYPLPDLCRQPSPHSAVSSFSGG-----RVKRERDV

Query:  SAEDIEEEKASSRVS--DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
         AE+ + E+ SS  +  D+D+DGS  RKKLRLTKEQSALLED F+ HSTLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT
Subjt:  SAEDIEEEKASSRVS--DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLT

Query:  DENRRLQKELQELKALKLA----------------QPLFMQMPAATLTMCPSCERIGG---GGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        +ENRRLQ+ELQEL+ALK A                 P +MQ+PAATLT+CPSCER+GG       V  DG   GP        F+ PFT+ SAAC
Subjt:  DENRRLQKELQELKALKLA----------------QPLFMQMPAATLTMCPSCERIGG---GGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.0e-8062.32Show/hide
Query:  MGFDDLSNTGLLLGLGWT-FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL----PDLCRQPSPHSAVSSFSGGR-VKRERDVSAEDIE
        MGFDD  NTGL+LGLG +  P+  N  + Q           +  + EPSLTL LS   +  +      LCRQ S HS VSSFS GR VKRERD   E  E
Subjt:  MGFDDLSNTGLLLGLGWT-FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL----PDLCRQPSPHSAVSSFSGGR-VKRERDVSAEDIE

Query:  EEKASSRV-SD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRR
        EE+ + RV SD  EDE+G +ARKKLRLTK+QSALLE+SFK HSTLNPKQKQ LAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETL DEN R
Subjt:  EEKASSRV-SD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRR

Query:  LQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDG-------------NPKGPFSMAPKPRFYKPFTNPSAAC
        LQKE+QELK LKL QP +M MPA+TLT CPSCERIGGGG    G G               KG FS++ KP F+ PFTNPSAAC
Subjt:  LQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGATVNGDG-------------NPKGPFSMAPKPRFYKPFTNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 33.4e-4858.5Show/hide
Query:  SPHSAVSSFSGGRVKRERDVSA----------EDIEEEKASSRV--SDEDEDGS-----NARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLR
        SP+S VSS   G+ K ER++ A          ED E E+AS  +    +DEDGS     ++RKKLRL+KEQ+ +LE++FK HSTLNPKQK ALA++LNLR
Subjt:  SPHSAVSSFSGGRVKRERDVSA----------EDIEEEKASSRV--SDEDEDGS-----NARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLR

Query:  PRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLAQPLFMQM-PAATLTMCPSCERIGGGGAT------VNGDGNPKGPFS
         RQVEVWFQNRRARTKLKQTEVDCE+LKRCCE LTDENRRLQKE+ EL+ALKL+  L+M M P  TLTMCPSCER+    ++      V    +P GP S
Subjt:  PRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLAQPLFMQM-PAATLTMCPSCERIGGGGAT------VNGDGNPKGPFS

AT4G16780.1 homeobox protein 29.6e-5154.02Show/hide
Query:  SNTGLLLGLGWT---FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASSRVS
        S+ GL     W      S PN D SQK  +    F      + P  T      D           SP+S VSS +G R +RE D   +        SR  
Subjt:  SNTGLLLGLGWT---FPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASSRVS

Query:  DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALK
         +DEDG N+RKKLRL+K+QSA+LE++FK HSTLNPKQKQALA++L LR RQVEVWFQNRRARTKLKQTEVDCEFL+RCCE LT+ENRRLQKE+ EL+ALK
Subjt:  DEDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALK

Query:  LAQPLFMQM-PAATLTMCPSCERI
        L+   +M M P  TLTMCPSCE +
Subjt:  LAQPLFMQM-PAATLTMCPSCERI

AT4G37790.1 Homeobox-leucine zipper protein family1.8e-8664.18Show/hide
Query:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL-------PDLCRQPSPHSAVSSFSGGRVKRERDVSAEDI
        MG DD  NTGL+LGLG    S   ++ +   KK+         + +PSLTL+LS  ++Y +         +CRQ S HS +SSFS GRVKRER++S  D 
Subjt:  MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPL-------PDLCRQPSPHSAVSSFSGGRVKRERDVSAEDI

Query:  EEEK-------ASSRVSD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCET
        EEE          SRVSD  +DE+G +ARKKLRLTK+QSALLED+FKLHSTLNPKQKQALAR+LNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCET
Subjt:  EEEK-------ASSRVSD--EDEDGSNARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCET

Query:  LTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGG----ATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC
        LTDENRRLQKELQ+LKALKL+QP +M MPAATLTMCPSCER+GGGG     T   +   KG FS+  KPRFY PFTNPSAAC
Subjt:  LTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGG----ATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.5e-5166.88Show/hide
Query:  VSSFSGGRVKRERDVSAEDIEEEKASSRVSDEDEDGSN--ARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT
        + S+   R   +RD+   D E E+++SR S+ED D  N   RKKLRL+K+QSA LEDSFK HSTLNPKQK ALA++LNLRPRQVEVWFQNRRARTKLKQT
Subjt:  VSSFSGGRVKRERDVSAEDIEEEKASSRVSDEDEDGSN--ARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQT

Query:  EVDCEFLKRCCETLTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGA
        EVDCE+LKRCCE+LT+ENRRLQKE++EL+ LK + P +MQ+PA TLTMCPSCER+    A
Subjt:  EVDCEFLKRCCETLTDENRRLQKELQELKALKLAQPLFMQMPAATLTMCPSCERIGGGGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATCTTTCTAATACAGGCCTTCTATTGGGTTTGGGATGGACGTTTCCATCGAAACCCAACGATGATCTTTCTCAAAAACCCAAGAAGGCTGTGGATTT
GTTCAGTTTTGCCGCCGCACAATCCGAGCCCTCTTTAACTTTGGCCCTTTCTACTGCCGACACTTACCCGCTGCCTGATTTGTGTCGCCAACCGTCGCCTCACAGCGCGG
TTTCTTCCTTCTCTGGCGGCAGGGTCAAGCGTGAAAGAGATGTTTCTGCTGAGGATATTGAAGAAGAGAAAGCTTCTTCTCGAGTTAGTGATGAAGATGAAGATGGGTCT
AATGCCAGGAAAAAACTTAGGCTAACGAAAGAACAATCTGCCCTTTTGGAGGACAGCTTCAAACTTCACAGCACTCTCAATCCTAAGCAAAAACAAGCCTTAGCCAGGGA
GTTGAATCTTCGGCCTCGACAAGTTGAAGTTTGGTTCCAGAACAGAAGAGCCAGGACGAAGCTCAAGCAAACAGAGGTAGATTGCGAGTTTCTAAAGAGATGCTGCGAAA
CGCTAACAGACGAAAACAGGAGGCTGCAAAAAGAACTGCAAGAACTGAAAGCCCTGAAACTAGCGCAGCCTCTGTTCATGCAAATGCCGGCGGCGACACTCACCATGTGC
CCGTCCTGCGAGAGGATCGGCGGGGGCGGCGCCACCGTTAACGGGGACGGAAATCCCAAGGGCCCATTTTCGATGGCTCCCAAGCCCCGGTTTTACAAACCCTTCACCAA
TCCTTCTGCTGCTTGCTAA
mRNA sequenceShow/hide mRNA sequence
ATTTTTCTCTTTTGGGTTTGGGTATAATAATGAGTAATGAGGGTAGGGGATATCAGATAGTGGGGTACATATGGTCAACCTATGTAAGTAGCGAGATAGAAGGAGCATTG
ATAATTTCTTTTCATCCAATGCCAAAGATTCGTCTGTCTCCTCATTCTTTCCTTTGATTATTAATCGCCCACCACCAACCCATTCCCTCCATTCCCATTTCCCTCTTCTT
AAACCCCATCCCTCTCCTCAAAAATCCCCACACCATCCCTTCAAAACTTTCACGCACCGCACCGTCTTTATTCATATAACCCAGATGGGTTTTGATGATCTTTCTAATAC
AGGCCTTCTATTGGGTTTGGGATGGACGTTTCCATCGAAACCCAACGATGATCTTTCTCAAAAACCCAAGAAGGCTGTGGATTTGTTCAGTTTTGCCGCCGCACAATCCG
AGCCCTCTTTAACTTTGGCCCTTTCTACTGCCGACACTTACCCGCTGCCTGATTTGTGTCGCCAACCGTCGCCTCACAGCGCGGTTTCTTCCTTCTCTGGCGGCAGGGTC
AAGCGTGAAAGAGATGTTTCTGCTGAGGATATTGAAGAAGAGAAAGCTTCTTCTCGAGTTAGTGATGAAGATGAAGATGGGTCTAATGCCAGGAAAAAACTTAGGCTAAC
GAAAGAACAATCTGCCCTTTTGGAGGACAGCTTCAAACTTCACAGCACTCTCAATCCTAAGCAAAAACAAGCCTTAGCCAGGGAGTTGAATCTTCGGCCTCGACAAGTTG
AAGTTTGGTTCCAGAACAGAAGAGCCAGGACGAAGCTCAAGCAAACAGAGGTAGATTGCGAGTTTCTAAAGAGATGCTGCGAAACGCTAACAGACGAAAACAGGAGGCTG
CAAAAAGAACTGCAAGAACTGAAAGCCCTGAAACTAGCGCAGCCTCTGTTCATGCAAATGCCGGCGGCGACACTCACCATGTGCCCGTCCTGCGAGAGGATCGGCGGGGG
CGGCGCCACCGTTAACGGGGACGGAAATCCCAAGGGCCCATTTTCGATGGCTCCCAAGCCCCGGTTTTACAAACCCTTCACCAATCCTTCTGCTGCTTGCTAATGCTACT
GCTAGGATATGTATTAGAATTTGCCTAGCAAGAAACAACAACAAAAAAAAAGAACAACAAATTAGGTAATTAATCAAAGAGGAAAATCCCCAGAAACCCAGGATTTTTTG
GTTGGGGCATCGTTTAGTTCTAGTTGGTTGGTTCTATTAAAAAAAAACCCCAAAAAAAGGACTTAGTCCAATTAAATTAGTTTAATTTGTAGAAAAAAATATAGTAATTG
CTAATCCAATTATTATGTCTCCTTATTTGATTTCATATGTTTGTATATAAAGTAAATTAATATTATTATTGAAGAAAATATTTGTGACATTATGACTTTTCGGAGTTCCT
TCTCTCCAATTTCATAATT
Protein sequenceShow/hide protein sequence
MGFDDLSNTGLLLGLGWTFPSKPNDDLSQKPKKAVDLFSFAAAQSEPSLTLALSTADTYPLPDLCRQPSPHSAVSSFSGGRVKRERDVSAEDIEEEKASSRVSDEDEDGS
NARKKLRLTKEQSALLEDSFKLHSTLNPKQKQALARELNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKRCCETLTDENRRLQKELQELKALKLAQPLFMQMPAATLTMC
PSCERIGGGGATVNGDGNPKGPFSMAPKPRFYKPFTNPSAAC