; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0042 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0042
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhomeobox-leucine zipper protein HAT3-like
Genome locationMC05:354807..357580
RNA-Seq ExpressionMC05g0042
SyntenyMC05g0042
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR006712 - HD-ZIP protein, N-terminal
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582705.1 Homeobox-leucine zipper protein HAT3, partial [Cucurbita argyrosperma subsp. sororia]4.90e-15785.32Show/hide
Query:  MGGTARDDDLALTLTLGFGVTT----QPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSE
        MGG  RDD+L LTL+LG GVTT    QPTH+HRP +S+HNH+R  SWNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNST+SSISGKRSE
Subjt:  MGGTARDDDLALTLTLGFGVTT----QPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSE

Query:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
        REA   EAEAEAEAEAERASCS+GSDDEDGGGGD  ASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
Subjt:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD

Query:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATS--QRPSVAINPWG-VLPIQR
        CEYLKRCCENLT ENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSST    T HP A +  QRPS+ INPW  VLPI+R
Subjt:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATS--QRPSVAINPWG-VLPIQR

XP_004133780.2 homeobox-leucine zipper protein HAT3 [Cucumis sativus]2.61e-16487.16Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGG  RDDD+ LTL+LGFGVTTQ TH+ RP    ++HLRK  WNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNSTISSISGKRSEREA 
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ
        GDEAEAEAEAEAE        RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNL PRQVEVWFQNRRARTKLKQ
Subjt:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RH  A   QRPS+AINPW VLPIQR
Subjt:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

XP_008437820.1 PREDICTED: homeobox-leucine zipper protein HAT3-like [Cucumis melo]1.87e-16889.19Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGG  RDDDL LTL+LGFGVTTQPTH+ RP  S+HNHLRK SWNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNSTISSISGKRSEREA 
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ
        GDEAEAEAEAEAE        RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNL PRQVEVWFQNRRARTKLKQ
Subjt:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RHP A   QR S+AINPW VLPIQR
Subjt:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

XP_022145200.1 homeobox-leucine zipper protein HAT3-like [Momordica charantia]1.96e-196100Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL
        GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL
Subjt:  GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL

Query:  KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR
        KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR
Subjt:  KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR

XP_038880433.1 homeobox-leucine zipper protein HAT3 [Benincasa hispida]1.10e-16589.97Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPH-SIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREA
        MGG  RDD+L L+L+LGFGVTT   H+ RP + S+HNHLRK+SWNE FQFSDRNAD+RSFLRGIDVNRLPT +DGEEENGVSSPNSTISSISGKRSEREA
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPH-SIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREA

Query:  VGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEY
        VGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEY
Subjt:  VGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEY

Query:  LKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        LKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RHP A   QR SVAINPW VLPIQR
Subjt:  LKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

TrEMBL top hitse value%identityAlignment
A0A0A0L3P1 Homeobox domain-containing protein1.26e-16487.16Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGG  RDDD+ LTL+LGFGVTTQ TH+ RP    ++HLRK  WNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNSTISSISGKRSEREA 
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ
        GDEAEAEAEAEAE        RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNL PRQVEVWFQNRRARTKLKQ
Subjt:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RH  A   QRPS+AINPW VLPIQR
Subjt:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

A0A1S3AUW7 homeobox-leucine zipper protein HAT3-like9.07e-16989.19Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGG  RDDDL LTL+LGFGVTTQPTH+ RP  S+HNHLRK SWNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNSTISSISGKRSEREA 
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ
        GDEAEAEAEAEAE        RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNL PRQVEVWFQNRRARTKLKQ
Subjt:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RHP A   QR S+AINPW VLPIQR
Subjt:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

A0A5A7TZ02 Homeobox-leucine zipper protein HAT3-like9.07e-16989.19Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGG  RDDDL LTL+LGFGVTTQPTH+ RP  S+HNHLRK SWNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNSTISSISGKRSEREA 
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ
        GDEAEAEAEAEAE        RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNL PRQVEVWFQNRRARTKLKQ
Subjt:  GDEAEAEAEAEAE--------RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQ

Query:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR
        TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSSTSAA T RHP A   QR S+AINPW VLPIQR
Subjt:  TEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPT-RHPPATS-QRPSVAINPWGVLPIQR

A0A6J1CTT0 homeobox-leucine zipper protein HAT3-like9.47e-197100Show/hide
Query:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
        MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV
Subjt:  MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAV

Query:  GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL
        GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL
Subjt:  GDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYL

Query:  KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR
        KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR
Subjt:  KRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR

A0A6J1E920 homeobox-leucine zipper protein HAT3-like5.15e-15784.98Show/hide
Query:  MGGTARDDDLALTLTLGFGVTT----QPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSE
        MGG  RDD+L LTL+LG GVTT    QPTH+HRP +S+HNH+R  SWNELFQFSDRNAD+RSFLRGIDVNRLPT VDGEEENGVSSPNST+SSISGKRSE
Subjt:  MGGTARDDDLALTLTLGFGVTT----QPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSE

Query:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
        REA   EAEAEAEAEAERASCS+GSDDEDGGGGD  ASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
Subjt:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD

Query:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATS--QRPSVAINPWG-VLPIQR
        CEYLKRCCENLT ENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAV+SSSST    T HP A +  QRPS+ +NPW  VLPI+R
Subjt:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATS--QRPSVAINPWG-VLPIQR

SwissProt top hitse value%identityAlignment
P46600 Homeobox-leucine zipper protein HAT11.0e-6659.21Show/hide
Query:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKR--SEREAV-GD
        +DL L+L+LGF     P  L+ +P  S  ++L+   WN+    S  +   + FLR IDVN LPTTVD EEE GVSSPNSTISS +SGKR  +ERE   G 
Subjt:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKR--SEREAV-GD

Query:  EAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR
            + +   +R+S SRG+ DE+   G G+  RKKLRLSK+QS VLE+TFKEHNTLNPKQKLALAK+L L  RQVEVWFQNRRARTKLKQTEVDCEYLKR
Subjt:  EAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR

Query:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        C E LTEENRRL+KE  ELRALKLSP+LY  M+PPTTL MCP CERVA  SSS+              + SV+++PW
Subjt:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW

P46601 Homeobox-leucine zipper protein HAT29.8e-7059.93Show/hide
Query:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKRSEREAVGDEAE
        +DL L+L+LGF     P  ++  P  S+ N+L++  WN+ F       D  S LR IDVN  P+TV+ EE+ GVSSPNSTISS ISGKRSERE +     
Subjt:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKRSEREAVGDEAE

Query:  AEAEAEAE---RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR
           +   E       SRG+ DE+  G  G+ SRKKLRLSK+QS  LEETFKEHNTLNPKQKLALAK+LNL  RQVEVWFQNRRARTKLKQTEVDCEYLKR
Subjt:  AEAEAEAE---RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR

Query:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        C E LTEENRRLQKE  ELR LKLSPQ Y  M PPTTL MCP CERV   SSS+       H    + RP V+INPW
Subjt:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW

P46602 Homeobox-leucine zipper protein HAT31.3e-8562.14Show/hide
Query:  RDDDLALTLTLGFGVT----------------TQPTHLHRPPHSIHNHLRKA--SWNELFQFSDRNADTRSFLRGIDVNRLPTT--VDGEEEN-GVSSPN
        RDD L L+L+L  G                     +H+     S +NH +K   +W  +FQ S+RN+D RSFLRGIDVNR P+T  VD E+E  GVSSPN
Subjt:  RDDDLALTLTLGFGVT----------------TQPTHLHRPPHSIHNHLRKA--SWNELFQFSDRNADTRSFLRGIDVNRLPTT--VDGEEEN-GVSSPN

Query:  STISSI-SGKRSERE---AVGDEAEAEAE-AEAERASCS--RGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQV
        ST+SS+ SGK+SERE   A G       E  E ERASCS   GSDDEDG G   D+SRKKLRLSKEQ++VLEETFKEH+TLNPKQK+ALAKQLNLR RQV
Subjt:  STISSI-SGKRSERE---AVGDEAEAEAE-AEAERASCS--RGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQV

Query:  EVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAIN
        EVWFQNRRARTKLKQTEVDCEYLKRCCENLT+ENRRLQKEV ELRALKLSP LYMHM PPTTLTMCP CERVAV SSSS+ A P  +    S  P   ++
Subjt:  EVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAIN

Query:  PWGVLPIQR
        PW  +P+++
Subjt:  PWGVLPIQR

P92953 Homeobox-leucine zipper protein ATHB-44.4e-7857.41Show/hide
Query:  RDDDLALTLTLGFGVTTQPT------------------HLHRPPHSIH-NHLRKASWNELFQFS-------DRNADTRSFLRGIDVNRLPTT---VDGEE
        RDD L L+L+LG     +P+                  H+H   ++ H   +   SW  LFQ S       +RN+D  SFLRG +VNR  ++   VD EE
Subjt:  RDDDLALTLTLGFGVTTQPT------------------HLHRPPHSIH-NHLRKASWNELFQFS-------DRNADTRSFLRGIDVNRLPTT---VDGEE

Query:  ENG-VSSPNSTISSISGKRSEREAVGDEAEAEAEAEAERASCSR-----GSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQL
        E   VSSPNS +SS+SG + +       A    E EAERASCSR     GSDDEDGG GDG  SRKKLRLSK+Q++VLEETFKEH+TLNPKQKLALAKQL
Subjt:  ENG-VSSPNSTISSISGKRSEREAVGDEAEAEAEAEAERASCSR-----GSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQ
        NLR RQVEVWFQNRRARTKLKQTEVDCEYLKRCC+NLTEENRRLQKEV ELRALKLSP LYMHM PPTTLTMCP CERV+ ++++ T+A  T   P    
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQ

Query:  RPS-VAINPWGVLPIQR
        RPS   + PW  + +Q+
Subjt:  RPS-VAINPWGVLPIQR

Q05466 Homeobox-leucine zipper protein HAT41.6e-7258.3Show/hide
Query:  DDLALTLTLGF-----GVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNAD-----TRSFLRGIDVNRLPTTVD-GEEENGVSSPNSTISSISGKRSE
        DDL L+L L F      + + P+    P  S     R++SWNE F  S  N+D     TR+F+RGIDVNR P+T + G+E+ GVSSPNST+SS +GKRSE
Subjt:  DDLALTLTLGF-----GVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNAD-----TRSFLRGIDVNRLPTTVD-GEEENGVSSPNSTISSISGKRSE

Query:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
        RE            E      SRG  D++    DGD SRKKLRLSK+QS +LEETFK+H+TLNPKQK ALAKQL LR RQVEVWFQNRRARTKLKQTEVD
Subjt:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD

Query:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        CE+L+RCCENLTEENRRLQKEV ELRALKLSPQ YMHM+PPTTLTMCP CE V+V      +A    H        S+ +N W
Subjt:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW

Arabidopsis top hitse value%identityAlignment
AT2G44910.1 homeobox-leucine zipper protein 43.1e-7957.41Show/hide
Query:  RDDDLALTLTLGFGVTTQPT------------------HLHRPPHSIH-NHLRKASWNELFQFS-------DRNADTRSFLRGIDVNRLPTT---VDGEE
        RDD L L+L+LG     +P+                  H+H   ++ H   +   SW  LFQ S       +RN+D  SFLRG +VNR  ++   VD EE
Subjt:  RDDDLALTLTLGFGVTTQPT------------------HLHRPPHSIH-NHLRKASWNELFQFS-------DRNADTRSFLRGIDVNRLPTT---VDGEE

Query:  ENG-VSSPNSTISSISGKRSEREAVGDEAEAEAEAEAERASCSR-----GSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQL
        E   VSSPNS +SS+SG + +       A    E EAERASCSR     GSDDEDGG GDG  SRKKLRLSK+Q++VLEETFKEH+TLNPKQKLALAKQL
Subjt:  ENG-VSSPNSTISSISGKRSEREAVGDEAEAEAEAEAERASCSR-----GSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQ
        NLR RQVEVWFQNRRARTKLKQTEVDCEYLKRCC+NLTEENRRLQKEV ELRALKLSP LYMHM PPTTLTMCP CERV+ ++++ T+A  T   P    
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQ

Query:  RPS-VAINPWGVLPIQR
        RPS   + PW  + +Q+
Subjt:  RPS-VAINPWGVLPIQR

AT3G60390.1 homeobox-leucine zipper protein 39.1e-8762.14Show/hide
Query:  RDDDLALTLTLGFGVT----------------TQPTHLHRPPHSIHNHLRKA--SWNELFQFSDRNADTRSFLRGIDVNRLPTT--VDGEEEN-GVSSPN
        RDD L L+L+L  G                     +H+     S +NH +K   +W  +FQ S+RN+D RSFLRGIDVNR P+T  VD E+E  GVSSPN
Subjt:  RDDDLALTLTLGFGVT----------------TQPTHLHRPPHSIHNHLRKA--SWNELFQFSDRNADTRSFLRGIDVNRLPTT--VDGEEEN-GVSSPN

Query:  STISSI-SGKRSERE---AVGDEAEAEAE-AEAERASCS--RGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQV
        ST+SS+ SGK+SERE   A G       E  E ERASCS   GSDDEDG G   D+SRKKLRLSKEQ++VLEETFKEH+TLNPKQK+ALAKQLNLR RQV
Subjt:  STISSI-SGKRSERE---AVGDEAEAEAE-AEAERASCS--RGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQV

Query:  EVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAIN
        EVWFQNRRARTKLKQTEVDCEYLKRCCENLT+ENRRLQKEV ELRALKLSP LYMHM PPTTLTMCP CERVAV SSSS+ A P  +    S  P   ++
Subjt:  EVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAIN

Query:  PWGVLPIQR
        PW  +P+++
Subjt:  PWGVLPIQR

AT4G16780.1 homeobox protein 21.1e-7358.3Show/hide
Query:  DDLALTLTLGF-----GVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNAD-----TRSFLRGIDVNRLPTTVD-GEEENGVSSPNSTISSISGKRSE
        DDL L+L L F      + + P+    P  S     R++SWNE F  S  N+D     TR+F+RGIDVNR P+T + G+E+ GVSSPNST+SS +GKRSE
Subjt:  DDLALTLTLGF-----GVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNAD-----TRSFLRGIDVNRLPTTVD-GEEENGVSSPNSTISSISGKRSE

Query:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD
        RE            E      SRG  D++    DGD SRKKLRLSK+QS +LEETFK+H+TLNPKQK ALAKQL LR RQVEVWFQNRRARTKLKQTEVD
Subjt:  REAVGDEAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVD

Query:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        CE+L+RCCENLTEENRRLQKEV ELRALKLSPQ YMHM+PPTTLTMCP CE V+V      +A    H        S+ +N W
Subjt:  CEYLKRCCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW

AT4G17460.1 Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein7.2e-6859.21Show/hide
Query:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKR--SEREAV-GD
        +DL L+L+LGF     P  L+ +P  S  ++L+   WN+    S  +   + FLR IDVN LPTTVD EEE GVSSPNSTISS +SGKR  +ERE   G 
Subjt:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKR--SEREAV-GD

Query:  EAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR
            + +   +R+S SRG+ DE+   G G+  RKKLRLSK+QS VLE+TFKEHNTLNPKQKLALAK+L L  RQVEVWFQNRRARTKLKQTEVDCEYLKR
Subjt:  EAEAEAEAEAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR

Query:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        C E LTEENRRL+KE  ELRALKLSP+LY  M+PPTTL MCP CERVA  SSS+              + SV+++PW
Subjt:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW

AT5G47370.1 Homeobox-leucine zipper protein 4 (HB-4) / HD-ZIP protein7.0e-7159.93Show/hide
Query:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKRSEREAVGDEAE
        +DL L+L+LGF     P  ++  P  S+ N+L++  WN+ F       D  S LR IDVN  P+TV+ EE+ GVSSPNSTISS ISGKRSERE +     
Subjt:  DDLALTLTLGFGVTTQPTHLH-RPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISS-ISGKRSEREAVGDEAE

Query:  AEAEAEAE---RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR
           +   E       SRG+ DE+  G  G+ SRKKLRLSK+QS  LEETFKEHNTLNPKQKLALAK+LNL  RQVEVWFQNRRARTKLKQTEVDCEYLKR
Subjt:  AEAEAEAE---RASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKR

Query:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW
        C E LTEENRRLQKE  ELR LKLSPQ Y  M PPTTL MCP CERV   SSS+       H    + RP V+INPW
Subjt:  CCENLTEENRRLQKEVQELRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGGACTGCGAGAGACGACGACTTGGCCCTCACTCTCACCTTAGGGTTTGGAGTTACCACTCAGCCCACTCACCTGCACAGGCCCCCCCACTCCATCCACAATCA
CCTACGCAAGGCGTCTTGGAACGAGCTTTTTCAATTTTCTGATCGAAACGCCGATACGAGGTCGTTTCTTCGGGGAATCGACGTGAATCGGCTGCCGACGACCGTGGATG
GCGAGGAAGAAAACGGCGTTTCTTCTCCGAACAGTACGATTTCTAGCATCAGCGGGAAGAGGAGCGAGAGAGAAGCGGTCGGAGACGAGGCCGAGGCGGAGGCAGAGGCG
GAAGCCGAGAGAGCCTCGTGCTCGCGGGGGAGTGACGATGAAGACGGCGGCGGGGGCGATGGCGACGCCTCGAGGAAGAAGCTGAGGCTATCGAAGGAGCAGTCGATGGT
GCTCGAGGAGACCTTCAAAGAGCACAACACTCTGAATCCAAAGCAAAAGCTGGCACTTGCAAAGCAGCTGAATCTGAGACCTAGACAGGTGGAGGTGTGGTTTCAAAACA
GGAGGGCAAGGACCAAGTTGAAGCAGACAGAAGTGGATTGCGAGTACTTGAAGAGGTGCTGTGAGAATCTAACAGAGGAGAACAGAAGGCTGCAAAAGGAGGTGCAAGAG
CTGAGAGCACTCAAGCTCTCTCCACAGCTCTATATGCACATGAATCCGCCTACCACCCTCACCATGTGCCCTCAGTGTGAGCGTGTGGCTGTCGCCTCGTCCTCTTCGAC
CTCGGCCGCTCCGACCCGCCATCCACCGGCCACCTCACAGCGTCCCTCAGTAGCCATCAATCCGTGGGGGGTGTTGCCGATCCAACGTTGA
mRNA sequenceShow/hide mRNA sequence
TGAGCTCAACTGAACTGTGTTGGTCTTGGTTTAACCATGGTTTGGTAACCGAGGGTTAAAACTTAAAACCCTAACAAGGGACAGAGGGCAGAGGGCAGAGGGCAGAGGGT
AGGGTTATATAAAGCCATTTTATCATGATTGAGCCAATAGCCCCTCTGACACAAGTGATGATTAAATCTACACCCCTCGTGTCTGTCTCTCTCTCTCTCTCTCTCTCTCT
CTCTCTCTTTATAATATTAATAATTAATTTCTCTGTGTGGCTGTATATATAGTCCCATCCATGCTTTTACTCTGCTGCAAAAGAAAAGCCTATGACATCTCTGCCGCAGC
GCCTGCCCGTTTTCCCACCATTTTCTTTCCCCTAATTCAAACACAACCACTAACACTACCACTACCACTGCGACTACGTATTTATATTTAACTCTCTATTCCATCATTGC
CTTTCCCTTCCTTTCCTTCTCTACTGCACCAGCACAATGGGTGGGACTGCGAGAGACGACGACTTGGCCCTCACTCTCACCTTAGGGTTTGGAGTTACCACTCAGCCCAC
TCACCTGCACAGGCCCCCCCACTCCATCCACAATCACCTACGCAAGGCGTCTTGGAACGAGCTTTTTCAATTTTCTGATCGAAACGCCGATACGAGGTCGTTTCTTCGGG
GAATCGACGTGAATCGGCTGCCGACGACCGTGGATGGCGAGGAAGAAAACGGCGTTTCTTCTCCGAACAGTACGATTTCTAGCATCAGCGGGAAGAGGAGCGAGAGAGAA
GCGGTCGGAGACGAGGCCGAGGCGGAGGCAGAGGCGGAAGCCGAGAGAGCCTCGTGCTCGCGGGGGAGTGACGATGAAGACGGCGGCGGGGGCGATGGCGACGCCTCGAG
GAAGAAGCTGAGGCTATCGAAGGAGCAGTCGATGGTGCTCGAGGAGACCTTCAAAGAGCACAACACTCTGAATCCAAAGCAAAAGCTGGCACTTGCAAAGCAGCTGAATC
TGAGACCTAGACAGGTGGAGGTGTGGTTTCAAAACAGGAGGGCAAGGACCAAGTTGAAGCAGACAGAAGTGGATTGCGAGTACTTGAAGAGGTGCTGTGAGAATCTAACA
GAGGAGAACAGAAGGCTGCAAAAGGAGGTGCAAGAGCTGAGAGCACTCAAGCTCTCTCCACAGCTCTATATGCACATGAATCCGCCTACCACCCTCACCATGTGCCCTCA
GTGTGAGCGTGTGGCTGTCGCCTCGTCCTCTTCGACCTCGGCCGCTCCGACCCGCCATCCACCGGCCACCTCACAGCGTCCCTCAGTAGCCATCAATCCGTGGGGGGTGT
TGCCGATCCAACGTTGAAGGTTGTATTTTATCATTTTTCCTGCGGGGGATTGAGTTTGATGTTTATGGAATATGGGATGGAAAATTTTGTTTTGGTGGTGTGGTGATTAG
GTCGGTTAGCTGCCAAATTCTCTCCTTAACAAAACCTAGTTAGATATTGCAGAGAGAGAGGAACGGACAATTAATTAAAGATCAATGGAGACCTTTTTATTTTCACAAAC
AATACTAGTGTTTAGAAAGCAATGATTAGAGCTCTTATCACATACTGAATTGTAGTACTCAAGAAAAAAAAAAAAGATTTGGATATACTAGTAAATAGGTGTAATTGGTA
TTTAATTTCTTAATGAATAAGCCAGCATCCTCATATTTATTGTATCTCAAAAAAAAAAAAAAATAATAATATAACTGCACCAAATCGATGAAAGGGGCCAGTTACAAAAT
TTCAGTAAAAATGCTATTCACTATACATTTTCAGAAGGAAAAAAGACAAATAAAAAGGAAAAAGCCTATGGAGATGAGATGAAAAGTCCATATTCCATGGGCCAAACAAG
CATGGGCTTCCGAGAACGAGGTTTATTATGGGCCTCCATGATTAGTTCGCCCATTTGGGGCATTCATGTTGTTTTTT
Protein sequenceShow/hide protein sequence
MGGTARDDDLALTLTLGFGVTTQPTHLHRPPHSIHNHLRKASWNELFQFSDRNADTRSFLRGIDVNRLPTTVDGEEENGVSSPNSTISSISGKRSEREAVGDEAEAEAEA
EAERASCSRGSDDEDGGGGDGDASRKKLRLSKEQSMVLEETFKEHNTLNPKQKLALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCENLTEENRRLQKEVQE
LRALKLSPQLYMHMNPPTTLTMCPQCERVAVASSSSTSAAPTRHPPATSQRPSVAINPWGVLPIQR