; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0380 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0380
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionhomeobox protein HAT3.1
Genome locationMC10:3034233..3045993
RNA-Seq ExpressionMC10g0380
SyntenyMC10g0380
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]0.076.27Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKA+VEV  LT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+  S LPE+++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ET  L NECSE P ED NKN I+Q NPPIED  QNTSI+ L  VP  S     +LG KDK++L+SKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASAS--EELEAAPNDD
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSDQSSSD+SGYASAS  EELEA PNDD
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASAS--EELEAAPNDD

Query:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG
        QYLGLPSDDSEDDDY+PGAP  DEGV+QESS SDFTSDSEDLAAL D                T PVRNSNGQ SG GP  +  HN+L SL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT
        LE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS KR T QKP AENM NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
        +TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKS  RMG QSS+TS K 
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PKPEQESGACFRD  +NGAQHQ SP     VAPCQSG T DDKLA QK  RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADKV+ 
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRR+S+
Subjt:  RRRRSI

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
        MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI

Query:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
        DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
Subjt:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS

Query:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
        RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
Subjt:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD
        LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD

Query:  DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
        DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
Subjt:  DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS

Query:  DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE
        DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE
Subjt:  DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE

Query:  RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA
        RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA
Subjt:  RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA

Query:  VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
        VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Subjt:  VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]0.076.13Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKASVEV  LT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ETT L NECSE P ED NKN I+Q NPPIED  QNTSI  L  VP  S     ++G KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS
        L+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE GK KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS

Query:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
        +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDD
Subjt:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEAAP
        C+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD SSSD+SGYASAS  EELEA P
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEAAP

Query:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD
        NDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNS+GQ SG GP  +  HN+L SL+ SGPD
Subjt:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD

Query:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKN
        + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS KR T QKP AENM N
Subjt:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS
        SVT+TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS

Query:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK
         K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Subjt:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK

Query:  VRTRRRRSI
        V+ RRR+S+
Subjt:  VRTRRRRSI

XP_023531864.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo]0.075.96Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKASVEV  LT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ +ELG G+  S LPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ETT L NEC E P ED NKN I+Q NPPIED  QNT I+ L  VP  S     +LG KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEA
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD SSSD+SGYASAS  EELEA
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEA

Query:  APNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAAL-------DDG--------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESG
         PNDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL       DD         T PVRNSNGQ SG GP  S  HN+L SL+ SG
Subjt:  APNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAAL-------DDG--------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESG

Query:  PDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENM
        PD+ GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTK S KR T QKP AENM
Subjt:  PDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENM

Query:  KNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSE
         NSVT+TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKA+SA RMG QSS+
Subjt:  KNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSE

Query:  TSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTA
        TS K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDK A Q+T RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTA
Subjt:  TSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTA

Query:  DKVRTRRRRSI
        DKV+ RRR+S+
Subjt:  DKVRTRRRRSI

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.076.93Show/hide
Query:  MEERHEYT--EPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK
        MEER E T  E RPNN+ E VQEAKASVEV  LTC SNE MHS    QELGTTPE +SKT GPD+EK GVQQNME     LGSG +LSEL EK+NQT+S 
Subjt:  MEERHEYT--EPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK

Query:  LAEIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML
         A+ DQVEAGNLLSSD +TENL LPIE+ETTTL NECSELP ED NKN I+Q+NPPIEDLTQN SIQ LE +P    S SQQLG KDK ILKSKK NY L
Subjt:  LAEIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKK--RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN LN  TA EGKRKKKK  RNI+GK A  DE+SSIR +LRYL+NRI YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKK--RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ

Query:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIM+ KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSS-------DESGYASASEELEAAPND
        KDDCLDLLNEFQGSNLSITD WEKVYPEAAAAAAGQNSDH LGLPSDDSEDGDYDPD PDTI+Q++ESSSD+SSS       D SGYASASE LE  PND
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSS-------DESGYASASEELEAAPND

Query:  DQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG--------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG
        DQYLGLPSDDSEDDDY+P  PELDEGV++ESS SDFTSDSEDLAALD+               T  V+NSNGQ SGCGP  S LHNEL SL      KDG
Subjt:  DQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG--------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT
        LEPVSGRRQVERLDYKKLHDETYGNVP+DSSDDT+GS S+DSS DRG  S TRKR P+NLV AL  NGTNDDL N KTKRS+KR T QK  A N+ NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
         TP D+ KSSSS R+T SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGLSLKQVS+WFENTRWSTRHPSS   N+AKS+ RM   SS+ SG+L
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PK EQESGACFRDTD+NGAQHQ  P  +    PCQSGDT D KL T+KT R ES+ATKSRKRK  SDH+ASH+KD++ SQ+PPAKSPKVN+IQTAD+ +T
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRRRSI
Subjt:  RRRRSI

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.076Show/hide
Query:  MEERHEYT--EPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK
        MEER E T  E RPN   EAVQEAKASVEV   TC SNE M+S    QELGTTPE + KT GPD+EK+GVQQNME     LGSG +LSEL EK+NQTIS 
Subjt:  MEERHEYT--EPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK

Query:  LAEIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML
         A+ DQVEAGN LS D +T+NL L IE ETTTL NECSELP ED  KN I+++NPPIEDLTQ TSIQ LET+P    S SQQL HKD++  KSKKKNY L
Subjt:  LAEIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKK-RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN+LN  TA E GKRKKKK RNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKK-RNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ

Query:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASASEELEAAPNDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA G+NSD  LGLPSDDSEDGDYDPD PDTI+Q+     DESSSDQS+SD SGYASASE LE  PNDDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASASEELEAAPNDDQ

Query:  YLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG--------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLE
        YLGLPSDDSED+DY+P  PELDEG +QESS SDFTSDSEDLAAL++               T PV+N+NG+ SG  P  S LHNEL SLL+SG DKDGLE
Subjt:  YLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG--------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLE

Query:  PVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRT
        P+SGRRQVERLDYKKLHDETYGNVP++SSDDT+GS  +DSSDDRG  S TRKR PK LV AL  NG+NDDL N KTKRSYKRRT QKPGA N+ NSVT T
Subjt:  PVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRT

Query:  PEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK
        P D+ KSSSSVR+  SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGL+LKQVSKWFENTRWSTRHPSS    KAKS+ RM I  S+ SG+L K
Subjt:  PEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK

Query:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRR
         EQES  CFRDTD+NGA+HQ  P  +  VA CQSGDT D KL T+KT R ES+ATKSRKRKGRSD+ AS+SKDR+ S +PPAKSPKVN+ QTAD+ +TRR
Subjt:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRR

Query:  RRSI
        RRSI
Subjt:  RRSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0100Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
        MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI

Query:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
        DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
Subjt:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS

Query:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
        RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
Subjt:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD
        LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDD

Query:  DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
        DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS
Subjt:  DYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPS

Query:  DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE
        DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE
Subjt:  DSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALE

Query:  RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA
        RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA
Subjt:  RLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGA

Query:  VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
        VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Subjt:  VAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI

A0A6J1E4I6 homeobox protein HAT3.1-like0.073.47Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVE--VLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R NNN EAVQEAK SVE  + TC SNEQ HS+PD  EL  TP  ++KT G D+EK  VQQNMEEE +ELGSGDVL EL EK+NQT S LA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVE--VLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTLN-ECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQVEAGNLL  D +TENLI+PIE+ETTTL  +CSELPPE  NKN I+Q+NPP E LTQNT  Q LETVP    S S+Q  HKDK+ILKS K N +LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTLN-ECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASS
        LVSSDR LRS+TQEK K PEPSN+LN  TA EGK KKK+RNI+GKGA  DEFSSIRN LRYL+NRIKYEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASS

Query:  EIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMR KLKIRD+FQ +D+LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDS-EDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDD
        L+LLNEFQGS LSITDGWEKVYPEAAA+AAG+N DHA GLPSDDS +D DYDPD PDTI Q+DESS + S     GYASASEELE+ PN DQYLGLPSDD
Subjt:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDS-EDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDD

Query:  SEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD---------------DGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQ
        SEDDDY+P APE DE V+QESS SDFTSDSEDLAALD               + TT ++N +G+ SG GPR S L+NEL SLLESGPDKDG EPV GRRQ
Subjt:  SEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD---------------DGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSGRRQ

Query:  VERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNG--TNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKS
        VERLDYKKLHDETYGNVP+DSSDDT+ S+S+DSSDD+G  S TRKRSPK LV AL    TNDDL N KTK S KR T QK  A NM  SV++TPED+ K+
Subjt:  VERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNG--TNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKS

Query:  SSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGA
        SSSVRRT  SS RRLS+ ALERLLASFQENQYP+RATKESLAQELGLS+KQVSKWF NTRWSTRHPSS+E NKAKS+ RMGI SS+ SG+L +PEQE GA
Subjt:  SSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGA

Query:  CFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
                  QHQ  P  D  VAPCQSGDT D KLATQ+T R E +ATKSRKRKGRSDH AS SKD KESQ+PPAKSPKVN+IQTA  ++TRRR S+
Subjt:  CFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI

A0A6J1FNP3 homeobox protein HAT3.10.076.13Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKASVEV  LT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ETT L NECSE P ED NKN I+Q NPPIED  QNTSI  L  VP  S     ++G KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS
        L+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE GK KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE-GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS

Query:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
        +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDD
Subjt:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEAAP
        C+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+          D+SSSD SSSD+SGYASAS  EELEA P
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE----------DESSSDQSSSDESGYASAS--EELEAAP

Query:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD
        NDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNS+GQ SG GP  +  HN+L SL+ SGPD
Subjt:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPD

Query:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKN
        + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS KR T QKP AENM N
Subjt:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS
        SVT+TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS

Query:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK
         K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Subjt:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK

Query:  VRTRRRRSI
        V+ RRR+S+
Subjt:  VRTRRRRSI

A0A6J1IPM8 homeobox protein HAT3.1-like0.076.27Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  +   AVQEAKASVEV  LT  +NEQ+ S P+  ELGT  + TSKT  PD+EK GV+QNMEE++KEL  G+  SELPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEV--LTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        E DQ EAGNLLSSD +TENLILPIE+ETT L NECSE P ED NKN I+Q NPPIE   QNTSI+ L  VP  S     +LG KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTTL-NECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK++KK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASAS--EELEAAPNDD
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSDQSSSD+SGYASAS  EELEA PNDD
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDQSSSDESGYASAS--EELEAAPNDD

Query:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG
        QYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNSNGQ SG GP  +  HN+L SL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG---------------TTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT
        LE VSGRR VERLDYKKLHDET+GNVPS+SSDDT+GS SIDSSDDRGRG  TRK SPKNLVPAL  NGT DD KN KTK S  RRT QKP AENM NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
        +TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS K 
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA  KT RPESTATKSRKRKGRSD VAS SK+RK+S+KPPAKS KV++IQTADKV+ 
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRR+S+
Subjt:  RRRRSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A8.1e-10741.37Show/hide
Query:  YMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL
        Y L S  S  RVLRS +  K  + E    +        KR+K  R      +S DEFS IR R+RY++NR+ YEQSLI+AY+SEGWK  S DK++PEKEL
Subjt:  YMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL

Query:  QRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD
        +RA SEI+R KL+IR++F+++DSL ++G++ E+LFDSEG+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP CD
Subjt:  QRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD

Query:  CKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPD--TINQEDESSSDQ----SSSDESGYASASEELEAAPN-
        CK DC+DL+NE  GSN+SI D WEKV+P+AAA A     D A  LPSDDS+D D+DP+ P+   + +++ESS +     S SD+S + + S++ E   + 
Subjt:  CKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPD--TINQEDESSSDQ----SSSDESGYASASEELEAAPN-

Query:  --DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG--SDFTSDSEDL------AALDDGTTPVRNSNGQG-----SGCGPRTSVLHNELQSLLESGPDK
          DD  L LPS+DSEDDDY+P  P+ D+ V+++SS   SDFTSDS+D       +  D+ ++P+      G     +     TS   + +++ ++ G   
Subjt:  --DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG--SDFTSDSEDL------AALDDGTTPVRNSNGQG-----SGCGPRTSVLHNELQSLLESGPDK

Query:  DGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGS---ISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKN
          + P S RRQ ERLDYKKL+DE YG   SDSSDD   S     I  S++ G  +     SP      +   ND+L  + TK+S            ++  
Subjt:  DGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGS---ISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL-ERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSET
        SV   P D   + S+     S++ +    P + ++L   F+   YP R+ KESLA+ELGL+ +QV+KWFE  R S R  SS +             S  T
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL-ERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSET

Query:  SGKLPK-PE----QESGACFR---DTDNNGAQHQVSPNTDGAVAPCQSGDTRDDK
        +   PK PE    +ES  C              +V   T G+       D+ +D+
Subjt:  SGKLPK-PE----QESGACFR---DTDNNGAQHQVSPNTDGAVAPCQSGDTRDDK

P48785 Pathogenesis-related homeodomain protein1.3e-5129.23Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE

Query:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG
                        D   D+D E                    S S  L+   DG      +   GS  G R S +  + ++  E        E V G
Subjt:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG

Query:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK
         RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T E S +
Subjt:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK

Query:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES
         S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Subjt:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES

Query:  GACFRDTDNNGAQHQVSPNTDGAVAP
               +NN   ++V    D  V P
Subjt:  GACFRDTDNNGAQHQVSPNTDGAVAP

P48786 Pathogenesis-related homeodomain protein4.9e-12042.64Show/hide
Query:  ENLILPIELETTTLN-ECSELPPEDANKN-SIKQVNPPIEDLTQ-----------------------------------NTSIQRLETVPITSVSISQQL
        E+L +P + ++ T N + SELPPE+A KN +  Q     +D T+                                    T +++L  V  T+   S QL
Subjt:  ENLILPIELETTTLN-ECSELPPEDANKN-SIKQVNPPIEDLTQ-----------------------------------NTSIQRLETVPITSVSISQQL

Query:  GHKDK--------------------KILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEG---KRKKKKRNIKGKGASGDEFSSIRNR
        G   K                    K++  K K+    S V+S R LRSR+QEK+  P    ++N + A EG   ++ +KKR  + +    DEF  IR  
Subjt:  GHKDK--------------------KILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEG---KRKKKKRNIKGKGASGDEFSSIRNR

Query:  LRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDII
        LRYL++RIKYE++ +DAYS EGWKG S DK+KPEKEL+RA +EI   KLKIRDLFQ LD   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDII
Subjt:  LRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDII

Query:  LCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT
        LCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD 
Subjt:  LCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT

Query:  INQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTS
           +++   D SS+DES Y S S++++     +   GLPSDDSEDD+Y+P     D+  K +SS SDFTSDSED   + D      +    G   GP  S
Subjt:  INQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTS

Query:  VLHNELQSLLESG-PDKDGLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKR
           +   +    G P++    P+  RRQVE LDYKKL+D                          E YGN  SDSSD+ +    + SS D+    +    
Subjt:  VLHNELQSLLESG-PDKDGLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKR

Query:  SPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLS
          +         + +L  K  + ++ RR  +K   E   + ++R+ ED   S++ V  + S+S     + A +RLL SF+ENQYP+RA KESLA EL LS
Subjt:  SPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLS

Query:  LKQVSKWFENTRWSTRHPSSIESNKAK
        ++QVS WF N RWS RH S I S+ AK
Subjt:  LKQVSKWFENTRWSTRHPSSIESNKAK

Q04996 Homeobox protein HAT3.13.4e-12150.27Show/hide
Query:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI
        R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKI
Subjt:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI

Query:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG
        RDLFQHLD+LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  G
Subjt:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG

Query:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS
        +  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD   D  N ED      ES ++  SSDE+ + SAS+E+     E        + LPS
Subjt:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS

Query:  DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLH
        DDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G +TS L  +  ++LES  G D DG   VS RR VERLDYKKL+
Subjt:  DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLH

Query:  DETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVR
        DE Y NVP+ SSDD       D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P     +N            S  + 
Subjt:  DETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVR

Query:  RTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS
        +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  RTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS

Q8H991 Homeobox protein HAZ13.1e-10643.06Show/hide
Query:  KKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG---EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWK
        +++ K +K++  LR   S  RVLRS +++K KA    NEL    AG     K++K  R  KG G   D++  IR R+RY++NR+ YEQSLI AY+SEGWK
Subjt:  KKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG---EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWK

Query:  GFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDI
        G S +K++PEKEL+RA  EI+R K +IR+ F++LDSL +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCDGICDRGFHQ+CL PPLL  DI
Subjt:  GFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDI

Query:  PPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQ---------SSSDE
        P  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G     A  LPSDDS D DYDP        ++E SS +          SS E
Subjt:  PPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQ---------SSSDE

Query:  SGYASASEELEAAPN----DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG-----SDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVL-----
           +S  E+ + + N    DD  LGLPS+DSED D++P  P+ D+    ES+      SDFTSDS+D  A       +  S GQ    GP +S +     
Subjt:  SGYASASEELEAAPN----DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG-----SDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVL-----

Query:  ------------HNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDT--FGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDD
                     N   + +E+  ++D + P+S +RQVERLDYKKL++E YG   SDSSDD   +G    +S+ ++G           +L  +  G    
Subjt:  ------------HNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDT--FGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDD

Query:  LKNKKTKRSYKRRTHQ--KPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRW
         +    +      T Q  +PG      SV+    + + S+S+    +++ NR       ++L A F+E+ YP RATKE+LAQELGL+  QV+KWF +TR 
Subjt:  LKNKKTKRSYKRRTHQ--KPGAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRW

Query:  -----STRHPSSIESNKAKS
             +T+  ++IE++ A++
Subjt:  -----STRHPSSIESNKAKS

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain2.4e-12250.27Show/hide
Query:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI
        R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKI
Subjt:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI

Query:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG
        RDLFQHLD+LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  G
Subjt:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG

Query:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS
        +  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD   D  N ED      ES ++  SSDE+ + SAS+E+     E        + LPS
Subjt:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDA-PDTINQED------ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLPS

Query:  DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLH
        DDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G +TS L  +  ++LES  G D DG   VS RR VERLDYKKL+
Subjt:  DDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPRTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKLH

Query:  DETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVR
        DE Y NVP+ SSDD       D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P     +N            S  + 
Subjt:  DETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMKNSVTRTPEDSVKSSSSVR

Query:  RTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS
        +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  RTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS

AT4G29940.1 pathogenesis related homeodomain protein A9.0e-5329.23Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE

Query:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG
                        D   D+D E                    S S  L+   DG      +   GS  G R S +  + ++  E        E V G
Subjt:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG

Query:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK
         RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T E S +
Subjt:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK

Query:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES
         S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Subjt:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES

Query:  GACFRDTDNNGAQHQVSPNTDGAVAP
               +NN   ++V    D  V P
Subjt:  GACFRDTDNNGAQHQVSPNTDGAVAP

AT4G29940.2 pathogenesis related homeodomain protein A9.0e-5329.23Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP        E   +   +SS+ SG      
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASE

Query:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG
                        D   D+D E                    S S  L+   DG      +   GS  G R S +  + ++  E        E V G
Subjt:  ELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPRTSVLHNELQSLLESGPDKDGLEPVSG

Query:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK
         RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T E S +
Subjt:  RRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMKNSVTRTPEDSVK

Query:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES
         S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    PE   
Subjt:  SSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQES

Query:  GACFRDTDNNGAQHQVSPNTDGAVAP
               +NN   ++V    D  V P
Subjt:  GACFRDTDNNGAQHQVSPNTDGAVAP

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAAT
GCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGA
CTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCC
AGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACA
GGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGA
AAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAAT
GAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAG
ATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAAC
GAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAA
GGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCA
ATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAAT
TTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGAT
GATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGGGTATGCTTCTGCTTC
TGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAACTTGATGAAGGTGTTA
AACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAAGGTTCTGGATGCGGT
CCTCGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGTGGAACGGTTGGATTA
CAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAGGTCGGGGTAGTAGAA
CAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGAACTCATCAAAAGCCA
GGTGCCGAAAATATGAAGAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAGAAGACTCAGTCAACC
AGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGAAGCAGGTTAGCAAAT
GGTTTGAGAACACACGATGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAGACGAGTGGAAAGCTG
CCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGCCCCATGTCAGAGTGG
AGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATGTGGCATCACATTCAA
AGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
AATTAAGTGATGGTCGATGGGCCGGGCAGAATTGGAGGCGCGCACCCGCACTCGCACCCGCACGCAATATGTGCGGCGGTAGCGGAAGGCAGACGGTGCGCGTGAAGGCT
GGCCGAAATGCAGCTGCTTGTAACGGTGGCCCAGATTTGGAGGACTCTGAGACAAAGCCGCTTTATTCTCTCTTCGCGCTGAGGACGAGAACAGCAACAACAACAACAAG
AAGAACCATCGACAAAATTTTACTCACGCCACCAACAAGATCCAAAACCAGGTTCCTCTTCACCAACACCAGGTAGCTCTTCCATTGCCAGAGAGTCAAGACTGATTGTT
CATGGAGGAGTTCTGTGAGGAAAGCTGAGGTGGTCCATTTATATATGCAGAGTAAATAGTCAGCCCAGAAAGAAACTGCCATCCGATGGCAGTTGTAAAAGTTGAAAAAA
TGAACTCCATGACAGCGTCCTGCTTAGCTAATGAGTAGTTGAAAACAGTATCTGAAAAGTAAGGAAAAGTTCGTGCAGGGGACAATTAAAGCAACCTTCTGAAGTTGTCT
CAGGAAAATTCAGCACTTTGTACACTAGAACAGGTTATAATTATTTAATCAACACACCTGGGGTCAAGCACTTCCAAAATGATTGAGTAGGAAGTGGCTTTTTAGTCTCC
AGAAGTGATGAGCAAGAATCTACATTATCTTACAAAAATGTGGTTGATTATTTGGCCTTAAAGTCTGAGCAACCTCTAGTTGATGTAAGCAGGAATAGGAATAGTCAAAC
TGGAGATTAGTAGACATTCTTAGGGGACAATAATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTT
GAAGTGCTAACTTGTTTTTCAAATGAACAAATGCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAA
ATCGGGGGTCCAGCAGAATATGGAGGAAGAGACTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAA
TTGATCAAGTTGAAGCTGGCAATTTATTATCCAGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCA
CCAGAAGATGCCAACAAAAACTCTATCAAACAGGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAG
TATTTCCCAACAGTTGGGCCACAAGGATAAGAAAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCC
AAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGT
GATGAATTTTCATCAATCAGGAATCGTTTGAGATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTC
AGATAAATTGAAGCCTGAAAAGGAACTTCAACGAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAG
GAAGGCTTTCTGAGTCTCTATTCGATTCTGAAGGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTA
TGTGATGGTATTTGTGATCGTGGGTTCCACCAATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTG
CAAAGATGACTGCTTGGATCTGCTTAATGAATTTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAA
ATTCTGATCACGCCTTGGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCGAGTTCTGATCAATCA
AGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGA
ACCTGGTGCTCCAGAACTTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGA
GAAACTCTAATGGGCAAGGTTCTGGATGCGGTCCTCGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTT
TCAGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGA
TTCAAGCGATGACAGAGGTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAAC
GTAGTTATAAGAGGAGAACTCATCAAAAGCCAGGTGCCGAAAATATGAAGAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACT
GCGTCATCATCAAATAGAAGACTCAGTCAACCAGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGA
ACTAGGACTCAGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGG
GGATTCAGTCATCTGAGACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAAC
ACAGATGGTGCTGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAA
GGGCAGGTCAGATCATGTGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGA
CAAGGAGGAGGAGATCCATTTAGGTTTTGAAGTTTTGATCCTTCCAGGAAAATTATCCGGCTATTTACCCTCTTCCAGGAAACACATTAGATCAGATACTGGTATTGTTT
GAAGGGAGGTAATTCCAAGAACAATCCCAATTGTTTGTAGTTCTTTGTTATCACATTAACATTCCTCCTAATCCCTTACACACAAATGCTTAGTATGAGTTACCTAAATC
TAGAGCTTGTTTGTTAGATATTTATTTGAATTTGCAAATTAATGCCTGTCTCCATCCGACCCTTCTCATGATATGTGTAACAATCAGTAGCTGGCTATACATTAATTGCG
GAGATAGTTCTGTTAAACAACTTTGCTTTTTTCAGTTTAAGGTTACCTGTGAATTCATTTCAGCTGGAAGTAGTCCATTGCTTCAATGCGCTGCATGGATCTAAGCTTGA
ACACCCCCCGAAGAAGGAAAGAAAAAAAGAACAAGAAAGAAGCAAAAAGACAATAGAGCTACAAAGCACCCTTCAGCTTCAAAGTTTCCTTCTCGAGGGGCAAGCTTTGC
TGCAGATACGATCACTTATGATGCACATTGTTCTCAAGTCTAATTTTGAGATCTCTTTGGCTCTTTGATCCAGCACTTGTCGAGCAAATTATTCTAGTGCCATCATGCTC
AGTGAAACTTTTTGCAATTAATTGTGTTATGAAATGTAAACTTAGGTTCTGCGGAATCGAATTGTTGACTCTAATGTCGAATGTAGTTCATTTACTAGCTGAAAGCAGTC
ATATCTCCTGCAGTTGGGCCCTATTTTTATCAATTCGGATGAGTAAAAGTGATGTAAAATGTTAAACGACTCAAATTAGTATCAATTCCTCTTCCATGTGGGTGGAACTT
GTACTGGAGGCTGGAGCTCTGGGTGGCCTGTGATTGAGCACTGAAATGTATATGGGGCATTTCTCCCCAATGGGGAACCACATGAAGAGGCGACAGAAGACAGAATTTTA
AAGTTTTGAATAATTATTTATCTTTTCTTCTTCCTCCTTACAGCCTACGTAGTCTGATATATTAGATACTGTACCTACGGTTGAAACAAATCAGGCCTCTTGTGTTTCAA
CAAGCAGAAGAGATGTAATATTAATGGTGCACTTTCTGGTTTGACGTAGGAATCTAGCATTTCCACTAAAGTTAGTCATGTACTTGGTATATGAACAAGCTATTCTAGCA
TTTCCACTAAAGTTAGTCATGTACTTGGTATATGAACAAGCTATTCTAGCATTTCCACTAAAGTTAGTCATGTACTTGGTATATGAACAAGCTATTCTAGCATTTCCACT
AAAGTTAGTCATGTACTTGGTATATGAACAAGCTATTCGGCTTTCGAAATTATAGGAATGAAGAAGACGACTTGAGATTACTATGACAGACAGATACAAGAATTAAAGTC
GGAGATTAAGGTCGAGCATGAATGGTTCAGATGGTGTTTCTGGGGGTGGCTCTGGGAGAATTTGATGTTGAGTGAAGCGGAAGCTTCCTGGCCACATCAAATCCATGGCA
TCCCCCAGCATGAAAGGTGTCCATGAACCAGTCTTGACGTGATTAAATCTTGCTACTATTGCAGTATCTTCCCTGCCTGGTTTGTGCACAAGTGAATGTGGCTGAACACC
AAGTGATCTAATCATAGAATTATGGATTGGCAGACCAAGCAAAGTCATCATTCTCTGGGCCTGGTGCCTTCTGGCTGCACTTCTCTCCCTCTTGTGGGCATTTTGGTGGC
CTCCCAATGCCTGGGAACTGTAGAATACCCTCTTACAGAAGTTGCATGAAAAGGTCTTGGCTGAAGCTGCAGCAGATCTTTGTGATTCAGTTAAAAGCTGTTTGTTGCTA
CCCAGGGACAAGCTCAGCCACTCCAAGGTTCCTACACCACCATCAGGGGTTGCTGATTCTTTATCTATCTCTGAAACTGAAGCAGCGGGGATGGTCAGCAGATTGATCCT
TCCTGGTCTTCCCAGATGAACAAATCCTGAGAAAGGTATGAGAATGAAAGGGTAAGTTTTGGTATATATGTTTTTGCATTTCTTATCTTCATTCTCCTCCCGACCATTAG
TGGTCCAGTGGACAATGCTTATGTCATGTATACTATTTCCCTATCAAACTAAAGGATTCCCCACTTCCATCAATCTTTCCCTTCCCTAATTTCTTAATACCCTCCCCATA
TTATACCACTTTTTTACGTTGGAGATGCTTTTAGAAAATCACATATTCAAGGTCTTCAATTATTTGCCTCCACTGAATCATTGCAGGGCTAGTGATCAAAGTGATCAAGT
AAGTTTTAGATTTAGATATCTTTAAAGCTAAGATAGTGAATATTCCATCAATTTTTCGATGCTAAAAATTTTAAATTAGATAGTATGTTTTGTGTGATTGGGTGAGAAG
Protein sequenceShow/hide protein sequence
MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLS
SDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSN
ELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSE
GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSD
DSEDGDYDPDAPDTINQEDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCG
PRTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKP
GAENMKNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI