; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012433 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012433
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionhomeobox protein HAT3.1
Genome locationscaffold63:654980..661319
RNA-Seq ExpressionMS012433
SyntenyMS012433
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600300.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0076.97Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKA+  VEVLT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+  S LPE+++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELET-TTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ET   LNECSE P ED NKN I+Q NPPIED  QNTSI+ L  VP      S +LG KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELET-TTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+ ESSSD SSSDQSSSD+SGY  ASASEELEA PNDD
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD

Query:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG
        QYLGLPSDDSEDDDY+PGAP  DEGV+QESS SDFTSDSEDLAAL D                T PVRNSNGQ SG GP+ +  HN+L SL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT
        LE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS K RT QKP AENM+NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
        +TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKS  RMG QSS TS K 
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVR
        PKPEQESGACFRD  +NGAQHQ SP     VAPCQSG T DDKLA QK  RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK R
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVR

KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0076.71Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKA+  VEVLT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+  S LPE+++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELET-TTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ET   LNECSE P ED NKN I+Q NPPIED  QNTSI+ L  VP      S +LG KDK++L+SKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELET-TTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK+KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+ ESSSD SSSDQSSSD+SGY  ASASEELEA PNDD
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD

Query:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG
        QYLGLPSDDSEDDDY+PGAP  DEGV+QESS SDFTSDSEDLAAL D                T PVRNSNGQ SG GP+ +  HN+L SL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT
        LE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS K RT QKP AENM+NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
        +TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKS  RMG QSS+TS K 
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PKPEQESGACFRD  +NGAQHQ SP     VAPCQSG T DDKLA QK  RPESTATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADKV+ 
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRR+S+
Subjt:  RRRRSI

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.0e+0099.21Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
        MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI

Query:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
        DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
Subjt:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS

Query:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
        RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
Subjt:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD
        LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE     DESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD

Query:  DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY
        DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGP TSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY
Subjt:  DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY

Query:  GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLS
        GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENM NSVTRTPEDSVKSSSSVRRTASSSNRRLS
Subjt:  GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLS

Query:  QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP
        QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP
Subjt:  QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP

Query:  NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
        NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Subjt:  NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]0.0e+0076.57Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKAS  VEVLT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ETT  LNECSE P ED NKN I+Q NPPIED  QNTSI  L  VP      S ++G KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS
        L+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAG EGK KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS

Query:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
        +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDD
Subjt:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDESSSDQSSSDESGY--ASASEELEAAP
        C+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSD+SSSD SSSD+SGY  ASASEELEA P
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDESSSDQSSSDESGY--ASASEELEAAP

Query:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPD
        NDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNS+GQ SG GP+ +  HN+L SL+ SGPD
Subjt:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPD

Query:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNN
        + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS K RT QKP AENM+N
Subjt:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS
        SVT+TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS

Query:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK
         K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Subjt:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK

Query:  VRTRRRRSI
        V+ RRR+S+
Subjt:  VRTRRRRSI

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.0e+0077.7Show/hide
Query:  MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK
        MEER E   TE RPNN+ E VQEAKAS  VEVLTC SNE MHS    QELGTTPE +SKT GPD+EK GVQQNM     ELGSG +LSEL EK+NQT+S 
Subjt:  MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK

Query:  LAEIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML
         A+ DQVEAGNLLSSD +TENL LPIE+ETTT LNECSELP ED NKN I+Q+NPPIEDLTQN SIQ LE +P    S SQQLG KDK ILKSKK NY L
Subjt:  LAEIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKR--KKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN LN  TA EGKR  KKKKRNI+GK A  DE+SSIR +LRYL+NRI YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKR--KKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ

Query:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIM+ KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDE--SSSDQSSSDESGYASASEELEAAPND
        KDDCLDLLNEFQGSNLSITD WEKVYPEAAAAAAGQNSDH LGLPSDDSEDGDYDPD PDTI+Q++ESSSDE  SSSDQS+SD SGYASASE LE  PND
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDE--SSSDQSSSDESGYASASEELEAAPND

Query:  DQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD--------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG
        DQYLGLPSDDSEDDDY+P  PELDEGV++ESS SDFTSDSEDLAALD              + T  V+NSNGQ SGCGPS S LHNEL SL      KDG
Subjt:  DQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD--------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT
        LEPVSGRRQVERLDYKKLHDETYGNVP+DSSDDT+GS S+DSS DRG  S TRKR P+NLV AL  NGTNDDL N KTKRS+K RT QK  A N+NNSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
         TP D+ KSSSS R+T SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGLSLKQVS+WFENTRWSTRHPSS   N+AKS+ RM   SS+ SG+L
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PK EQESGACFRDTD+NGAQHQ  P  +    PCQSGDT D KL T+KT R ES+ATKSRKRK  SDH+ASH+KD++ SQ+PPAKSPKVN+IQTAD+ +T
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRRRSI
Subjt:  RRRRSI

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0076.55Show/hide
Query:  MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK
        MEER E   TE RPN   EAVQEAKAS  VEV TC SNE M+S    QELGTTPE + KT GPD+EK+GVQQNM     ELGSG +LSEL EK+NQTIS 
Subjt:  MEERHEY--TEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISK

Query:  LAEIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML
         A+ DQVEAGN LS D +T+NL L IE ETTT LNECSELP ED  KN I+++NPPIEDLTQ TSIQ LET+P    S SQQL HKD++  KSKKKNY L
Subjt:  LAEIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYML

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTA-GEGKR-KKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN+LN  TA  EGKR KKKKRNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNELNKLTA-GEGKR-KKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQ

Query:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPE AAAAAG+NSD  LGLPSDDSEDGDYDPD PDTI+Q++E SSDESSSDQS+SD SGYASASE LE  PNDDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQ

Query:  YLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD--------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLE
        YLGLPSDDSED+DY+P  PELDEG +QESS SDFTSDSEDLAAL+              + T PV+N+NG+ S  GPS S LHNEL SLL+SG DKDGLE
Subjt:  YLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD--------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLE

Query:  PVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRT
        P+SGRRQVERLDYKKLHDETYGNVP++SSDDT+GS ++DSSDDRG  S TRKR PK LV AL  NG+NDDL N KTKRSYKRRT QKPGA N+NNSVT T
Subjt:  PVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRT

Query:  PEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK
        P D+ KSSSSVR+  SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGL+LKQVSKWFENTRWSTRHPSS    KAKS+ RM I  S+ SG+L K
Subjt:  PEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK

Query:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRR
         EQES  CFRDTD+NGA+HQ  P  +  VA CQSGDT D KL T+KT R ES+ATKSRKRKGRSD+ AS+SKDR+ S +PPAKSPKVN+ QTAD+ +TRR
Subjt:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRR

Query:  RRSI
        RRSI
Subjt:  RRSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0099.21Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
        MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEI

Query:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
        DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS
Subjt:  DQVEAGNLLSSDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVS

Query:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
        RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL
Subjt:  RGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD
        LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE     DESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSD

Query:  DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY
        DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGP TSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY
Subjt:  DSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETY

Query:  GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLS
        GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENM NSVTRTPEDSVKSSSSVRRTASSSNRRLS
Subjt:  GNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLS

Query:  QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP
        QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP
Subjt:  QPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPEQESGACFRDTDNNGAQHQVSP

Query:  NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
        NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI
Subjt:  NTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRSI

A0A6J1E4I6 homeobox protein HAT3.1-like0.0e+0073.06Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKASV--EVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R NNN EAVQEAK SV  E+ TC SNEQ HS+PD  EL  TP  ++KT G D+EK  VQQNMEEE +ELGSGDVL EL EK+NQT S LA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKASV--EVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQVEAGNLL  D +TENLI+PIE+ETTT L +CSELPPE  NKN I+Q+NPP E LTQNT  Q LETVP    S S+Q  HKDK+ILKS K N +LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASS
        LVSSDR LRS+TQEK K PEPSN+LN  TA EGK KKK+RNI+GKGA  DEFSSIRN LRYL+NRIKYEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASS

Query:  EIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMR KLKIRD+FQ +D+LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDS-EDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLG
        L+LLNEFQGS LSITDGWEKVYPEAAA+AAG+N DHA GLPSDDS +D DYDPD PDTI Q+DE          SS + SGYASASEELE+ PN DQYLG
Subjt:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDS-EDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLG

Query:  LPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD---------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPV
        LPSDDSEDDDY+P APE DE V+QESS SDFTSDSEDLAALD               + TT ++N +G+ SG GP  S L+NEL SLLESGPDKDG EPV
Subjt:  LPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALD---------------DGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGLEPV

Query:  SGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNG--TNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPE
         GRRQVERLDYKKLHDETYGNVP+DSSDDT+ S+S+DSSDD+G  S TRKRSPK LV AL    TNDDL N KTK S KR T QK  A NMN SV++TPE
Subjt:  SGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNG--TNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPE

Query:  DSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPE
        D+ K+SSSVRRT  SS RRLS+ ALERLLASFQENQYP+RATKESLAQELGLS+KQVSKWF NTRWSTRHPSS+E NKAKS+ RMGI SS+ SG+L +PE
Subjt:  DSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPKPE

Query:  QESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRR
        QE           GAQHQ  P  D  VAPCQSGDT D KLATQ+T R E +ATKSRKRKGRSDH AS SKD KESQ+PPAKSPKVN+IQTA  ++TRRR 
Subjt:  QESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRR

Query:  SI
        S+
Subjt:  SI

A0A6J1FNP3 homeobox protein HAT3.10.0e+0076.57Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  N   AVQEAKAS  VEVLT  +NEQMHS P+  ELGT  + TSKT  PD+EK GV+QNMEE+ KELG G+    LPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        + DQ EAGNLLSSD +TENLILPIE+ETT  LNECSE P ED NKN I+Q NPPIED  QNTSI  L  VP      S ++G KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS
        L+SSDRVLRSRTQ+KAKAPEPSN+L+ +TAG EGK KKK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAG-EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRAS

Query:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
        +EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDD
Subjt:  SEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDESSSDQSSSDESGY--ASASEELEAAP
        C+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGQ+SDH + LPSDDS+DGDYDPD PD I+Q+     D SSSD+SSSD SSSD+SGY  ASASEELEA P
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQE-----DESSSDESSSDQSSSDESGY--ASASEELEAAP

Query:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPD
        NDDQYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNS+GQ SG GP+ +  HN+L SL+ SGPD
Subjt:  NDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPD

Query:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNN
        + GLE VSGRR VERLDYKKLHDET+GNVP+DSSDDT+GS SIDSSDDRGRG  TRK SPKN VPAL  NGT DDLKN KTKRS K RT QKP AENM+N
Subjt:  KDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS
        SVT+TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETS

Query:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK
         K PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD VAS SKDRK+S+KPPAKS KV++IQTADK
Subjt:  GKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADK

Query:  VRTRRRRSI
        V+ RRR+S+
Subjt:  VRTRRRRSI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0076.71Show/hide
Query:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA
        MEER EYTE R  +   AVQEAKAS  VEVLT  +NEQ+ S P+  ELGT  + TSKT  PD+EK GV+QNMEE++KEL  G+  SELPEK++QTISKLA
Subjt:  MEERHEYTEPRPNNNCEAVQEAKAS--VEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLA

Query:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS
        E DQ EAGNLLSSD +TENLILPIE+ETT  LNECSE P ED NKN I+Q NPPIE   QNTSI+ L  VP      S +LG KDK++LKSKKKNY+LRS
Subjt:  EIDQVEAGNLLSSDIETENLILPIELETTT-LNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRS

Query:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR
        LVSSDRVLRSRTQ+KAKAPEPSN+L+ +TAGE   GK++KK R IKGKGA  DEFSSIRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQR
Subjt:  LVSSDRVLRSRTQEKAKAPEPSNELNKLTAGE---GKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQR

Query:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK
        AS+EIMR KLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCK
Subjt:  ASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCK

Query:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD
        DDC+DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD I+Q+ ESSSD SSSDQSSSD+SGY  ASASEELEA PNDD
Subjt:  DDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY--ASASEELEAAPNDD

Query:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG
        QYLGLPSDDSEDDDY+PGAP  DEGV QESS SDFTSDSEDLAAL D                T PVRNSNGQ SG GP+ +  HN+L SL+ SGPD+ G
Subjt:  QYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDD---------------GTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDG

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT
        LE VSGRR VERLDYKKLHDET+GNVPS+SSDDT+GS SIDSSDDRGRG  TRK SPKNLVPAL  NGT DD KN KTK S  RRT QKP AENM+NSVT
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPAL--NGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVT

Query:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL
        +TPE ++KSSSSVRRT SSS+RRLSQP LERLLASFQENQYP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E+NKAKSA RMG QSS+TS K 
Subjt:  RTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKL

Query:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT
        PKPEQESGACFRDT +NGAQHQ SP     VAPCQSG T DDKLA  KT RPESTATKSRKRKGRSD VAS SK+RK+S+KPPAKS KV++IQTADKV+ 
Subjt:  PKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRT

Query:  RRRRSI
        RRR+S+
Subjt:  RRRRSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A2.1e-10741.68Show/hide
Query:  YMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL
        Y L S  S  RVLRS +  K  + E    +        KR+K  R      +S DEFS IR R+RY++NR+ YEQSLI+AY+SEGWK  S DK++PEKEL
Subjt:  YMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL

Query:  QRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD
        +RA SEI+R KL+IR++F+++DSL ++G++ E+LFDSEG+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP CD
Subjt:  QRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD

Query:  CKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT-INQEDESSSDESSSDQSSSDESGYASASEELEAAPN-
        CK DC+DL+NE  GSN+SI D WEKV+P+AAA A     D A  LPSDDS+D D+DP+ P+  +  +DE SS+E     S SD+S + + S++ E   + 
Subjt:  CKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT-INQEDESSSDESSSDQSSSDESGYASASEELEAAPN-

Query:  --DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG--SDFTSDSEDL------AALDDGTTPVRNSNGQG-----SGCGPSTSVLHNELQSLLESGPDK
          DD  L LPS+DSEDDDY+P  P+ D+ V+++SS   SDFTSDS+D       +  D+ ++P+      G     +    +TS   + +++ ++ G   
Subjt:  --DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG--SDFTSDSEDL------AALDDGTTPVRNSNGQG-----SGCGPSTSVLHNELQSLLESGPDK

Query:  DGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGS---ISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNN
          + P S RRQ ERLDYKKL+DE YG   SDSSDD   S     I  S++ G  +     SP      +   ND+L  + TK+S            +++ 
Subjt:  DGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGS---ISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNN

Query:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL-ERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSET
        SV   P D   + S+     S++ +    P + ++L   F+   YP R+ KESLA+ELGL+ +QV+KWFE  R S R  SS +             S  T
Subjt:  SVTRTPEDSVKSSSSVRRTASSSNRRLSQPAL-ERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSET

Query:  SGKLPK-PE----QESGACFR---DTDNNGAQHQVSPNTDGAVAPCQSGDTRDDK
        +   PK PE    +ES  C              +V   T G+       D+ +D+
Subjt:  SGKLPK-PE----QESGACFR---DTDNNGAQHQVSPNTDGAVAPCQSGDTRDDK

P48785 Pathogenesis-related homeodomain protein5.4e-5029Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP+                +   +SS+ SG 
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY

Query:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL
                             D   D+D              E S S   S S D  AL  G+      +     C  S     NE              
Subjt:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL

Query:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP
        E V G RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T 
Subjt:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP

Query:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK
        E S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    
Subjt:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK

Query:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP
        PE          +NN   ++V    D  V P
Subjt:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP

P48786 Pathogenesis-related homeodomain protein5.4e-11942.35Show/hide
Query:  ENLILPIELETTTLN-ECSELPPEDANKN-SIKQVNPPIEDLTQ-----------------------------------NTSIQRLETVPITSVSISQQL
        E+L +P + ++ T N + SELPPE+A KN +  Q     +D T+                                    T +++L  V  T+   S QL
Subjt:  ENLILPIELETTTLN-ECSELPPEDANKN-SIKQVNPPIEDLTQ-----------------------------------NTSIQRLETVPITSVSISQQL

Query:  GHKDK--------------------KILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEG---KRKKKKRNIKGKGASGDEFSSIRNR
        G   K                    K++  K K+    S V+S R LRSR+QEK+  P    ++N + A EG   ++ +KKR  + +    DEF  IR  
Subjt:  GHKDK--------------------KILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEG---KRKKKKRNIKGKGASGDEFSSIRNR

Query:  LRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDII
        LRYL++RIKYE++ +DAYS EGWKG S DK+KPEKEL+RA +EI   KLKIRDLFQ LD   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDII
Subjt:  LRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDII

Query:  LCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT
        LCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD 
Subjt:  LCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDT

Query:  INQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGC
                 ++   D SS+DES Y S S++++     +   GLPSDDSEDD+Y+P     D+  K +SS SDFTSDSED   + D      +    G   
Subjt:  INQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGC

Query:  GPSTSVLHNELQSLLESG-PDKDGLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPSDSSDDTFGSISIDSSDDRGRGS
        GP  S   +   +    G P++    P+  RRQVE LDYKKL+D                          E YGN  SDSSD+ +    + SS D+    
Subjt:  GPSTSVLHNELQSLLESG-PDKDGLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPSDSSDDTFGSISIDSSDDRGRGS

Query:  RTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQ
        +      +         + +L  K  + ++ RR  +K   E  ++ ++R+ ED   S++ V  + S+S     + A +RLL SF+ENQYP+RA KESLA 
Subjt:  RTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQ

Query:  ELGLSLKQVSKWFENTRWSTRHPSSIESNKAK
        EL LS++QVS WF N RWS RH S I S+ AK
Subjt:  ELGLSLKQVSKWFENTRWSTRHPSSIESNKAK

Q04996 Homeobox protein HAT3.11.3e-12050.36Show/hide
Query:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI
        R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKI
Subjt:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI

Query:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG
        RDLFQHLD+LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  G
Subjt:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG

Query:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSD---ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLP
        +  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  N+ DE  SD   ES ++  SSDE+ + SAS+E+     E        + LP
Subjt:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSD---ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLP

Query:  SDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPSTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKL
        SDDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G  TS L  +  ++LES  G D DG   VS RR VERLDYKKL
Subjt:  SDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPSTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKL

Query:  HDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSV
        +DE Y NVP+ SSDD       D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P      N            S  +
Subjt:  HDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSV

Query:  RRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS
         +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  RRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS

Q8H991 Homeobox protein HAZ12.4e-10643.57Show/hide
Query:  KKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG---EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWK
        +++ K +K++  LR   S  RVLRS +++K KA    NEL    AG     K++K  R  KG G   D++  IR R+RY++NR+ YEQSLI AY+SEGWK
Subjt:  KKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAG---EGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWK

Query:  GFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDI
        G S +K++PEKEL+RA  EI+R K +IR+ F++LDSL +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCDGICDRGFHQ+CL PPLL  DI
Subjt:  GFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDI

Query:  PPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDES-----SSDQSSSD
        P  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G     A  LPSDDS D DYDP        ++E SS E       SD SSS+
Subjt:  PPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSDES-----SSDQSSSD

Query:  ESGYASASEELEAAPN----DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG-----SDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVL----
        +S  +S  E+ + + N    DD  LGLPS+DSED D++P  P+ D+    ES+      SDFTSDS+D  A       +  S GQ    GPS+S +    
Subjt:  ESGYASASEELEAAPN----DDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSG-----SDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVL----

Query:  -------------HNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDT--FGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTND
                      N   + +E+  ++D + P+S +RQVERLDYKKL++E YG   SDSSDD   +G+ + +  +     + +   SP+           
Subjt:  -------------HNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDT--FGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTND

Query:  DLKNKKTKRSYKRRTHQK---PGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENT
            K   R    R H     P       SV+    + + S+S+    +++ NR       ++L A F+E+ YP RATKE+LAQELGL+  QV+KWF +T
Subjt:  DLKNKKTKRSYKRRTHQK---PGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENT

Query:  RW-----STRHPSSIESNKAKS
        R      +T+  ++IE++ A++
Subjt:  RW-----STRHPSSIESNKAKS

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain9.2e-12250.36Show/hide
Query:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI
        R Q   +   PS+ + N    G  K+K K  N KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+R KLKI
Subjt:  RTQEKAKAPEPSNEL-NKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKI

Query:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG
        RDLFQHLD+LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  G
Subjt:  RDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQG

Query:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSD---ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLP
        +  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  N+ DE  SD   ES ++  SSDE+ + SAS+E+     E        + LP
Subjt:  SNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSDDSEDGDYDPDAPDTINQEDESSSD---ESSSDQSSSDESGYASASEEL-----EAAPNDDQYLGLP

Query:  SDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPSTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKL
        SDDSEDDDY+P AP  D+   +ESS SD TSD+EDL     G  T  +  +      G  TS L  +  ++LES  G D DG   VS RR VERLDYKKL
Subjt:  SDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDG-TTPVRNSNGQGSGCGPSTSVLHNELQSLLES--GPDKDGLEPVSGRRQVERLDYKKL

Query:  HDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSV
        +DE Y NVP+ SSDD       D  D   R  +    S    + VP    +N +D  +K    K+KR+ K+ T + P      N            S  +
Subjt:  HDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPK--NLVPALNGTN-DDLKNK----KTKRSYKRRTHQKPGAENMNNSVTRTPEDSVKSSSSV

Query:  RRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS
         +++SS+ ++ + P  +RL  SFQENQYP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  RRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWS

AT4G29940.1 pathogenesis related homeodomain protein A3.8e-5129Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP+                +   +SS+ SG 
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY

Query:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL
                             D   D+D              E S S   S S D  AL  G+      +     C  S     NE              
Subjt:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL

Query:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP
        E V G RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T 
Subjt:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP

Query:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK
        E S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    
Subjt:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK

Query:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP
        PE          +NN   ++V    D  V P
Subjt:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP

AT4G29940.2 pathogenesis related homeodomain protein A3.8e-5129Show/hide
Query:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG
        K  K + +K+ +   +     +   +SRT++ ++      E+ +    + +++K KR  K      D+   ++ R RYL+ ++K +Q+LIDAY++EGWKG
Subjt:  KDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSNELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKG

Query:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP
         S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IP
Subjt:  FSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP

Query:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY
        P D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  +  ++    PSDDS+D DYDP+                +   +SS+ SG 
Subjt:  PDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNS--DHALGLPSDDSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGY

Query:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL
                             D   D+D              E S S   S S D  AL  G+      +     C  S     NE              
Subjt:  ASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQGSGCGPSTSVLHNELQSLLESGPDKDGL

Query:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP
        E V G RQ   +DY +L+ E +G    D+     GS   D   +  R  +    +   LV     +  D                          V  T 
Subjt:  EPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRRTHQKPGAENMNNSVTRTP

Query:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK
        E S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+      ++ + K +S  + G  S   SG    
Subjt:  EDSVKSSSSVRRTASSSNR-RLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSETSGKLPK

Query:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP
        PE          +NN   ++V    D  V P
Subjt:  PEQESGACFRDTDNNGAQHQVSPNTDGAVAP

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.0e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAAT
GCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGA
CTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCC
AGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACA
GGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGA
AAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAAT
GAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAG
ATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAAC
GAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAA
GGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCA
ATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAAT
TTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGAT
GATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCAAGTTCTGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGG
GTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAAC
TTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAA
GGTTCTGGATGCGGTCCTAGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGT
GGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAG
GTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGA
ACTCATCAAAAGCCAGGTGCCGAAAATATGAATAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAG
AAGACTCAGTCAACCAGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGA
AGCAGGTTAGCAAATGGTTTGAGAACACACGGTGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAG
ACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGC
CCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATG
TGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCC
ATT
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAGACATGAATATACAGAACCAAGACCTAATAATAACTGTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTGCTAACTTGTTTTTCAAATGAACAAAT
GCATTCAATACCTGACAATCAGGAACTGGGAACAACTCCAGAATGTACCAGCAAAACTGCTGGTCCAGATGATGAAAAATCGGGGGTCCAGCAGAATATGGAGGAAGAGA
CTAAGGAACTTGGTTCTGGAGATGTGCTTAGTGAGTTACCGGAAAAGAATAATCAGACTATCTCTAAGCTTGCTGAAATTGATCAAGTTGAAGCTGGCAATTTATTATCC
AGCGATATAGAAACTGAAAATTTAATATTACCTATTGAATTAGAGACAACAACTCTTAATGAGTGCTCTGAACTTCCACCAGAAGATGCCAACAAAAACTCTATCAAACA
GGTGAACCCTCCCATTGAAGATTTAACCCAGAATACTTCTATCCAAAGGTTAGAAACAGTCCCCATAACTAGTGTCAGTATTTCCCAACAGTTGGGCCACAAGGATAAGA
AAATTTTGAAATCAAAAAAGAAAAACTATATGTTAAGGTCCCTTGTAAGTAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAAT
GAGTTGAATAAGTTGACTGCCGGAGAGGGAAAAAGGAAGAAGAAGAAGAGAAATATAAAAGGAAAGGGAGCTAGTGGTGATGAATTTTCATCAATCAGGAATCGTTTGAG
ATATTTAGTGAACCGCATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAAC
GAGCATCAAGTGAAATAATGCGGGGTAAATTGAAAATAAGAGATCTATTTCAACATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAGTCTCTATTCGATTCTGAA
GGACAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATACTATGTGATGGTATTTGTGATCGTGGGTTCCACCA
ATTCTGTTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGTCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTTAATGAAT
TTCAAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGGCAAAATTCTGATCACGCCTTGGGTCTTCCTTCAGAT
GATTCTGAAGATGGTGATTATGATCCTGATGCTCCAGACACTATTAACCAGGAAGATGAATCAAGTTCTGATGAATCGAGTTCTGATCAATCAAGTTCTGATGAATCTGG
GTATGCTTCTGCTTCTGAGGAATTGGAGGCTGCACCCAATGATGACCAATACTTAGGTCTTCCTTCTGATGACTCGGAGGATGATGACTATGAACCTGGTGCTCCAGAAC
TTGATGAAGGTGTTAAACAGGAAAGTTCAGGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATGGCACTACGCCTGTGAGAAACTCTAATGGGCAA
GGTTCTGGATGCGGTCCTAGCACGAGTGTACTACATAATGAGTTACAAAGTCTTCTAGAGTCAGGTCCTGATAAGGATGGTCTTGAACCTGTTTCAGGAAGAAGACAGGT
GGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTCCATCCGACTCAAGCGATGACACATTCGGGAGTATTTCTATTGATTCAAGCGATGACAGAG
GTCGGGGTAGTAGAACAAGGAAGAGAAGCCCTAAAAACCTGGTTCCTGCATTAAATGGAACTAATGATGATTTGAAAAATAAAAAAACTAAACGTAGTTATAAGAGGAGA
ACTCATCAAAAGCCAGGTGCCGAAAATATGAATAATTCTGTGACTAGGACTCCTGAAGACTCTGTGAAATCTAGTTCTTCTGTTAGACGAACTGCGTCATCATCAAATAG
AAGACTCAGTCAACCAGCGTTGGAGAGACTTCTTGCATCGTTCCAAGAAAATCAGTATCCTAAACGAGCTACAAAAGAGAGTTTGGCACAAGAACTAGGACTCAGTCTGA
AGCAGGTTAGCAAATGGTTTGAGAACACACGGTGGAGCACACGCCATCCTTCAAGCATTGAATCCAATAAAGCAAAGAGTGCTTTAAGAATGGGGATTCAGTCATCTGAG
ACGAGTGGAAAGCTGCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACACCGATAACAATGGTGCTCAACACCAAGTATCACCAAACACAGATGGTGCTGTGGC
CCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAGAAAACTACTAGACCAGAATCTACTGCTACAAAATCCAGAAAACGGAAGGGCAGGTCAGATCATG
TGGCATCACATTCAAAGGACAGAAAGGAATCACAAAAGCCTCCTGCTAAGTCACCAAAAGTTAATCAAATACAAACAGCGGATAAGGTTAGGACAAGGAGGAGGAGATCC
ATT
Protein sequenceShow/hide protein sequence
MEERHEYTEPRPNNNCEAVQEAKASVEVLTCFSNEQMHSIPDNQELGTTPECTSKTAGPDDEKSGVQQNMEEETKELGSGDVLSELPEKNNQTISKLAEIDQVEAGNLLS
SDIETENLILPIELETTTLNECSELPPEDANKNSIKQVNPPIEDLTQNTSIQRLETVPITSVSISQQLGHKDKKILKSKKKNYMLRSLVSSDRVLRSRTQEKAKAPEPSN
ELNKLTAGEGKRKKKKRNIKGKGASGDEFSSIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRGKLKIRDLFQHLDSLCAEGRLSESLFDSE
GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGQNSDHALGLPSD
DSEDGDYDPDAPDTINQEDESSSDESSSDQSSSDESGYASASEELEAAPNDDQYLGLPSDDSEDDDYEPGAPELDEGVKQESSGSDFTSDSEDLAALDDGTTPVRNSNGQ
GSGCGPSTSVLHNELQSLLESGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPSDSSDDTFGSISIDSSDDRGRGSRTRKRSPKNLVPALNGTNDDLKNKKTKRSYKRR
THQKPGAENMNNSVTRTPEDSVKSSSSVRRTASSSNRRLSQPALERLLASFQENQYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSIESNKAKSALRMGIQSSE
TSGKLPKPEQESGACFRDTDNNGAQHQVSPNTDGAVAPCQSGDTRDDKLATQKTTRPESTATKSRKRKGRSDHVASHSKDRKESQKPPAKSPKVNQIQTADKVRTRRRRS
I