; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G093230 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G093230
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionhomeobox protein HAT3.1-like
Genome locationCicolChr05:11242271..11250200
RNA-Seq ExpressionCcUC05G093230
SyntenyCcUC05G093230
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456177.1 PREDICTED: pathogenesis-related homeodomain protein [Cucumis melo]0.0e+0088.02Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV
        MEERDENTDTESR N   EAVQEAKASVEVEV TCLSNEPM+SGYQELGTTPE+S KTDGPDEEK GVQQNME GSGYLLSELSE +N+TISNHADNDQV
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV

Query:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR
        EAGN LS DKDT+NLKL IE   TTLLNEC+ELP+EDV KNYIE MNPPIEDLTQ TSIQ LE +PSNSQQL HKD+R  KSKKKNYKLRSLV+SDRVLR
Subjt:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR

Query:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        SRTQEKAKAPEPSNDLNNFTAEE  ++K+KKKRNIQGKGARVDEYSSI+NHLRYLLNRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
Subjt:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF
        KIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDCLDLLNEF
Subjt:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF

Query:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSED
        QGSNLSITDGWEKVYPE AAAAAGRNSD TLGLPSDDSEDGDYDPD+PDTIDQDNE SSDESS  QSNSDTSGYASASEGLEVPP DDQYLGLPSDDSED
Subjt:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSED

Query:  DDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVER
        +DYDPSVPELDEG R+ESSSSDFTSDSEDLAAL+NN SSKDDDLV SSLNNT+ +KN+NG+SSG  PSKS LHNELSSLL+SG DKDGLEP+ GRRQVER
Subjt:  DDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVER

Query:  LDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSS
        LDYKKLHDETYG+VPT+SSDDTYGST +DSSDDRG DS TR RGPK LVLALSNNG+NDDLTNVKTKRSYKRRTRQK  AINVN+SVT+TPVD AKSSSS
Subjt:  LDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSS

Query:  VRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACFRD
        VRQ TSSSNRRLSQPALERLFASFQENEYP+RATKESLAQELGL+LKQVSKWFENTRWSTRHPSSGG +AKS+SRMSIH SQAS EL KNEQES  CFRD
Subjt:  VRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACFRD

Query:  TDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI
        TD+NGA+HQDLP  N+VVASCQSGDTGDKK  T+KTKR ESSATKSRKRK +SD+TAS+SKDRE SPRPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  TDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI

XP_011651230.2 homeobox protein HOX1A [Cucumis sativus]0.0e+0087.31Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV
        MEERDENTDTESR N   EAVQEAKASVEVEVLTCLSNE  +SGYQELGTTPE+SSK DGPDEEK GVQQNME GSGYLLSELSE +N+TISNHADND+V
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV

Query:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR
        EAGNLLS+DKDT+NLKL IE  ATTLLNEC+ELP+EDV KNYIE MNPPI DLTQ TSIQ LE +PSNSQQ   KDK  LKSKKKNYKLRS V+SDRVLR
Subjt:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR

Query:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        SRTQEKAKAPE SNDLNNFTAEE  ++K+KKKRNIQGKGARVDEYSSI+NHLRYLLNRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
Subjt:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF
        KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDCLDLLNEF
Subjt:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF

Query:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQ-----SNSDTSGYASASEGLEVPPPDDQYLGLPS
        QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNE SSDESS  Q     SNSDTSGYASASEGLEV   DDQYLGLPS
Subjt:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQ-----SNSDTSGYASASEGLEVPPPDDQYLGLPS

Query:  DDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGR
        DDSED+DYDPSVPELDEGVR+ESSSSDFTSDSEDLAALDNN SSKD DLV SSLNNT+ +KNSNGQSSG  P+KSALHNELSSLL+SGPDKDGLEPV GR
Subjt:  DDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGR

Query:  RQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNA
        RQVERLDYKKLHDETYG+VPTDSSDDTYGST +DSSDDRGWDS TR RGPK LVLALSNNG+NDDLTNVKTKRSYKRRTRQK  AINVN+SVT+TPVD A
Subjt:  RQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNA

Query:  KSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESG
        KSSSSV+++TSSSNRRLSQPALERL ASFQENEYP+RATK+SLAQELGL LKQVSKWFENTRWSTRHPSS G +AKS+SRMSI+ SQAS EL KNE ES 
Subjt:  KSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESG

Query:  ACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI
         CFRDTD+NGA+HQDLP  N+VVASCQSGDTGDKK  ++KTKRA+SSATKSRKRK +SD+TASHSKDRE SPRPPAKSPKVNE+QTADRFKTRRRRSI
Subjt:  ACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.0e+0077.9Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN
        MEER E   TE R NNN EAVQEAKAS  VEVLTC SNE MHS    QELGTTPE +SKT GPD+EK GVQQNM     E GSG +LSEL E NN+TIS 
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN

Query:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVP----SNSQQLGHKDKRILKSKKKNYKL
         A+ DQVEAGNLLSSD +TENL LPIE+  TT LNEC+ELP ED NKN I+ +NPPIEDLTQNTSIQ+LE VP    S SQQLGHKDK+ILKSKKKNY L
Subjt:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVP----SNSQQLGHKDKRILKSKKKNYKL

Query:  RSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        RSLV+SDRVLRSRTQEKAKAPEPSN+LN  TA EGKRK  KKKRNI+GKGA  DE+SSI+N LRYL+NRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDH LGLPSDDSEDGDYDPD PDTI+Q++ESSSD     QS+SD SGYASASE LE  P DDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQ

Query:  YLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGL
        YLGLPSDDSEDDDY+P  PELDEGV++ESS SDFTSDSEDLAALD               + T  ++NSNGQ SGC P  S LHNEL SLLESGPDKDGL
Subjt:  YLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGL

Query:  EPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTD
        EPV GRRQVERLDYKKLHDETYG+VP+DSSDDT+GS S+DSSDDRG  S TR R PK LV AL  NGTNDDL N KTKRSYKRRT QK  A N+ +SVT 
Subjt:  EPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTD

Query:  TPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELP
        TP D+ KSSSSVR+T SSSNRRLSQPALERL ASFQEN+YP+RATKESLAQELGLSLKQVSKWFENTRWSTRHPSS   N+AKS  RM I SS+ S +LP
Subjt:  TPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELP

Query:  KNEQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTR
        K EQESGACFRDTDNNGAQHQ  P  +  VA CQSGDT D K  TQKT R ES+ATKSRKRK +SDH ASHSKDR+ES +PPAKSPKVN+IQTAD+ +TR
Subjt:  KNEQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTR

Query:  RRRSI
        RRRSI
Subjt:  RRRSI

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.0e+0090.17Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV
        MEERDENTDTESR NN+ E VQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNME GSGYLLSEL E +N+T+SNHADNDQV
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV

Query:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR
        EAGNLLSSDKDTENLKLPIEV  TTLLNEC+ELPVEDVNKN+IE MNPPIEDLTQN SIQ LE +PSNSQQLG KDK ILKSKK NY+LRSLV+SDRVLR
Subjt:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR

Query:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        SRTQEKAKAPEPSN LNNFTAEEGKRKK+KKKRNIQGK ARVDEYSSI+  LRYLLNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKL
Subjt:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF
        KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDCLDLLNEF
Subjt:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF

Query:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCG--QSNSDTSGYASASEGLEVPPPDDQYLGLPSDDS
        QGSNLSITD WEKVYPEAAAAAAG+NSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESS    QSNSDTSGYASASEGLEVPP DDQYLGLPSDDS
Subjt:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCG--QSNSDTSGYASASEGLEVPPPDDQYLGLPSDDS

Query:  EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQV
        EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNR SKDDD V SSLNNT+S+KNSNGQSSGC PSKSALHNELSSL      KDGLEPV GRRQV
Subjt:  EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQV

Query:  ERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSS
        ERLDYKKLHDETYG+VPTDSSDDTYGSTSMDSS DRGWDSSTR RGP+ LVLALSNNGTNDDLTNVKTKRS+K RTRQK+AAINVN+SVT+TPVD AKSS
Subjt:  ERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSS

Query:  SSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACF
        SS RQTTSSSNRRLSQPALERLFASFQENEYP+RATKESLAQELGLSLKQVS+WFENTRWSTRHPSSGGNRAKS+SRMS  SS+AS ELPKNEQESGACF
Subjt:  SSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACF

Query:  RDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI
        RDTD+NGAQHQDLPT N+    CQSGDTGDKK VT+KTKRAESSATKSRKRK  SDH ASH+KD+E S RPPAKSPKVNEIQTADRFKTRRRRSI
Subjt:  RDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI

XP_038876114.1 homeobox protein HAT3.1 isoform X2 [Benincasa hispida]0.0e+0091.08Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV
        MEERDENTDTESR NN+ E VQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNME GSGYLLSEL E +N+T+SNHADNDQV
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV

Query:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR
        EAGNLLSSDKDTENLKLPIEV  TTLLNEC+ELPVEDVNKN+IE MNPPIEDLTQN SIQ LE +PSNSQQLG KDK ILKSKK NY+LRSLV+SDRVLR
Subjt:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR

Query:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        SRTQEKAKAPEPSN LNNFTAEEGKRKK+KKKRNIQGK ARVDEYSSI+  LRYLLNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKL
Subjt:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF
        KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDCLDLLNEF
Subjt:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF

Query:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCG--QSNSDTSGYASASEGLEVPPPDDQYLGLPSDDS
        QGSNLSITD WEKVYPEAAAAAAG+NSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESS    QSNSDTSGYASASEGLEVPP DDQYLGLPSDDS
Subjt:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCG--QSNSDTSGYASASEGLEVPPPDDQYLGLPSDDS

Query:  EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQV
        EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNR SKDDD V SSLNNT+S+KNSNGQSSGC PSKSALHNELSSL      KDGLEPV GRRQV
Subjt:  EDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQV

Query:  ERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSS
        ERLDYKKLHDETYG+VPTDSSDDTYGSTSMDSS DRGWDSSTR RGP+ LVLALSNNGTNDDLTNVKTKRS+K RTRQK+AAINVN+SVT+TPVD AKSS
Subjt:  ERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSS

Query:  SSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQV
        SS RQTTSSSNRRLSQPALERLFASFQENEYP+RATKESLAQELGLSLKQ+
Subjt:  SSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQV

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0088.02Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV
        MEERDENTDTESR N   EAVQEAKASVEVEV TCLSNEPM+SGYQELGTTPE+S KTDGPDEEK GVQQNME GSGYLLSELSE +N+TISNHADNDQV
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQV

Query:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR
        EAGN LS DKDT+NLKL IE   TTLLNEC+ELP+EDV KNYIE MNPPIEDLTQ TSIQ LE +PSNSQQL HKD+R  KSKKKNYKLRSLV+SDRVLR
Subjt:  EAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLR

Query:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
        SRTQEKAKAPEPSNDLNNFTAEE  ++K+KKKRNIQGKGARVDEYSSI+NHLRYLLNRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL
Subjt:  SRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKL

Query:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF
        KIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDCLDLLNEF
Subjt:  KIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEF

Query:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSED
        QGSNLSITDGWEKVYPE AAAAAGRNSD TLGLPSDDSEDGDYDPD+PDTIDQDNE SSDESS  QSNSDTSGYASASEGLEVPP DDQYLGLPSDDSED
Subjt:  QGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSED

Query:  DDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVER
        +DYDPSVPELDEG R+ESSSSDFTSDSEDLAAL+NN SSKDDDLV SSLNNT+ +KN+NG+SSG  PSKS LHNELSSLL+SG DKDGLEP+ GRRQVER
Subjt:  DDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVER

Query:  LDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSS
        LDYKKLHDETYG+VPT+SSDDTYGST +DSSDDRG DS TR RGPK LVLALSNNG+NDDLTNVKTKRSYKRRTRQK  AINVN+SVT+TPVD AKSSSS
Subjt:  LDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSS

Query:  VRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACFRD
        VRQ TSSSNRRLSQPALERLFASFQENEYP+RATKESLAQELGL+LKQVSKWFENTRWSTRHPSSGG +AKS+SRMSIH SQAS EL KNEQES  CFRD
Subjt:  VRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKNEQESGACFRD

Query:  TDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI
        TD+NGA+HQDLP  N+VVASCQSGDTGDKK  T+KTKR ESSATKSRKRK +SD+TAS+SKDRE SPRPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  TDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0077.9Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN
        MEER E   TE R NNN EAVQEAKAS  VEVLTC SNE MHS    QELGTTPE +SKT GPD+EK GVQQNM     E GSG +LSEL E NN+TIS 
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN

Query:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVP----SNSQQLGHKDKRILKSKKKNYKL
         A+ DQVEAGNLLSSD +TENL LPIE+  TT LNEC+ELP ED NKN I+ +NPPIEDLTQNTSIQ+LE VP    S SQQLGHKDK+ILKSKKKNY L
Subjt:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVP----SNSQQLGHKDKRILKSKKKNYKL

Query:  RSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        RSLV+SDRVLRSRTQEKAKAPEPSN+LN  TA EGKRK  KKKRNI+GKGA  DE+SSI+N LRYL+NRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDH LGLPSDDSEDGDYDPD PDTI+Q++ESSSD     QS+SD SGYASASE LE  P DDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQ

Query:  YLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGL
        YLGLPSDDSEDDDY+P  PELDEGV++ESS SDFTSDSEDLAALD               + T  ++NSNGQ SGC P  S LHNEL SLLESGPDKDGL
Subjt:  YLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGL

Query:  EPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTD
        EPV GRRQVERLDYKKLHDETYG+VP+DSSDDT+GS S+DSSDDRG  S TR R PK LV AL  NGTNDDL N KTKRSYKRRT QK  A N+ +SVT 
Subjt:  EPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTD

Query:  TPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELP
        TP D+ KSSSSVR+T SSSNRRLSQPALERL ASFQEN+YP+RATKESLAQELGLSLKQVSKWFENTRWSTRHPSS   N+AKS  RM I SS+ S +LP
Subjt:  TPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELP

Query:  KNEQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTR
        K EQESGACFRDTDNNGAQHQ  P  +  VA CQSGDT D K  TQKT R ES+ATKSRKRK +SDH ASHSKDR+ES +PPAKSPKVN+IQTAD+ +TR
Subjt:  KNEQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTR

Query:  RRRSI
        RRRSI
Subjt:  RRRSI

A0A6J1E4I6 homeobox protein HAT3.1-like0.0e+0077.38Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN
        MEERDE   TESR NNN EAVQEAK SVE E+ TCLSNE  HS   Y EL  TP YS+KT G DEEKP VQQNM     E GSG +L ELSE +N+T SN
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISN

Query:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLV
         ADNDQVEAGNLL  DKDTENL +PIEV  TTLL +C+ELP E VNKNYIE MNPP E LTQNT  Q LE VPSNS+Q  HKDKRILKS K N  LRSLV
Subjt:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLV

Query:  NSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN
        +SDR LRS+TQEK K PEPSNDLNNFTAEEGK K  KK+RNIQGKGARVDE+SSI+NHLRYLLNRIKYEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Subjt:  NSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDC
        EIMRRKLKIRD+FQRIDALC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNT+IPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDC

Query:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDS-EDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLG
        L+LLNEFQGS LSITDGWEKVYPEAAA+AAGRN DH  GLPSDDS +D DYDPDVPDTI QD+ESS           +TSGYASASE LE PP  DQYLG
Subjt:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDS-EDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLG

Query:  LPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPV
        LPSDDSEDDDYDPS PE DE VR+ESSSSDFTSDSEDLAALD+N SSK D+LVS SLNNT S+KN +G+SSG  P KSAL+NELSSLLESGPDKDG EPV
Subjt:  LPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPV

Query:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV
        LGRRQVERLDYKKLHDETYG+VPTDSSDDTY S SMDSSDD+GWDS+TR R PK LVLAL N  TNDDLTN+KTK S KR TRQK+ A N+N SV+ TP 
Subjt:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV

Query:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELPKNE
        D  K+SSSVR+TT SS RRLS+ ALERL ASFQEN+YPERATKESLAQELGLS+KQVSKWF NTRWSTRHPSS  GN+AKS+SRM IHSSQAS EL + E
Subjt:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELPKNE

Query:  QESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRR
        QE           GAQHQ+LPT ++VVA CQSGDTGD K  TQ+TKR+E SATKSRKRK +SDH AS SKD +ES RPPAKSPKVNEIQTA   KTRRR 
Subjt:  QESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRR

Query:  SI
        S+
Subjt:  SI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0075.75Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGS-----GYLLSELSENNNETISN
        MEERDE   TESR  +   AVQEAKASVEVEVLT L+NE + S   Y ELGT  +++SKT  PDEEKPGV+QNME  S     G   SEL E +++TIS 
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHS--GYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGS-----GYLLSELSENNNETISN

Query:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLV
         A+NDQ EAGNLLSSDKDTENL LPIEV  T LLNEC+E P ED NKNYIE  NPPIE   QNTSI+ L +VP NS +LG KDKR+LKSKKKNY LRSLV
Subjt:  HADNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLV

Query:  NSDRVLRSRTQEKAKAPEPSNDLNNFTA-EEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
        +SDRVLRSRTQ+KAKAPEPSNDL+N TA EEGK KKRKK R I+GKGARVDE+SSI+NHLRYL+NRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  NSDRVLRSRTQEKAKAPEPSNDLNNFTA-EEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS

Query:  NEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDD
        NEIMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN++IPPDDEGWLCPGCDCKDD
Subjt:  NEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY--ASASEGLEVPPPDDQY
        C+DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDHT+ LPSDDS+DGDYDPDVPD IDQD ESSSD SS  QS+SD SGY  ASASE LE PP DDQY
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY--ASASEGLEVPPPDDQY

Query:  LGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLE
        LGLPSDDSEDDDYDP  P  DEGV +ESSSSDFTSDSEDLAAL +N SSKDD++ SS LNNT  ++NSNGQSSG  P+K+A HN+LSSL+ SGPD+ GLE
Subjt:  LGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLE

Query:  PVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDT
         V GRR VERLDYKKLHDET+G+VP++SSDDTYGS S+DSSDDRG   STR   PK LV ALS NGT DD  N+KTK S  RRTRQK AA N+++SVT T
Subjt:  PVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDT

Query:  PVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKN
        P    KSSSSVR+TTSSS+RRLSQP LERL ASFQEN+YPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS  N+AKS SRM   SSQ S + PK 
Subjt:  PVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRAKSTSRMSIHSSQAS-ELPKN

Query:  EQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRR
        EQESGACFRDT +NGAQHQ+ P    VVA CQSG TGD K    KTKR ES+ATKSRKRK +SD  AS SK+R++S +PPAKS KV+EIQTAD+ K RRR
Subjt:  EQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRR

Query:  RSI
        +S+
Subjt:  RSI

A0A6J1J9X9 homeobox protein HAT3.1-like0.0e+0077.53Show/hide
Query:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISNHA
        MEERDE   TESR NNN EAVQEAK  VE E+ TCLSNE  H    EL  TP Y++KT GPDEEKP VQQNM     E GSG +LSELSE +N+T SN A
Subjt:  MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNM-----EFGSGYLLSELSENNNETISNHA

Query:  DNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNS
        DNDQVEAGNLL  DKDTENL +PIEV  TTLL +C+ELP E VNKNYIE MNPPIE+LTQNT  QKLE VPSNS+Q  HKDKRILKS K N  LRSLV+S
Subjt:  DNDQVEAGNLLSSDKDTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEI
        DR +RS+TQEK K PEPSNDLNNFTAEEGK K  KK+RNIQGKGARVDE+SSI+NHLRYLLNRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEI
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEI

Query:  MRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLD
        MRRKLKIRD+FQRIDALC EG LS+SLFDS+GQI SEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNT+IP DDEGWLCPGCDCKDDCL+
Subjt:  MRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLD

Query:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPS
        LLNEFQGS LSITDGWEKVYPEAAA+AAGRN DH LGLPSDDSED DYDPDVPDTI QD++SS          S+TSGYASASE LE  P  DQYLGLPS
Subjt:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPS

Query:  DDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGR
        DDSEDDDYDPS PE DE VR+ESSSSDFTSDSEDLAALD+N SSK D+LVSSSLNNT S+KN +G+SSG  P KS+L+NELSSLLESGPDKDG EPVLGR
Subjt:  DDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGR

Query:  RQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNA
        RQVERLDYKKLHDETYG+VPTDSSDDTY S SMDSSDD+GWDS+TR R PK LVLAL N   NDDLTNVKTK S KR TRQK+AA+N+N SVT TP D  
Subjt:  RQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNA

Query:  KSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELPKNEQES
        K+SSSVR+TTSSS RRLSQ ALERL ASFQEN+YPERATKESLAQELGLS+KQV+KWF NTRWSTRHPSS  GN+AKS+SRM IHSSQAS EL + E+E 
Subjt:  KSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNRAKSTSRMSIHSSQAS-ELPKNEQES

Query:  GACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI
                  GAQHQ+LPT ++VVA CQSGDTGD K  TQ TKR+E SA KSRKRK +SDH AS SKD +ES RPPAKSPKVNEIQTA   KTRRR S+
Subjt:  GACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQTADRFKTRRRRSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A1.6e-10239.01Show/hide
Query:  AELPVE---DVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKK-------KNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFT
        A  PVE   ++        NP  E L  +  +   + +P+N     +  +   + KK       + Y L S  +  RVLRS +  K  + E      +  
Subjt:  AELPVE---DVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKK-------KNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFT

Query:  AEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSE
        A      KR+K      K +  DE+S I+  +RY+LNR+ YEQSLIEAY+SEGWK  S DK++PEKEL+RA +EI+R KL+IR++F+ ID+L ++G++ E
Subjt:  AEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSE

Query:  SLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA
        +LFDSEG+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ CL PPL   +IP  DEGWLCP CDCK DC+DL+NE  GSN+SI D WEKV+P+AAA
Subjt:  SLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA

Query:  AAAGRNSDHTLGLPSDDSEDGDYDPDVPDT-IDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQY--LGLPSDDSEDDDYDPSVPELDEGVRRE
         A     D    LPSDDS+D D+DP++P+  +   +E SS+E   G S+SD S + + S+  E P  D +   L LPS+DSEDDDYDP+ P+ D+ V ++
Subjt:  AAAGRNSDHTLGLPSDDSEDGDYDPDVPDT-IDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQY--LGLPSDDSEDDDYDPSVPELDEGVRRE

Query:  SSS--SDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVP
        SSS  SDFTSDS+D        S    D VSS L     + +    ++  + + SA        +E+  D+  + P   RRQ ERLDYKKL+DE YG   
Subjt:  SSS--SDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVP

Query:  TDSSDDTYGS---TSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRL
        +DSSDD   S   T +  S++ G  +S   +G + +         ND+LT   TK+S            +++ SV + P D   + S+     S++ +  
Subjt:  TDSSDDTYGS---TSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRL

Query:  SQPAL-ERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS--GGNRAKSTSRMSIHSSQASELPKNE-----QESGACFRDTDNNG
          P + ++L   F+   YP R+ KESLA+ELGL+ +QV+KWFE  R S R  SS  G +  K + + +     AS  PK       +ES  C      NG
Subjt:  SQPAL-ERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS--GGNRAKSTSRMSIHSSQASELPKNE-----QESGACFRDTDNNG

Query:  AQHQDLPTVNNVVASCQSGDT--GDKKSVTQKTKRAESSATKSRKR
                V++ V S   G    G K    +        A K+R++
Subjt:  AQHQDLPTVNNVVASCQSGDT--GDKKSVTQKTKRAESSATKSRKR

P48785 Pathogenesis-related homeodomain protein9.3e-5029.18Show/hide
Query:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW
        K  + + +K+ + + +     +   +SRT++ ++      ++     +  K +KRK KR  +     VD+   ++   RYLL ++K +Q+LI+AY++EGW
Subjt:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW

Query:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE
        KG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    
Subjt:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE

Query:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY
        IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  S+ T+   +D                                      
Subjt:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY

Query:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH
                           PSDDS+DDDYDP + E   G     +SS+ + D              D++ +S+SL+ ++  +  S G   G R       
Subjt:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH

Query:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR
          LS+++E   +    E V G RQ   +DY +L+ E +G    D+     G      S+D  W  + R +  ++     S+ G+    T V    S K+ 
Subjt:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR

Query:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW
                  +  V +T   + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L  ++V+KWF+NTR+
Subjt:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW

P48786 Pathogenesis-related homeodomain protein5.5e-11945.05Show/hide
Query:  VPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEG-KRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQ
        V  +   LG   K + +  K + +L   VNS R LRSR+QEK+  P    D+NN  A+EG  R+K +KKR  + +  RVDE+  I+ HLRYLL+RIKYE+
Subjt:  VPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEG-KRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQ

Query:  SLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQ
        + ++AYS EGWKG S DK+KPEKEL+RA  EI  RKLKIRDLFQR+D   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDIILCDG CDRGFHQ
Subjt:  SLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQ

Query:  FCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDES
        FCL+PPLL   IPPDDEGWLCPGC+CK DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD    D +   D+S
Subjt:  FCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDES

Query:  SCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQS
        S     +D S Y S S+ ++V    +   GLPSDDSEDD+YDPS    D+ + ++SS SDFTSDSED   + ++   KD       L +T     +N + 
Subjt:  SCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQS

Query:  SGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGS--------------VPTDSSDDTYGSTSMDSSD-DRGWDSSTRHRGPKK
         G                   P++    P+  RRQVE LDYKKL+D  +                +    + + YG+TS DSSD D    SS       K
Subjt:  SGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGS--------------VPTDSSDDTYGSTSMDSSD-DRGWDSSTRHRGPKK

Query:  LVLALSNNGTNDDL-TNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSL
           A+     + DL  + K + S   R   K  A+   DS      ++  S++ V  + S+S     + A +RL  SF+EN+YP+RA KESLA EL LS+
Subjt:  LVLALSNNGTNDDL-TNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSL

Query:  KQVSKWFENTRWSTRHPSSGGN-----------RAKS------TSRMSIHSSQASELPKNEQESGA
        +QVS WF N RWS RH S  G+           R KS      + +  + S+  SE+ K EQ++ +
Subjt:  KQVSKWFENTRWSTRHPSSGGN-----------RAKS------TSRMSIHSSQASELPKNEQESGA

Q04996 Homeobox protein HAT3.11.4e-11747.8Show/hide
Query:  RTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N T   G+ KK+ K  N +G+    DEY+ IK  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   +IPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSD--ESSCGQSNSDTSGYASASEGL-----EVPPPDDQYLGLP
        G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  + D + S D  ES     +SD + + SAS+ +     E        + LP
Subjt:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSD--ESSCGQSNSDTSGYASASEGL-----EVPPPDDQYLGLP

Query:  SDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLES--GPDKDGLEPV
        SDDSEDDDYDP  P  D+   +ESS+SD TSD+EDL       S K D+    + +  +             P +     +  ++LES  G D DG   V
Subjt:  SDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLES--GPDKDGLEPV

Query:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV
          RR VERLDYKKL+DE Y +VPT SSDD       D +   G + S        + L  S+N   +D T+ K  R  KR  ++ +      +   + P 
Subjt:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV

Query:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWS
        +N  S     + +SSS  + + P  +RL+ SFQEN+YP++ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWS

Q8H991 Homeobox protein HAZ11.0e-10443.21Show/hide
Query:  IVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQ
        IVP+N        +R+ K +K++  LR      RVLRS +++K KA   +  LN+    +   KKRK  R  +G G   D+Y  I+  +RY+LNR+ YEQ
Subjt:  IVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQ

Query:  SLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQ
        SLI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCDGICDRGFHQ
Subjt:  SLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQ

Query:  FCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESS
        +CL PPLL  +IP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G        LPSDDS D DYDP +      D E SS E  
Subjt:  FCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESS

Query:  C-GQSNSDTSGYASASEGLEVPPPDD-----QYLGLPSDDSEDDDYDPSVPELDEGVRRESSS-----SDFTSDSEDLAALDNNRSSKDDDLVSSSLNNT
          G  + D+S   S S   E             LGLPS+DSED D+DP+ P+ D+    ES+S     SDFTSDS+D  A +  +S   D++   S +  
Subjt:  C-GQSNSDTSGYASASEGLEVPPPDD-----QYLGLPSDDSEDDDYDPSVPELDEGVRRESSS-----SDFTSDSEDLAALDNNRSSKDDDLVSSSLNNT

Query:  MSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDT--YGSTSMDSSDDRGWDSSTRHRGPKKLVL
         ++  ++G      P+     N   + +E+  ++D + P+  +RQVERLDYKKL++E YG   +DSSDD   YG+++ +  +    DS T         L
Subjt:  MSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDT--YGSTSMDSSDDRGWDSSTRHRGPKKLVL

Query:  ALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVS
        A S  G          +      T Q    +    SV+D   +   S+S+    +++ NR       ++L A F+E+ YP RATKE+LAQELGL+  QV+
Subjt:  ALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVS

Query:  KWFENTRWSTR
        KWF +TR   R
Subjt:  KWFENTRWSTR

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain9.6e-11947.8Show/hide
Query:  RTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N T   G+ KK+ K  N +G+    DEY+ IK  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   +IPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSD--ESSCGQSNSDTSGYASASEGL-----EVPPPDDQYLGLP
        G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +  + D + S D  ES     +SD + + SAS+ +     E        + LP
Subjt:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSD--ESSCGQSNSDTSGYASASEGL-----EVPPPDDQYLGLP

Query:  SDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLES--GPDKDGLEPV
        SDDSEDDDYDP  P  D+   +ESS+SD TSD+EDL       S K D+    + +  +             P +     +  ++LES  G D DG   V
Subjt:  SDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLNNTMSLKNSNGQSSGCRPSKSALHNELSSLLES--GPDKDGLEPV

Query:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV
          RR VERLDYKKL+DE Y +VPT SSDD       D +   G + S        + L  S+N   +D T+ K  R  KR  ++ +      +   + P 
Subjt:  LGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRRTRQKSAAINVNDSVTDTPV

Query:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWS
        +N  S     + +SSS  + + P  +RL+ SFQEN+YP++ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  DNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWS

AT4G29940.1 pathogenesis related homeodomain protein A6.6e-5129.18Show/hide
Query:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW
        K  + + +K+ + + +     +   +SRT++ ++      ++     +  K +KRK KR  +     VD+   ++   RYLL ++K +Q+LI+AY++EGW
Subjt:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW

Query:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE
        KG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    
Subjt:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE

Query:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY
        IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  S+ T+   +D                                      
Subjt:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY

Query:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH
                           PSDDS+DDDYDP + E   G     +SS+ + D              D++ +S+SL+ ++  +  S G   G R       
Subjt:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH

Query:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR
          LS+++E   +    E V G RQ   +DY +L+ E +G    D+     G      S+D  W  + R +  ++     S+ G+    T V    S K+ 
Subjt:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR

Query:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW
                  +  V +T   + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L  ++V+KWF+NTR+
Subjt:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW

AT4G29940.2 pathogenesis related homeodomain protein A6.6e-5129.18Show/hide
Query:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW
        K  + + +K+ + + +     +   +SRT++ ++      ++     +  K +KRK KR  +     VD+   ++   RYLL ++K +Q+LI+AY++EGW
Subjt:  KDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFTAEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGW

Query:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE
        KG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    
Subjt:  KGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTE

Query:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY
        IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  S+ T+   +D                                      
Subjt:  IPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGY

Query:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH
                           PSDDS+DDDYDP + E   G     +SS+ + D              D++ +S+SL+ ++  +  S G   G R       
Subjt:  ASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN-NTMSLKNSNGQSSGCRPSKSALH

Query:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR
          LS+++E   +    E V G RQ   +DY +L+ E +G    D+     G      S+D  W  + R +  ++     S+ G+    T V    S K+ 
Subjt:  NELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDDLTNVKTKRSYKRR

Query:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW
                  +  V +T   + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L  ++V+KWF+NTR+
Subjt:  TRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNR-RLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRW

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 55.5e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 55.5e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAGATGAAAATACTGATACAGAATCAAGACATAATAATAATGTTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCTAACTTGTCTTTC
CAATGAGCCAATGCATTCAGGTTATCAGGAATTGGGAACAACTCCAGAATATTCCAGCAAAACTGACGGTCCAGATGAAGAAAAACCAGGGGTCCAGCAGAATATGGAAT
TTGGTTCGGGATATTTGCTTAGTGAGTTATCAGAAAATAATAATGAGACCATCTCTAACCATGCTGATAATGATCAAGTTGAAGCTGGCAATTTATTATCTAGTGATAAA
GATACTGAGAATTTAAAATTACCTATTGAAGTTGGGGCAACGACTCTTCTTAATGAGTGCGCGGAACTTCCAGTTGAAGATGTCAACAAAAATTATATTGAACTGATGAA
CCCTCCTATTGAAGATTTAACTCAAAATACTTCTATCCAAAAGTTAGAAATAGTCCCCAGTAATTCCCAACAATTAGGACACAAGGATAAGAGAATTTTGAAATCAAAGA
AGAAAAATTATAAGTTAAGGTCCCTGGTTAATAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAACGACTTGAATAATTTTACC
GCTGAAGAGGGAAAAAGGAAGAAGAGGAAGAAGAAGAGAAATATACAAGGAAAGGGAGCAAGAGTTGATGAGTACTCATCAATTAAGAATCATTTGAGATATTTACTGAA
CCGCATCAAATATGAACAGAGCTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCCGAAAAGGAACTTCAGCGGGCATCAAATG
AAATAATGCGACGCAAACTGAAAATAAGAGATCTATTTCAACGTATCGATGCACTTTGTGCTGAAGGGAGGCTTTCTGAATCTTTATTCGATTCTGAAGGACAGATAGAC
AGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAGAATGACATCATATTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGA
ACCACCTTTGCTAAATACAGAAATTCCACCGGATGATGAGGGATGGCTATGCCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTCAATGAATTTCAAGGATCAA
ATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCTGATCACACCTTAGGTCTTCCTTCAGATGATTCTGAAGAT
GGCGATTATGATCCTGATGTTCCAGATACTATTGACCAGGACAATGAATCGAGTTCTGATGAATCAAGTTGTGGTCAATCAAATTCTGATACATCTGGGTATGCTTCTGC
TTCTGAGGGATTGGAGGTTCCACCTCCTGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCGGAGGATGATGACTATGATCCCAGTGTTCCAGAACTTGATGAAGGTG
TTAGACGGGAAAGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGACAATAACCGCTCTTCCAAAGATGATGACCTTGTGTCTTCTTCACTAAAT
AATACAATGTCTTTGAAAAACTCTAATGGGCAAAGTTCTGGATGCCGTCCTAGCAAGAGTGCACTACATAATGAGTTATCAAGTCTACTAGAGTCCGGTCCTGATAAGGA
TGGTCTTGAACCTGTTTTGGGAAGAAGACAGGTTGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATACGGGAGTGTTCCTACCGACTCAAGCGATGACACGTACG
GGAGTACTTCTATGGACTCAAGTGATGATAGAGGCTGGGATAGTAGTACAAGGCATAGAGGTCCTAAAAAACTGGTTCTTGCATTGTCAAACAATGGAACTAATGATGAT
TTGACTAATGTAAAAACTAAACGCAGCTATAAGAGGAGAACTCGTCAAAAATCAGCTGCTATAAATGTGAATGATTCTGTGACTGATACTCCTGTAGACAATGCAAAATC
TAGTTCTTCTGTTAGGCAAACCACATCATCATCAAACAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATGAGTATCCTGAACGAGCTA
CAAAGGAGAGTTTGGCACAAGAACTAGGGCTCAGTCTAAAGCAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCCTCAAGCGGTGGTAATAGAGCA
AAGAGTACCTCAAGGATGAGCATTCATTCATCTCAGGCAAGTGAACTACCCAAAAATGAGCAAGAATCTGGTGCATGTTTCAGAGATACCGATAACAATGGTGCTCAACA
TCAAGACTTACCAACAGTAAATAATGTAGTGGCCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATCGGTGACTCAGAAAACTAAAAGAGCGGAATCTTCTGCCACAA
AATCCAGAAAACGGAAGAGCAAGTCAGATCACACGGCATCACATTCAAAAGACAGGGAGGAATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATCCAAACA
GCAGATAGGTTTAAGACAAGGAGGAGGAGATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGAGATGAAAATACTGATACAGAATCAAGACATAATAATAATGTTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCTAACTTGTCTTTC
CAATGAGCCAATGCATTCAGGTTATCAGGAATTGGGAACAACTCCAGAATATTCCAGCAAAACTGACGGTCCAGATGAAGAAAAACCAGGGGTCCAGCAGAATATGGAAT
TTGGTTCGGGATATTTGCTTAGTGAGTTATCAGAAAATAATAATGAGACCATCTCTAACCATGCTGATAATGATCAAGTTGAAGCTGGCAATTTATTATCTAGTGATAAA
GATACTGAGAATTTAAAATTACCTATTGAAGTTGGGGCAACGACTCTTCTTAATGAGTGCGCGGAACTTCCAGTTGAAGATGTCAACAAAAATTATATTGAACTGATGAA
CCCTCCTATTGAAGATTTAACTCAAAATACTTCTATCCAAAAGTTAGAAATAGTCCCCAGTAATTCCCAACAATTAGGACACAAGGATAAGAGAATTTTGAAATCAAAGA
AGAAAAATTATAAGTTAAGGTCCCTGGTTAATAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAACGACTTGAATAATTTTACC
GCTGAAGAGGGAAAAAGGAAGAAGAGGAAGAAGAAGAGAAATATACAAGGAAAGGGAGCAAGAGTTGATGAGTACTCATCAATTAAGAATCATTTGAGATATTTACTGAA
CCGCATCAAATATGAACAGAGCTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCCGAAAAGGAACTTCAGCGGGCATCAAATG
AAATAATGCGACGCAAACTGAAAATAAGAGATCTATTTCAACGTATCGATGCACTTTGTGCTGAAGGGAGGCTTTCTGAATCTTTATTCGATTCTGAAGGACAGATAGAC
AGTGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAGAATGACATCATATTATGTGATGGTATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGA
ACCACCTTTGCTAAATACAGAAATTCCACCGGATGATGAGGGATGGCTATGCCCTGGATGTGATTGCAAAGATGACTGCTTGGATCTGCTCAATGAATTTCAAGGATCAA
ATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCTGATCACACCTTAGGTCTTCCTTCAGATGATTCTGAAGAT
GGCGATTATGATCCTGATGTTCCAGATACTATTGACCAGGACAATGAATCGAGTTCTGATGAATCAAGTTGTGGTCAATCAAATTCTGATACATCTGGGTATGCTTCTGC
TTCTGAGGGATTGGAGGTTCCACCTCCTGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCGGAGGATGATGACTATGATCCCAGTGTTCCAGAACTTGATGAAGGTG
TTAGACGGGAAAGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGACAATAACCGCTCTTCCAAAGATGATGACCTTGTGTCTTCTTCACTAAAT
AATACAATGTCTTTGAAAAACTCTAATGGGCAAAGTTCTGGATGCCGTCCTAGCAAGAGTGCACTACATAATGAGTTATCAAGTCTACTAGAGTCCGGTCCTGATAAGGA
TGGTCTTGAACCTGTTTTGGGAAGAAGACAGGTTGAACGGTTGGATTACAAGAAGCTCCATGATGAGACATACGGGAGTGTTCCTACCGACTCAAGCGATGACACGTACG
GGAGTACTTCTATGGACTCAAGTGATGATAGAGGCTGGGATAGTAGTACAAGGCATAGAGGTCCTAAAAAACTGGTTCTTGCATTGTCAAACAATGGAACTAATGATGAT
TTGACTAATGTAAAAACTAAACGCAGCTATAAGAGGAGAACTCGTCAAAAATCAGCTGCTATAAATGTGAATGATTCTGTGACTGATACTCCTGTAGACAATGCAAAATC
TAGTTCTTCTGTTAGGCAAACCACATCATCATCAAACAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATGAGTATCCTGAACGAGCTA
CAAAGGAGAGTTTGGCACAAGAACTAGGGCTCAGTCTAAAGCAGGTTAGCAAATGGTTTGAGAACACACGATGGAGCACACGCCATCCCTCAAGCGGTGGTAATAGAGCA
AAGAGTACCTCAAGGATGAGCATTCATTCATCTCAGGCAAGTGAACTACCCAAAAATGAGCAAGAATCTGGTGCATGTTTCAGAGATACCGATAACAATGGTGCTCAACA
TCAAGACTTACCAACAGTAAATAATGTAGTGGCCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATCGGTGACTCAGAAAACTAAAAGAGCGGAATCTTCTGCCACAA
AATCCAGAAAACGGAAGAGCAAGTCAGATCACACGGCATCACATTCAAAAGACAGGGAGGAATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATCCAAACA
GCAGATAGGTTTAAGACAAGGAGGAGGAGATCCATTTAG
Protein sequenceShow/hide protein sequence
MEERDENTDTESRHNNNVEAVQEAKASVEVEVLTCLSNEPMHSGYQELGTTPEYSSKTDGPDEEKPGVQQNMEFGSGYLLSELSENNNETISNHADNDQVEAGNLLSSDK
DTENLKLPIEVGATTLLNECAELPVEDVNKNYIELMNPPIEDLTQNTSIQKLEIVPSNSQQLGHKDKRILKSKKKNYKLRSLVNSDRVLRSRTQEKAKAPEPSNDLNNFT
AEEGKRKKRKKKRNIQGKGARVDEYSSIKNHLRYLLNRIKYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQID
SEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTEIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSED
GDYDPDVPDTIDQDNESSSDESSCGQSNSDTSGYASASEGLEVPPPDDQYLGLPSDDSEDDDYDPSVPELDEGVRRESSSSDFTSDSEDLAALDNNRSSKDDDLVSSSLN
NTMSLKNSNGQSSGCRPSKSALHNELSSLLESGPDKDGLEPVLGRRQVERLDYKKLHDETYGSVPTDSSDDTYGSTSMDSSDDRGWDSSTRHRGPKKLVLALSNNGTNDD
LTNVKTKRSYKRRTRQKSAAINVNDSVTDTPVDNAKSSSSVRQTTSSSNRRLSQPALERLFASFQENEYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNRA
KSTSRMSIHSSQASELPKNEQESGACFRDTDNNGAQHQDLPTVNNVVASCQSGDTGDKKSVTQKTKRAESSATKSRKRKSKSDHTASHSKDREESPRPPAKSPKVNEIQT
ADRFKTRRRRSI