; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0012633 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0012633
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
Descriptionhomeobox protein HAT3.1
Genome locationchr06:10696073..10702936
RNA-Seq ExpressionPI0012633
SyntenyPI0012633
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456177.1 PREDICTED: pathogenesis-related homeodomain protein [Cucumis melo]0.0e+0090.8Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDENTG ESR NNNAE+VQ+AKASDN+EERDE T  ESRPN N EA+QEANASDDME RDENT T SIPNNNAEAV+EA ASDNMEER+ENTGTESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNNNAEAV+EAKASDNMKERD NT TES  NNNAEAAQEG+  DNMEERDENTDTESRPNKIAEAVQEAKASVEVEV TCLSNEPMYSGYQELG TPEFS
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
         KTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGN LS DKDTKN KLSIEDET TLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
        IT IQ+LETIP N           F+ KKKNYKLR+LVSSDRVLRSRTQEKAKAPEPSNDLNNF AEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPE AAAAAGRNSD TLGLPSDDSEDGDYDPD+PD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        ELSSDE          SSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSED+DYDPSVPELDEG RQESSSSDFTSDSEDLAAL+NNCSSKDD++V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV
        SSLN+TLPV+N+NG+SSGPSKS LHNELSSLLDS  DKD LEP+SGRRQVERLDYKKLHDETYGNVPT+SSDDTYGSTLDSSDDRG DS TRKR PKTLV
Subjt:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV

Query:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV
        LALSNNG+NDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQ TSSSNRRLSQPALERLFASFQ NEYPKRATKESLAQELGL+LKQV
Subjt:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV

Query:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR
        SKWFENTRWSTRHPSSGG KAKSSSRMSIH SQASGELSKNEQESA CFRDTDSNGA+HQDLPM NSVVASCQSGDTGDKK TTRKTKR ESSATKSRKR
Subjt:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR

Query:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
        KGRSDNTAS+SKDREGSPRPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI

XP_011651230.2 homeobox protein HOX1A [Cucumis sativus]0.0e+0090.33Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDE+T TESRPNNNAEAVQEAKAS +MEERDENT TE RP NN E+VQ+A ASD+ME RDENT T S PNNN EAV+EA ASDNMEER+E+TGTESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNNNAEAV+EAKASDNMKERDENT TES  NNNAEAAQEG+  DNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNE  YSGYQELG TPEFS
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
        SK DGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADND+VEAGNLLS+DKDTKN KLSIEDE  TLLNECSELPLEDVTKNYIEKMNPPI DLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
        IT IQ+LETIP N            + KKKNYKLR+ VSSDRVLRSRTQEKAKAPE SNDLNNF AEE+GKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        ELSSDESS     SDQS+SD SNSDTSGYASASEGLEV  NDDQYLGLPSDDSED+DYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKD ++V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV
        SSLN+TLPV+NSNG+SSGP+KSALHNELSSLLDS PDKD LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDS TRKR PKTLV
Subjt:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV

Query:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV
        LALSNNG+NDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSV++STSSSNRRLSQPALERL ASFQ NEYPKRATK+SLAQELGL LKQV
Subjt:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV

Query:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR
        SKWFENTRWSTRHPSS G KAKSSSRMSI+ SQASGELSKNE ESA CFRDTDSNGA+HQDLPM NSVVASCQSGDTGDKK ++RKTKRA+SSATKSRKR
Subjt:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR

Query:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
        KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
Subjt:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.0e+0074.95Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN
        MEER E   TE RPN   EAVQEAKAS  VEVLTC SNE M+S    QELG TPE +SKT GPD+EK+GVQQNM     ELGSG +LSEL EK+NQTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIP---------------KNFEIKKKNYKL
         A+ DQVEAGNLLSSD +T+N  L IE ET T LNECSELP ED  KN I+++NPPIEDLTQ T IQ LET+P               K  + KKKNY L
Subjt:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIP---------------KNFEIKKKNYKL

Query:  RTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        R+LVSSDRVLRSRTQEKAKAPEPSN+LN   A  EGKR KKKKRNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASE
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDH LGLPSDDSEDGDYDPD PD I+Q+               D+SSSDQS+SD SGYASASE
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASE

Query:  GLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSS--GPSKSALHNELSSLL
         LE  PNDDQYLGLPSDDSEDDDY+P  PELDEGV+QESS SDFTSDSEDLAALD+               T PV NSNG+ S  GP  S LHNEL SLL
Subjt:  GLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSS--GPSKSALHNELSSLL

Query:  DSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGA
        +S PDKD LEPVSGRRQVERLDYKKLHDETYGNVP+DSSDDT+GS ++DSSDDRG  S TRKR PK LV AL  NGTNDDL N KTKRSYKRRT QKPGA
Subjt:  DSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGA

Query:  INVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNKAKSSSRMSIH
         N+ NSVT TP D+ KSSSSVR++ SSSNRRLSQPALERL ASFQ N+YPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS   NKAKS+ RM I 
Subjt:  INVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNKAKSSSRMSIH

Query:  SSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEM
        SS+ SG+L K EQES  CFRDTD+NGAQHQ  P  +  VA CQSGDT D K  T+KT R ES+ATKSRKRKGRSD+ ASHSKDR+ S +PPAKSPKVN++
Subjt:  SSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEM

Query:  QTADRFKTRRRRSI
        QTAD+ +TRRRRSI
Subjt:  QTADRFKTRRRRSI

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.0e+0085.48Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDENT TE RPN+NAEAVQEAKASDNME RDEN+DTESRPN++ EAVQEA ASD+MEGRDENT T S PNN+AEAV+EAKASDNME R+ENT TESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNN+AEAV+EAKASDNM+  DENTDTES  N +A A QE +  DNMEERDENTDTESRPN  AE VQEAKASVEVEVLTCLSNEPM+SGYQELG TPE+S
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
        SKTDGPDEEK GVQQNMELGSGYLLSEL EKDNQT+SNHADNDQVEAGNLLSSDKDT+N KL IE ET TLLNECSELP+EDV KN+IE+MNPPIEDLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
           IQNLE IP N            + KK NY+LR+LVSSDRVLRSRTQEKAKAPEPSN LNNF AEE  ++KKKKKRNIQGK ARVDEYSSIR  LRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD WEKVYPEAAAAAAG+NSDHTLGLPSDDSEDGDYDPDVPD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        E SSDESS        SSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVR+ESSSSDFTSDSEDLAALDNN  SKDD+ V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSS--GPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST-LDSSDDRGWDSSTRKRRPK
        SSLN+TL V+NSNG+SS  GPSKSALHNELSSL      KD LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST +DSS DRGWDSSTRKR P+
Subjt:  SSLNDTLPVENSNGKSS--GPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST-LDSSDDRGWDSSTRKRRPK

Query:  TLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSL
         LVLALSNNGTNDDLTNVKTKRS+K RTRQK  AINVNNSVTETPVDTAKSSSS RQ+TSSSNRRLSQPALERLFASFQ NEYPKRATKESLAQELGLSL
Subjt:  TLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSL

Query:  KQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKS
        KQVS+WFENTRWSTRHPSSGGN+AKSSSRMS  SS+ASGEL KNEQES  CFRDTDSNGAQHQDLP  NS    CQSGDTGDKK  TRKTKRAESSATKS
Subjt:  KQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKS

Query:  RKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
        RKRK  SD+ ASH+KD+E S RPPAKSPKVNE+QTADRFKTRRRRSI
Subjt:  RKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI

XP_038876114.1 homeobox protein HAT3.1 isoform X2 [Benincasa hispida]0.0e+0085.83Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDENT TE RPN+NAEAVQEAKASDNME RDEN+DTESRPN++ EAVQEA ASD+MEGRDENT T S PNN+AEAV+EAKASDNME R+ENT TESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNN+AEAV+EAKASDNM+  DENTDTES  N +A A QE +  DNMEERDENTDTESRPN  AE VQEAKASVEVEVLTCLSNEPM+SGYQELG TPE+S
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
        SKTDGPDEEK GVQQNMELGSGYLLSEL EKDNQT+SNHADNDQVEAGNLLSSDKDT+N KL IE ET TLLNECSELP+EDV KN+IE+MNPPIEDLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
           IQNLE IP N            + KK NY+LR+LVSSDRVLRSRTQEKAKAPEPSN LNNF AEE  ++KKKKKRNIQGK ARVDEYSSIR  LRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD WEKVYPEAAAAAAG+NSDHTLGLPSDDSEDGDYDPDVPD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        E SSDESS        SSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVR+ESSSSDFTSDSEDLAALDNN  SKDD+ V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSS--GPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST-LDSSDDRGWDSSTRKRRPK
        SSLN+TL V+NSNG+SS  GPSKSALHNELSSL      KD LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST +DSS DRGWDSSTRKR P+
Subjt:  SSLNDTLPVENSNGKSS--GPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGST-LDSSDDRGWDSSTRKRRPK

Query:  TLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSL
         LVLALSNNGTNDDLTNVKTKRS+K RTRQK  AINVNNSVTETPVDTAKSSSS RQ+TSSSNRRLSQPALERLFASFQ NEYPKRATKESLAQELGLSL
Subjt:  TLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSL

Query:  KQV
        KQ+
Subjt:  KQV

TrEMBL top hitse value%identityAlignment
A0A0A0LA53 PHD-type domain-containing protein0.0e+0089.87Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDE+TGTESRPNNNAEAVQEAKAS +MEERDE+T TESRPNNN EAVQEA AS DME RDE+T T S PNNN EAV+EA ASDNMEER+E+TGTESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNNNAEAV+EAKASDNMKERDENT TES  NNNAEAAQEG+  DNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNE  YSGYQELG TPEFS
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
        SK DGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADND+VEAGNLLS+DKDTKN KLSIEDE  TLLNECSELPLEDVTKNYIEKMNPPI DLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
        IT IQ+LETIP N            + KKKNYKLR+ VSSDRVLRSRTQEKAKAPE SNDLNNF AEE+GKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        ELSSDESS     SDQS+SD SNSDTSGYASASEGLEV  NDDQYLGLPSDDSED+DYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKD ++V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHD
        SSLN+TLPV+NSNG+SSGP+KSALHNELSSLLDS PDKD LEPVSGRRQVERLDYKKLHD
Subjt:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHD

A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0090.8Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDENTG ESR NNNAE+VQ+AKASDN+EERDE T  ESRPN N EA+QEANASDDME RDENT T SIPNNNAEAV+EA ASDNMEER+ENTGTESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNNNAEAV+EAKASDNMKERD NT TES  NNNAEAAQEG+  DNMEERDENTDTESRPNKIAEAVQEAKASVEVEV TCLSNEPMYSGYQELG TPEFS
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
         KTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGN LS DKDTKN KLSIEDET TLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
        IT IQ+LETIP N           F+ KKKNYKLR+LVSSDRVLRSRTQEKAKAPEPSNDLNNF AEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPE AAAAAGRNSD TLGLPSDDSEDGDYDPD+PD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        ELSSDE          SSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSED+DYDPSVPELDEG RQESSSSDFTSDSEDLAAL+NNCSSKDD++V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV
        SSLN+TLPV+N+NG+SSGPSKS LHNELSSLLDS  DKD LEP+SGRRQVERLDYKKLHDETYGNVPT+SSDDTYGSTLDSSDDRG DS TRKR PKTLV
Subjt:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLV

Query:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV
        LALSNNG+NDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQ TSSSNRRLSQPALERLFASFQ NEYPKRATKESLAQELGL+LKQV
Subjt:  LALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQV

Query:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR
        SKWFENTRWSTRHPSSGG KAKSSSRMSIH SQASGELSKNEQESA CFRDTDSNGA+HQDLPM NSVVASCQSGDTGDKK TTRKTKR ESSATKSRKR
Subjt:  SKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKR

Query:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
        KGRSDNTAS+SKDREGSPRPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI

A0A5D3CQ03 Pathogenesis-related homeodomain protein0.0e+0085.79Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR
        MEERDENTG ESR NNNAE+VQ+AKASDN+EERDE T  ESRPN N EA+QEANASDDME RDENT T SIPNNNAEAV+EA ASDNMEER+ENTGTESR
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESR

Query:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS
        PNNNAEAV+EAKASDNMKERD NT TES  NNNAEAAQEG+  DNMEERDENTDTESRPNKIAEAVQEAKASVEVEV TCLSNEPMYSGYQELG TPEFS
Subjt:  PNNNAEAVEEAKASDNMKERDENTDTES-SNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFS

Query:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
         KTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGN LS DKDTKN KLSIEDET TLLNECSELPLEDVTKNYIEKMNPPIEDLTQ
Subjt:  SKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQ

Query:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL
        IT IQ+LETIP N           F+ KKKNYKLR+LVSSDRVLRSRTQEKAKAPEP                             +DEYSSIRNHLRYL
Subjt:  ITCIQNLETIPKN-----------FEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYL

Query:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
        LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG
Subjt:  LNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDG

Query:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN
        ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPE AAAAAGRNSD TLGLPSDDSEDGDYDPD+PD IDQDN
Subjt:  ICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDN

Query:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV
        ELSSDE          SSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSED+DYDPSVPELDEG RQESSSSDFTSDSEDLAAL+NNCSSKDD++V
Subjt:  ELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVV

Query:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHD
        SSLN+TLPV+N+NG+SSGPSKS LHNELSSLLDS  DKD LEP+SGRRQVERLDYKKLHD
Subjt:  SSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHD

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0074.95Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN
        MEER E   TE RPN   EAVQEAKAS  VEVLTC SNE M+S    QELG TPE +SKT GPD+EK+GVQQNM     ELGSG +LSEL EK+NQTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIP---------------KNFEIKKKNYKL
         A+ DQVEAGNLLSSD +T+N  L IE ET T LNECSELP ED  KN I+++NPPIEDLTQ T IQ LET+P               K  + KKKNY L
Subjt:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIP---------------KNFEIKKKNYKL

Query:  RTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        R+LVSSDRVLRSRTQEKAKAPEPSN+LN   A  EGKR KKKKRNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASE
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDH LGLPSDDSEDGDYDPD PD I+Q+               D+SSSDQS+SD SGYASASE
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASE

Query:  GLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSS--GPSKSALHNELSSLL
         LE  PNDDQYLGLPSDDSEDDDY+P  PELDEGV+QESS SDFTSDSEDLAALD+               T PV NSNG+ S  GP  S LHNEL SLL
Subjt:  GLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSS--GPSKSALHNELSSLL

Query:  DSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGA
        +S PDKD LEPVSGRRQVERLDYKKLHDETYGNVP+DSSDDT+GS ++DSSDDRG  S TRKR PK LV AL  NGTNDDL N KTKRSYKRRT QKPGA
Subjt:  DSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGA

Query:  INVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNKAKSSSRMSIH
         N+ NSVT TP D+ KSSSSVR++ SSSNRRLSQPALERL ASFQ N+YPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS   NKAKS+ RM I 
Subjt:  INVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSS-GGNKAKSSSRMSIH

Query:  SSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEM
        SS+ SG+L K EQES  CFRDTD+NGAQHQ  P  +  VA CQSGDT D K  T+KT R ES+ATKSRKRKGRSD+ ASHSKDR+ S +PPAKSPKVN++
Subjt:  SSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEM

Query:  QTADRFKTRRRRSI
        QTAD+ +TRRRRSI
Subjt:  QTADRFKTRRRRSI

A0A6J1FNP3 homeobox protein HAT3.10.0e+0074.23Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN
        MEERDE   TESR    + AVQEAKASVEVEVLT L+NE M+S   Y ELG   +++SKT  PDEEK GV+QNM     ELG G     L EK +QTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYS--GYQELGRTPEFSSKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIPKN-----------FEIKKKNYKLRTLV
         ADNDQ EAGNLLSSDKDT+N  L IE ET  LLNECSE P ED  KNYIE+ NPPIED  Q T I NL  +P N            + KKKNY LR+L+
Subjt:  HADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIPKN-----------FEIKKKNYKLRTLV

Query:  SSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN
        SSDRVLRSRTQ+KAKAPEPSNDL+N  A EEGK  KKK R I+GKGARVDE+SSIRNHLRYL+NRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN
Subjt:  SSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMRRKLKIRDLFQRIDALC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGY--ASASEGL
        +DLLNEFQGSNLSITDGWEKV+PEAAAAAAG++SDHT+ LPSDDS+DGDYDPDVPDAIDQD E  SD SS     SDQSSSD S+SD SGY  ASASE L
Subjt:  LDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGY--ASASEGL

Query:  EVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSS-LNDTLPVENSNGKSS--GPSKSALHNELSSLLD
        E PPNDDQYLGLPSDDSEDDDYDP  P  DEGV QESSSSDFTSDSEDLAAL +N SSKDDN+ SS LN+T+PV NS+G+SS  GP+K+A HN+LSSL+ 
Subjt:  EVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSS-LNDTLPVENSNGKSS--GPSKSALHNELSSLLD

Query:  SVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAI
        S PD+  LE VSGRR VERLDYKKLHDET+GNVPTDSSDDTYGS ++DSSDDRG   STRK  PK  V ALS NGT DDL N+KTKRS K RTRQKP A 
Subjt:  SVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS-TLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAI

Query:  NVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSS
        N++NSVT+TP  T KSSSSVR++TSSS+RRLSQP LERL ASFQ N+YP+RATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS  NKAKS+SRM   SS
Subjt:  NVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSS

Query:  QASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQT
        Q S +  K EQES  CFRDT SNGAQHQ+ P   SVVA CQSG TGD K   +K KR ES+ATKSRKRKGRSD  AS SKDR+ S +PPAKS KV+E+QT
Subjt:  QASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQT

Query:  ADRFKTRRRRSI
        AD+ K RRR+S+
Subjt:  ADRFKTRRRRSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A6.4e-10339.42Show/hide
Query:  EDLTQITCIQNLETIPKNFEIKKKNYK----------------LRTLVSSD---RVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARV
        E L     +   +T+P N  I +KNYK                  TL+SS+   RVLRS +  K  + E      +  A  +   K++K      K +  
Subjt:  EDLTQITCIQNLETIPKNFEIKKKNYK----------------LRTLVSSD---RVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DE+S IR  +RY+LNR+ YEQSLIEAY+SEGWK  S DK++PEKEL+RA +EI+R KL+IR++F+ ID+L ++G++ E+LFDSEG+I  EDIFC+ CGS 
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGD
        + +L NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP CDCK DC+DL+NE  GSN+SI D WEKV+P+AAA A     D    LPSDDS+D D
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGD

Query:  YDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSS--SDFTSDSEDL
        +DP++P    +++ +  DE SS++   +   SD  +SD    +  SE L     DD  L LPS+DSEDDDYDP+ P+ D+ V ++SSS  SDFTSDS+D 
Subjt:  YDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSS--SDFTSDSEDL

Query:  AALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS----TLDS
                S  D V S L   LP           +++   +     +++  D+  + P S RRQ ERLDYKKL+DE YG   +DSSDD   S     +  
Subjt:  AALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGS----TLDS

Query:  SDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPAL-ERLFASFQINEY
        S++ G  +S   +  + +         ND+LT   TK+S            +++ SV E P D   + S+     S++ +    P + ++L   F+   Y
Subjt:  SDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPAL-ERLFASFQINEY

Query:  PKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNE------QESAKCFRDTDSNGAQHQDLPMVNSVVASCQS
        P R+ KESLA+ELGL+ +QV+KWFE  R S R  SS    +         +SQ +  +   E      +ES  C      NG        V+S V S   
Subjt:  PKRATKESLAQELGLSLKQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNE------QESAKCFRDTDSNGAQHQDLPMVNSVVASCQS

Query:  GDT--GDKKSTTRKTKRAESSATKSRKR
        G    G K  +          A K+R++
Subjt:  GDT--GDKKSTTRKTKRAESSATKSRKR

P48785 Pathogenesis-related homeodomain protein5.1e-4433.16Show/hide
Query:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE
        RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+
Subjt:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE

Query:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN
        G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  
Subjt:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN

Query:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------
        +  ++    PSDDS+D DYDP++ +    ++   S +   D  N ++S S   +  + G A ++   E              N++   G           
Subjt:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------

Query:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK
                   L    SED+D+ P+    D   R+  S +  T     L  +  + S KD +VV +L     D++ VEN  G+
Subjt:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK

P48786 Pathogenesis-related homeodomain protein3.2e-11845.34Show/hide
Query:  VSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKK-KKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA
        V+S R LRSR+QEK+  P    D+NN +A+E   R+K +KKR  + +  RVDE+  IR HLRYLL+RI+YE++ ++AYS EGWKG S DK+KPEKEL+RA
Subjt:  VSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKK-KKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRA

Query:  SNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
          EI  RKLKIRDLFQR+D   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDIILCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK 
Subjt:  SNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

Query:  DCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEG
        DC+ LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD    D ++  D+SS+D+               S Y S S+ 
Subjt:  DCLDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEG

Query:  LEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSAL-HNELSSLLDS
        ++V    +   GLPSDDSEDD+YDPS    D+ + ++SS SDFTSDSED              V     DT       GK+ GP  S   H   +     
Subjt:  LEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSAL-HNELSSLLDS

Query:  VPDKDDLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGT
         P++ D  P+  RRQVE LDYKKL+D                          E YGN  +DSSD+ Y  T          SS  K        A+     
Subjt:  VPDKDDLEPVSGRRQVERLDYKKLHD--------------------------ETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGT

Query:  NDDL-TNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENT
        + DL  + K + S   R   K  A+   +S      +   S++ V  S S+S     + A +RL  SF+ N+YP+RA KESLA EL LS++QVS WF N 
Subjt:  NDDL-TNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENT

Query:  RWSTRHPSS-GGNKAKSSSR---------------MSIHSSQASGELSKNEQESA
        RWS RH S  G + AK  S                 S+  S    E+ K EQ++A
Subjt:  RWSTRHPSS-GGNKAKSSSR---------------MSIHSSQASGELSKNEQESA

Q04996 Homeobox protein HAT3.11.1e-11848.25Show/hide
Query:  RTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N  +   G+ KKK K   +G+    DEY+ I+  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGL-----EVPPN
        G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +    DNE  +DE  SD  ++++S ++  +SD + + SAS+ +     E    
Subjt:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGL-----EVPPN

Query:  DDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDD-
            + LPSDDSEDDDYDP  P  D+   +ESS+SD TSD+EDL       S K D       DT P+E+       P +     +  ++L+S    DD 
Subjt:  DDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDD-

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNV--KTKRSYKRRTRQKPGAINVNNSV
           VS RR VERLDYKKL+DE Y NVPT SSDD      D +   G + S  +    T+ L  S+N  +     +  K+KR+ K+ T + P         
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNV--KTKRSYKRRTRQKPGAINVNNSV

Query:  TETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWS
         E P +    S  + +S+SS+ ++ + P  +RL+ SFQ N+YP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  TETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWS

Q8H991 Homeobox protein HAZ19.5e-10741.05Show/hide
Query:  EKMNPPIEDLTQITCIQNLETIPKNFEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRY
        +K   PI      T +Q +       + +K++  LR   S  RVLRS +++K KA    N+L N  A  +   KK+K       G   D+Y  IR  +RY
Subjt:  EKMNPPIEDLTQITCIQNLETIPKNFEIKKKNYKLRTLVSSDRVLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRY

Query:  LLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCD
        +LNR+ YEQSLI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCD
Subjt:  LLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCD

Query:  GICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQD
        GICDRGFHQ+CL PPLL  DIP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G        LPSDDS D DYDP +      D
Subjt:  GICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQD

Query:  NELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSS-----SDFTSDSEDLAALDNNCSS
         E SS E   + L+SD SSS+ S S     +  S+       DD  LGLPS+DSED D+DP+ P+ D+    ES+S     SDFTSDS+D  A       
Subjt:  NELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSS-----SDFTSDSEDLAALDNNCSS

Query:  KDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDT--YGSTLDSSDDRGWDSSTR
        +D+    S +    V+ ++G       +A ++ L + +++  ++D + P+S +RQVERLDYKKL++E YG   +DSSDD   YG   +S+ ++G      
Subjt:  KDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDT--YGSTLDSSDDRGWDSSTR

Query:  KRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQE
            +T  LA S  G          +      T   P  +    SV++   +   S+S+    +++ NR       ++L A F+ + YP RATKE+LAQE
Subjt:  KRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQE

Query:  LGLSLKQVSKWFENTRWSTR------------HPSSGGNKAKSSSRMSIHSSQASGELSKNEQES
        LGL+  QV+KWF +TR   R            H +   N   +   + +  S     + +N+  S
Subjt:  LGLSLKQVSKWFENTRWSTR------------HPSSGGNKAKSSSRMSIHSSQASGELSKNEQES

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain7.7e-12048.25Show/hide
Query:  RTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N  +   G+ KKK K   +G+    DEY+ I+  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +D LCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGL-----EVPPN
        G+  S++D WEK++PEAAAA  G   +    LPSDDS+D +YDPD  +    DNE  +DE  SD  ++++S ++  +SD + + SAS+ +     E    
Subjt:  GSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGL-----EVPPN

Query:  DDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDD-
            + LPSDDSEDDDYDP  P  D+   +ESS+SD TSD+EDL       S K D       DT P+E+       P +     +  ++L+S    DD 
Subjt:  DDQYLGLPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDD-

Query:  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNV--KTKRSYKRRTRQKPGAINVNNSV
           VS RR VERLDYKKL+DE Y NVPT SSDD      D +   G + S  +    T+ L  S+N  +     +  K+KR+ K+ T + P         
Subjt:  LEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNV--KTKRSYKRRTRQKPGAINVNNSV

Query:  TETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWS
         E P +    S  + +S+SS+ ++ + P  +RL+ SFQ N+YP +ATKESLA+EL +++KQV+ WF++ RWS
Subjt:  TETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQELGLSLKQVSKWFENTRWS

AT4G29940.1 pathogenesis related homeodomain protein A3.7e-4533.16Show/hide
Query:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE
        RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+
Subjt:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE

Query:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN
        G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  
Subjt:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN

Query:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------
        +  ++    PSDDS+D DYDP++ +    ++   S +   D  N ++S S   +  + G A ++   E              N++   G           
Subjt:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------

Query:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK
                   L    SED+D+ P+    D   R+  S +  T     L  +  + S KD +VV +L     D++ VEN  G+
Subjt:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK

AT4G29940.2 pathogenesis related homeodomain protein A3.7e-4533.16Show/hide
Query:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE
        RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+
Subjt:  RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE

Query:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN
        G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+   G  
Subjt:  GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRN

Query:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------
        +  ++    PSDDS+D DYDP++ +    ++   S +   D  N ++S S   +  + G A ++   E              N++   G           
Subjt:  S--DHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLE-----------VPPNDDQYLG-----------

Query:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK
                   L    SED+D+ P+    D   R+  S +  T     L  +  + S KD +VV +L     D++ VEN  G+
Subjt:  -----------LPSDDSEDDDYDPSVPELDEGVRQESSSSDFTSDSEDLAALDNNCSSKDDNVVSSL----NDTLPVENSNGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAGATGAAAATACTGGTACAGAATCAAGACCTAACAATAATGCTGAAGCTGTACAAGAAGCCAAGGCCAGCGACAATATGGAAGAAAGAGATGAAAATAC
TGATACAGAATCAAGACCTAATAATAATATTGAAGCCGTACAAGAAGCCAATGCCAGTGACGATATGGAAGGAAGAGATGAAAATACTGTTACACATTCAATACCTAACA
ATAATGCTGAAGCTGTAGAAGAAGCCAAGGCCAGTGACAATATGGAAGAAAGAGAGGAAAATACTGGTACAGAATCAAGACCTAATAATAATGCTGAAGCTGTAGAAGAA
GCCAAGGCCAGTGACAATATGAAAGAAAGAGATGAAAATACTGATACAGAATCAAGTAATAATAATGCTGAAGCCGCACAAGAAGGCAGGGTCGGTGACAATATGGAAGA
AAGAGATGAAAATACAGATACAGAATCAAGACCTAATAAAATTGCTGAGGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCTAACTTGTCTTTCAAATGAGC
CAATGTATTCTGGTTATCAGGAGTTGGGAAGAACTCCAGAGTTTTCCAGCAAAACTGACGGTCCAGATGAAGAAAAAGCAGGGGTCCAGCAGAATATGGAACTTGGTTCG
GGATATTTGCTTAGTGAGTTGTCAGAAAAAGATAATCAGACCATCTCTAATCATGCTGATAATGATCAAGTTGAAGCTGGCAATTTATTATCTAGTGATAAAGATACTAA
AAATTTTAAATTATCTATTGAAGATGAGACAATGACTCTTCTTAATGAGTGCTCGGAACTTCCTCTCGAAGATGTCACAAAAAATTATATCGAAAAGATGAACCCTCCTA
TTGAAGACTTAACTCAAATTACTTGTATCCAAAATTTAGAAACAATCCCCAAGAATTTTGAAATCAAAAAGAAAAATTATAAGTTAAGGACCCTTGTAAGTAGTGACAGA
GTTTTGCGTTCAAGAACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGACTTGAATAATTTTATTGCTGAAGAGGAGGGAAAAAGGAAGAAGAAGAAGAAGAGAAA
TATACAAGGAAAGGGAGCAAGAGTGGATGAGTATTCATCAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAGATATGAACAGAGCTTGATTGAAGCTTATTCTA
GTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAGCGGGCATCAAACGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAA
CGTATTGATGCACTTTGTGCTGAAGGGAGGCTTTCTGAATCTTTATTCGATTCTGAAGGACAGATAGACAGCGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACT
GTCCCTTGAGAATGACATCATATTATGTGATGGCATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAACACAGACATTCCACCGGATGATGAGG
GATGGCTGTGCCCCGGATGTGATTGCAAAGATGACTGCTTGGATCTTCTCAATGAATTTCAAGGATCAAATCTTTCAATCACTGATGGTTGGGAGAAAGTCTATCCCGAG
GCGGCAGCAGCAGCAGCTGGACGAAATTCTGATCACACCTTAGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATGCTATTGACCAGGA
CAATGAATTGAGTTCTGATGAATCAAGTTCTGATCAATTGAACTCTGATCAATCGAGCTCTGATCAATCAAACTCTGATACATCTGGGTATGCTTCTGCTTCTGAGGGAT
TAGAGGTTCCACCCAATGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCGGAGGATGATGACTATGATCCCAGTGTTCCAGAACTTGATGAGGGTGTTAGACAGGAA
AGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCCCTTGACAATAACTGTTCTTCCAAAGATGATAACGTTGTGTCTTCATTAAATGATACTTTACCTGT
GGAAAACTCTAATGGGAAAAGTTCCGGTCCTAGCAAGAGTGCACTACATAATGAGCTATCAAGTCTACTAGACTCTGTTCCTGATAAGGATGATCTTGAACCTGTTTCGG
GAAGAAGGCAGGTTGAACGGTTGGATTATAAGAAGCTCCATGATGAGACATACGGGAACGTTCCTACTGACTCAAGCGATGACACGTATGGGAGTACTTTGGACTCAAGT
GATGACAGAGGCTGGGATAGTAGTACAAGGAAGAGACGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGAACTAATGATGATTTGACTAATGTAAAAACTAAACG
CAGTTATAAGAGGAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAATTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCTTCTGTTAGGCAAAGCA
CATCATCATCAAATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAATAAATGAGTATCCTAAACGAGCTACAAAGGAGAGTTTAGCACAAGAA
CTAGGCCTCAGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACGAGATGGAGCACACGCCATCCCTCAAGCGGTGGTAATAAGGCAAAAAGTTCCTCAAGAATGAGCAT
TCATTCATCTCAGGCAAGTGGAGAACTATCCAAGAACGAGCAAGAATCTGCAAAATGTTTCAGAGATACCGATAGCAATGGTGCTCAACATCAAGACTTACCAATGGTAA
ATAGTGTTGTGGCCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATCGACAACTCGGAAAACTAAAAGAGCAGAATCTTCTGCCACAAAATCCAGAAAACGGAAGGGC
AGGTCAGATAACACGGCATCGCATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATGCAAACAGCAGATAGGTTTAAGACAAG
GAGGAGGAGATCCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGAGATGAAAATACTGGTACAGAATCAAGACCTAACAATAATGCTGAAGCTGTACAAGAAGCCAAGGCCAGCGACAATATGGAAGAAAGAGATGAAAATAC
TGATACAGAATCAAGACCTAATAATAATATTGAAGCCGTACAAGAAGCCAATGCCAGTGACGATATGGAAGGAAGAGATGAAAATACTGTTACACATTCAATACCTAACA
ATAATGCTGAAGCTGTAGAAGAAGCCAAGGCCAGTGACAATATGGAAGAAAGAGAGGAAAATACTGGTACAGAATCAAGACCTAATAATAATGCTGAAGCTGTAGAAGAA
GCCAAGGCCAGTGACAATATGAAAGAAAGAGATGAAAATACTGATACAGAATCAAGTAATAATAATGCTGAAGCCGCACAAGAAGGCAGGGTCGGTGACAATATGGAAGA
AAGAGATGAAAATACAGATACAGAATCAAGACCTAATAAAATTGCTGAGGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCTAACTTGTCTTTCAAATGAGC
CAATGTATTCTGGTTATCAGGAGTTGGGAAGAACTCCAGAGTTTTCCAGCAAAACTGACGGTCCAGATGAAGAAAAAGCAGGGGTCCAGCAGAATATGGAACTTGGTTCG
GGATATTTGCTTAGTGAGTTGTCAGAAAAAGATAATCAGACCATCTCTAATCATGCTGATAATGATCAAGTTGAAGCTGGCAATTTATTATCTAGTGATAAAGATACTAA
AAATTTTAAATTATCTATTGAAGATGAGACAATGACTCTTCTTAATGAGTGCTCGGAACTTCCTCTCGAAGATGTCACAAAAAATTATATCGAAAAGATGAACCCTCCTA
TTGAAGACTTAACTCAAATTACTTGTATCCAAAATTTAGAAACAATCCCCAAGAATTTTGAAATCAAAAAGAAAAATTATAAGTTAAGGACCCTTGTAAGTAGTGACAGA
GTTTTGCGTTCAAGAACCCAAGAGAAAGCTAAAGCTCCTGAACCAAGTAATGACTTGAATAATTTTATTGCTGAAGAGGAGGGAAAAAGGAAGAAGAAGAAGAAGAGAAA
TATACAAGGAAAGGGAGCAAGAGTGGATGAGTATTCATCAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAGATATGAACAGAGCTTGATTGAAGCTTATTCTA
GTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAGCGGGCATCAAACGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAA
CGTATTGATGCACTTTGTGCTGAAGGGAGGCTTTCTGAATCTTTATTCGATTCTGAAGGACAGATAGACAGCGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACT
GTCCCTTGAGAATGACATCATATTATGTGATGGCATTTGTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAACACAGACATTCCACCGGATGATGAGG
GATGGCTGTGCCCCGGATGTGATTGCAAAGATGACTGCTTGGATCTTCTCAATGAATTTCAAGGATCAAATCTTTCAATCACTGATGGTTGGGAGAAAGTCTATCCCGAG
GCGGCAGCAGCAGCAGCTGGACGAAATTCTGATCACACCTTAGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATGTTCCAGATGCTATTGACCAGGA
CAATGAATTGAGTTCTGATGAATCAAGTTCTGATCAATTGAACTCTGATCAATCGAGCTCTGATCAATCAAACTCTGATACATCTGGGTATGCTTCTGCTTCTGAGGGAT
TAGAGGTTCCACCCAATGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCGGAGGATGATGACTATGATCCCAGTGTTCCAGAACTTGATGAGGGTGTTAGACAGGAA
AGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCCCTTGACAATAACTGTTCTTCCAAAGATGATAACGTTGTGTCTTCATTAAATGATACTTTACCTGT
GGAAAACTCTAATGGGAAAAGTTCCGGTCCTAGCAAGAGTGCACTACATAATGAGCTATCAAGTCTACTAGACTCTGTTCCTGATAAGGATGATCTTGAACCTGTTTCGG
GAAGAAGGCAGGTTGAACGGTTGGATTATAAGAAGCTCCATGATGAGACATACGGGAACGTTCCTACTGACTCAAGCGATGACACGTATGGGAGTACTTTGGACTCAAGT
GATGACAGAGGCTGGGATAGTAGTACAAGGAAGAGACGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGAACTAATGATGATTTGACTAATGTAAAAACTAAACG
CAGTTATAAGAGGAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAATTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCTTCTGTTAGGCAAAGCA
CATCATCATCAAATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAATAAATGAGTATCCTAAACGAGCTACAAAGGAGAGTTTAGCACAAGAA
CTAGGCCTCAGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACGAGATGGAGCACACGCCATCCCTCAAGCGGTGGTAATAAGGCAAAAAGTTCCTCAAGAATGAGCAT
TCATTCATCTCAGGCAAGTGGAGAACTATCCAAGAACGAGCAAGAATCTGCAAAATGTTTCAGAGATACCGATAGCAATGGTGCTCAACATCAAGACTTACCAATGGTAA
ATAGTGTTGTGGCCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATCGACAACTCGGAAAACTAAAAGAGCAGAATCTTCTGCCACAAAATCCAGAAAACGGAAGGGC
AGGTCAGATAACACGGCATCGCATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATGCAAACAGCAGATAGGTTTAAGACAAG
GAGGAGGAGATCCATTTAG
Protein sequenceShow/hide protein sequence
MEERDENTGTESRPNNNAEAVQEAKASDNMEERDENTDTESRPNNNIEAVQEANASDDMEGRDENTVTHSIPNNNAEAVEEAKASDNMEEREENTGTESRPNNNAEAVEE
AKASDNMKERDENTDTESSNNNAEAAQEGRVGDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEPMYSGYQELGRTPEFSSKTDGPDEEKAGVQQNMELGS
GYLLSELSEKDNQTISNHADNDQVEAGNLLSSDKDTKNFKLSIEDETMTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITCIQNLETIPKNFEIKKKNYKLRTLVSSDR
VLRSRTQEKAKAPEPSNDLNNFIAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQ
RIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPE
AAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDAIDQDNELSSDESSSDQLNSDQSSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDDDYDPSVPELDEGVRQE
SSSSDFTSDSEDLAALDNNCSSKDDNVVSSLNDTLPVENSNGKSSGPSKSALHNELSSLLDSVPDKDDLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSS
DDRGWDSSTRKRRPKTLVLALSNNGTNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQSTSSSNRRLSQPALERLFASFQINEYPKRATKESLAQE
LGLSLKQVSKWFENTRWSTRHPSSGGNKAKSSSRMSIHSSQASGELSKNEQESAKCFRDTDSNGAQHQDLPMVNSVVASCQSGDTGDKKSTTRKTKRAESSATKSRKRKG
RSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI