; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0025228 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0025228
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
Descriptionhomeobox protein HAT3.1-like
Genome locationchr06:5112335..5124246
RNA-Seq ExpressionIVF0025228
SyntenyIVF0025228
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456177.1 PREDICTED: pathogenesis-related homeodomain protein [Cucumis melo]0.096.62Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
        MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY

Query:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
        SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
Subjt:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY

Query:  IEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
        IEKMNPPIEDLTQITSIQSLETIP           RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
Subjt:  IEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY
        ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY

Query:  DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMT
        DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK   
Subjt:  DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMT

Query:  F-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT
                          SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT
Subjt:  F-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT

Query:  LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK
        LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK
Subjt:  LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK

Query:  QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR
        QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR
Subjt:  QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR

Query:  KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
Subjt:  KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

XP_011651230.2 homeobox protein HOX1A [Cucumis sativus]0.091.18Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
        MEERDE+TGTESRPNNNAEAVQEAKASDNMKERD NT TESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEV TCLSNE  Y
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY

Query:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
        SGYQELGTTPEFS K DGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADND+VEAGN LS DKDTKNLKLSIEDE TTLLNECSELPLEDVTKNY
Subjt:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY

Query:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
        IEKMNPPI DLTQITSIQSLETIP            F KSKKKNYKLRS VSSDRVLRSRTQEKAKAPE SNDLNNFTAEE+GKRKKKKKRNIQGKGARV
Subjt:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD
        ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA GRNSD TLGLPSDDSEDGD
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD

Query:  YDPDIPDTIDQDNELSSDESSSDQSNSD-----TSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC
        YDPD+PDTIDQDNELSSDESSSDQSNSD     TSGYASASEGLEV  NDDQYLGLPSDDSEDNDYDPSVPELDEG RQESSSSDFTSDSEDLAAL+NNC
Subjt:  YDPDIPDTIDQDNELSSDESSSDQSNSD-----TSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC

Query:  SSKMMTF-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTR
        SSK                     SGP+KS LHNELSSLLDSG DKDGLEP+SGRRQVERLDYKKLHDETYGNVPT+SSDDTYGSTLDSSDDRG DSGTR
Subjt:  SSKMMTF-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTR

Query:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE
        KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSV++ TSSSNRRLSQPALERL ASFQENEYPKRATK+SLAQE
Subjt:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE

Query:  LGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGES
        LGL LKQVSKWFENTRWSTRHPSS GKKAKSSSRMSI+LSQASGELSKNE ESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKL++RKTKR +S
Subjt:  LGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGES

Query:  SATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        SATKSRKRKGRSDNTAS+SKDREGSPRPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  SATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.075.62Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISN
        MEER E T  E RPN   EAVQEAKASVEV   TC SNE M+S    QELGTTPE + KT GPD+EK+GVQQNME     LGSG +LSEL EK+NQTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNME-----LGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP---------------RFFKSKKKNYKL
         A+ DQVEAGN LS D +T+NL L IE ETTTL NECSELP ED  KN I+++NPPIEDLTQ TSIQ LET+P               +  KSKKKNY L
Subjt:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP---------------RFFKSKKKNYKL

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN+LN  TA E GKRKKKK RNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA G+NSD  LGLPSDDSEDGDYDPD PDTI+Q+     DESSSDQS+SD SGYASASE LE  PNDDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQ

Query:  YLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC-----SSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYK
        YLGLPSDDSED+DY+P  PELDEG +QESS SDFTSDSEDLAAL++       S+   +  GP  S LHNEL SLL+SG DKDGLEP+SGRRQVERLDYK
Subjt:  YLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC-----SSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYK

Query:  KLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQC
        KLHDETYGNVP++SSDDT+GS ++DSSDDRG  S TRKR PK LV AL  NG+NDDL N KTKRSYKRRT QKPGA N+ NSVT TP D+ KSSSSVR+ 
Subjt:  KLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQC

Query:  TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDS
         SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGL+LKQVSKWFENTRWSTRHPSS    KAKS+ RM I  S+ SG+L K EQES  CFRDTD+
Subjt:  TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDS

Query:  NGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        NGA+HQ  P  +  VA CQSGDT D KL T+KT R ES+ATKSRKRKGRSD+ AS+SKDR+ S +PPAKSPKVN+ QTAD+ +TRRRRSI
Subjt:  NGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

XP_038876083.1 homeobox protein HAT3.1 isoform X1 [Benincasa hispida]0.083.93Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
        ME RDENT TESRPNN+AEAVQEAKASDNM+  D NT TESR N +A A QE KASDNMEERDENTDTESRPN  AE VQEAKASVEVEV TCLSNEPM+
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY

Query:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
        SGYQELGTTPE+S KTDGPDEEK GVQQNMELGSGYLLSEL EKDNQT+SNHADNDQVEAGN LS DKDT+NLKL IE ETTTLLNECSELP+EDV KN+
Subjt:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY

Query:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
        IE+MNPPIEDLTQ  SIQ+LE IP              KSKK NY+LRSLVSSDRVLRSRTQEKAKAPEPSN LNNFTAEE  ++KKKKKRNIQGK ARV
Subjt:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DEYSSIR  LRYLLNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD
        ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD WEKVYPEAAAAA G+NSD TLGLPSDDSEDGD
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD

Query:  YDPDIPDTIDQDNELSSDESSS--DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK
        YDPD+PDTIDQDNE SSDESSS  DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSED+DYDPSVPELDEG R+ESSSSDFTSDSEDLAAL+NN  SK
Subjt:  YDPDIPDTIDQDNELSSDESSS--DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK

Query:  MMTF-------------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGST-LDSSDDRGCDSGTR
           F                    GPSKS LHNELSSL      KDGLEP+SGRRQVERLDYKKLHDETYGNVPT+SSDDTYGST +DSS DRG DS TR
Subjt:  MMTF-------------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGST-LDSSDDRGCDSGTR

Query:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE
        KRGP+ LVLALSNNG+NDDLTNVKTKRS+KR TRQK  AINVNNSVTETPVDTAKSSSS RQ TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE
Subjt:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE

Query:  LGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGES
        LGL+LKQVS+WFENTRWSTRHPSSGG +AKSSSRMS   S+ASGEL KNEQES  CFRDTDSNGA+HQDLP ANS    CQSGDTGDKKL TRKTKR ES
Subjt:  LGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGES

Query:  SATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        SATKSRKRK  SD+ AS++KD+E S RPPAKSPKVNE QTADRFKTRRRRSI
Subjt:  SATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

XP_038876114.1 homeobox protein HAT3.1 isoform X2 [Benincasa hispida]0.084.41Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
        ME RDENT TESRPNN+AEAVQEAKASDNM+  D NT TESR N +A A QE KASDNMEERDENTDTESRPN  AE VQEAKASVEVEV TCLSNEPM+
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY

Query:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
        SGYQELGTTPE+S KTDGPDEEK GVQQNMELGSGYLLSEL EKDNQT+SNHADNDQVEAGN LS DKDT+NLKL IE ETTTLLNECSELP+EDV KN+
Subjt:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY

Query:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
        IE+MNPPIEDLTQ  SIQ+LE IP              KSKK NY+LRSLVSSDRVLRSRTQEKAKAPEPSN LNNFTAEE  ++KKKKKRNIQGK ARV
Subjt:  IEKMNPPIEDLTQITSIQSLETIPR-----------FFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DEYSSIR  LRYLLNRI YEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIM+RKLKIRDLFQRID LCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD
        ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITD WEKVYPEAAAAA G+NSD TLGLPSDDSEDGD
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAA-GRNSDDTLGLPSDDSEDGD

Query:  YDPDIPDTIDQDNELSSDESSS--DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK
        YDPD+PDTIDQDNE SSDESSS  DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSED+DYDPSVPELDEG R+ESSSSDFTSDSEDLAAL+NN  SK
Subjt:  YDPDIPDTIDQDNELSSDESSS--DQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK

Query:  MMTF-------------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGST-LDSSDDRGCDSGTR
           F                    GPSKS LHNELSSL      KDGLEP+SGRRQVERLDYKKLHDETYGNVPT+SSDDTYGST +DSS DRG DS TR
Subjt:  MMTF-------------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGST-LDSSDDRGCDSGTR

Query:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE
        KRGP+ LVLALSNNG+NDDLTNVKTKRS+KR TRQK  AINVNNSVTETPVDTAKSSSS RQ TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE
Subjt:  KRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQE

Query:  LGLNLKQV
        LGL+LKQ+
Subjt:  LGLNLKQV

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein0.0e+0096.62Show/hide
Query:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
        MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY
Subjt:  MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMY

Query:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
        SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY
Subjt:  SGYQELGTTPEFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNY

Query:  IEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
        IEKMNPPIEDLTQITSIQSLETIP           RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV
Subjt:  IEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARV

Query:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
        DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK
Subjt:  DEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSK

Query:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY
        ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY
Subjt:  ELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDY

Query:  DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMT
        DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK   
Subjt:  DPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMT

Query:  F-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT
                          SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT
Subjt:  F-----------------SGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKT

Query:  LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK
        LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK
Subjt:  LVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLK

Query:  QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR
        QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR
Subjt:  QVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSR

Query:  KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
Subjt:  KRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0075.62Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN
        MEER E   TE RPN   EAVQEAKAS  VEV TC SNE M+S    QELGTTPE + KT GPD+EK+GVQQNM     ELGSG +LSEL EK+NQTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP---------------RFFKSKKKNYKL
         A+ DQVEAGN LS D +T+NL L IE ETTT LNECSELP ED  KN I+++NPPIEDLTQ TSIQ LET+P               +  KSKKKNY L
Subjt:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP---------------RFFKSKKKNYKL

Query:  RSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ
        RSLVSSDRVLRSRTQEKAKAPEPSN+LN  TA  EGKR KKKKRNI+GKGA  DE+SSIRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQ
Subjt:  RSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQ

Query:  RASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
        RAS+EIMR KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC
Subjt:  RASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDC

Query:  KDDCLDLLNEFQGSNLSITDGWEKVYPE-AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQ
        KDDCLDLLNEFQGSNLSITDGWEKVYPE AAAAAG+NSD  LGLPSDDSEDGDYDPD PDTI+Q+     DESSSDQS+SD SGYASASE LE  PNDDQ
Subjt:  KDDCLDLLNEFQGSNLSITDGWEKVYPE-AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQ

Query:  YLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC-----SSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYK
        YLGLPSDDSED+DY+P  PELDEG +QESS SDFTSDSEDLAAL++       S+   +  GP  S LHNEL SLL+SG DKDGLEP+SGRRQVERLDYK
Subjt:  YLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNC-----SSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYK

Query:  KLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQC
        KLHDETYGNVP++SSDDT+GS ++DSSDDRG  S TRKR PK LV AL  NG+NDDL N KTKRSYKRRT QKPGA N+ NSVT TP D+ KSSSSVR+ 
Subjt:  KLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQC

Query:  TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDS
         SSSNRRLSQPALERL ASFQEN+YPKRATKESLAQELGL+LKQVSKWFENTRWSTRHPSS    KAKS+ RM I  S+ SG+L K EQES  CFRDTD+
Subjt:  TSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDS

Query:  NGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
        NGA+HQ  P  +  VA CQSGDT D KL T+KT R ES+ATKSRKRKGRSD+ AS+SKDR+ S +PPAKSPKVN+ QTAD+ +TRRRRSI
Subjt:  NGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

A0A6J1E4I6 homeobox protein HAT3.1-like0.0e+0072.73Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN
        MEERDE   TESR N  AEAVQEAK SVE E+RTCLSNE  +S   Y EL  TP +S KT G DEEK  VQQNM     ELGSG +L ELSEK NQT SN
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN

Query:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLV
         ADNDQVEAGN L  DKDT+NL + IE ETTTLL +CSELP E V KNYIE+MNPP E LTQ T  Q+LET+P           R  KS K N  LRSLV
Subjt:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLV

Query:  SSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN
        SSDR LRS+TQEK K PEPSNDLNNFTAEE   + KKK+RNIQGKGARVDE+SSIRNHLRYLLNRI+YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASN
Subjt:  SSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASN

Query:  EIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
        EIMRRKLKIRD+FQRID LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
Subjt:  EIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC

Query:  LDLLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDS-EDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLG
        L+LLNEFQGS LSITDGWEKVYPEAAA AAGRN D   GLPSDDS +D DYDPD+PDTI QD          D+S+ +TSGYASASE LE PPN DQYLG
Subjt:  LDLLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDS-EDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLG

Query:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLEPI
        LPSDDSED+DYDPS PE DE  RQESSSSDFTSDSEDLAAL++N SSK                      +  GP KS L+NELSSLL+SG DKDG EP+
Subjt:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLEPI

Query:  SGRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPV
         GRRQVERLDYKKLHDETYGNVPT+SSDDTY S ++DSSDD+G DS TRKR PKTLVLAL N  +NDDLTN+KTK S KR TRQK  A N+N SV++TP 
Subjt:  SGRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPV

Query:  DTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNE
        DT K+SSSVR+ T SS RRLS+ ALERL ASFQEN+YP+RATKESLAQELGL++KQVSKWF NTRWSTRHPSS  G KAKSSSRM IH SQASGEL + E
Subjt:  DTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNE

Query:  QESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRR
        QE           GA+HQ+LP A+SVVA CQSGDTGD KL T++TKR E SATKSRKRKGRSD+ AS SKD + S RPPAKSPKVNE QTA   KTRRR 
Subjt:  QESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRR

Query:  SI
        S+
Subjt:  SI

A0A6J1IPM8 homeobox protein HAT3.1-like0.0e+0071.76Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNMELGS-----GYLLSELSEKDNQTISN
        MEERDE   TESR    + AVQEAKASVEVEV T L+NE + S   Y ELGT  +++ KT  PDEEK GV+QNME  S     G   SEL EK +QTIS 
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYS--GYQELGTTPEFSRKTDGPDEEKAGVQQNMELGS-----GYLLSELSEKDNQTISN

Query:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLV
         A+NDQ EAGN LS DKDT+NL L IE ETT LLNECSE P ED  KNYIE+ NPPIE   Q TSI++L  +P           R  KSKKKNY LRSLV
Subjt:  HADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLV

Query:  SSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKK-RNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
        SSDRVLRSRTQ+KAKAPEPSNDL+N TA EEGK KK+KK R I+GKGARVDE+SSIRNHLRYL+NRI+YEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
Subjt:  SSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKK-RNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS

Query:  NEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
        NEIMRRKLKIRDLFQRID LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELSLENDIILCDG+CDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDD
Subjt:  NEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPE-AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGY--ASASEGLEVPPNDDQY
        C+DLLNEFQGSNLSITDGWEKV+PE AAAAAGR+SD T+ LPSDDS+DGDYDPD+PD IDQD E SSD SSSDQS+SD SGY  ASASE LE PPNDDQY
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPE-AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGY--ASASEGLEVPPNDDQY

Query:  LGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLE
        LGLPSDDSED+DYDP  P  DEG  QESSSSDFTSDSEDLAAL +N SSK                      +  GP+K+  HN+LSSL+ SG D+ GLE
Subjt:  LGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLE

Query:  PISGRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTET
         +SGRR VERLDYKKLHDET+GNVP+ SSDDTYGS ++DSSDDRG    TRK  PK LV ALS NG+ DD  N+KTK S  RRTRQKP A N++NSVT+T
Subjt:  PISGRRQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTET

Query:  PVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKN
        P  T KSSSSVR+ TSSS+RRLSQP LERL ASFQEN+YP+RATKESLA+ELGL+LKQVSKWFENTRWSTRHPSS   KAKS+SRM    SQ S +  K 
Subjt:  PVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKN

Query:  EQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRR
        EQES  CFRDT SNGA+HQ+ P A +VVA CQSG TGD KL   KTKR ES+ATKSRKRKGRSD  AS SK+R+ S +PPAKS KV+E QTAD+ K RRR
Subjt:  EQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRR

Query:  RSI
        +S+
Subjt:  RSI

A0A6J1J9X9 homeobox protein HAT3.1-like0.0e+0072.86Show/hide
Query:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYSGYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISNHA
        MEERDE   TESR N  AEAVQEAK  VE E+ TCLSNE  +    EL  TP ++ KT GPDEEK  VQQNM     ELGSG +LSELSEK NQT SN A
Subjt:  MEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYSGYQELGTTPEFSRKTDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISNHA

Query:  DNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSS
        DNDQVEAGN L  DKDT+NL + IE ETTTLL +CSELP E V KNYIE+MNPPIE+LTQ T  Q LET+P           R  KS K N  LRSLVSS
Subjt:  DNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSLETIP-----------RFFKSKKKNYKLRSLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEI
        DR +RS+TQEK K PEPSNDLNNFTAEE   + KKK+RNIQGKGARVDE+SSIRNHLRYLLNRI YEQ+LIEAYSSEGWKGFSSDKLKPEKELQRASNEI
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEI

Query:  MRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLD
        MRRKLKIRD+FQRID LC EG LS+SLFDS+GQI SEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIP DDEGWLCPGCDCKDDCL+
Subjt:  MRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLD

Query:  LLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPS
        LLNEFQGS LSITDGWEKVYPEAAA AAGRN D  LGLPSDDSED DYDPD+PDTI QD          D+S+S+TSGYASASE LE  PN DQYLGLPS
Subjt:  LLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPS

Query:  DDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGR
        DDSED+DYDPS PE DE  RQESSSSDFTSDSEDLAAL++N SSK                      +  GP KS+L+NELSSLL+SG DKDG EP+ GR
Subjt:  DDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSK--------------------MMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGR

Query:  RQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTA
        RQVERLDYKKLHDETYGNVPT+SSDDTY S ++DSSDD+G DS TRKR PKTLVLAL N   NDDLTNVKTK S KR TRQK  A+N+N SVT+TP DT 
Subjt:  RQVERLDYKKLHDETYGNVPTESSDDTYGS-TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTA

Query:  KSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQES
        K+SSSVR+ TSSS RRLSQ ALERL ASFQEN+YP+RATKESLAQELGL++KQV+KWF NTRWSTRHPSS  G KAKSSSRM IH SQASGEL + E+E 
Subjt:  KSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSSRMSIHLSQASGELSKNEQES

Query:  ATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI
                  GA+HQ+LP A+SVVA CQSGDTGD KL T+ TKR E SA KSRKRKGRSD+ AS SKD + S RPPAKSPKVNE QTA   KTRRR S+
Subjt:  ATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A9.7e-10341.04Show/hide
Query:  YKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEK
        Y L S  S  RVLRS +  K  + E      +  A  +   K++K      K +  DE+S IR  +RY+LNR+ YEQSLIEAY+SEGWK  S DK++PEK
Subjt:  YKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEK

Query:  ELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPG
        EL+RA +EI+R KL+IR++F+ ID+L ++G++ E+LFDSEG+I  EDIFC+ CGS + +L NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP 
Subjt:  ELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPG

Query:  CDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAG-RNSDDTLGLPSDDSEDGDYDPDIPD--TIDQDNELSSDESSSDQSNSDTSGYASASEGLEVP
        CDCK DC+DL+NE  GSN+SI D WEKV+P+AAA A     DD   LPSDDS+D D+DP++P+   + +D E SS+E     S+SD S + + S+  E P
Subjt:  CDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAG-RNSDDTLGLPSDDSEDGDYDPDIPD--TIDQDNELSSDESSSDQSNSDTSGYASASEGLEVP

Query:  PNDDQY--LGLPSDDSEDNDYDPSVPELDEGDRQESSS--SDFTSDSEDLA--------------ALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDK
          D +   L LPS+DSED+DYDP+ P+ D+   ++SSS  SDFTSDS+D                 L +     M   +  +K+T  +     +++ +D+
Subjt:  PNDDQY--LGLPSDDSEDNDYDPSVPELDEGDRQESSS--SDFTSDSEDLA--------------ALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDK

Query:  DGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGS----TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINV
          + P S RRQ ERLDYKKL+DE YG   ++SSDD   S     +  S++ G  +    +G + +         ND+LT   TK+S            ++
Subjt:  DGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGS----TLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINV

Query:  NNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPAL-ERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS--GGKKAKSSSRMSIHL
        + SV E P D   + S+     S++ +    P + ++L   F+   YP R+ KESLA+ELGL  +QV+KWFE  R S R  SS  G    K S + +   
Subjt:  NNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPAL-ERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS--GGKKAKSSSRMSIHL

Query:  SQASGELSKNE----QESATCFRDTDSNGARHQDLPMANSVVASCQSG-DTGDKKLTTRKTKR-GESSATKSRKR
          AS E  + E    +ES  C      NG         +S V S   G D G  K+ + + +  G   A K+R++
Subjt:  SQASGELSKNE----QESATCFRDTDSNGARHQDLPMANSVVASCQSG-DTGDKKLTTRKTKR-GESSATKSRKR

P48785 Pathogenesis-related homeodomain protein3.3e-5029.86Show/hide
Query:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS
        S K+N K     + + +  +SRT++ ++      ++     EE+ K  RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S
Subjt:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP 
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD

Query:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS
        D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+    +    ++    PSDDS+D DYDP++               +   ++S+ SG   
Subjt:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS

Query:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR
                           D   DND              E S S   S S D  AL                S   + LS++++   +    E + G R
Subjt:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR

Query:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD
        Q   +DY +L+ E +G         S D+ +G        R  D+G+      TLV    ++  + D                          V ET   
Subjt:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD

Query:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW
        + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L+ ++V+KWF+NTR+
Subjt:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW

P48786 Pathogenesis-related homeodomain protein2.4e-11746.25Show/hide
Query:  VSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS
        V+S R LRSR+QEK+  P+ +N + +  A+ E  RKK+KKR  +    RVDE+  IR HLRYLL+RI+YE++ ++AYS EGWKG S DK+KPEKEL+RA 
Subjt:  VSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRAS

Query:  NEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD
         EI  RKLKIRDLFQR+D   +EGRL E LFDS G+IDSEDIFCAKCGSK+++L NDIILCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+CK D
Subjt:  NEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDD

Query:  CLDLLNEFQGSNLSITDGWEKVYPE--AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYL
        C+ LLN+ Q +N+ + D WEKV+ E  AAAA+G+N DD  GLPSDDSED DYDP  PD    D ++  D+SS+D+S+     Y S S+ ++V    +   
Subjt:  CLDLLNEFQGSNLSITDGWEKVYPE--AAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYL

Query:  GLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDG-LEPISGRRQVERLDYKKLHD--
        GLPSDDSED++YDPS    D+   ++SS SDFTSDSED   + ++         GP  ST  +  ++    G  + G   P+  RRQVE LDYKKL+D  
Subjt:  GLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDG-LEPISGRRQVERLDYKKLHD--

Query:  ------------------------ETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNN
                                E YGN  ++SSD+ Y  T  SS D       +    K          S D   + K + S   R   K  A+   +
Subjt:  ------------------------ETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNN

Query:  SVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSS-----RMSIH
        S      +   S++ V    S+S     + A +RL  SF+EN+YP+RA KESLA EL L+++QVS WF N RWS RH S  G   AK  S     + SI 
Subjt:  SVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSS-GGKKAKSSS-----RMSIH

Query:  LSQAS----------GELSKNEQESAT
        +S  S           E+ K EQ++A+
Subjt:  LSQAS----------GELSKNEQESAT

Q04996 Homeobox protein HAT3.11.9e-12249.64Show/hide
Query:  RTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N T    G+ KKK K   +G+    DEY+ I+  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +DTLCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAA---AGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSD--ESSSDQSNSDTSGYASASEGL-----EVPPNDDQYLG
        G+  S++D WEK++PEAAAA    G+N D    LPSDDS+D +YDPD  +  + D + S D  ES ++  +SD + + SAS+ +     E        + 
Subjt:  GSNLSITDGWEKVYPEAAAA---AGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSD--ESSSDQSNSDTSGYASASEGL-----EVPPNDDQYLG

Query:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL----ENNCSSKMMTFSGPSKSTLHNELSSLLDS--GLDKDGLEPISGRRQVERLDYKKL
        LPSDDSED+DYDP  P  D  D +ESS+SD TSD+EDL       E N  ++      P + T   +  ++L+S  GLD DG   +S RR VERLDYKKL
Subjt:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL----ENNCSSKMMTFSGPSKSTLHNELSSLLDS--GLDKDGLEPISGRRQVERLDYKKL

Query:  HDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV--KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCT
        +DE Y NVPT SSDD      D +   G +    +    T+ L  S+N  +     +  K+KR+ K+ T + P          E P +   S    +  +
Subjt:  HDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV--KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCT

Query:  SSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWS
        SSS  + + P  +RL+ SFQEN+YP +ATKESLA+EL + +KQV+ WF++ RWS
Subjt:  SSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWS

Q8H991 Homeobox protein HAZ12.1e-10539.83Show/hide
Query:  TQITSIQSLETIPRFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQS
        T I        + R  K +K++  LR   S  RVLRS +++K KA    N+L N  A  +   KK+K       G   D+Y  IR  +RY+LNR+ YEQS
Subjt:  TQITSIQSLETIPRFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQS

Query:  LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQF
        LI+AY+SEGWKG S +K++PEKEL+RA  EI+R K +IR+ F+ +D+L +EG+L ES+FDS G+I SEDIFCA CGSK+++L+NDIILCDGICDRGFHQ+
Subjt:  LIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQF

Query:  CLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNE--------
        CL PPLL  DIP  DEGWLCP CDCK DC+D+LNE QG  LSI D WEKV+PEAA+   G    D   LPSDDS D DYDP +      D E        
Subjt:  CLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA-AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNE--------

Query:  --LSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSS-----SDFTSDSEDLAA-LENNCSSKMMTFSGPS
          L SD+SSS+ S S     +  S+       DD  LGLPS+DSED D+DP+ P+ D+    ES+S     SDFTSDS+D  A +  +C       SGPS
Subjt:  --LSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSS-----SDFTSDSEDLAA-LENNCSSKMMTFSGPS

Query:  KSTL-----------------HNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDT--YGSTLDSSDDRGCDSGTRKRGPKTLVL
         S +                  N   + +++ L++D + PIS +RQVERLDYKKL++E YG   ++SSDD   YG   +S+ ++G          +T  L
Subjt:  KSTL-----------------HNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDT--YGSTLDSSDDRGCDSGTRKRGPKTLVL

Query:  ALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVS
        A S  G          +      T   P  +    SV++   +   S+S+    +++ NR       ++L A F+E+ YP RATKE+LAQELGL   QV+
Subjt:  ALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVS

Query:  KWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRK
        KWF +TR                     H ++ +    +N  E+ T   + ++N      L  +N +V+  ++    +++  T ++   E +  +S    
Subjt:  KWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRK

Query:  GRS
        G+S
Subjt:  GRS

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain1.3e-12349.64Show/hide
Query:  RTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK
        R Q   +   PS+ + N T    G+ KKK K   +G+    DEY+ I+  LRY LNRI YEQSLI+AYS EGWKG S +K++PEKEL+RA+ EI+RRKLK
Subjt:  RTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLK

Query:  IRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ
        IRDLFQ +DTLCAEG L ESLFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD LDLLN+  
Subjt:  IRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQ

Query:  GSNLSITDGWEKVYPEAAAA---AGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSD--ESSSDQSNSDTSGYASASEGL-----EVPPNDDQYLG
        G+  S++D WEK++PEAAAA    G+N D    LPSDDS+D +YDPD  +  + D + S D  ES ++  +SD + + SAS+ +     E        + 
Subjt:  GSNLSITDGWEKVYPEAAAA---AGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSD--ESSSDQSNSDTSGYASASEGL-----EVPPNDDQYLG

Query:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL----ENNCSSKMMTFSGPSKSTLHNELSSLLDS--GLDKDGLEPISGRRQVERLDYKKL
        LPSDDSED+DYDP  P  D  D +ESS+SD TSD+EDL       E N  ++      P + T   +  ++L+S  GLD DG   +S RR VERLDYKKL
Subjt:  LPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAAL----ENNCSSKMMTFSGPSKSTLHNELSSLLDS--GLDKDGLEPISGRRQVERLDYKKL

Query:  HDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV--KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCT
        +DE Y NVPT SSDD      D +   G +    +    T+ L  S+N  +     +  K+KR+ K+ T + P          E P +   S    +  +
Subjt:  HDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV--KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCT

Query:  SSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWS
        SSS  + + P  +RL+ SFQEN+YP +ATKESLA+EL + +KQV+ WF++ RWS
Subjt:  SSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRWS

AT4G29940.1 pathogenesis related homeodomain protein A2.3e-5129.86Show/hide
Query:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS
        S K+N K     + + +  +SRT++ ++      ++     EE+ K  RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S
Subjt:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP 
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD

Query:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS
        D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+    +    ++    PSDDS+D DYDP++               +   ++S+ SG   
Subjt:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS

Query:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR
                           D   DND              E S S   S S D  AL                S   + LS++++   +    E + G R
Subjt:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR

Query:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD
        Q   +DY +L+ E +G         S D+ +G        R  D+G+      TLV    ++  + D                          V ET   
Subjt:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD

Query:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW
        + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L+ ++V+KWF+NTR+
Subjt:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW

AT4G29940.2 pathogenesis related homeodomain protein A2.3e-5129.86Show/hide
Query:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS
        S K+N K     + + +  +SRT++ ++      ++     EE+ K  RK+K KR  +     VD+   ++   RYLL +++ +Q+LI+AY++EGWKG S
Subjt:  SKKKNYKLRSLVSSDRVL-RSRTQEKAKAPEPSNDLNNFTAEEEGK--RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFS

Query:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD
         +K++P+KEL+RA  EI+  KL +RD  +++D L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R FHQ CL+PPL    IPP 
Subjt:  SDKLKPEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPD

Query:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS
        D+GW C  CDCK + +D +N   G++  +   W+ ++ E A+    +    ++    PSDDS+D DYDP++               +   ++S+ SG   
Subjt:  DEGWLCPGCDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAA---AAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYAS

Query:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR
                           D   DND              E S S   S S D  AL                S   + LS++++   +    E + G R
Subjt:  ASEGLEVPPNDDQYLGLPSDDSEDNDYDPSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRR

Query:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD
        Q   +DY +L+ E +G         S D+ +G        R  D+G+      TLV    ++  + D                          V ET   
Subjt:  QVERLDYKKLHDETYGN----VPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD

Query:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW
        + + S SV          RL + A+E+L   F E E P +A ++ LA+EL L+ ++V+KWF+NTR+
Subjt:  TAKSSSSVRQCTSSSNR-RLSQPALERLFASFQENEYPKRATKESLAQELGLNLKQVSKWFENTRW

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.3e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 57.3e-0538.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAAGAGATGAAAATACTGGTACAGAATCAAGACCTAATAATAATGCTGAAGCTGTACAAGAAGCCAAGGCCAGTGACAATATGAAAGAAAGAGATGGAAATAC
TTATACAGAATCAAGACCGAATAATAATGCAGAAGCCGCACAAGAAGGCAAGGCCAGTGACAATATGGAAGAAAGAGATGAAAATACAGATACAGAATCAAGACCTAATA
AAATTGCTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCGAACTTGTCTTTCAAATGAGCCAATGTATTCAGGTTATCAGGAGTTGGGAACAACTCCA
GAGTTTTCCAGAAAAACTGACGGTCCAGATGAAGAAAAAGCAGGGGTCCAGCAGAATATGGAACTTGGTTCGGGATATTTGCTTAGTGAGTTGTCAGAAAAAGATAATCA
GACCATCTCTAATCATGCTGATAATGATCAAGTTGAAGCTGGCAATTCATTATCTATTGATAAAGATACTAAAAATTTAAAATTGTCTATTGAGGATGAGACAACGACTC
TTCTTAATGAGTGCTCAGAACTTCCTCTTGAAGATGTCACCAAAAATTATATCGAAAAGATGAACCCTCCTATTGAAGATTTAACTCAAATTACTTCTATCCAAAGTTTA
GAAACAATCCCAAGATTTTTTAAATCAAAAAAGAAAAATTATAAGTTAAGGTCCCTTGTAAGCAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCC
TGAACCAAGTAATGACTTGAATAATTTTACTGCTGAAGAGGAGGGAAAAAGGAAGAAGAAGAAGAAGAGAAATATACAAGGAAAGGGAGCAAGAGTGGATGAGTATTCAT
CAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAGATATGAACAGAGTTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAG
CCCGAAAAGGAACTTCAGCGGGCATCAAATGAAATCATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTATTGATACACTTTGTGCTGAAGGGAGGCTTTCTGA
ATCTTTGTTCGATTCTGAAGGACAGATAGACAGCGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATATTATGTGATGGCATTT
GTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAACACAGACATTCCGCCGGATGATGAGGGATGGCTGTGCCCTGGATGTGATTGCAAAGATGACTGC
TTGGATCTTCTCAATGAATTTCAAGGATCAAATCTTTCAATCACCGATGGTTGGGAGAAAGTCTATCCTGAGGCGGCGGCGGCGGCTGGACGAAATTCTGATGACACCTT
AGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATATTCCAGATACTATTGACCAGGACAATGAATTGAGTTCTGATGAATCAAGCTCTGATCAGTCAA
ACTCTGATACATCTGGGTATGCTTCTGCTTCTGAGGGATTAGAGGTTCCACCCAATGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCTGAGGATAATGACTATGAT
CCGAGTGTTCCAGAACTAGATGAGGGTGATAGACAGGAAAGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCCCTTGAGAATAACTGTTCTTCCAAGAT
GATGACCTTTTCTGGACCCAGCAAGAGTACACTACATAATGAGTTATCAAGTCTACTAGACTCTGGTCTTGATAAGGATGGTCTTGAACCTATTTCGGGAAGAAGGCAGG
TTGAACGGTTGGATTATAAGAAGCTCCATGACGAGACATATGGGAACGTTCCTACTGAGTCGAGCGATGACACCTACGGGAGTACTTTGGACTCAAGTGATGACAGAGGC
TGTGATAGTGGTACAAGGAAGAGAGGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGATCCAATGATGATTTGACTAATGTAAAAACTAAACGCAGTTATAAGAG
GAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAACTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCTTCTGTTAGGCAATGCACATCATCATCAA
ATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATGAGTATCCCAAACGAGCTACAAAGGAGAGTTTAGCACAAGAACTAGGCCTCAAT
CTGAAGCAGGTTAGCAAATGGTTTGAGAACACGCGATGGAGCACACGCCATCCCTCAAGCGGTGGTAAGAAAGCAAAAAGTTCCTCAAGAATGAGCATTCATTTATCTCA
GGCAAGTGGAGAACTATCCAAGAATGAGCAAGAATCTGCAACATGTTTCAGAGATACTGATAGCAATGGTGCTCGACATCAAGACTTACCAATGGCAAATAGTGTTGTGG
CCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATTGACAACTCGGAAAACTAAAAGAGGAGAATCTTCTGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATAAC
ACGGCATCAAATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAACGCAAACAGCAGATAGGTTTAAGACAAGGAGGAGGAGGTC
CATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAAGAGATGAAAATACTGGTACAGAATCAAGACCTAATAATAATGCTGAAGCTGTACAAGAAGCCAAGGCCAGTGACAATATGAAAGAAAGAGATGGAAATAC
TTATACAGAATCAAGACCGAATAATAATGCAGAAGCCGCACAAGAAGGCAAGGCCAGTGACAATATGGAAGAAAGAGATGAAAATACAGATACAGAATCAAGACCTAATA
AAATTGCTGAAGCCGTACAAGAAGCCAAGGCCAGTGTTGAAGTTGAAGTGCGAACTTGTCTTTCAAATGAGCCAATGTATTCAGGTTATCAGGAGTTGGGAACAACTCCA
GAGTTTTCCAGAAAAACTGACGGTCCAGATGAAGAAAAAGCAGGGGTCCAGCAGAATATGGAACTTGGTTCGGGATATTTGCTTAGTGAGTTGTCAGAAAAAGATAATCA
GACCATCTCTAATCATGCTGATAATGATCAAGTTGAAGCTGGCAATTCATTATCTATTGATAAAGATACTAAAAATTTAAAATTGTCTATTGAGGATGAGACAACGACTC
TTCTTAATGAGTGCTCAGAACTTCCTCTTGAAGATGTCACCAAAAATTATATCGAAAAGATGAACCCTCCTATTGAAGATTTAACTCAAATTACTTCTATCCAAAGTTTA
GAAACAATCCCAAGATTTTTTAAATCAAAAAAGAAAAATTATAAGTTAAGGTCCCTTGTAAGCAGTGACAGAGTTTTGCGTTCAAGGACCCAAGAGAAAGCTAAAGCTCC
TGAACCAAGTAATGACTTGAATAATTTTACTGCTGAAGAGGAGGGAAAAAGGAAGAAGAAGAAGAAGAGAAATATACAAGGAAAGGGAGCAAGAGTGGATGAGTATTCAT
CAATCAGGAATCATTTGAGATATTTACTGAACCGCATCAGATATGAACAGAGTTTGATTGAAGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAG
CCCGAAAAGGAACTTCAGCGGGCATCAAATGAAATCATGCGACGCAAATTGAAAATAAGAGATCTATTTCAACGTATTGATACACTTTGTGCTGAAGGGAGGCTTTCTGA
ATCTTTGTTCGATTCTGAAGGACAGATAGACAGCGAGGATATATTCTGTGCAAAATGTGGATCCAAAGAACTGTCCCTTGAAAATGACATCATATTATGTGATGGCATTT
GTGATCGTGGGTTCCATCAGTTCTGTTTAGAACCACCTTTGCTAAACACAGACATTCCGCCGGATGATGAGGGATGGCTGTGCCCTGGATGTGATTGCAAAGATGACTGC
TTGGATCTTCTCAATGAATTTCAAGGATCAAATCTTTCAATCACCGATGGTTGGGAGAAAGTCTATCCTGAGGCGGCGGCGGCGGCTGGACGAAATTCTGATGACACCTT
AGGTCTTCCTTCAGATGATTCTGAAGATGGTGATTATGATCCTGATATTCCAGATACTATTGACCAGGACAATGAATTGAGTTCTGATGAATCAAGCTCTGATCAGTCAA
ACTCTGATACATCTGGGTATGCTTCTGCTTCTGAGGGATTAGAGGTTCCACCCAATGATGACCAGTACTTAGGTCTCCCTTCTGATGACTCTGAGGATAATGACTATGAT
CCGAGTGTTCCAGAACTAGATGAGGGTGATAGACAGGAAAGTTCAAGTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCCCTTGAGAATAACTGTTCTTCCAAGAT
GATGACCTTTTCTGGACCCAGCAAGAGTACACTACATAATGAGTTATCAAGTCTACTAGACTCTGGTCTTGATAAGGATGGTCTTGAACCTATTTCGGGAAGAAGGCAGG
TTGAACGGTTGGATTATAAGAAGCTCCATGACGAGACATATGGGAACGTTCCTACTGAGTCGAGCGATGACACCTACGGGAGTACTTTGGACTCAAGTGATGACAGAGGC
TGTGATAGTGGTACAAGGAAGAGAGGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGATCCAATGATGATTTGACTAATGTAAAAACTAAACGCAGTTATAAGAG
GAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAACTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCTTCTGTTAGGCAATGCACATCATCATCAA
ATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATGAGTATCCCAAACGAGCTACAAAGGAGAGTTTAGCACAAGAACTAGGCCTCAAT
CTGAAGCAGGTTAGCAAATGGTTTGAGAACACGCGATGGAGCACACGCCATCCCTCAAGCGGTGGTAAGAAAGCAAAAAGTTCCTCAAGAATGAGCATTCATTTATCTCA
GGCAAGTGGAGAACTATCCAAGAATGAGCAAGAATCTGCAACATGTTTCAGAGATACTGATAGCAATGGTGCTCGACATCAAGACTTACCAATGGCAAATAGTGTTGTGG
CCTCATGTCAGAGTGGGGATACAGGGGATAAGAAATTGACAACTCGGAAAACTAAAAGAGGAGAATCTTCTGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATAAC
ACGGCATCAAATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAACGCAAACAGCAGATAGGTTTAAGACAAGGAGGAGGAGGTC
CATTTAG
Protein sequenceShow/hide protein sequence
MEERDENTGTESRPNNNAEAVQEAKASDNMKERDGNTYTESRPNNNAEAAQEGKASDNMEERDENTDTESRPNKIAEAVQEAKASVEVEVRTCLSNEPMYSGYQELGTTP
EFSRKTDGPDEEKAGVQQNMELGSGYLLSELSEKDNQTISNHADNDQVEAGNSLSIDKDTKNLKLSIEDETTTLLNECSELPLEDVTKNYIEKMNPPIEDLTQITSIQSL
ETIPRFFKSKKKNYKLRSLVSSDRVLRSRTQEKAKAPEPSNDLNNFTAEEEGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLK
PEKELQRASNEIMRRKLKIRDLFQRIDTLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC
LDLLNEFQGSNLSITDGWEKVYPEAAAAAGRNSDDTLGLPSDDSEDGDYDPDIPDTIDQDNELSSDESSSDQSNSDTSGYASASEGLEVPPNDDQYLGLPSDDSEDNDYD
PSVPELDEGDRQESSSSDFTSDSEDLAALENNCSSKMMTFSGPSKSTLHNELSSLLDSGLDKDGLEPISGRRQVERLDYKKLHDETYGNVPTESSDDTYGSTLDSSDDRG
CDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASFQENEYPKRATKESLAQELGLN
LKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQASGELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSATKSRKRKGRSDN
TASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI