; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020746 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020746
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionhomeobox protein HAT3.1
Genome locationtig00153554:748418..756698
RNA-Seq ExpressionSgr020746
SyntenySgr020746
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR001965 - Zinc finger, PHD-type
IPR009057 - Homeobox-like domain superfamily
IPR011011 - Zinc finger, FYVE/PHD-type
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR017970 - Homeobox, conserved site
IPR019786 - Zinc finger, PHD-type, conserved site
IPR019787 - Zinc finger, PHD-finger


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600300.1 Homeobox protein HAZ1, partial [Cucurbita argyrosperma subsp. sororia]1.9e-30869.57Show/hide
Query:  MEERHEYTEPRPNNNSEAVQE--VNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------
        MEER EYTE R  N S AVQE   N  VEVLT  +NEQMHS P+  ELGT  + TSK G  D+EK GV+QNM E+ KELGLG+                 
Subjt:  MEERHEYTEPRPNNNSEAVQE--VNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------

Query:  --------------------------------------------------------------------------------------------LTESLVSS
                                                                                                    +  SLVSS
Subjt:  --------------------------------------------------------------------------------------------LTESLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE
        DRVLRSRTQ+KAKAPEPSNDL+N T GE   GK+KKKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+E
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE

Query:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI
        IMRRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCI
Subjt:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPD PD IDQ+     D SSSD+SSSD+SGY  ASASEELEAPPNDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG

Query:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV
        LPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL DN SSK D + SS LN+++ VRNSNGQSSG GP+K+A HN+L +L+ SG D+ GLE V
Subjt:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV

Query:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE
        SGRR VERLDYKKLHDET+GNV TDSSDDTYGS SIDSSDDRG G  TRK SPKN VPALS+NGT DDL NI  KRSSK RT QKPAAEN++NSVT+T E
Subjt:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE

Query:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE
         ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N AK  SRMG QSS+TS K PKPE
Subjt:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        QESGACFRDI +NGAQHQE PKA SVVAPCQSG T DDKLA QK  RPESTATKSRKRKGRSD +ASR+KDRK+ +KP AKS K
Subjt:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

KAG7030959.1 Homeobox protein HAZ1 [Cucurbita argyrosperma subsp. argyrosperma]6.7e-30969.68Show/hide
Query:  MEERHEYTEPRPNNNSEAVQE--VNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------
        MEER EYTE R  N S AVQE   N  VEVLT  +NEQMHS P+  ELGT  + TSK G  D+EK GV+QNM E+ KELGLG+                 
Subjt:  MEERHEYTEPRPNNNSEAVQE--VNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------

Query:  --------------------------------------------------------------------------------------------LTESLVSS
                                                                                                    +  SLVSS
Subjt:  --------------------------------------------------------------------------------------------LTESLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE
        DRVLRSRTQ+KAKAPEPSNDL+N T GE   GK+KKKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+E
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE

Query:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI
        IMRRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCI
Subjt:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPD PD IDQ+     D SSSD+SSSD+SGY  ASASEELEAPPNDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG

Query:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV
        LPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL DN SSK D + SS LN+++ VRNSNGQSSG GP+K+A HN+L +L+ SG D+ GLE V
Subjt:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV

Query:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE
        SGRR VERLDYKKLHDET+GNV TDSSDDTYGS SIDSSDDRG G  TRK SPKN VPALS+NGT DDL NI  KRSSK RT QKPAAEN++NSVT+T E
Subjt:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE

Query:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE
         ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N AK  SRMG QSSQTS K PKPE
Subjt:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        QESGACFRDI +NGAQHQE PKA SVVAPCQSG T DDKLA QK  RPESTATKSRKRKGRSD +ASR+KDRK+ +KP AKS K
Subjt:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

XP_022149322.1 homeobox protein HAT3.1 isoform X1 [Momordica charantia]0.0e+0075.66Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE---------------
        MEERHEYTEPRPNNN EAVQE  A VEVLTCFSNEQMHSIPDNQELGTTPECTSK  G DDEKSGVQQNM EE KELG GDVL+E               
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE---------------

Query:  ------------------------------------------------------------------------------------------------SLVS
                                                                                                        SLVS
Subjt:  ------------------------------------------------------------------------------------------------SLVS

Query:  SDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSN+LN  T GEGKRKKK RNIKGKGA  DEFS+IRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL
        R KLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELS+ENDIILCDG+CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC+DL
Subjt:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD
        LNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDHALGLPSDDSEDGDYDPDAPDTI+QEDESSSD+SSSDESGYASASEELEA PNDDQYLGLPSDDSEDD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD

Query:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL
        DY+P APELDE V QESS SDFTSDSEDLAALDD                +  VRNSNGQ SG GP  S LHNEL +LLESG DKDGLEPVSGRRQVERL
Subjt:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL

Query:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV
        DYKKLHDETYGNV +DSSDDT+GS+SIDSSDDRG G RTRKRSPKNLVPAL  NGTNDDL N   KRS KRRTHQKP AEN+ NSVTRT EDSVKSSSSV
Subjt:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV

Query:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD
        RR  SSS+RRLSQPALERL ASFQENQYP+RATKESLAQELGLSLKQVSKWFENTRWSTRHPSS E N AK A RMGIQSS+TSGKLPKPEQESGACFRD
Subjt:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD

Query:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
         DNNGAQHQ  P  +  VAPCQSGDTRDDKLATQKT RPESTATKSRKRKGRSD +AS +KDRKE QKP AKSPK
Subjt:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

XP_022942376.1 homeobox protein HAT3.1 [Cucurbita moschata]6.8e-30669Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------
        MEER EYTE R  N S AVQE  A   VEVLT  +NEQMHS P+  ELGT  + TSK G  D+EK GV+QNM E+ KELGLG+                 
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------

Query:  --------------------------------------------------------------------------------------------LTESLVSS
                                                                                                    +  SL+SS
Subjt:  --------------------------------------------------------------------------------------------LTESLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGG-EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        DRVLRSRTQ+KAKAPEPSNDL+N T G EGK KKKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+EIM
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGG-EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL
        RRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCIDL
Subjt:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPNDDQ
        LNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD IDQ+          D+SSSD SSSD+SGY  ASASEELEAPPNDDQ
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPNDDQ

Query:  YLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGL
        YLGLPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL DN SSK D + SS LN+++ VRNS+GQSSG GP+K+A HN+L +L+ SG D+ GL
Subjt:  YLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGL

Query:  EPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTR
        E VSGRR VERLDYKKLHDET+GNV TDSSDDTYGS SIDSSDDRG G  TRK SPKN VPALS+NGT DDL NI  KRSSK RT QKPAAEN++NSVT+
Subjt:  EPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTR

Query:  TLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLP
        T E ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N AK ASRMG QSSQTS K P
Subjt:  TLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLP

Query:  KPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        KPEQESGACFRD  +NGAQHQE PKA SVVAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD +ASR+KDRK+ +KP AKS K
Subjt:  KPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

XP_023531864.1 homeobox protein HAT3.1 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-30568.84Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------
        MEER EYTE R  N S AVQE  A   VEVLT  +NEQMHS P+  ELGT  + TSK G  D+EK GV+QNM E+ +ELGLG+                 
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------

Query:  --------------------------------------------------------------------------------------------LTESLVSS
                                                                                                    +  SLVSS
Subjt:  --------------------------------------------------------------------------------------------LTESLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE
        DRVLRSRTQ+KAKAPEPSNDL+N T GE   GK+KKKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+E
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE

Query:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI
        IMRRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCI
Subjt:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPND
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPD PD IDQ+          D+SSSD SSSD+SGY  ASASEELEAPPND
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPND

Query:  DQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKD
        DQYLGLPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL +N SSK D + SS LN+++ VRNSNGQSSG GP+KSA HN+L +L+ SG D+ 
Subjt:  DQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKD

Query:  GLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSV
        GLE VSGRR VERLDYKKLHDET+GNV TDSSDDTYGS SIDSSDDRG G  TRK SPKN VPALS+NGT DDL NI  K SSK RT QKPAAEN++NSV
Subjt:  GLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSV

Query:  TRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGK
        T+T E ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N A+ ASRMG QSSQTS K
Subjt:  TRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGK

Query:  LPKPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
         PKPEQESGACFRD  +NGAQHQE PKA SVVAPCQSG T DDK A Q+T RPESTATKSRKRKGRSD +ASR+KDRK+ +KP AKS K
Subjt:  LPKPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

TrEMBL top hitse value%identityAlignment
A0A1S3C283 pathogenesis-related homeodomain protein9.0e-29668.68Show/hide
Query:  MEERHEY--TEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNM--------------------------
        MEER E   TE RPN  +EAVQE  A   VEV TC SNE M+S    QELGTTPE + K  G D+EK+GVQQNM                          
Subjt:  MEERHEY--TEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNM--------------------------

Query:  ----------------------------TEENKELGLGDV--------------LTE-----------------------------------SLVSSDRV
                                      E  EL L DV              LT+                                   SLVSSDRV
Subjt:  ----------------------------TEENKELGLGDV--------------LTE-----------------------------------SLVSSDRV

Query:  LRSRTQEKAKAPEPSNDLNNFTG-GEGKR-KKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRR
        LRSRTQEKAKAPEPSNDLNNFT   EGKR KKK RNI+GKGAR DE+S+IRN LRYL+NRI+YEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+EIMRR
Subjt:  LRSRTQEKAKAPEPSNDLNNFTG-GEGKR-KKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRR

Query:  KLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLN
        KLKIRDLFQ +D+LCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELS+ENDIILCDG+CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC+DLLN
Subjt:  KLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLN

Query:  EFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDS
        EFQGSNLSITDGWEKVYPE AAAAAGRNSD  LGLPSDDSEDGDYDPD PDTIDQ+     DESSSD+S+SD SGYASASE LE PPNDDQYLGLPSDDS
Subjt:  EFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDS

Query:  EDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQV
        ED+DYDPS PELDE   QESS+SDFTSDSEDLAAL++N SSK D L+ SSLN+++ V+N+NG+SS  GPSKS LHNEL +LL+SG DKDGLEP+SGRRQV
Subjt:  EDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQV

Query:  ERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSS
        ERLDYKKLHDETYGNV T+SSDDTYGS ++DSSDDRG    TRKR PK LV ALS NG+NDDL N+  KRS KRRT QKP A N+NNSVT T  D+ KSS
Subjt:  ERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSS

Query:  SSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGAC
        SSVR+ TSSS+RRLSQPALERLFASFQEN+YP+RATKESLAQELGL+LKQVSKWFENTRWSTRHPSS     AK +SRM I  SQ SG+L K EQES  C
Subjt:  SSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGAC

Query:  FRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        FRD D+NGA+HQ+LP ANSVVA CQSGDT D KL T+KT R ES+ATKSRKRKGRSD  AS +KDR+   +P AKSPK
Subjt:  FRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

A0A6J1D6Q5 homeobox protein HAT3.1 isoform X10.0e+0075.66Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE---------------
        MEERHEYTEPRPNNN EAVQE  A VEVLTCFSNEQMHSIPDNQELGTTPECTSK  G DDEKSGVQQNM EE KELG GDVL+E               
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE---------------

Query:  ------------------------------------------------------------------------------------------------SLVS
                                                                                                        SLVS
Subjt:  ------------------------------------------------------------------------------------------------SLVS

Query:  SDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        SDRVLRSRTQEKAKAPEPSN+LN  T GEGKRKKK RNIKGKGA  DEFS+IRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
Subjt:  SDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL
        R KLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELS+ENDIILCDG+CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC+DL
Subjt:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD
        LNEFQGSNLSITDGWEKVYPEAAAAAAG+NSDHALGLPSDDSEDGDYDPDAPDTI+QEDESSSD+SSSDESGYASASEELEA PNDDQYLGLPSDDSEDD
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD

Query:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL
        DY+P APELDE V QESS SDFTSDSEDLAALDD                +  VRNSNGQ SG GP  S LHNEL +LLESG DKDGLEPVSGRRQVERL
Subjt:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL

Query:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV
        DYKKLHDETYGNV +DSSDDT+GS+SIDSSDDRG G RTRKRSPKNLVPAL  NGTNDDL N   KRS KRRTHQKP AEN+ NSVTRT EDSVKSSSSV
Subjt:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV

Query:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD
        RR  SSS+RRLSQPALERL ASFQENQYP+RATKESLAQELGLSLKQVSKWFENTRWSTRHPSS E N AK A RMGIQSS+TSGKLPKPEQESGACFRD
Subjt:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD

Query:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
         DNNGAQHQ  P  +  VAPCQSGDTRDDKLATQKT RPESTATKSRKRKGRSD +AS +KDRKE QKP AKSPK
Subjt:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

A0A6J1E4I6 homeobox protein HAT3.1-like1.0e-29466.97Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAGV--EVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE-------------
        MEER EYTE R NNN+EAVQE    V  E+ TC SNEQ HS+PD  EL  TP  ++K GG D+EK  VQQNM EEN+ELG GDVL E             
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAGV--EVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE-------------

Query:  -----------------------------------------------------------------------------------------------SLVSS
                                                                                                       SLVSS
Subjt:  -----------------------------------------------------------------------------------------------SLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMR
        DR LRS+TQEK K PEPSNDLNNFT  EGK KKK RNI+GKGAR DEFS+IRN LRYL+NRIKYEQ+LI+AYSSEGWKGFSSDKLKPEKELQRAS+EIMR
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMR

Query:  RKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLL
        RKLKIRD+FQ +D+LC EG LS+SLFDS+GQIDSEDIFCAKCGSKELS ENDIILCDG+CDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDC++LL
Subjt:  RKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLL

Query:  NEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDS-EDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD
        NEFQGS LSITDGWEKVYPEAAA+AAGRN DHA GLPSDDS +D DYDPD PDTI Q+     DESS + SGYASASEELE+PPN DQYLGLPSDDSEDD
Subjt:  NEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDS-EDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDD

Query:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL
        DYDPSAPE DE V QESS+SDFTSDSEDLAALD N SSK+D L+S SLN++ S++N +G+SSGGGP KSAL+NEL +LLESG DKDG EPV GRRQVERL
Subjt:  DYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERL

Query:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV
        DYKKLHDETYGNV TDSSDDTY SVS+DSSDD+GW   TRKRSPK LV AL    TNDDL NI  K SSKR T QK  A N+N SV++T ED+ K+SSSV
Subjt:  DYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSV

Query:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD
        RR T SS+RRLS+ ALERL ASFQENQYPERATKESLAQELGLS+KQVSKWF NTRWSTRHPSS E N AK +SRMGI SSQ SG+L +PEQE       
Subjt:  RRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPEQESGACFRD

Query:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
            GAQHQELP A+SVVAPCQSGDT D KLATQ+T R E +ATKSRKRKGRSD  AS +KD KE Q+P AKSPK
Subjt:  IDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

A0A6J1FNP3 homeobox protein HAT3.13.3e-30669Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------
        MEER EYTE R  N S AVQE  A   VEVLT  +NEQMHS P+  ELGT  + TSK G  D+EK GV+QNM E+ KELGLG+                 
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDV----------------

Query:  --------------------------------------------------------------------------------------------LTESLVSS
                                                                                                    +  SL+SS
Subjt:  --------------------------------------------------------------------------------------------LTESLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGG-EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM
        DRVLRSRTQ+KAKAPEPSNDL+N T G EGK KKKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+EIM
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGG-EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIM

Query:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL
        RRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCIDL
Subjt:  RRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDL

Query:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPNDDQ
        LNEFQGSNLSITDGWEKV+PEAAAAAAG++SDH + LPSDDS+DGDYDPD PD IDQ+          D+SSSD SSSD+SGY  ASASEELEAPPNDDQ
Subjt:  LNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE----------DESSSDESSSDESGY--ASASEELEAPPNDDQ

Query:  YLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGL
        YLGLPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL DN SSK D + SS LN+++ VRNS+GQSSG GP+K+A HN+L +L+ SG D+ GL
Subjt:  YLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGL

Query:  EPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTR
        E VSGRR VERLDYKKLHDET+GNV TDSSDDTYGS SIDSSDDRG G  TRK SPKN VPALS+NGT DDL NI  KRSSK RT QKPAAEN++NSVT+
Subjt:  EPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTR

Query:  TLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLP
        T E ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N AK ASRMG QSSQTS K P
Subjt:  TLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLP

Query:  KPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        KPEQESGACFRD  +NGAQHQE PKA SVVAPCQSG T DDKLA QK  RPES ATKSRKRKGRSD +ASR+KDRK+ +KP AKS K
Subjt:  KPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

A0A6J1IPM8 homeobox protein HAT3.1-like5.9e-30368.67Show/hide
Query:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE-------------
        MEER EYTE R  + S AVQE  A   VEVLT  +NEQ+ S P+  ELGT  + TSK G  D+EK GV+QNM E++KEL LG+  +E             
Subjt:  MEERHEYTEPRPNNNSEAVQEVNAG--VEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTE-------------

Query:  -----------------------------------------------------------------------------------------------SLVSS
                                                                                                       SLVSS
Subjt:  -----------------------------------------------------------------------------------------------SLVSS

Query:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE
        DRVLRSRTQ+KAKAPEPSNDL+N T GE   GK++KKNR IKGKGAR DEFS+IRN LRYLVNRIKYEQSLI+AYSSEGWKGFSSDKLKPEKELQRAS+E
Subjt:  DRVLRSRTQEKAKAPEPSNDLNNFTGGE---GKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSE

Query:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI
        IMRRKLKIRDLFQ +D+LC+EGR SE+LFDSEGQIDSEDIFC KCGSKELS+ENDIILCDGVCDRGFHQFCLEPPLLN+DIPPDDEGWLCPGCDCKDDCI
Subjt:  IMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCI

Query:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG
        DLLNEFQGSNLSITDGWEKV+PEAAAAAAGR+SDH + LPSDDS+DGDYDPD PD IDQ+     D SSSD+SSSD+SGY  ASASEELEAPPNDDQYLG
Subjt:  DLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQE-----DESSSDESSSDESGY--ASASEELEAPPNDDQYLG

Query:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV
        LPSDDSEDDDYDP AP  DE V QESS+SDFTSDSEDLAAL DN SSK D + SS LN++  VRNSNGQSSG GP+K+A HN+L +L+ SG D+ GLE V
Subjt:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV

Query:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE
        SGRR VERLDYKKLHDET+GNV ++SSDDTYGS SIDSSDDRG G  TRK SPKNLVPALS+NGT DD  NI  K SS RRT QKPAAEN++NSVT+T E
Subjt:  SGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSVTRTLE

Query:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE
         ++KSSSSVRR TSSSHRRLSQP LERL ASFQENQYPERATKESLA+ELGLSLKQVSKWFENTRWSTRHPSS E N AK ASRMG QSSQTS K PKPE
Subjt:  DSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSGKLPKPE

Query:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK
        QESGACFRD  +NGAQHQE PKA +VVAPCQSG T DDKLA  KT RPESTATKSRKRKGRSD +ASR+K+RK+ +KP AKS K
Subjt:  QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPK

SwissProt top hitse value%identityAlignment
P46605 Homeobox protein HOX1A5.6e-10140.12Show/hide
Query:  SSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEI
        S  RVLRS +  K  + E    +        KR+K +R      +  DEFS IR R+RY++NR+ YEQSLI+AY+SEGWK  S DK++PEKEL+RA SEI
Subjt:  SSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEI

Query:  MRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCID
        +R KL+IR++F+++DSL ++G++ E+LFDSEG+I  EDIFC+ CGS + ++ NDIILCDG CDRGFHQ CL PPL   DIP  DEGWLCP CDCK DCID
Subjt:  MRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCID

Query:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPD--TIDQEDESSSDE----SSSDESGYASASEELEAPPNDDQY--LG
        L+NE  GSN+SI D WEKV+P+AAA A     D A  LPSDDS+D D+DP+ P+   + +++ESS ++    S SD+S + + S++ E P  D +   L 
Subjt:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPD--TIDQEDESSSDE----SSSDESGYASASEELEAPPNDDQY--LG

Query:  LPSDDSEDDDYDPSAPELDERVNQESST--SDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLE
        LPS+DSEDDDYDP+ P+ D+ V ++SS+  SDFTSDS+D        S      +SS L     V +    ++    + SA        +E+  D+  + 
Subjt:  LPSDDSEDDDYDPSAPELDERVNQESST--SDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLE

Query:  PVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGS---VSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSV
        P S RRQ ERLDYKKL+DE YG  S+DSSDD   S     I  S++ G        SP      +  +  ND+L   + K+S            +I+ SV
Subjt:  PVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGS---VSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQKPAAENINNSV

Query:  TRTLEDSVKSSSSVRRPTSSSHRRLSQPAL-ERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSG
             D   + S+     S++ +    P + ++L   F+   YP R+ KESLA+ELGL+ +QV+KWFE  R S R  SS +       S     S  T+ 
Subjt:  TRTLEDSVKSSSSVRRPTSSSHRRLSQPAL-ERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAKIASRMGIQSSQTSG

Query:  KLPK-PE----QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDR
          PK PE    +ES  C     N G    +   ++ V +     D    K+ + +   P     +  ++K     +  +   R
Subjt:  KLPK-PE----QESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDR

P48785 Pathogenesis-related homeodomain protein1.1e-5128.24Show/hide
Query:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY
        K G +   K   +QN  +  +EL                 +SRT++ ++      ++      + +++K  R  K      D+   ++ R RYL+ ++K 
Subjt:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY

Query:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF
        +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R F
Subjt:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF

Query:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE
        HQ CL+PPL    IPP D+GW C  CDCK + ID +N   G++  +   W+ ++ E A+   G                                  S+ 
Subjt:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE

Query:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG
        + ++E+ +                   PSDDS+DDDYDP   E +   N  + + D   D+++                  S+++S+S+      S G  
Subjt:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG

Query:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA
         S  +     L+ +    +    E V G RQ   +DY +L+ E +G  +          +    S+D  WG   R++          K  ++     +  
Subjt:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA

Query:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS
          SSK+           +  V  TLE S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+       
Subjt:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS

Query:  TE
        TE
Subjt:  TE

P48786 Pathogenesis-related homeodomain protein9.2e-12043.27Show/hide
Query:  ESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEG---KRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL
        E  V+S R LRSR+QEK+  P    D+NN    EG   ++ +K R  + +  R DEF  IR  LRYL++RIKYE++ +DAYS EGWKG S DK+KPEKEL
Subjt:  ESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEG---KRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKEL

Query:  QRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD
        +RA +EI  RKLKIRDLFQ LD   +EGRL E LFDS G+IDSEDIFCAKCGSK++++ NDIILCDG CDRGFHQFCL+PPLL   IPPDDEGWLCPGC+
Subjt:  QRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCD

Query:  CKDDCIDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLG
        CK DCI LLN+ Q +N+ + D WEKV+  EAAAAA+G+N D   GLPSDDSED DYDP  PD    +++   D+SS+DES Y S S++++     +   G
Subjt:  CKDDCIDLLNEFQGSNLSITDGWEKVY-PEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLG

Query:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV
        LPSDDSEDD+YDPS    D+ + ++SS SDFTSDSED   + D++    D   +    +S      N +   G P                 ++    P+
Subjt:  LPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPV

Query:  SGRRQVERLDYKKLHD--------------------------ETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA
          RRQVE LDYKKL+D                          E YGN S+DSSD+ Y    + SS D        K +      A+ +   + DL     
Subjt:  SGRRQVERLDYKKLHD--------------------------ETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA

Query:  KRSS--KRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPS
         R S   RR  +K A E  ++ ++R+ EDS    +  +  + + H    + A +RL  SF+ENQYP+RA KESLA EL LS++QVS WF N RWS RH S
Subjt:  KRSS--KRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPS

Query:  STEPNIAKIASR---------------MGIQSSQTSGKLPKPEQESGA------CFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPE
            ++AK  S                  +  S T  ++ K EQ++ +      C R +  N    +      +V  PC   +TR++K  T+  I+P+
Subjt:  STEPNIAKIASR---------------MGIQSSQTSGKLPKPEQESGA------CFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPE

Q04996 Homeobox protein HAT3.11.5e-12244.81Show/hide
Query:  HSIPD----NQELGTTPECTSKNGGQDDEKSGVQQNM------------------TEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFT
        H I D    N+E   TP+   K+   + + SG  + +                  +  +K LG G        S  R  RS+     +   PS+ + N T
Subjt:  HSIPD----NQELGTTPECTSKNGGQDDEKSGVQQNM------------------TEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFT

Query:  GGEGKRKKKNRNI-KGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSES
           G+ KKKN+ + KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQHLD+LCAEG L ES
Subjt:  GGEGKRKKKNRNI-KGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSES

Query:  LFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAA
        LFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++D WEK++PEAAAA
Subjt:  LFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAA

Query:  AAGRNSDHALGLPSDDSEDGDYDPDA-------PDTIDQEDESSSDESSSDESGYASASEEL-----EAPPNDDQYLGLPSDDSEDDDYDPSAPELDERV
          G   +    LPSDDS+D +YDPD         D  D  +ES +++ SSDE+ + SAS+E+     E        + LPSDDSEDDDYDP AP  D+  
Subjt:  AAGRNSDHALGLPSDDSEDGDYDPDA-------PDTIDQEDESSSDESSSDESGYASASEEL-----EAPPNDDQYLGLPSDDSEDDDYDPSAPELDERV

Query:  NQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNV
        ++ESS SD TSD+EDL       S K D     + ++ +            G   S L  + +   + G D DG   VS RR VERLDYKKL+DE Y NV
Subjt:  NQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNV

Query:  STDSSDDTYGSVSIDSSDDRGWGGRTRKRSPK--NLVPALSKNGTNDDLNN---INAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSH
         T SSDD       D  D     G+    S    + VP    +   D  +      +KR+ K+ T + P      N  +  +E   KSSSS  + T    
Subjt:  STDSSDDTYGSVSIDSSDDRGWGGRTRKRSPK--NLVPALSKNGTNDDLNN---INAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSH

Query:  RRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWS-TRHPSSTEPNIAKI
             P  +RL+ SFQENQYP++ATKESLA+EL +++KQV+ WF++ RWS    P  +E N+ K+
Subjt:  RRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWS-TRHPSSTEPNIAKI

Q8H991 Homeobox protein HAZ14.0e-10744.24Show/hide
Query:  RVLRSRTQEKAKAPEPSNDLNNFTGG---EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEI
        RVLRS +++K KA    N+L N   G     K++K  R  KG G   D++  IR R+RY++NR+ YEQSLI AY+SEGWKG S +K++PEKEL+RA  EI
Subjt:  RVLRSRTQEKAKAPEPSNDLNNFTGG---EGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEI

Query:  MRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCID
        +R K +IR+ F++LDSL +EG+L ES+FDS G+I SEDIFCA CGSK+++++NDIILCDG+CDRGFHQ+CL PPLL  DIP  DEGWLCP CDCK DCID
Subjt:  MRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCID

Query:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPD--TIDQEDESSSD-------ESSSDESGYASASEELEAPPN----D
        +LNE QG  LSI D WEKV+PEAA+   G     A  LPSDDS D DYDP       +D+E  S  D       + SS E   +S  E+ +   N    D
Subjt:  LLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPD--TIDQEDESSSD-------ESSSDESGYASASEELEAPPN----D

Query:  DQYLGLPSDDSEDDDYDPSAPELDERVNQESST-----SDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLES
        D  LGLPS+DSED D+DP+ P+ D+  N ES++     SDFTSDS+D  A +   S   D +   S +   +V  ++G    G P+     N  L  +E+
Subjt:  DQYLGLPSDDSEDDDYDPSAPELDERVNQESST-----SDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLES

Query:  GSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQK---PA
          ++D + P+S +RQVERLDYKKL++E YG  S+DSSD           D+  +G  T ++   NL  + + +            R +  R H     P 
Subjt:  GSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINAKRSSKRRTHQK---PA

Query:  AENINNSVTRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPN
              SV+    + + S+S+    +++ +R       ++L A F+E+ YP RATKE+LAQELGL+  QV+KWF +TR   R  ++ + N
Subjt:  AENINNSVTRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPN

Arabidopsis top hitse value%identityAlignment
AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain1.1e-12344.81Show/hide
Query:  HSIPD----NQELGTTPECTSKNGGQDDEKSGVQQNM------------------TEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFT
        H I D    N+E   TP+   K+   + + SG  + +                  +  +K LG G        S  R  RS+     +   PS+ + N T
Subjt:  HSIPD----NQELGTTPECTSKNGGQDDEKSGVQQNM------------------TEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFT

Query:  GGEGKRKKKNRNI-KGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSES
           G+ KKKN+ + KG+    DE++ I+ +LRY +NRI YEQSLIDAYS EGWKG S +K++PEKEL+RA+ EI+RRKLKIRDLFQHLD+LCAEG L ES
Subjt:  GGEGKRKKKNRNI-KGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSES

Query:  LFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAA
        LFD++G+I SEDIFCAKCGSK+LS++NDIILCDG CDRGFHQ+CLEPPL   DIPPDDEGWLCPGCDCKDD +DLLN+  G+  S++D WEK++PEAAAA
Subjt:  LFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAA

Query:  AAGRNSDHALGLPSDDSEDGDYDPDA-------PDTIDQEDESSSDESSSDESGYASASEEL-----EAPPNDDQYLGLPSDDSEDDDYDPSAPELDERV
          G   +    LPSDDS+D +YDPD         D  D  +ES +++ SSDE+ + SAS+E+     E        + LPSDDSEDDDYDP AP  D+  
Subjt:  AAGRNSDHALGLPSDDSEDGDYDPDA-------PDTIDQEDESSSDESSSDESGYASASEEL-----EAPPNDDQYLGLPSDDSEDDDYDPSAPELDERV

Query:  NQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNV
        ++ESS SD TSD+EDL       S K D     + ++ +            G   S L  + +   + G D DG   VS RR VERLDYKKL+DE Y NV
Subjt:  NQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNV

Query:  STDSSDDTYGSVSIDSSDDRGWGGRTRKRSPK--NLVPALSKNGTNDDLNN---INAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSH
         T SSDD       D  D     G+    S    + VP    +   D  +      +KR+ K+ T + P      N  +  +E   KSSSS  + T    
Subjt:  STDSSDDTYGSVSIDSSDDRGWGGRTRKRSPK--NLVPALSKNGTNDDLNN---INAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSH

Query:  RRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWS-TRHPSSTEPNIAKI
             P  +RL+ SFQENQYP++ATKESLA+EL +++KQV+ WF++ RWS    P  +E N+ K+
Subjt:  RRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWS-TRHPSSTEPNIAKI

AT4G29940.1 pathogenesis related homeodomain protein A7.6e-5328.24Show/hide
Query:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY
        K G +   K   +QN  +  +EL                 +SRT++ ++      ++      + +++K  R  K      D+   ++ R RYL+ ++K 
Subjt:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY

Query:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF
        +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R F
Subjt:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF

Query:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE
        HQ CL+PPL    IPP D+GW C  CDCK + ID +N   G++  +   W+ ++ E A+   G                                  S+ 
Subjt:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE

Query:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG
        + ++E+ +                   PSDDS+DDDYDP   E +   N  + + D   D+++                  S+++S+S+      S G  
Subjt:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG

Query:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA
         S  +     L+ +    +    E V G RQ   +DY +L+ E +G  +          +    S+D  WG   R++          K  ++     +  
Subjt:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA

Query:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS
          SSK+           +  V  TLE S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+       
Subjt:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS

Query:  TE
        TE
Subjt:  TE

AT4G29940.2 pathogenesis related homeodomain protein A7.6e-5328.24Show/hide
Query:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY
        K G +   K   +QN  +  +EL                 +SRT++ ++      ++      + +++K  R  K      D+   ++ R RYL+ ++K 
Subjt:  KNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSNDLNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKY

Query:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF
        +Q+LIDAY++EGWKG S +K++P+KEL+RA  EI+  KL +RD  + LD L + G + E +  S+G I  + IFCA+C S+E   +NDIILCDG C+R F
Subjt:  EQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGF

Query:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE
        HQ CL+PPL    IPP D+GW C  CDCK + ID +N   G++  +   W+ ++ E A+   G                                  S+ 
Subjt:  HQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDDSEDGDYDPDAPDTIDQEDESSSDE

Query:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG
        + ++E+ +                   PSDDS+DDDYDP   E +   N  + + D   D+++                  S+++S+S+      S G  
Subjt:  SSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSSMSVRNSNGQSSGGG

Query:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA
         S  +     L+ +    +    E V G RQ   +DY +L+ E +G  +          +    S+D  WG   R++          K  ++     +  
Subjt:  PSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLNNINA

Query:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS
          SSK+           +  V  TLE S + S SV          RL + A+E+L   F E + P +A ++ LA+EL L  ++V+KWF+NTR+       
Subjt:  KRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHR-RLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSS

Query:  TE
        TE
Subjt:  TE

AT5G09790.1 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 51.0e-0438.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD

AT5G09790.2 ARABIDOPSIS TRITHORAX-RELATED PROTEIN 51.0e-0438.24Show/hide
Query:  DSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD
        + E +    ++ C KCGS E   +++++LCD  CDRGFH  CL P ++   I      WLC   DC D
Subjt:  DSEGQIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGAGGCATGAATATACAGAACCAAGACCTAATAATAACTCTGAAGCCGTACAAGAAGTCAATGCAGGTGTTGAAGTGCTAACTTGTTTTTCAAATGAGCAAAT
GCATTCAATACCTGACAATCAGGAATTGGGAACAACTCCAGAATGTACCAGCAAAAATGGTGGTCAAGATGATGAAAAATCAGGGGTCCAGCAGAATATGACGGAAGAGA
ATAAGGAACTTGGTCTTGGAGACGTGCTTACTGAGTCTCTTGTAAGCAGTGACAGAGTTTTGCGTTCAAGGACACAAGAGAAAGCTAAAGCACCTGAACCAAGTAATGAC
TTGAATAATTTTACTGGGGGAGAGGGAAAAAGGAAGAAGAAGAATAGAAATATAAAAGGAAAGGGAGCAAGAGCTGATGAATTTTCAACAATCAGGAATCGTTTGAGATA
TTTAGTGAACCGTATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGG
CATCAAGTGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAGCATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAATCTCTATTTGATTCTGAAGGA
CAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCGAAAGAACTGTCCATTGAAAATGACATCATACTATGTGATGGTGTTTGTGATCGTGGGTTCCACCAGTT
CTGCTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGCCCTGGATGTGATTGCAAAGATGACTGCATCGATCTGCTTAATGAATTTC
AAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGAT
TCTGAAGATGGTGATTATGATCCTGATGCTCCGGATACTATTGACCAGGAAGATGAATCAAGTTCTGATGAATCGAGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGA
GGAATTGGAGGCTCCACCCAATGATGACCAGTACTTAGGTCTCCCATCTGATGACTCGGAGGATGATGACTATGATCCCAGTGCTCCAGAACTTGATGAACGTGTCAACC
AGGAAAGTTCAACTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATAACTGGTCTTCTAAAAGTGATTACCTCATGTCTTCTTCATTAAATAGTTCT
ATGTCTGTGAGAAACTCTAATGGGCAAAGTTCTGGAGGCGGTCCTAGCAAGAGTGCACTACATAATGAGTTATTAACTCTACTAGAGTCAGGTTCTGATAAGGATGGTCT
TGAACCTGTTTCTGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTTCTACTGACTCAAGTGATGACACATACGGGAGTG
TTTCTATTGATTCAAGTGATGACAGAGGTTGGGGTGGTAGAACAAGGAAGAGAAGCCCAAAAAACCTGGTTCCTGCATTGTCGAAAAATGGAACTAATGATGATTTGAAT
AATATAAACGCTAAACGCAGTTCTAAGAGGAGAACTCATCAAAAGCCAGCTGCTGAAAATATAAATAATTCTGTGACTAGAACTCTTGAAGACTCTGTGAAATCCAGTTC
TTCTGTTAGACGGCCCACATCATCATCACATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATCAGTATCCTGAACGAGCTACAAAAG
AGAGTTTGGCACAAGAACTAGGACTCAGTTTGAAGCAGGTTAGCAAATGGTTTGAGAACACACGTTGGAGCACACGCCATCCCTCAAGCACTGAACCTAATATAGCAAAG
ATTGCTTCAAGAATGGGCATTCAGTCATCTCAGACAAGTGGAAAACTTCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACATCGATAACAATGGTGCTCAACA
TCAAGAATTACCAAAGGCAAATAGTGTCGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAAAAAACTATTAGACCAGAATCTACTGCTACAA
AATCCAGAAAACGGAAGGGCAGGTCAGATCCCATGGCATCACGTACAAAGGACAGAAAGGAACCACAAAAGCCTTCTGCCAAGTCACCAAAAGAAATTTATTCGGCTATT
TACCCCCTTCCAAGAAACACATTAGATCAGATGCTGGCATTGTTTGAAGGGAGTTCAAGGTCACCTGTGAATTCAGTTCAGCTGGCAGTAGTCCATTGCTTCAATGCTCT
GCATGGCTCGAACACCCCTGAAAAAGGGAAAAGAAAAAGAAAAAGAAAGAAGCCAAACAAAGAGAACAAGATAAGAGAGCTACAAAGTATCCTTCTCGAGTCCTCTTCCA
TGTCGGTGGAGCTTATACTTCCAAGCTTTTTGGCTGTTTTTCGTCTTCTAAATCACCGGAGGCTGGAGCACTGGGTGGCCTGTGATCGAGCACTGAAATACCAAGCAGAG
TCATCATTCTCTGGGCTTGGTGTCTTCTGGCTGCACTTCTCTCCCTCTTGTGGGCATTTTGGTGGCCTCCCAAAGCCTGTGAACTGTAGAATACCCTCTTACAGAATTAA
AAGCTGTTTGTTCCTACCCAGGCTCAAGTTCAGCCACTCCAAAGCTCCTACACCATCAGGTTGGATTGCTGATTCTCTATCAAGCCCTGAAACTGAAGCTGTGGGAACGG
TAGCAGATTTTGATCCTTCCAAGTTTCCCAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGAGGCATGAATATACAGAACCAAGACCTAATAATAACTCTGAAGCCGTACAAGAAGTCAATGCAGGTGTTGAAGTGCTAACTTGTTTTTCAAATGAGCAAAT
GCATTCAATACCTGACAATCAGGAATTGGGAACAACTCCAGAATGTACCAGCAAAAATGGTGGTCAAGATGATGAAAAATCAGGGGTCCAGCAGAATATGACGGAAGAGA
ATAAGGAACTTGGTCTTGGAGACGTGCTTACTGAGTCTCTTGTAAGCAGTGACAGAGTTTTGCGTTCAAGGACACAAGAGAAAGCTAAAGCACCTGAACCAAGTAATGAC
TTGAATAATTTTACTGGGGGAGAGGGAAAAAGGAAGAAGAAGAATAGAAATATAAAAGGAAAGGGAGCAAGAGCTGATGAATTTTCAACAATCAGGAATCGTTTGAGATA
TTTAGTGAACCGTATCAAATACGAACAGAGCTTGATTGATGCTTATTCTAGTGAAGGCTGGAAAGGGTTCAGCTCAGATAAATTGAAGCCTGAAAAGGAACTTCAACGGG
CATCAAGTGAAATAATGCGACGCAAATTGAAAATAAGAGATCTATTTCAGCATCTTGATTCACTTTGTGCTGAAGGAAGGCTTTCTGAATCTCTATTTGATTCTGAAGGA
CAGATAGACAGTGAGGATATATTCTGTGCAAAATGTGGATCGAAAGAACTGTCCATTGAAAATGACATCATACTATGTGATGGTGTTTGTGATCGTGGGTTCCACCAGTT
CTGCTTAGAACCACCTTTGCTAAATACAGACATTCCGCCGGATGATGAGGGCTGGCTATGCCCTGGATGTGATTGCAAAGATGACTGCATCGATCTGCTTAATGAATTTC
AAGGATCAAATCTTTCTATTACTGATGGTTGGGAGAAAGTCTATCCTGAGGCTGCAGCAGCAGCTGCTGGACGAAATTCTGATCACGCCTTGGGTCTTCCTTCAGATGAT
TCTGAAGATGGTGATTATGATCCTGATGCTCCGGATACTATTGACCAGGAAGATGAATCAAGTTCTGATGAATCGAGTTCTGATGAATCTGGGTATGCTTCTGCTTCTGA
GGAATTGGAGGCTCCACCCAATGATGACCAGTACTTAGGTCTCCCATCTGATGACTCGGAGGATGATGACTATGATCCCAGTGCTCCAGAACTTGATGAACGTGTCAACC
AGGAAAGTTCAACTTCTGACTTTACATCTGATTCTGAGGATCTAGCTGCACTTGATGATAACTGGTCTTCTAAAAGTGATTACCTCATGTCTTCTTCATTAAATAGTTCT
ATGTCTGTGAGAAACTCTAATGGGCAAAGTTCTGGAGGCGGTCCTAGCAAGAGTGCACTACATAATGAGTTATTAACTCTACTAGAGTCAGGTTCTGATAAGGATGGTCT
TGAACCTGTTTCTGGAAGAAGACAGGTGGAACGGTTGGATTACAAGAAGCTGCATGATGAGACATACGGGAATGTTTCTACTGACTCAAGTGATGACACATACGGGAGTG
TTTCTATTGATTCAAGTGATGACAGAGGTTGGGGTGGTAGAACAAGGAAGAGAAGCCCAAAAAACCTGGTTCCTGCATTGTCGAAAAATGGAACTAATGATGATTTGAAT
AATATAAACGCTAAACGCAGTTCTAAGAGGAGAACTCATCAAAAGCCAGCTGCTGAAAATATAAATAATTCTGTGACTAGAACTCTTGAAGACTCTGTGAAATCCAGTTC
TTCTGTTAGACGGCCCACATCATCATCACATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTTTTGCATCATTCCAAGAAAATCAGTATCCTGAACGAGCTACAAAAG
AGAGTTTGGCACAAGAACTAGGACTCAGTTTGAAGCAGGTTAGCAAATGGTTTGAGAACACACGTTGGAGCACACGCCATCCCTCAAGCACTGAACCTAATATAGCAAAG
ATTGCTTCAAGAATGGGCATTCAGTCATCTCAGACAAGTGGAAAACTTCCCAAGCCTGAGCAAGAATCTGGTGCATGTTTCAGAGACATCGATAACAATGGTGCTCAACA
TCAAGAATTACCAAAGGCAAATAGTGTCGTGGCCCCATGTCAGAGTGGAGATACAAGGGATGACAAATTGGCGACTCAAAAAACTATTAGACCAGAATCTACTGCTACAA
AATCCAGAAAACGGAAGGGCAGGTCAGATCCCATGGCATCACGTACAAAGGACAGAAAGGAACCACAAAAGCCTTCTGCCAAGTCACCAAAAGAAATTTATTCGGCTATT
TACCCCCTTCCAAGAAACACATTAGATCAGATGCTGGCATTGTTTGAAGGGAGTTCAAGGTCACCTGTGAATTCAGTTCAGCTGGCAGTAGTCCATTGCTTCAATGCTCT
GCATGGCTCGAACACCCCTGAAAAAGGGAAAAGAAAAAGAAAAAGAAAGAAGCCAAACAAAGAGAACAAGATAAGAGAGCTACAAAGTATCCTTCTCGAGTCCTCTTCCA
TGTCGGTGGAGCTTATACTTCCAAGCTTTTTGGCTGTTTTTCGTCTTCTAAATCACCGGAGGCTGGAGCACTGGGTGGCCTGTGATCGAGCACTGAAATACCAAGCAGAG
TCATCATTCTCTGGGCTTGGTGTCTTCTGGCTGCACTTCTCTCCCTCTTGTGGGCATTTTGGTGGCCTCCCAAAGCCTGTGAACTGTAGAATACCCTCTTACAGAATTAA
AAGCTGTTTGTTCCTACCCAGGCTCAAGTTCAGCCACTCCAAAGCTCCTACACCATCAGGTTGGATTGCTGATTCTCTATCAAGCCCTGAAACTGAAGCTGTGGGAACGG
TAGCAGATTTTGATCCTTCCAAGTTTCCCAGATGA
Protein sequenceShow/hide protein sequence
MEERHEYTEPRPNNNSEAVQEVNAGVEVLTCFSNEQMHSIPDNQELGTTPECTSKNGGQDDEKSGVQQNMTEENKELGLGDVLTESLVSSDRVLRSRTQEKAKAPEPSND
LNNFTGGEGKRKKKNRNIKGKGARADEFSTIRNRLRYLVNRIKYEQSLIDAYSSEGWKGFSSDKLKPEKELQRASSEIMRRKLKIRDLFQHLDSLCAEGRLSESLFDSEG
QIDSEDIFCAKCGSKELSIENDIILCDGVCDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCIDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHALGLPSDD
SEDGDYDPDAPDTIDQEDESSSDESSSDESGYASASEELEAPPNDDQYLGLPSDDSEDDDYDPSAPELDERVNQESSTSDFTSDSEDLAALDDNWSSKSDYLMSSSLNSS
MSVRNSNGQSSGGGPSKSALHNELLTLLESGSDKDGLEPVSGRRQVERLDYKKLHDETYGNVSTDSSDDTYGSVSIDSSDDRGWGGRTRKRSPKNLVPALSKNGTNDDLN
NINAKRSSKRRTHQKPAAENINNSVTRTLEDSVKSSSSVRRPTSSSHRRLSQPALERLFASFQENQYPERATKESLAQELGLSLKQVSKWFENTRWSTRHPSSTEPNIAK
IASRMGIQSSQTSGKLPKPEQESGACFRDIDNNGAQHQELPKANSVVAPCQSGDTRDDKLATQKTIRPESTATKSRKRKGRSDPMASRTKDRKEPQKPSAKSPKEIYSAI
YPLPRNTLDQMLALFEGSSRSPVNSVQLAVVHCFNALHGSNTPEKGKRKRKRKKPNKENKIRELQSILLESSSMSVELILPSFLAVFRLLNHRRLEHWVACDRALKYQAE
SSFSGLGVFWLHFSPSCGHFGGLPKPVNCRIPSYRIKSCLFLPRLKFSHSKAPTPSGWIADSLSSPETEAVGTVADFDPSKFPR