; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G024480 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G024480
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationCma_Chr04:16837923..16840526
RNA-Seq ExpressionCmaCh04G024480
SyntenyCmaCh04G024480
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602183.1 hypothetical protein SDJN03_07416, partial [Cucurbita argyrosperma subsp. sororia]9.1e-23295.12Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK
        MEGSLSQEGLIPGGASYGGPD QGSFKVHNQAQHHHL THQGSS+NPSIQEGFLLLQNCDHT+SLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGG RKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL

Query:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAHACGNS NSLDCNRGSHAYSQT+FAQGDI HLETESMKGS SQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEI
        DRELEKMRMINERMKLENERVALDIKQKEI
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEI

XP_022134251.1 uncharacterized protein LOC111006553 [Momordica charantia]1.5e-19782.38Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTH--QGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE---
        MEG+LSQ GLIPGG+SYGG DLQGSFKVHNQA HHH  +H  QGSS NPSIQEGF L    LQNCDHTMS+VDY+KG+R KNS SDDEPSF EDG++   
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTH--QGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE---

Query:  ----GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ
            GKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRKFQIIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLND+IGRGT+CQ
Subjt:  ----GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ

Query:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP
        VVENPALLDV+DYLTDK+KDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLA R RDDHD DEPRRHQNDDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP

Query:  YGDDTLSLG-GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKW
        +GD+    G GS KRLRRGQD D+AHACGNSL S DCN+ SH YS  QF   D   LETESMK S+SQKQWMELRLLQ+EDQKLQ +VEMLELEKQ+FKW
Subjt:  YGDDTLSLG-GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKW

Query:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI
        ER NKKKD ELEKMRM+NERMKLENER+ALD+KQKEI
Subjt:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI

XP_022964375.1 uncharacterized protein LOC111464406 [Cucurbita moschata]2.0e-23195.58Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK
        MEGSLSQ GLIPGGASYGGPDLQGSFKVHNQAQHHHL THQGSSVNPS QEGFLLLQNCDHTMSLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN N+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL

Query:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDI HLETESMKGSSSQKQWMELRLLQLED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEI
        DRELEKMRMINERMKLENERVALDIKQKEI
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEI

XP_022990368.1 uncharacterized protein LOC111487246 [Cucurbita maxima]7.4e-242100Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR
        MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR

Query:  VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY
        VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY
Subjt:  VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY

Query:  LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK
        LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK
Subjt:  LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK

Query:  RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM
        RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM
Subjt:  RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM

Query:  RMINERMKLENERVALDIKQKEI
        RMINERMKLENERVALDIKQKEI
Subjt:  RMINERMKLENERVALDIKQKEI

XP_023548026.1 uncharacterized protein LOC111806794 [Cucurbita pepo subsp. pepo]2.2e-23395.81Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK
        MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHL THQGSSVNPS QEGFLLLQNCDHTMSLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KG MWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTC+VVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL

Query:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRLRRGQDFDDAH CGNSLNSLDCNRGSHAYSQTQFAQGDI HLETESMKGSSSQKQWMELRLL+LEDQKLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEI
        DRELEKMRMINERMKLENERVALDIKQKEI
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEI

TrEMBL top hitse value%identityAlignment
A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856206.6e-19680.41Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQH-------HHLQTHQGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDG
        MEG+LSQ GLIPGG+SYGG DLQG FKVHNQ QH       HH  T QGSS NPSIQEGF L    +QNCDHTMSLV+Y+KG+RCKNS SD++PSFNED 
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQH-------HHLQTHQGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDG

Query:  IE-------GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR
        I+       GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD +G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLND+IGR
Subjt:  IE-------GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR

Query:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFE
        GT+CQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLA R RDDHD DEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPYGDDTLS---LGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL
        ENF P+ D+  S   LGGSVKRL+RGQD DDAHACGNSL+ LDCN+ SH +SQ QFAQ D  HLETESMK S+SQKQWMELRLLQLEDQKLQ +VEMLEL
Subjt:  ENFAPYGDDTLS---LGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI
        EKQKFKWER NK KDRELEKMRM+NE+MKLENER+ALD+KQK+I
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI

A0A5D3DGK7 Putative transcription factor1.3e-19680.63Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQH-------HHLQTHQGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDG
        MEG+LSQ GLIPGG+SYGG DLQG FKVHNQ QH       HH  T QGSS NPSIQEGF L    +QNCDHTMSLV+Y+KG+RCKNS SD++PSFNED 
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQH-------HHLQTHQGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDG

Query:  IE-------GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR
        I+       GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD +G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLND+IGR
Subjt:  IE-------GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR

Query:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFE
        GT+CQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLA R RDDHD DEPRRHQNDDFDE+E GETDEHDD+E
Subjt:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFE

Query:  ENFAPYGDDTLS---LGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL
        ENF P+ D+  S   LGGSVKRL+RGQD DDAHACGNSL+ LDCN+ SH +SQ QFAQ D  HLETESMK S+SQKQWMELRLLQLEDQKLQ +VEMLEL
Subjt:  ENFAPYGDDTLS---LGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI
        EKQKFKWER NKKKDRELEKMRM+NE+MKLENER+ALD+KQK+I
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI

A0A6J1BXD6 uncharacterized protein LOC1110065537.1e-19882.38Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTH--QGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE---
        MEG+LSQ GLIPGG+SYGG DLQGSFKVHNQA HHH  +H  QGSS NPSIQEGF L    LQNCDHTMS+VDY+KG+R KNS SDDEPSF EDG++   
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTH--QGSSVNPSIQEGFLL----LQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE---

Query:  ----GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ
            GKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRKFQIIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLND+IGRGT+CQ
Subjt:  ----GKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ

Query:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP
        VVENPALLDV+DYLTDK+KDDVRKILNSKQLFYEEMCSYHN NRLHLPHDPALQRSLQLA R RDDHD DEPRRHQNDDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAP

Query:  YGDDTLSLG-GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKW
        +GD+    G GS KRLRRGQD D+AHACGNSL S DCN+ SH YS  QF   D   LETESMK S+SQKQWMELRLLQ+EDQKLQ +VEMLELEKQ+FKW
Subjt:  YGDDTLSLG-GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKW

Query:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI
        ER NKKKD ELEKMRM+NERMKLENER+ALD+KQKEI
Subjt:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI

A0A6J1HKM3 uncharacterized protein LOC1114644069.8e-23295.58Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK
        MEGSLSQ GLIPGGASYGGPDLQGSFKVHNQAQHHHL THQGSSVNPS QEGFLLLQNCDHTMSLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIE-------GK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN N+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTL

Query:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDI HLETESMKGSSSQKQWMELRLLQLED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEI
        DRELEKMRMINERMKLENERVALDIKQKEI
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEI

A0A6J1JT31 uncharacterized protein LOC1114872463.6e-242100Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR
        MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHR

Query:  VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY
        VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY
Subjt:  VKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDY

Query:  LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK
        LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK
Subjt:  LTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVK

Query:  RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM
        RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM
Subjt:  RLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKM

Query:  RMINERMKLENERVALDIKQKEI
        RMINERMKLENERVALDIKQKEI
Subjt:  RMINERMKLENERVALDIKQKEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.3e-12757.88Show/hide
Query:  MEGSLSQEGLIPGGA-SYGGPDLQGSFKVHNQAQHHHLQTHQGSS--VNPSIQEGFLLLQNCDH----TMSLVDYSKGDRCKNSQS-DDEPSFNEDGIEG
        M+G+  Q G++  GA SYGG DLQGS +VH+Q   +    H  +S  ++  +    +  Q CDH     MS+ +  K +R KNS S DDEPSF E+G +G
Subjt:  MEGSLSQEGLIPGGA-SYGGPDLQGSFKVHNQAQHHHLQTHQGSS--VNPSIQEGFLLLQNCDH----TMSLVDYSKGDRCKNSQS-DDEPSFNEDGIEG

Query:  --------KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGT
                 KGS W RVKWTDKMVKLLITAVSYIGDD  S  +   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+LNDM+GRGT
Subjt:  --------KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGT

Query:  TCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEH-GETDEHDDFEE
        +CQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R RDDHD D+ R+HQ +D D+ +H G+ DEHD++EE
Subjt:  TCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEH-GETDEHDDFEE

Query:  NFAPYGDDTLSL----GGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL
            YGD  ++     GG +K++R     +D     + +NSL+CN+ S    Q  F+Q D+     ES +  S QKQWME R LQLE+QKLQ +VE+LEL
Subjt:  NFAPYGDDTLSL----GGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI
        EKQ+F+W+R +KK+D+ELE+MRM NERMKLEN+R+ L++KQ+E+
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEI

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.0e-8848.72Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEP--SFNEDG-IEGKKGSM
        MEG+ SQ G      S    DL+ +    NQ QHH          N     GF      ++TM    ++  DR K S S+D+     + DG  + K+ S 
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEP--SFNEDG-IEGKKGSM

Query:  WHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDV
        W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+M+GRGT+C+VVENP+LLD 
Subjt:  WHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDV

Query:  IDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQL-ACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLG
        IDYL +KEKD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L     RDDHD DE  +HQN+D D++        DD+EE+     D  LS  
Subjt:  IDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHDPALQRSLQL-ACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLG

Query:  GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYH-LETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDR
          +KRLR+ Q  +D    G+       N+G       + +Q D+   +  +S K +  Q+Q +E + L+LE +KLQ + EM+ELE+Q+FKWE  +K++++
Subjt:  GSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYH-LETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDR

Query:  ELEKMRMINERMKLENERVALDIKQKEIDRK
        +L KMRM NERMKLENER++L++K+ E+  K
Subjt:  ELEKMRMINERMKLENERVALDIKQKEIDRK

AT3G10040.1 sequence-specific DNA binding transcription factors8.3e-5033.85Show/hide
Query:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQG-SSVNPSIQEGFLLLQNCDHTMSLVDYSKG-----DRCKNSQSDDEPSFNEDGIEGK-
        ME ++   G  P   S   P    + +   Q QH H  T  G     P I+    L         +   S G     DR   S S   P  +  G +GK 
Subjt:  MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQG-SSVNPSIQEGFLLLQNCDHTMSLVDYSKG-----DRCKNSQSDDEPSFNEDGIEGK-

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIA----------SDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRG
        K S WHR+KWTD MV+LLI AV YIGD+            +   GGG     ++QKKGKWK +S+ M E+G+ VSPQQCEDKFNDLNKRYKR+ND++G+G
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIA----------SDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRG

Query:  TTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHD--PALQRSLQLACRVRDDHDYDE---------PRRHQNDDFDENEH
          C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHN       HD  P  Q  + +    +  + +             R + ++  E++ 
Subjt:  TTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNLNRLHLPHD--PALQRSLQLACRVRDDHDYDE---------PRRHQNDDFDENEH

Query:  GETDEHDDFEENFAPYGDDTLSLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQT
         E D   + EE+          +  +VKRLR                                   +   +  +  K    +K+W+  ++L++E++K+  
Subjt:  GETDEHDDFEENFAPYGDDTLSLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQT

Query:  RVEMLELEKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEID
          E +E+EKQ+ KW R   KK+RE+EK ++ N+R +LE ER+ L +++ EI+
Subjt:  RVEMLELEKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEID

AT5G47660.1 Homeodomain-like superfamily protein4.8e-0525.44Show/hide
Query:  LVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCED
        L +  K ++C+++Q + E  F      G  G      +W  + V+ LI++ S + +                I K   W  +S  M ERGY+ S ++C++
Subjt:  LVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCED

Query:  KFNDLNKRYKRLND
        K+ ++NK Y+R+ +
Subjt:  KFNDLNKRYKRLND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAGTTTATCACAAGAAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCCTGATTTGCAAGGCTCGTTTAAGGTTCATAATCAGGCACAACATCATCATCT
TCAAACTCATCAGGGTTCTTCAGTAAATCCTTCCATTCAGGAGGGTTTTTTACTTCTACAAAATTGTGACCATACCATGTCTTTGGTAGATTATAGCAAGGGAGACAGGT
GTAAAAACTCACAGAGTGACGATGAGCCAAGCTTTAACGAGGACGGTATCGAGGGGAAGAAGGGATCGATGTGGCATCGTGTGAAATGGACGGATAAAATGGTGAAGCTT
CTGATTACGGCGGTGTCTTATATCGGAGATGATATTGCTTCTGATTATGAAGGAGGTGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATGTC
AAAGGTTATGGCTGAAAGGGGCTATCAAGTTTCACCACAGCAGTGTGAGGATAAATTTAATGACCTCAATAAGAGGTATAAGAGGCTGAATGATATGATTGGGAGAGGCA
CTACGTGCCAAGTTGTTGAGAATCCTGCACTTCTTGATGTCATTGATTATTTAACAGACAAAGAGAAGGATGATGTGAGAAAAATTTTAAACTCGAAGCAATTGTTCTAT
GAGGAAATGTGTTCGTATCATAATTTGAATCGACTCCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGTTGGCTTGTAGAGTTAGGGATGATCATGATTATGA
TGAGCCAAGGAGACACCAAAACGATGATTTTGATGAAAACGAACATGGTGAAACTGATGAACACGATGATTTTGAGGAGAATTTTGCACCTTATGGGGACGACACGCTAT
CTCTAGGAGGCTCAGTAAAGAGGCTAAGGCGAGGTCAGGACTTCGACGATGCTCATGCCTGTGGAAATTCCTTGAATTCTCTTGATTGCAATAGAGGTTCTCATGCTTAC
TCACAAACACAATTCGCTCAAGGCGATATATATCATTTAGAAACCGAAAGTATGAAAGGTTCTTCGTCGCAAAAGCAGTGGATGGAGCTTCGTTTACTTCAGTTGGAAGA
TCAGAAGCTTCAAACTCGAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAATGGGAGAGAGTTAACAAGAAAAAGGACCGTGAGCTTGAAAAAATGAGGATGATAA
ACGAGAGGATGAAGCTTGAAAACGAGCGCGTTGCGCTTGACATAAAGCAGAAGGAAATCGATAGGAAGCATGAACATTTTCCTTTCTTGGCACTGTTTTCTGCATCCATT
AACACTCACGAAATGTTATTCTGTCCATGGAGTGTGATGAACTTGAATCTCGAACTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAGTTTATCACAAGAAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCCTGATTTGCAAGGCTCGTTTAAGGTTCATAATCAGGCACAACATCATCATCT
TCAAACTCATCAGGGTTCTTCAGTAAATCCTTCCATTCAGGAGGGTTTTTTACTTCTACAAAATTGTGACCATACCATGTCTTTGGTAGATTATAGCAAGGGAGACAGGT
GTAAAAACTCACAGAGTGACGATGAGCCAAGCTTTAACGAGGACGGTATCGAGGGGAAGAAGGGATCGATGTGGCATCGTGTGAAATGGACGGATAAAATGGTGAAGCTT
CTGATTACGGCGGTGTCTTATATCGGAGATGATATTGCTTCTGATTATGAAGGAGGTGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATGTC
AAAGGTTATGGCTGAAAGGGGCTATCAAGTTTCACCACAGCAGTGTGAGGATAAATTTAATGACCTCAATAAGAGGTATAAGAGGCTGAATGATATGATTGGGAGAGGCA
CTACGTGCCAAGTTGTTGAGAATCCTGCACTTCTTGATGTCATTGATTATTTAACAGACAAAGAGAAGGATGATGTGAGAAAAATTTTAAACTCGAAGCAATTGTTCTAT
GAGGAAATGTGTTCGTATCATAATTTGAATCGACTCCATCTGCCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGTTGGCTTGTAGAGTTAGGGATGATCATGATTATGA
TGAGCCAAGGAGACACCAAAACGATGATTTTGATGAAAACGAACATGGTGAAACTGATGAACACGATGATTTTGAGGAGAATTTTGCACCTTATGGGGACGACACGCTAT
CTCTAGGAGGCTCAGTAAAGAGGCTAAGGCGAGGTCAGGACTTCGACGATGCTCATGCCTGTGGAAATTCCTTGAATTCTCTTGATTGCAATAGAGGTTCTCATGCTTAC
TCACAAACACAATTCGCTCAAGGCGATATATATCATTTAGAAACCGAAAGTATGAAAGGTTCTTCGTCGCAAAAGCAGTGGATGGAGCTTCGTTTACTTCAGTTGGAAGA
TCAGAAGCTTCAAACTCGAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAATGGGAGAGAGTTAACAAGAAAAAGGACCGTGAGCTTGAAAAAATGAGGATGATAA
ACGAGAGGATGAAGCTTGAAAACGAGCGCGTTGCGCTTGACATAAAGCAGAAGGAAATCGATAGGAAGCATGAACATTTTCCTTTCTTGGCACTGTTTTCTGCATCCATT
AACACTCACGAAATGTTATTCTGTCCATGGAGTGTGATGAACTTGAATCTCGAACTGTAA
Protein sequenceShow/hide protein sequence
MEGSLSQEGLIPGGASYGGPDLQGSFKVHNQAQHHHLQTHQGSSVNPSIQEGFLLLQNCDHTMSLVDYSKGDRCKNSQSDDEPSFNEDGIEGKKGSMWHRVKWTDKMVKL
LITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFY
EEMCSYHNLNRLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQNDDFDENEHGETDEHDDFEENFAPYGDDTLSLGGSVKRLRRGQDFDDAHACGNSLNSLDCNRGSHAY
SQTQFAQGDIYHLETESMKGSSSQKQWMELRLLQLEDQKLQTRVEMLELEKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIDRKHEHFPFLALFSASI
NTHEMLFCPWSVMNLNLEL