; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G025630 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G025630
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationCmo_Chr04:18770153..18771457
RNA-Seq ExpressionCmoCh04G025630
SyntenyCmoCh04G025630
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602183.1 hypothetical protein SDJN03_07416, partial [Cucurbita argyrosperma subsp. sororia]1.7e-24097.01Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK
        MEGSLSQ GLIPGGASYGGPD QGSFKVHNQAQHHHLHTHQGSS+NPS QEGFLLLQNCDHT+SLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGG RKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAPHGDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL

Query:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRLKRGQDFDDAHACGNS NSLDCNRGSHAYSQT+FAQGDI+HLETESMKGS SQKQWMELRLLQLED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH

XP_022134251.1 uncharacterized protein LOC111006553 [Momordica charantia]3.4e-20483.48Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTH--QGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHN
        MEG+LSQ GLIPGG+SYGG DLQGSFKVHNQA HHH H+H  QGSS NPS QEGF L    LQNCDHTMS+VDY+KGER KNS SDDEPSF EDG+D HN
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTH--QGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHN

Query:  ETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ
        ET KGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRKFQIIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLND+IGRGT+CQ
Subjt:  ETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ

Query:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAP
        VVENPALLDV+DYLTDK+KDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLA R RDDHD DEPRRHQ DDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAP

Query:  HGDDTLSLG-GSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKW
        HGD+    G GS KRL+RGQD D+AHACGNSL S DCN+ SH YS  QF   D A LETESMK S+SQKQWMELRLLQ+ED KLQ +VEMLELEKQ+FKW
Subjt:  HGDDTLSLG-GSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKW

Query:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        ER NKKKD ELEKMRM+NERMKLENER+ALD+KQKEIGS  H
Subjt:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH

XP_022964375.1 uncharacterized protein LOC111464406 [Cucurbita moschata]7.7e-249100Show/hide
Query:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK
        MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK
Subjt:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK

Query:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL
        GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL
Subjt:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL

Query:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS
        LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS
Subjt:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS

Query:  LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD
        LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD
Subjt:  LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD

Query:  RELEKMRMINERMKLENERVALDIKQKEIGSRIH
        RELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  RELEKMRMINERMKLENERVALDIKQKEIGSRIH

XP_022990368.1 uncharacterized protein LOC111487246 [Cucurbita maxima]3.1e-23495.63Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK
        MEGSLSQ GLIPGGASYGGPDLQGSFKVHNQAQHHHL THQGSSVNPS QEGFLLLQNCDHTMSLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN N+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL

Query:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDI HLETESMKGSSSQKQWMELRLLQLED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH

XP_023548026.1 uncharacterized protein LOC111806794 [Cucurbita pepo subsp. pepo]1.8e-24297.7Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK
        MEGSLSQ GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGID HNETIKGK
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KG MWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTC+VVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAPHGDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL

Query:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAH CGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLL+LED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH

TrEMBL top hitse value%identityAlignment
A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856209.0e-20381.74Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQH-------HHLHTHQGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDG
        MEG+LSQ GLIPGG+SYGG DLQG FKVHNQ QH       HH HT QGSS NPS QEGF L    +QNCDHTMSLV+Y+KGERCKNS SD++PSFNED 
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQH-------HHLHTHQGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDG

Query:  IDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR
        ID HNE  KGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD +G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLND+IGR
Subjt:  IDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR

Query:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFE
        GT+CQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLA R RDDHD DEPRRHQ DDFDE+E GETDEHDD+E
Subjt:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFE

Query:  ENFAPHGDDTLS---LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL
        ENF PH D+  S   LGGSVKRLKRGQD DDAHACGNSL+ LDCN+ SH +SQ QFAQ D AHLETESMK S+SQKQWMELRLLQLED KLQ +VEMLEL
Subjt:  ENFAPHGDDTLS---LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        EKQKFKWER NK KDRELEKMRM+NE+MKLENER+ALD+KQK+IGS  H
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH

A0A5D3DGK7 Putative transcription factor1.8e-20381.96Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQH-------HHLHTHQGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDG
        MEG+LSQ GLIPGG+SYGG DLQG FKVHNQ QH       HH HT QGSS NPS QEGF L    +QNCDHTMSLV+Y+KGERCKNS SD++PSFNED 
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQH-------HHLHTHQGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDG

Query:  IDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR
        ID HNE  KGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASD +G GRRK QIIQKKGKWKL+SKV+AERGYQVSPQQCEDKFNDLNKRYKRLND+IGR
Subjt:  IDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGR

Query:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFE
        GT+CQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLA R RDDHD DEPRRHQ DDFDE+E GETDEHDD+E
Subjt:  GTTCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFE

Query:  ENFAPHGDDTLS---LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL
        ENF PH D+  S   LGGSVKRLKRGQD DDAHACGNSL+ LDCN+ SH +SQ QFAQ D AHLETESMK S+SQKQWMELRLLQLED KLQ +VEMLEL
Subjt:  ENFAPHGDDTLS---LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        EKQKFKWER NKKKDRELEKMRM+NE+MKLENER+ALD+KQK+IGS  H
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH

A0A6J1BXD6 uncharacterized protein LOC1110065531.6e-20483.48Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTH--QGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHN
        MEG+LSQ GLIPGG+SYGG DLQGSFKVHNQA HHH H+H  QGSS NPS QEGF L    LQNCDHTMS+VDY+KGER KNS SDDEPSF EDG+D HN
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTH--QGSSVNPSTQEGFLL----LQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHN

Query:  ETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ
        ET KGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD++GGGRRKFQIIQKKGKWKL+SKVMAERGYQVSPQQCEDKFNDLNKRYKRLND+IGRGT+CQ
Subjt:  ETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQ

Query:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAP
        VVENPALLDV+DYLTDK+KDDVRKILNSKQLFYEEMCSYHNSN+LHLPHDPALQRSLQLA R RDDHD DEPRRHQ DDFDENEHGETDEHDDFEENFAP
Subjt:  VVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAP

Query:  HGDDTLSLG-GSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKW
        HGD+    G GS KRL+RGQD D+AHACGNSL S DCN+ SH YS  QF   D A LETESMK S+SQKQWMELRLLQ+ED KLQ +VEMLELEKQ+FKW
Subjt:  HGDDTLSLG-GSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKW

Query:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        ER NKKKD ELEKMRM+NERMKLENER+ALD+KQKEIGS  H
Subjt:  ERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH

A0A6J1HKM3 uncharacterized protein LOC1114644063.7e-249100Show/hide
Query:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK
        MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK
Subjt:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKK

Query:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL
        GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL
Subjt:  GSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPAL

Query:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS
        LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS
Subjt:  LDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLS

Query:  LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD
        LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD
Subjt:  LGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKD

Query:  RELEKMRMINERMKLENERVALDIKQKEIGSRIH
        RELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  RELEKMRMINERMKLENERVALDIKQKEIGSRIH

A0A6J1JT31 uncharacterized protein LOC1114872461.5e-23495.63Show/hide
Query:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK
        MEGSLSQ GLIPGGASYGGPDLQGSFKVHNQAQHHHL THQGSSVNPS QEGFLLLQNCDHTMSLVDYSKG+RCKNSQSDDEPSFNEDGI+       GK
Subjt:  MEGSLSQ-GLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGK

Query:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
        KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA
Subjt:  KGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPA

Query:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL
        LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHN N+LHLPHDPALQRSLQLACRVRDDHDYDEPRRHQ DDFDENEHGETDEHDDFEENFAP+GDDTL
Subjt:  LLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTL

Query:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK
        SLGGSVKRL+RGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDI HLETESMKGSSSQKQWMELRLLQLED KLQTRVEMLELEKQKFKWERVNKKK
Subjt:  SLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKK

Query:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
        DRELEKMRMINERMKLENERVALDIKQKEIGSRIH
Subjt:  DRELEKMRMINERMKLENERVALDIKQKEIGSRIH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.3e-12958.2Show/hide
Query:  MEGSLSQGLI--PGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLL--QNCDH----TMSLVDYSKGERCKNSQS-DDEPSFNEDGID-
        M+G+  QG +   G +SYGG DLQGS +VH+Q   +  H H  +S        F ++  Q CDH     MS+ +  K ER KNS S DDEPSF E+G D 
Subjt:  MEGSLSQGLI--PGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLL--QNCDH----TMSLVDYSKGERCKNSQS-DDEPSFNEDGID-

Query:  AHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGT
         HNE  +  KGS W RVKWTDKMVKLLITAVSYIGDD  S  +   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+LNDM+GRGT
Subjt:  AHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGT

Query:  TCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEH-GETDEHDDFEE
        +CQVVENPALLD I YL DKEKDDVRKI++SK LFYEEMCSYHN N+LHLPHD ALQRSLQLA R RDDHD D+ R+HQ +D D+ +H G+ DEHD++EE
Subjt:  TCQVVENPALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEH-GETDEHDDFEE

Query:  NFAPHGDDTLSL----GGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL
            +GD  ++     GG +K+++     +D     + +NSL+CN+ S    Q  F+Q D+     ES +  S QKQWME R LQLE+ KLQ +VE+LEL
Subjt:  NFAPHGDDTLSL----GGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLEL

Query:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIG
        EKQ+F+W+R +KK+D+ELE+MRM NERMKLEN+R+ L++KQ+E+G
Subjt:  EKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.3e-8947.37Show/hide
Query:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEP--SFNEDGIDAHNETIKG
        MEG+ SQG      S    DL+ +    NQ QHH          N     GF      ++TM    ++  +R K S S+D+     + DG +      K 
Subjt:  MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEP--SFNEDGIDAHNETIKG

Query:  KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENP
        K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+M+GRGT+C+VVENP
Subjt:  KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENP

Query:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQL-ACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDD
        +LLD IDYL +KEKD+VR+I++SK LFYEEMCSYHN N+LHLPHDPA+QRSL L     RDDHD DE  +HQ +D D++        DD+EE+     D 
Subjt:  ALLDVIDYLTDKEKDDVRKILNSKQLFYEEMCSYHNSNQLHLPHDPALQRSLQL-ACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDD

Query:  TLSLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAH-LETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVN
         LS    +KRL++ Q  +D    G+       N+G       + +Q D+   +  +S K +  Q+Q +E + L+LE  KLQ + EM+ELE+Q+FKWE  +
Subjt:  TLSLGGSVKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAH-LETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVN

Query:  KKKDRELEKMRMINERMKLENERVALDIKQKEIGSRI
        K+++++L KMRM NERMKLENER++L++K+ E+G+++
Subjt:  KKKDRELEKMRMINERMKLENERVALDIKQKEIGSRI

AT3G10040.1 sequence-specific DNA binding transcription factors7.1e-5134.12Show/hide
Query:  QAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKKG----SMWHRVKWTDKMVKLLITAVSYIG
        Q QH H +T  G        +      +    MS +    G  C     DDE   +  G   + E   G  G    S WHR+KWTD MV+LLI AV YIG
Subjt:  QAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKKG----SMWHRVKWTDKMVKLLITAVSYIG

Query:  DDIA----------SDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDYLTDKEKDDV
        D+            +   GGG     ++QKKGKWK +S+ M E+G+ VSPQQCEDKFNDLNKRYKR+ND++G+G  C+VVEN  LL+ +D+LT K KD+V
Subjt:  DDIA----------SDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDYLTDKEKDDV

Query:  RKILNSKQLFYEEMCSYHNSNQLHLPHD--PALQRSLQLACRVRDDHDYDE---------PRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLSLGGS
        +K+LNSK LF+ EMC+YHNS      HD  P  Q  + +    +  + +             R + ++  E++  E D   + EE+          +  +
Subjt:  RKILNSKQLFYEEMCSYHNSNQLHLPHD--PALQRSLQLACRVRDDHDYDE---------PRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLSLGGS

Query:  VKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKDRELE
        VKRL+                                   + A +  +  K    +K+W+  ++L++E+ K+    E +E+EKQ+ KW R   KK+RE+E
Subjt:  VKRLKRGQDFDDAHACGNSLNSLDCNRGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKDRELE

Query:  KMRMINERMKLENERVALDIKQKEI
        K ++ N+R +LE ER+ L +++ EI
Subjt:  KMRMINERMKLENERVALDIKQKEI

AT5G47660.1 Homeodomain-like superfamily protein8.5e-0424.79Show/hide
Query:  LVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQV
        L +  K E+C+++Q + E  F            +   GS     +W  + V+ LI++ S + +                I K   W  +S  M ERGY+ 
Subjt:  LVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQV

Query:  SPQQCEDKFNDLNKRYKRLND
        S ++C++K+ ++NK Y+R+ +
Subjt:  SPQQCEDKFNDLNKRYKRLND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAGTTTATCACAAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCCTGATTTGCAAGGCTCGTTTAAGGTTCATAATCAGGCACAACATCATCATCTTCA
TACTCATCAGGGTTCTTCAGTAAATCCTTCCACTCAGGAGGGTTTTTTACTTCTACAAAATTGTGACCATACCATGTCTTTGGTAGATTATAGCAAGGGAGAGAGGTGTA
AAAACTCACAGAGTGACGATGAGCCAAGCTTTAACGAGGACGGTATCGATGCTCATAATGAGACGATTAAGGGGAAGAAGGGATCGATGTGGCATCGTGTGAAATGGACG
GATAAAATGGTGAAGCTTCTGATTACGGCGGTGTCTTATATCGGAGATGATATTGCTTCGGATTATGAAGGAGGTGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGG
TAAATGGAAATTGATGTCAAAGGTTATGGCTGAAAGGGGCTATCAAGTTTCACCACAGCAGTGTGAGGATAAATTTAATGACCTCAATAAGAGGTATAAGAGGCTGAATG
ATATGATTGGGAGAGGCACTACGTGCCAAGTTGTTGAGAATCCTGCACTTCTTGATGTCATTGATTATTTAACAGACAAAGAGAAGGATGATGTGAGAAAAATTTTGAAC
TCGAAGCAATTGTTCTATGAGGAAATGTGTTCGTATCATAATTCGAATCAACTCCATCTACCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGTTGGCTTGTAGAGTTAG
GGATGATCATGATTATGATGAGCCAAGGAGACACCAAAAAGATGATTTTGATGAAAACGAACATGGTGAAACTGATGAACACGATGATTTTGAGGAGAATTTTGCACCTC
ATGGGGACGACACACTATCTCTAGGAGGCTCAGTAAAGAGGCTAAAGCGAGGTCAGGACTTCGACGATGCTCATGCTTGTGGAAATTCCTTGAATTCTCTTGATTGCAAT
AGAGGTTCTCATGCTTACTCACAAACACAATTCGCTCAAGGCGATATAGCTCATTTAGAAACTGAAAGTATGAAAGGTTCTTCGTCGCAAAAGCAGTGGATGGAGCTTCG
TTTACTTCAGTTGGAAGATCATAAGCTTCAAACTCGAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAATGGGAGAGAGTTAACAAGAAAAAGGACCGTGAGCTTG
AAAAAATGAGGATGATAAACGAGAGGATGAAGCTTGAAAACGAGCGCGTTGCGCTTGACATAAAGCAGAAGGAAATCGGTTCGAGAATTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAGTTTATCACAAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCCTGATTTGCAAGGCTCGTTTAAGGTTCATAATCAGGCACAACATCATCATCTTCA
TACTCATCAGGGTTCTTCAGTAAATCCTTCCACTCAGGAGGGTTTTTTACTTCTACAAAATTGTGACCATACCATGTCTTTGGTAGATTATAGCAAGGGAGAGAGGTGTA
AAAACTCACAGAGTGACGATGAGCCAAGCTTTAACGAGGACGGTATCGATGCTCATAATGAGACGATTAAGGGGAAGAAGGGATCGATGTGGCATCGTGTGAAATGGACG
GATAAAATGGTGAAGCTTCTGATTACGGCGGTGTCTTATATCGGAGATGATATTGCTTCGGATTATGAAGGAGGTGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGG
TAAATGGAAATTGATGTCAAAGGTTATGGCTGAAAGGGGCTATCAAGTTTCACCACAGCAGTGTGAGGATAAATTTAATGACCTCAATAAGAGGTATAAGAGGCTGAATG
ATATGATTGGGAGAGGCACTACGTGCCAAGTTGTTGAGAATCCTGCACTTCTTGATGTCATTGATTATTTAACAGACAAAGAGAAGGATGATGTGAGAAAAATTTTGAAC
TCGAAGCAATTGTTCTATGAGGAAATGTGTTCGTATCATAATTCGAATCAACTCCATCTACCCCATGATCCTGCTTTGCAGCGTTCTTTGCAGTTGGCTTGTAGAGTTAG
GGATGATCATGATTATGATGAGCCAAGGAGACACCAAAAAGATGATTTTGATGAAAACGAACATGGTGAAACTGATGAACACGATGATTTTGAGGAGAATTTTGCACCTC
ATGGGGACGACACACTATCTCTAGGAGGCTCAGTAAAGAGGCTAAAGCGAGGTCAGGACTTCGACGATGCTCATGCTTGTGGAAATTCCTTGAATTCTCTTGATTGCAAT
AGAGGTTCTCATGCTTACTCACAAACACAATTCGCTCAAGGCGATATAGCTCATTTAGAAACTGAAAGTATGAAAGGTTCTTCGTCGCAAAAGCAGTGGATGGAGCTTCG
TTTACTTCAGTTGGAAGATCATAAGCTTCAAACTCGAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAATGGGAGAGAGTTAACAAGAAAAAGGACCGTGAGCTTG
AAAAAATGAGGATGATAAACGAGAGGATGAAGCTTGAAAACGAGCGCGTTGCGCTTGACATAAAGCAGAAGGAAATCGGTTCGAGAATTCATTGA
Protein sequenceShow/hide protein sequence
MEGSLSQGLIPGGASYGGPDLQGSFKVHNQAQHHHLHTHQGSSVNPSTQEGFLLLQNCDHTMSLVDYSKGERCKNSQSDDEPSFNEDGIDAHNETIKGKKGSMWHRVKWT
DKMVKLLITAVSYIGDDIASDYEGGGRRKFQIIQKKGKWKLMSKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDMIGRGTTCQVVENPALLDVIDYLTDKEKDDVRKILN
SKQLFYEEMCSYHNSNQLHLPHDPALQRSLQLACRVRDDHDYDEPRRHQKDDFDENEHGETDEHDDFEENFAPHGDDTLSLGGSVKRLKRGQDFDDAHACGNSLNSLDCN
RGSHAYSQTQFAQGDIAHLETESMKGSSSQKQWMELRLLQLEDHKLQTRVEMLELEKQKFKWERVNKKKDRELEKMRMINERMKLENERVALDIKQKEIGSRIH