; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G6043 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G6043
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionSequence-specific DNA binding transcription factors
Genome locationctg1425:1341140..1350566
RNA-Seq ExpressionCucsat.G6043
SyntenyCucsat.G6043
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]0.099.33Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]0.099.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]2.46e-29189.31Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH D+RRS GVLGGSVKRL+R QDHDD HACG SLS       SH H+QAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]0.099.55Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEP ETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]3.51e-31194.43Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQ HHPHTRQGSSANPSIQEGFSLSMGVV NCDHTM LVEYNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDI SD+DGGGR+KCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF+PHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLS LDCNKSSH HSQA F QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELE MRMVNERMKLEN+RLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein0.099.55Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEP ETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856200.099.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A5D3DGK7 Putative transcription factor0.099.33Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452949.43e-29189.53Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKCQI QKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLD+++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRS GVLGGSVKRL+R QDHDD HACG SLS       SH HSQAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897532.81e-29089.09Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQ PFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV++NKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF  H DNRRS GVLGGSVKRL+RGQDHDD HACG SLS       SH H+QAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors3.3e-13660.13Show/hide
Query:  MEGNLSQGGLI-PGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSASDED-P
        M+GN  QGG++  G SSYGG DLQG  +VH    H  +++QQH    R   ++ P + EG   +M   Q CDH     MS+ E  K ER KNS SD+D P
Subjt:  MEGNLSQGGLI-PGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSASDED-P

Query:  SFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKR
        SF E+  DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S ID   RRK  ++QKKGKWK +SKV+AERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDF-DEHEPGE
        LND++GRGTSCQVVENPALLD I YL +K+KDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D  DE   G+
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDF-DEHEPGE

Query:  TDEHDDYEENFVPHTDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKL
         DEHD+YEE    + D R    G  GG +K+++    H+D     + ++ L+CNK S P  Q  F+QAD      ES +A + QKQWME R LQLE+QKL
Subjt:  TDEHDDYEENFVPHTDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKL

Query:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIG
        QIQVE+LELEKQ+F+W+RF+KK+D+ELE+MRM NERMKLEN+R+ L+LKQ+++G
Subjt:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.4e-9449.44Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGN SQG      SS   L    P  + NQ Q      +QHHP++RQ S  N                  +TM    +N  +R K S S++D      S
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +K  ++QKKGKW+ +SKV+ ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDY
        GTSC+VVENP+LLD IDYL EK+KD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D         D+ DDY
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDY

Query:  EENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLE
        EE+      +R         +KRL++ Q H+D    G+     D      P SQA   +     +  +S KA+  Q+Q +E + L+LE +KLQIQ EM+E
Subjt:  EENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGS
        LE+Q+FKWE F+K+++++L KMRM NERMKLENER++L+LK+ ++G+
Subjt:  LEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGS

AT3G10040.1 sequence-specific DNA binding transcription factors3.7e-5034.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQH-HPHTRQG-SSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNE
        ME N+   G  P       L L+ P    N     +++  QH HP+T  G     P I+  +  +    Q     MS +    G  C     DED     
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQH-HPHTRQG-SSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNE

Query:  DSIDGHN-ENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDID--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDK
         S  G N E+S G  G    S WHR+KWTD MV+LLI AV YIGD+  +   +D        GGG     ++QKKGKWK +S+ + E+G+ VSPQQCEDK
Subjt:  DSIDGHN-ENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDID--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDK

Query:  FNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSLQLAFRAR
        FNDLNKRYKR+NDI+G+G +C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS              N + +P     Q     A   +
Subjt:  FNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSLQLAFRAR

Query:  DDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESM
             +     +  + D  E  E++  +  EE     T  +R +     +VKRL+                                   + A +  +  
Subjt:  DDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESM

Query:  KASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQI
        K+   +K+W+  ++L++E++K+  + E +E+EKQ+ KW R+  KK+RE+EK ++ N+R +LE ER+ L L++ +I
Subjt:  KASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGAGTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCACTCTCATGC
CTTACACCAACAGCATCATCCTCATACTCGTCAGGGATCATCAGCTAATCCCTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGATCCGAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGG
AAGAAGGGATCGATGTGGCACCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATATTGATGGGGG
TGGAAGAAGGAAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAAGTCATTGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGAAGAGGCACTTCTTGTCAGGTTGTTGAGAATCCAGCACTTCTTGATGTCATTGATTATTTAACT
GAAAAAGATAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAACTCAAATCGACTTCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCGTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGAAGACATCAAAATGATGATTTTGATGAACACGAACCTGGTGAAACTG
ATGAACATGATGATTATGAGGAGAATTTTGTACCCCATACGGACAATAGGCGATCGCTTGGGGTATTAGGAGGTTCGGTGAAGAGGCTAAAACGAGGCCAAGACCATGAT
GATGCACATGCTTGTGGTAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATCCTCATTCACAAGCACAATTTACTCAAGCTGATACAGCTCATTTAGAAACTGA
AAGTATGAAAGCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGATCAGAAACTTCAAATTCAAGTTGAAATGTTGGAACTGGAGAAGCAAA
AGTTTAAGTGGGAGAGATTTAACAAGAAAAAGGACCGCGAGTTGGAAAAAATGAGGATGGTGAACGAGAGGATGAAGCTTGAAAATGAGCGCCTTGCACTTGACTTAAAG
CAAAAGCAAATTGGATCAGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGAGTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCACTCTCATGC
CTTACACCAACAGCATCATCCTCATACTCGTCAGGGATCATCAGCTAATCCCTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGATCCGAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGG
AAGAAGGGATCGATGTGGCACCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATATTGATGGGGG
TGGAAGAAGGAAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAAGTCATTGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGAAGAGGCACTTCTTGTCAGGTTGTTGAGAATCCAGCACTTCTTGATGTCATTGATTATTTAACT
GAAAAAGATAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAACTCAAATCGACTTCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCGTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGAAGACATCAAAATGATGATTTTGATGAACACGAACCTGGTGAAACTG
ATGAACATGATGATTATGAGGAGAATTTTGTACCCCATACGGACAATAGGCGATCGCTTGGGGTATTAGGAGGTTCGGTGAAGAGGCTAAAACGAGGCCAAGACCATGAT
GATGCACATGCTTGTGGTAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATCCTCATTCACAAGCACAATTTACTCAAGCTGATACAGCTCATTTAGAAACTGA
AAGTATGAAAGCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGATCAGAAACTTCAAATTCAAGTTGAAATGTTGGAACTGGAGAAGCAAA
AGTTTAAGTGGGAGAGATTTAACAAGAAAAAGGACCGCGAGTTGGAAAAAATGAGGATGGTGAACGAGAGGATGAAGCTTGAAAATGAGCGCCTTGCACTTGACTTAAAG
CAAAAGCAAATTGGATCAGGATTTCATTAA
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDSIDGHNENSKG
KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
EKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHD
DAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLK
QKQIGSGFH