; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G04930 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G04930
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationChr6:4490142..4492642
RNA-Seq ExpressionCSPI06G04930
SyntenyCSPI06G04930
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]1.3e-25499.33Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]6.3e-25499.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]6.6e-22789.31Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH D+RRS GVLGGSVKRL+R QDHDD HACG SL       SSH H+QAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_031743106.1 uncharacterized protein LOC105435760 [Cucumis sativus]3.7e-25499.55Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEP ETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]2.5e-24294.43Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVHNQGQHSHALHQ HHPHTRQGSSANPSIQEGFSLSMGVV NCDHTM LVEYNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDI SD+DGGGR+KCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE E GETDEHDD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF+PHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLS LDCNKSSH HSQA F QADTAHLETESMKASTSQKQWMELRLLQLE+QKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKW+RFNKKKDRELE MRMVNERMKLEN+RLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein1.8e-25499.55Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQ SHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEP ETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856203.1e-25499.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNK KDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A5D3DGK7 Putative transcription factor6.2e-25599.33Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDG GRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQF QADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNE+MKLENERLALDLKQKQIGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452941.6e-22689.53Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQGPFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV+YNKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKCQ IQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLD+++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF PH DNRRS GVLGGSVKRL+R QDHDD HACG SL       SSH HSQAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

A0A6J1JU49 uncharacterized protein LOC1114897533.5e-22689.09Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGNLSQGGLIPGG+SYGGLDLQ PFKVH+Q QHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH MSLV++NKGERCKNSASDE+PSF ED 
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        IDGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DG GRRKC IIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE
        GTSC+VVENPALLDV++YLT+K+KDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDE+E  ETDE DD+E
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYE

Query:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL
        ENF  H DNRRS GVLGGSVKRL+RGQDHDD HACG SL       SSH H+QAQF QADTAHLETE MK STSQKQWMELRLLQLEDQKLQIQVEMLEL
Subjt:  ENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH
        EKQKFKWERFNKKKDRELEKMRMVNERMKLENER+ALDLKQK+IGSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.0e-13660.13Show/hide
Query:  MEGNLSQGGLI-PGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSASDED-P
        M+GN  QGG++  G SSYGG DLQG  +VH    H  +++QQH    R   ++ P + EG   +M   Q CDH     MS+ E  K ER KNS SD+D P
Subjt:  MEGNLSQGGLI-PGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TMSLVEYNKGERCKNSASDED-P

Query:  SFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKR
        SF E+  DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S ID   RRK  ++QKKGKWK +SKV+AERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFNEDSIDG-HNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDF-DEHEPGE
        LND++GRGTSCQVVENPALLD I YL +K+KDDVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D  DE   G+
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDF-DEHEPGE

Query:  TDEHDDYEENFVPHTDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKL
         DEHD+YEE    + D R    G  GG +K+++    H+D     + ++ L+CNK S P  Q  F+QAD      ES +A + QKQWME R LQLE+QKL
Subjt:  TDEHDDYEENFVPHTDNR-RSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKL

Query:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIG
        QIQVE+LELEKQ+F+W+RF+KK+D+ELE+MRM NERMKLEN+R+ L+LKQ+++G
Subjt:  QIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)8.3e-9549.44Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS
        MEGN SQG      SS   L    P  + NQ Q      +QHHP++RQ S  N                  +TM    +N  +R K S S++D      S
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDS

Query:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +K  ++QKKGKW+ +SKV+ ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  IDGHNENSKGKKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDY
        GTSC+VVENP+LLD IDYL EK+KD+VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D         D+ DDY
Subjt:  GTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQL-AFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDY

Query:  EENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLE
        EE+      +R         +KRL++ Q H+D    G+     D      P SQA   +     +  +S KA+  Q+Q +E + L+LE +KLQIQ EM+E
Subjt:  EENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLE

Query:  LEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGS
        LE+Q+FKWE F+K+++++L KMRM NERMKLENER++L+LK+ ++G+
Subjt:  LEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQIGS

AT3G10040.1 sequence-specific DNA binding transcription factors2.2e-5034.11Show/hide
Query:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQH-HPHTRQG-SSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNE
        ME N+   G  P       L L+ P    N     +++  QH HP+T  G     P I+  +  +    Q     MS +    G  C     DED     
Subjt:  MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQH-HPHTRQG-SSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNE

Query:  DSIDGHN-ENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDID--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDK
         S  G N E+S G  G    S WHR+KWTD MV+LLI AV YIGD+  +   +D        GGG     ++QKKGKWK +S+ + E+G+ VSPQQCEDK
Subjt:  DSIDGHN-ENSKGKKG----SMWHRVKWTDKMVKLLITAVSYIGDD--IASDID--------GGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDK

Query:  FNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSLQLAFRAR
        FNDLNKRYKR+NDI+G+G +C+VVEN  LL+ +D+LT K KD+V+K+LNSK LF+ EMC+YHNS              N + +P     Q     A   +
Subjt:  FNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTEKDKDDVRKILNSKQLFYEEMCSYHNS--------------NRLHLPHDPALQRSLQLAFRAR

Query:  DDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESM
             +     +  + D  E  E++  +  EE     T  +R +     +VKRL+                                   + A +  +  
Subjt:  DDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHDDAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESM

Query:  KASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQI
        K+   +K+W+  ++L++E++K+  + E +E+EKQ+ KW R+  KK+RE+EK ++ N+R +LE ER+ L L++ +I
Subjt:  KASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLKQKQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGAGTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCACTCTCATGC
CTTACACCAACAGCATCATCCTCATACTCGTCAGGGATCATCAGCTAATCCCTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCA
TGTCTTTGGTAGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGATCCGAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGG
AAGAAGGGATCGATGTGGCACCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATATTGATGGGGG
TGGAAGAAGGAAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAAGTCATTGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAAT
TTAATGACCTCAATAAGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGTCAGGTTGTTGAGAATCCAGCACTTCTTGATGTCATTGATTATTTAACT
GAAAAAGATAAGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAACTCAAATCGACTTCATCTGCCCCATGATCCTGC
TTTGCAGCGTTCGTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGAAGACATCAAAATGATGATTTTGATGAACACGAACCTGGTGAAACTG
ATGAACATGATGATTATGAGGAGAATTTTGTACCCCATACGGACAATAGGCGATCGCTTGGGGTATTAGGAGGTTCGGTGAAGAGGCTAAAACGAGGCCAAGACCATGAT
GATGCACATGCTTGTGGTAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATCCTCATTCACAAGCACAATTTACTCAAGCTGATACAGCTCATTTAGAAACTGA
AAGTATGAAAGCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGATCAGAAACTTCAAATTCAAGTTGAAATGTTGGAACTGGAGAAACAAA
AGTTTAAGTGGGAGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTGAACGAGAGGATGAAGCTTGAAAATGAGCGCCTTGCACTTGACTTAAAG
CAAAAGCAAATTGGATCAGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
AGAAAGAAAAAAAAAGGGATGAAAAGTTAAAGTATGGGAATCAAAGTCTTGTAATTGAAGCAGCCATAATCAAAGAAACGGACGTTAATACTTATTGAGGAGATGCTGCT
CAAACATGGCGTATTGATTGATATGATTAAACACAACCCAATAGCACGCAACAAGTTTTTGTCAGCATTTGTGACGTCGAATTTAAAAATGACCCTCCCATTCGATTCAT
TTAACCTACGCATCCGTATTGGCTTCTCTGATCGGTTCTTTCGCCCACCCATTTGACATTTTTCCATTTTTCCTTCTTAAAATACAATAAAACCCATATCCCTCTCTTCC
CCTTTTCCCTTCTCGATCGCCACACTCACCCAATAAGCGCCTGGATCTGTTTCCTTCTATCTGGGTTTTCAGCTCTTGGCGGGTTTTGCTCAGATAAGTGATTTTGCTTC
TTTGTTGATTGTTGATTGTGTTGCTCTTTCAAGGGTTCTTTCTCTTATCTTACCTCTTTTTTCAGTTTAATCGTTCTTGTCCAATATTTGAGTTTAGTTGTGTTTGAAGG
GGCTTCTCTTGGTTCTGGGAAAATGGGGTTTGTTTGAAAGAAATTTCCTATTATCTAGTGGGGAAATTTCGGCTTGGGTGTGCTTGATTTGTCCATTCTGTTATCTTAGG
GATGATTAATTGGTGATTTTAATAGTTTAAGGATCCTGTGGAAGATGGGATTCTGGGGTATGTGTTTTTGAGTATGAGGTGAAGTGTTCCTTTGAAATATATGGAAGGGA
ATTTATCACAAGGAGGGTTGATTCCAGGAGGGAGTTCTTATGGAGGTCTTGATTTGCAAGGACCGTTTAAGGTTCATAATCAGGGACAGCACTCTCATGCCTTACACCAA
CAGCATCATCCTCATACTCGTCAGGGATCATCAGCTAATCCCTCCATTCAGGAGGGATTTTCACTTTCCATGGGAGTTGTACAAAATTGTGACCATACCATGTCTTTGGT
AGAGTATAACAAGGGAGAAAGGTGTAAAAACTCAGCTAGCGACGAAGATCCGAGTTTTAATGAGGATAGTATTGATGGTCATAATGAGAATAGTAAGGGGAAGAAGGGAT
CGATGTGGCACCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCTTCAGATATTGATGGGGGTGGAAGAAGG
AAATGCCAAATTATACAGAAGAAAGGTAAATGGAAACTGATATCAAAAGTCATTGCTGAAAGGGGTTATCAAGTTTCACCTCAGCAGTGTGAGGATAAATTTAATGACCT
CAATAAGAGGTATAAGAGGCTCAATGATATAATTGGGAGAGGCACTTCTTGTCAGGTTGTTGAGAATCCAGCACTTCTTGATGTCATTGATTATTTAACTGAAAAAGATA
AGGATGATGTGAGAAAAATTTTAAACTCAAAGCAACTGTTCTATGAGGAGATGTGTTCTTATCATAACTCAAATCGACTTCATCTGCCCCATGATCCTGCTTTGCAGCGT
TCGTTGCAGCTGGCTTTTAGAGCAAGGGATGATCACGATAATGACGAGCCAAGAAGACATCAAAATGATGATTTTGATGAACACGAACCTGGTGAAACTGATGAACATGA
TGATTATGAGGAGAATTTTGTACCCCATACGGACAATAGGCGATCGCTTGGGGTATTAGGAGGTTCGGTGAAGAGGCTAAAACGAGGCCAAGACCATGATGATGCACATG
CTTGTGGTAATTCCTTGAGTCCTCTTGATTGCAACAAAAGTTCTCATCCTCATTCACAAGCACAATTTACTCAAGCTGATACAGCTCATTTAGAAACTGAAAGTATGAAA
GCTTCTACATCGCAAAAGCAGTGGATGGAGCTTCGCTTACTTCAATTGGAAGATCAGAAACTTCAAATTCAAGTTGAAATGTTGGAACTGGAGAAACAAAAGTTTAAGTG
GGAGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAGGATGGTGAACGAGAGGATGAAGCTTGAAAATGAGCGCCTTGCACTTGACTTAAAGCAAAAGCAAA
TTGGATCAGGATTTCATTAATTGGCTGACATGATCTGAATTAATTCTATCAGGTTGGAAGATTTGCTTTTAATTAAGCCACTACTTTGAATCGACTAGCTGCTGCTGTTT
ATATATTTCTCGTTATGCTGGTAATAGGTTAGCGTTCTATTTGAGTATTTTTCATTGCTTACATCTAGCATGTAAAGTTAGAAAGGTGACAATACAAATGACTGTTTTTC
ACTGTTTTGCCTTTACTTATGTGGAATCTATCTTGGTTGAGGAATTTGAAAAATGGTTGGATATTTTGTGTCAAATTGATGCCATTTCTCTGAACCAATTCTACAGTTTT
GGTCACGCATTCAAATTAATGAACAATCTAGCTTACAGATGAGTATATTTACTGCTGGATTATTGAGCTCATTCAATGGAG
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGSSYGGLDLQGPFKVHNQGQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTMSLVEYNKGERCKNSASDEDPSFNEDSIDGHNENSKG
KKGSMWHRVKWTDKMVKLLITAVSYIGDDIASDIDGGGRRKCQIIQKKGKWKLISKVIAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
EKDKDDVRKILNSKQLFYEEMCSYHNSNRLHLPHDPALQRSLQLAFRARDDHDNDEPRRHQNDDFDEHEPGETDEHDDYEENFVPHTDNRRSLGVLGGSVKRLKRGQDHD
DAHACGNSLSPLDCNKSSHPHSQAQFTQADTAHLETESMKASTSQKQWMELRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMRMVNERMKLENERLALDLK
QKQIGSGFH