; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0008676 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0008676
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationchr05:22122836..22126722
RNA-Seq ExpressionPI0008676
SyntenyPI0008676
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96146.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa]3.7e-24696.85Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILP ANFSKGNNESE+ EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELE KRS SAVGPILA DRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

XP_004142119.1 uncharacterized protein LOC101205501 [Cucumis sativus]7.0e-24596.85Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILPVANFSKGNNES   EDSDSDSDSGESDNEDDHSPVENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELELKRS  AVGP+LAIDRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

XP_008449727.1 PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo]8.2e-24696.63Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILP ANFSKGNNESE+ EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELE KRS SAVGPILA DRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]1.2e-20984.93Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIASG +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG+GSKRKSGIL KKGKWK VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDI+GRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILPV NFSKGNNESE+ +DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENR

Query:  LWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SAD+GPLWS ++ +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQG
        K+DNE+RVLQLK+KEMELE KRS S+ GP L IDRIQG
Subjt:  KLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]1.5e-23190.83Show/hide
Query:  MKMDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCIT-NNYTSEEDEPSY
        MKMDSSGLGGGFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNMMS FEGDH S+G +D+KSLGQKDLLMAFN+GKAIASG IT NNYTSEEDEPS+
Subjt:  MKMDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCIT-NNYTSEEDEPSY

Query:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGR
        TEDGEC EFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKT+SKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDIIG+
Subjt:  TEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVE
        GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILPVANFSKGNNES++ EDSDSDSDSGESDNEDDHSPVE
Subjt:  GTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVE

Query:  NRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENE
        NRLW SESRGRDKVSAD+GPLWSNS  KNEFEG+IDVFLSDPTKSQWER+ W++KQMLQLQEQCN+FQAQSVELEKQRFKWLRYCSKKNRDLERARLENE
Subjt:  NRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENE

Query:  RMKLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLG
        RMKLDNE+RVLQLK+KEMELELKR  S+ GP L IDRIQGREQLDLG
Subjt:  RMKLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLG

TrEMBL top hitse value%identityAlignment
A0A0A0KX12 Uncharacterized protein3.4e-24596.85Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILPVANFSKGNNES   EDSDSDSDSGESDNEDDHSPVENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELELKRS  AVGP+LAIDRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

A0A1S3BM36 uncharacterized protein LOC1034915224.0e-24696.63Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILP ANFSKGNNESE+ EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELE KRS SAVGPILA DRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

A0A5A7TE21 Stress response protein nst1 isoform X14.0e-24696.63Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILP ANFSKGNNESE+ EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELE KRS SAVGPILA DRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

A0A5D3BB81 Stress response protein nst1 isoform X11.8e-24696.85Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG+GSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDI+G+GTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILP ANFSKGNNESE+ EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRL

Query:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSAD+GPLWSNS GKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM
        LDNEQRVLQLKRKEMELE KRS SAVGPILA DRIQGREQLDLGM
Subjt:  LDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGREQLDLGM

A0A6J1H0P0 uncharacterized protein LOC1114593386.0e-21084.93Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIASG +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAG+GSKRKSGIL KKGKWK VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDI+GRGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQ IPGCQDVDFQGKILPV NFSKGNNESE+ +DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENR

Query:  LWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SAD+GPLWS ++ +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQG
        K+DNE+RVLQLK+KEMELE KRS S+ GP L IDRIQG
Subjt:  KLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.7e-7646.41Show/hide
Query:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFND
        N  S++DEPS+TE   DG  +E  +  KGSPWQR+KWTD++V+LLI  V+ +GDD      S+RK  +L KKGKWK+VSK+M  +G HVSPQQCEDKFND
Subjt:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFND

Query:  LNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNES--------EDV
        LNKRYK+LND++GRGTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG ++    D+  Q + L +A  S+ ++++        ED+
Subjt:  LNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNES--------EDV

Query:  EDSDSDSDSGESDN-EDDHSPVENRLWSSESRG-------RDKVSADEG--PLWSNSAGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQL
        +D D D D  E D  E+ H    +   +    G       R  +S ++G  P   NS   N+    QI    +D  +   E       +K W++ + LQL
Subjt:  EDSDSDSDSGESDN-EDDHSPVENRLWSSESRG-------RDKVSADEG--PLWSNSAGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQL

Query:  QEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELEL
        +EQ    Q + +ELEKQRF+W R+  K++++LER R+ENERMKL+N++  L+LK++E+ +EL
Subjt:  QEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELEL

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)1.1e-5940.41Show/hide
Query:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRY
        SE+DE    + DG+     K K+ SPWQR+KW D++V+L+I  ++ +G+D     GS +K  +L KKGKW++VSK+M  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRY

Query:  KRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVA--------NFSKGNNESEDVEDSDS
        K+LN+++GRGTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG ++    D   Q  +  +         N   G +++ED++D D 
Subjt:  KRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVA--------NFSKGNNESEDVEDSDS

Query:  DSDSGESDNEDDHSPVENRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLR
          +  +    D       +  S E  G      D   L  + A  N           D  K+   ++  I+ + L+L+ +    QA+ +ELE+Q+FKW  
Subjt:  DSDSGESDNEDDHSPVENRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLR

Query:  YCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELEL
        +  ++ + L + R+ENERMKL+NE+  L+LKR E+  +L
Subjt:  YCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELEL

AT3G10040.1 sequence-specific DNA binding transcription factors1.5e-6440.31Show/hide
Query:  NRGKAIASGCITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG----IGSKRKS----------GILHKKGKWK
        +RG    SGC          E S   DG+       +K S W RMKWTD +VRLLI  V  +GD  EAG    + +K+K+          G+L KKGKWK
Subjt:  NRGKAIASGCITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG----IGSKRKS----------GILHKKGKWK

Query:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNG------------QKI
        +VS+ M  KG  VSPQQCEDKFNDLNKRYKR+NDI+G+G +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN             Q+ 
Subjt:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNG------------QKI

Query:  P----------GCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLS
        P           C      GK+  +A   +   E E     DS+S+  ES+ E+     + R+ ++  R R++ ++                      + 
Subjt:  P----------GCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRLWSSESRGRDKVSADEGPLWSNSAGKNEFEGQIDVFLS

Query:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ELKRSGSAVGP
        D  KS WE+K WI+++ML+++E+   ++ + VE+EKQR KW+RY SKK R++E+A+L+N+R +L+ E+ +L L+R E+EL EL+ SG+ V P
Subjt:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ELKRSGSAVGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTCAGGAAATGGGGGGTTATTAGATTTGGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCC
CTCATTGACACAACGCCATCAGTTGAACATGATGAGTAATTTTGAAGGTGATCACCAGTCCATTGGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGG
CGTTCAATAGAGGGAAAGCTATTGCCTCTGGTTGCATTACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTATACTGAGGATGGTGAGTGCTCTGAGTTTTTAAAG
GGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAGCTGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATAGGATC
AAAGAGAAAATCTGGGATTTTGCATAAGAAGGGAAAATGGAAAACAGTGTCAAAGATTATGCAAAGTAAGGGGTGTCACGTTTCTCCACAGCAGTGTGAGGACAAGTTTA
ATGACCTAAACAAAAGATACAAGAGATTGAACGATATCATTGGGAGGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCCCTCATGGACTCAATGCCTCACCTCTCAAGT
AAAGCCAAGGATGATGTAAGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAAAAATTCCTGGTTGCCAGGATGTTGATTT
CCAAGGTAAAATTTTGCCTGTTGCAAATTTCTCCAAAGGAAATAATGAATCAGAAGATGTTGAGGATAGTGATAGTGACAGTGACAGTGGTGAATCAGATAATGAAGATG
ATCACTCTCCTGTGGAAAATAGATTATGGTCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGAGGGTCCTCTTTGGTCAAACTCTGCTGGAAAAAATGAATTT
GAAGGTCAAATTGATGTTTTTCTTTCAGATCCAACAAAATCCCAATGGGAGCGCAAAGTGTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCA
GGCTCAATCTGTTGAACTCGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGAATGAAACTAG
ATAATGAGCAGAGAGTACTGCAACTGAAGCGGAAGGAAATGGAACTAGAATTAAAAAGGTCTGGTTCAGCCGTTGGCCCAATCCTTGCCATTGATAGAATTCAAGGGAGA
GAGCAACTTGATTTGGGTATGCAATGA
mRNA sequenceShow/hide mRNA sequence
CAAAAAGTGTGGGAAAAAAAAACTTCATCGCATACGCCGACCCCTTCCACTAGTTAGTCAATTGGTCTTCTGTGCACCTCTCTTAAGAACAAAAGCGGGGAATTTTTCCT
GTACCGATTTTCATTGAAGGTCATTGAGGTAGGCCATGGAGGAGTTAATTTTCGTATGCATAGGTTTTTTTCTTGAATTTTTTGTTCATTAGATTGGTCCAAGTTGCAAG
TATAATTTATTCCAGCGATTTTTAATGGTCAACTTAGTTACAAGGATATTTTTTCCTATATTCGACGCAAAGGCTGTTTAGTTTTTGTTTTTTCTATTTGGTTCCGGTCA
GTTAAATCAGCATTTTCGAGATTCTGATGATGCACTTTATGTGTGGAATAAGAAAGTGAAGTACAGCAAAGGAAAATGGAAATTGGGTTTTCGTTTAAAATGGTTCAATG
GTGGTTGTTGATGCCAAATCTGTAACTATTACTATTTAGAACGATTTGATCAAGCTTAATGAAGTAGATTTCTTATTCATTCTCGTGAGTCCGAGTTGAATCTCGGTTTT
CTTTTAGCTGCTACAGCAAAATTGAGGGTTTCTAGGGTTTAGAGGCATGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTCAGGAAATGGGGGGTTATTAGATT
TGGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCCCTCATTGACACAACGCCATCAGTTGAACATGATGAGTAATTTTGAAGGTGATCACCAGTCCATT
GGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGGCGTTCAATAGAGGGAAAGCTATTGCCTCTGGTTGCATTACAAACAACTACACGAGTGAAGAAGA
TGAGCCAAGTTATACTGAGGATGGTGAGTGCTCTGAGTTTTTAAAGGGCAAAAAGGGCTCTCCATGGCAGAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAG
CTGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATAGGATCAAAGAGAAAATCTGGGATTTTGCATAAGAAGGGAAAATGGAAAACAGTGTCAAAGATTATGCAA
AGTAAGGGGTGTCACGTTTCTCCACAGCAGTGTGAGGACAAGTTTAATGACCTAAACAAAAGATACAAGAGATTGAACGATATCATTGGGAGGGGAACCAGTTGTAGGGT
TGTGGAGAACCCTGCCCTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTAAGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTG
CTTACCATAATGGACAAAAAATTCCTGGTTGCCAGGATGTTGATTTCCAAGGTAAAATTTTGCCTGTTGCAAATTTCTCCAAAGGAAATAATGAATCAGAAGATGTTGAG
GATAGTGATAGTGACAGTGACAGTGGTGAATCAGATAATGAAGATGATCACTCTCCTGTGGAAAATAGATTATGGTCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGC
AGATGAGGGTCCTCTTTGGTCAAACTCTGCTGGAAAAAATGAATTTGAAGGTCAAATTGATGTTTTTCTTTCAGATCCAACAAAATCCCAATGGGAGCGCAAAGTGTGGA
TTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCAGGCTCAATCTGTTGAACTCGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAAT
AGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGAATGAAACTAGATAATGAGCAGAGAGTACTGCAACTGAAGCGGAAGGAAATGGAACTAGAATTAAAAAGGTCTGG
TTCAGCCGTTGGCCCAATCCTTGCCATTGATAGAATTCAAGGGAGAGAGCAACTTGATTTGGGTATGCAATGAAAGCACACGAAGAACTGAGAAAACCATCTATATTTCA
GAGCTGTTGGTTTGCATGAGATTTTGCTGATCATTGTGGAACAAATCCTTTAAGCCGAATGCATGTTATCATGCTAACCTTATCACGGACCTTATTAAAGCTATCAACAA
GAGAAGACTCCCTTTGAAAAGTGCTTGGACGACAAAATCATGGCCATGGATAAATCTCCACTCTTCTTTCAGTGGTTTCGGTTTTGCTGAGAAAAATATCTAGTTCTTCT
ATAGAACCTACCCTCACCTCTATTATTTGATACTTTATTGAGTGTATTTTATGGATTCCATACAGTTCTTAAATACTACCTTCGTGGCAAATAATGAATGGAGGTAGTAT
TTTCATACTACGCAATGATAAGATTTAGGAAGTTTTTCTGTCCACATCCGCTTTGTTATCGGAAAAAAGCTCTCAAAATTATGAGCTTTTAAGTGTTATTATTAACTTTT
GACCTGAATTTCCCAATATATACCATTCTGTGAAACATATTGATGAGCTGAA
Protein sequenceShow/hide protein sequence
MKMDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASGCITNNYTSEEDEPSYTEDGECSEFLK
GKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGIGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCRVVENPALMDSMPHLSS
KAKDDVRKILSSKHLFYKEMCAYHNGQKIPGCQDVDFQGKILPVANFSKGNNESEDVEDSDSDSDSGESDNEDDHSPVENRLWSSESRGRDKVSADEGPLWSNSAGKNEF
EGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELELKRSGSAVGPILAIDRIQGR
EQLDLGMQ