; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0025924 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0025924
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationchr05:981590..985400
RNA-Seq ExpressionIVF0025924
SyntenyIVF0025924
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96146.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa]0.0100Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_004142119.1 uncharacterized protein LOC101205501 [Cucumis sativus]1.05e-31297.31Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNESE   DSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELE KRSD AVGP+LA DRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_008449727.1 PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo]0.099.78Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]6.80e-26685.62Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNY-TSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIAS  +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNY-TSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKWK VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP  NFSKGNNESEEA+DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SADDGPLWS +  +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG
        K+DNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQG
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]4.23e-29390.83Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNMMS FEGDH S+G +D+KSLGQKDLLMAFN+GKAIAS  ITNN YTSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC EFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKWKT+SKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDI+GKGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNES+EAEDSDSDSDSGESDNEDDHSP ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEG+IDVFLSDPTKSQWER+ W++KQMLQLQEQCN+FQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        KLDNE+RVLQLK+KEMELE KR DS+ GP L  DRIQGREQLDLG H
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

TrEMBL top hitse value%identityAlignment
A0A0A0KX12 Uncharacterized protein2.3e-24697.31Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNES   EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELE KRSD AVGP+LA DRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A1S3BM36 uncharacterized protein LOC1034915221.2e-25399.78Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A5A7TE21 Stress response protein nst1 isoform X11.2e-25399.78Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A5D3BB81 Stress response protein nst1 isoform X15.2e-254100Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A6J1H0P0 uncharacterized protein LOC1114593382.1e-21085.62Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIAS  +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKWK VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP  NFSKGNNESEEA+DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SADDGPLWS +  +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG
        K+DNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQG
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.1e-7545.83Show/hide
Query:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFND
        N  S++DEPS+TE   DG  +E  +  KGSPWQR+KWTD++V+LLI  V+ +GDD      S+RK  +L KKGKWK+VSK+M  +G HVSPQQCEDKFND
Subjt:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFND

Query:  LNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAA-------NFSKGNNESEEAE
        LNKRYK+LND+LG+GTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+  Q  +  A        N     ++ E+ +
Subjt:  LNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAA-------NFSKGNNESEEAE

Query:  DSDSDSDSGESDNEDDHSPAENRLWSSESRG--------RDKVSADDG--PLWSNSVGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQLQ
        D D D D  E D  ++   A      +   G        R  +S +DG  P   NS+  N+    QI    +D  +   E       +K W++ + LQL+
Subjt:  DSDSDSDSGESDNEDDHSPAENRLWSSESRG--------RDKVSADDG--PLWSNSVGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQLQ

Query:  EQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELE
        EQ    Q + +ELEKQRF+W R+  K++++LER R+ENERMKL+N++  L+LK++E+ +E
Subjt:  EQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)9.5e-5940.66Show/hide
Query:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRY
        SE+DE    + DG+     K K+ SPWQR+KW D++V+L+I  ++ +G+D     GS +K  +L KKGKW++VSK+M  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRY

Query:  KRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESD
        K+LN++LG+GTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D   Q  +      S+ +++++E     ++    + D
Subjt:  KRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESD

Query:  NEDDHSPAEN-----RLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCS
         E+DH  A +     RL  S+S   + V   +       + +++ +    + L D  K+   ++  I+ + L+L+ +    QA+ +ELE+Q+FKW  +  
Subjt:  NEDDHSPAEN-----RLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCS

Query:  KKNRDLERARLENERMKLDNEQRVLQLKRKEM
        ++ + L + R+ENERMKL+NE+  L+LKR E+
Subjt:  KKNRDLERARLENERMKLDNEQRVLQLKRKEM

AT3G10040.1 sequence-specific DNA binding transcription factors3.2e-6240.05Show/hide
Query:  NRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILHKKGKWK
        +RG    S C          E S   DG+       +K S W RMKWTD +VRLLI  V  +GD  EAG+     +K+K+          G+L KKGKWK
Subjt:  NRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILHKKGKWK

Query:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN----------------
        +VS+ M  KG  VSPQQCEDKFNDLNKRYKR+NDILGKG +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN                
Subjt:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN----------------

Query:  --GQTIPGCQDVDFQ----GKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLS
             IP  Q   F     GK+   A   +   E E   D   DS+S   ++E++ +  + R+ ++  R R++ ++                      + 
Subjt:  --GQTIPGCQDVDFQ----GKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLS

Query:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ESKRSDSAVGP
        D  KS WE+K WI+++ML+++E+   ++ + VE+EKQR KW+RY SKK R++E+A+L+N+R +L+ E+ +L L+R E+EL E + S + V P
Subjt:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ESKRSDSAVGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTCAGGAAATGGGGGGTTATTAGATTTAGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCCCTCATT
GACACAACGCCATCAGTTGAACATGATGAGTAATTTTGAAGGTGATCACCAGTCCATTGGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGGCGTTTA
ATAGAGGGAAAGCTATTGCCTCTGCTTGCATTACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTATACCGAGGATGGTGAGTGCTCTGAGTTTTTAAAGGGCAAA
AAGGGCTCTCCATGGCAAAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATGGGATCGAAGAG
AAAATCTGGGATTTTGCATAAGAAGGGAAAATGGAAAACAGTGTCAAAGATTATGCAAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAGTTTAATGACT
TAAACAAAAGATACAAGAGATTGAACGATATCCTTGGGAAGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCC
AAGGATGATGTAAGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAAGATGTTGATTTCCAAGG
TAAAATTTTGCCTGCTGCAAATTTCTCCAAAGGAAATAATGAATCAGAAGAAGCTGAGGATAGTGATAGTGATAGTGACAGTGGTGAATCAGATAATGAAGATGATCACT
CTCCTGCGGAAAATAGATTATGGTCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCTCTTTGGTCAAATTCTGTTGGAAAAAATGAATTTGAAGGT
CAAATTGATGTCTTTCTTTCAGATCCAACAAAGTCCCAATGGGAGCGCAAAGTGTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCAGGCTCA
ATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGGATGAAACTAGATAATG
AGCAGAGAGTACTGCAACTGAAGCGGAAGGAAATGGAACTAGAATCGAAAAGGTCTGATTCAGCCGTTGGCCCAATCCTTGCCTTCGATAGAATTCAAGGGAGAGAGCAA
CTTGATTTGGGTATGCATTGA
mRNA sequenceShow/hide mRNA sequence
AATATGATGAAGAAAAAGGAAGGGAGAAAAAAAAAAAAGAATTACTTCATCGCATACGCCGACCCCTTCCACTAGTTAGTCAATTGGTCTTCTGTGTACCTCGCTCAAGA
ACAAAAGTGGGAATTTTTTCCTGTACCGAGGTAGGTCATGGAGGAGCTAATTTTCGTCTGCATAGGTTTTCTTTCTTGAATTTTTTGTTCATTAGGTTGGTCCAAGTTGC
AAATATGATTTATTCCAGCGATTTTTAATGGTCAACTTAGTTAAAAGGATATTTTTTTCCTATATTCGACGCAAAGGCTGTGTAGTTTTTGTTTTTTGTTTTTTTTTTTT
TTTTTTTTTCTATTTGGTTCCGGTCAGTTAAATCAGCATTTCGGAGATTTTGATGATGCACTTGCACTTTATGTGTGGAATAAGAAAGTGAAGTACAATGATGGAAAATG
GAAATTGGGTTTTCGTTTAAACTGGTTCAATGGTGGTTGCTGATGCCAAATCTGTAACTATTACTATTTAGCACGATTTGATCAAGCTTGGTGAAGCAGAGTTCTTATTC
ATTCTCATGAGTCCGAGTTGAATCTCGGTTTTCTTTTAGGTGCTACAGCAAACTTGAGGGTTTCTAGGGTTTAGAGGCACGAAAATGGATAGTTCAGGTTTGGGAGGTGG
ATTTCTGTCAGGAAATGGGGGGTTATTAGATTTAGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCCCTCATTGACACAACGCCATCAGTTGAACATGA
TGAGTAATTTTGAAGGTGATCACCAGTCCATTGGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGGCGTTTAATAGAGGGAAAGCTATTGCCTCTGCT
TGCATTACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTATACCGAGGATGGTGAGTGCTCTGAGTTTTTAAAGGGCAAAAAGGGCTCTCCATGGCAAAGAATGAA
GTGGACAGATGAGATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATGGGATCGAAGAGAAAATCTGGGATTTTGCATAAGAAGG
GAAAATGGAAAACAGTGTCAAAGATTATGCAAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAGTTTAATGACTTAAACAAAAGATACAAGAGATTGAAC
GATATCCTTGGGAAGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTAAGAAAAATATTGAG
CTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAAGATGTTGATTTCCAAGGTAAAATTTTGCCTGCTGCAAATTTCT
CCAAAGGAAATAATGAATCAGAAGAAGCTGAGGATAGTGATAGTGATAGTGACAGTGGTGAATCAGATAATGAAGATGATCACTCTCCTGCGGAAAATAGATTATGGTCG
TCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCTCTTTGGTCAAATTCTGTTGGAAAAAATGAATTTGAAGGTCAAATTGATGTCTTTCTTTCAGATCC
AACAAAGTCCCAATGGGAGCGCAAAGTGTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTT
TCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCAGAGAGTACTGCAACTGAAGCGG
AAGGAAATGGAACTAGAATCGAAAAGGTCTGATTCAGCCGTTGGCCCAATCCTTGCCTTCGATAGAATTCAAGGGAGAGAGCAACTTGATTTGGGTATGCATTGAAAGCA
GACGAAGAACTGAGAAAACCATCTATATTTCAGAGCTGTTGGTTTGCATGAGATTTTGTCTATCATTGTGGAACGAATCCTTTAAGCCAAATGCTTGTTATCATGCTAAC
CTTGTAAGCATGTGTATACTTCTATTTCACTTTAGGCTGACCCTTCTTGTTGGCCCTTGGAACTTTCTGCTGTAAAGCTCAATGATTAATGTTTAAGTGGCTTGGTTTTC
ATTTTCTACAAAATTTCTTCTTTCCTTTTTCAAGGATATAAATTTGTGTGACTATATGGATTTATGTTAACATGCGAGTCTGGTGAACTGCTTGTTCTGAGAATAACGTC
AAATCTCTTGCTCCCTTCATGGACATCCGAGATTCTCAGTTTCGGAGATCACAGACCTTATTAAAGCTATCAACAAGAGAAGACTCCCTTTGAAAAGTGCTTGGACGACA
AAATCATGGCCATGGATAAACTCTCCACTCTTCTTCCACTGGTACCGGTTTTGCTGAGAAAATATCTACTTCTGCCGTAGTACCTACCCTCTCCTTCATTATTTGATACA
TCATTAAGTGTATTTTATGGACCCATACAATTCTTAAAATATTATCTCTGTGATAAATAATGAATGGAGGTAGTATTTTCATACTATGCAATGTTAGAAATTTTTCCTGT
CTTTCTCGGATGCTCATCTGCTATGCTATCAGAATAAAGCTCTCAAATTATGAGCTTTTAAGTGTTATTATTAACTTTTGAC
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGK
KGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWKTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKA
KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEG
QIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQ
LDLGMH