; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0014800 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0014800
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationchr05:4848521..4852510
RNA-Seq ExpressionPay0014800
SyntenyPay0014800
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96146.1 stress response protein nst1 isoform X1 [Cucumis melo var. makuwa]1.8e-25399.78Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_004142119.1 uncharacterized protein LOC101205501 [Cucumis sativus]8.2e-24697.09Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNES   EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELE KRSD AVGP+LA DRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_008449727.1 PREDICTED: uncharacterized protein LOC103491522 [Cucumis melo]8.2e-254100Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

XP_022957960.1 uncharacterized protein LOC111459338 [Cucurbita moschata]7.3e-21085.39Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIAS  +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKW+ VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP  NFSKGNNESEEA+DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SADDGPLWS +  +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG
        K+DNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQG
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG

XP_038901508.1 uncharacterized protein LOC120088355 [Benincasa hispida]5.7e-23190.6Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACIT-NNYTSEEDEPSYTE
        MDSSGLGGGFLSGNGGL+DLESPIRRPQKTQLVNPSLT RH LNMMS FEGDH S+G +D+KSLGQKDLLMAFN+GKAIAS  IT NNYTSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACIT-NNYTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC EFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAG GSKRKSGIL KKGKW+T+SKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDI+GKGT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNES+EAEDSDSDSDSGESDNEDDHSP ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW SESRGRDKVSADDGPLWSNSV KNEFEG+IDVFLSDPTKSQWER+ W++KQMLQLQEQCN+FQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        KLDNE+RVLQLK+KEMELE KR DS+ GP L  DRIQGREQLDLG H
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

TrEMBL top hitse value%identityAlignment
A0A0A0KX12 Uncharacterized protein4.0e-24697.09Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMM+NFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIAS CITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP ANFSKGNNES   EDSDSDSDSGESDNEDDHSP ENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKS WERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELE KRSD AVGP+LA DRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A1S3BM36 uncharacterized protein LOC1034915224.0e-254100Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A5A7TE21 Stress response protein nst1 isoform X14.0e-254100Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A5D3BB81 Stress response protein nst1 isoform X18.8e-25499.78Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
        MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTED

Query:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
        GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKW+TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS
Subjt:  GECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTS

Query:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
        CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL
Subjt:  CRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRL

Query:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
        WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK
Subjt:  WSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMK

Query:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
        LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH
Subjt:  LDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQLDLGMH

A0A6J1H0P0 uncharacterized protein LOC1114593383.5e-21085.39Show/hide
Query:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE
        MDSSGLGGGFLSGNGGLLDLESPIRR Q+TQL+N SLT RH L MM+  EGDHQS+GI+D+K LG KDL M F +GKAIAS  +TNN  TSEEDEPS+TE
Subjt:  MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNN-YTSEEDEPSYTE

Query:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT
        DGEC+EFLKGKKGSPWQRMKWTD+IVRLLIAVVACVGDDGEAGMGSKRKSGIL KKGKW+ VSKIM SKGCHVSPQQCEDKFNDLNKRYKRLNDILG+GT
Subjt:  DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGT

Query:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR
        SCRVVENPALMDSMPHLSSK KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILP  NFSKGNNESEEA+DSDSDSD  ESDNEDDH P ENR
Subjt:  SCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENR

Query:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM
        LW +ESRGRDK SADDGPLWS +  +NEFEGQIDVFLSDPTK QWER+ WIKKQMLQLQEQC SFQAQS ELEKQRFKWLRYCSKK+RDLER RLENERM
Subjt:  LWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERM

Query:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG
        K+DNE+RVLQLK+KEMELE KRSDS+ GP L  DRIQG
Subjt:  KLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.9e-7545.56Show/hide
Query:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFND
        N  S++DEPS+TE   DG  +E  +  KGSPWQR+KWTD++V+LLI  V+ +GDD      S+RK  +L KKGKW++VSK+M  +G HVSPQQCEDKFND
Subjt:  NYTSEEDEPSYTE---DGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFND

Query:  LNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAA-------NFSKGNNESEEAE
        LNKRYK+LND+LG+GTSC+VVENPAL+DS+ +L+ K KDDVRKI+SSKHLFY+EMC+YHNG  +    D+  Q  +  A        N     ++ E+ +
Subjt:  LNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAA-------NFSKGNNESEEAE

Query:  DSDSDSDSGESDNEDDHSPAENRLWSSESRG--------RDKVSADDG--PLWSNSVGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQLQ
        D D D D  E D  ++   A      +   G        R  +S +DG  P   NS+  N+    QI    +D  +   E       +K W++ + LQL+
Subjt:  DSDSDSDSGESDNEDDHSPAENRLWSSESRG--------RDKVSADDG--PLWSNSVGKNEFE-GQIDVFLSDPTKSQWE-------RKVWIKKQMLQLQ

Query:  EQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELE
        EQ    Q + +ELEKQRF+W R+  K++++LER R+ENERMKL+N++  L+LK++E+ +E
Subjt:  EQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELE

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)3.3e-5940.96Show/hide
Query:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRY
        SE+DE    + DG+     K K+ SPWQR+KW D++V+L+I  ++ +G+D     GS +K  +L KKGKWR+VSK+M  +G HVSPQQCEDKFNDLNKRY
Subjt:  SEEDEPS-YTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRY

Query:  KRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESD
        K+LN++LG+GTSC VVENP+L+D + +L+ K KD+VR+I+SSKHLFY+EMC+YHNG  +    D   Q  +      S+ +++++E     ++    + D
Subjt:  KRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESD

Query:  NEDDHSPAEN-----RLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCS
         E+DH  A +     RL  S+S   + V   +       + +++ +    + L D  K+   ++  I+ + L+L+ +    QA+ +ELE+Q+FKW  +  
Subjt:  NEDDHSPAEN-----RLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCS

Query:  KKNRDLERARLENERMKLDNEQRVLQLKRKEM
        ++ + L + R+ENERMKL+NE+  L+LKR E+
Subjt:  KKNRDLERARLENERMKLDNEQRVLQLKRKEM

AT3G10040.1 sequence-specific DNA binding transcription factors5.4e-6239.8Show/hide
Query:  NRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILHKKGKWR
        +RG    S C          E S   DG+       +K S W RMKWTD +VRLLI  V  +GD  EAG+     +K+K+          G+L KKGKW+
Subjt:  NRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGKKGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMG----SKRKS----------GILHKKGKWR

Query:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN----------------
        +VS+ M  KG  VSPQQCEDKFNDLNKRYKR+NDILGKG +CRVVEN  L++SM HL+ K KD+V+K+L+SKHLF++EMCAYHN                
Subjt:  TVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKAKDDVRKILSSKHLFYKEMCAYHN----------------

Query:  --GQTIPGCQDVDFQ----GKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLS
             IP  Q   F     GK+   A   +   E E   D   DS+S   ++E++ +  + R+ ++  R R++ ++                      + 
Subjt:  --GQTIPGCQDVDFQ----GKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEGQIDVFLS

Query:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ESKRSDSAVGP
        D  KS WE+K WI+++ML+++E+   ++ + VE+EKQR KW+RY SKK R++E+A+L+N+R +L+ E+ +L L+R E+EL E + S + V P
Subjt:  DPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMEL-ESKRSDSAVGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTCAGGAAATGGGGGGTTATTAGATTTAGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCCCTCATT
GACACAACGCCATCAGTTGAACATGATGAGTAATTTTGAAGGTGATCACCAGTCCATTGGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGGCGTTTA
ATAGAGGGAAAGCTATTGCCTCTGCTTGCATTACAAACAACTACACGAGTGAAGAAGATGAGCCAAGTTATACCGAGGATGGTGAGTGCTCTGAGTTTTTAAAGGGCAAA
AAGGGCTCTCCATGGCAAAGAATGAAGTGGACAGATGAGATTGTGAGGCTTCTCATAGCAGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATGGGATCGAAGAG
AAAATCTGGGATTTTGCATAAGAAGGGAAAATGGAGAACAGTGTCAAAGATTATGCAAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAGTTTAATGACT
TAAACAAAAGATACAAGAGATTAAACGATATCCTTGGGAAGGGAACCAGTTGTAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCC
AAGGATGATGTAAGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAATGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAAGATGTTGATTTCCAAGG
TAAAATTTTGCCTGCTGCAAATTTCTCCAAAGGAAATAATGAATCAGAAGAAGCTGAGGATAGTGATAGTGATAGTGACAGTGGTGAATCAGATAATGAAGATGATCACT
CTCCTGCGGAAAATAGATTATGGTCGTCTGAATCTCGTGGCAGGGATAAAGTGAGTGCAGATGATGGTCCTCTTTGGTCAAATTCTGTTGGAAAAAATGAATTTGAAGGT
CAAATTGATGTCTTTCTTTCAGATCCAACAAAGTCCCAATGGGAGCGCAAAGTGTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCAGGCTCA
ATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAGAAAAATAGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGGATGAAACTAGATAATG
AGCAGAGAGTACTGCAACTGAAGCGGAAGGAAATGGAACTAGAATCGAAAAGGTCTGATTCAGCCGTTGGCCCAATCCTTGCCTTCGATAGAATTCAAGGGAGAGAGCAA
CTTGATTTGGGTATGCATTGA
mRNA sequenceShow/hide mRNA sequence
TCGCACTTTTTGCGCCTAACATCCTCAATTACTCATAATTGTAACAATCACTAACTCAACATTTTTCCTAAAGGCTAAATCTGCAAATATGATGAAGAAAAAGGAAGGGA
GAAAAAAAAAAAAAAGAATTACTTCATCGCATACGCCGACCCCTTCCACTAGTTAGTCAATTGGTCTTCTGTGTACCTCGCTCAAGAACAAAAGTGGGAATTTTTTCCTG
TACCGAGGTAGGTCATGGAGGAGCTAATTTTCGTCTGCATAGGTTTTCTTTCTTGAATTTTTTGTTCATTAGGTTGGTCCAAGTTGCAAATATGATTTATTCCAGCGATT
TTTAATGGTCAACTTAGTTAAAAGGATATTTTTTCCTATATTCGACGCAAAGGCTGTGTAGTTTTTGTTTTTTTTTTTTTTTTTTTTTGTTTTTCTATTTGGTTCCGGTC
AGTTAAATCAGCATTTCGGAGATTTTGATGATGCACTTGCACTTTATGTGTGGAATAAGAAAGTGAAGTACAATGATGGAAAATGGAAATTGGGTTTTCGTTTAAACTGG
TTCAATGGTGGTTGCTGATGCCAAATCTGTAGCATTACTATTTAGCACGATTTGATCAAGCTTGGTGAAGCAGAGTTCTTATTCATTCTCATGAGTCCGAGTTGAATCTC
GGTTTTCTTTTAGGTGCTACAGCAAACTTGAGGGTTTCTAGGGTTTAGAGGCACGAAAATGGATAGTTCAGGTTTGGGAGGTGGATTTCTGTCAGGAAATGGGGGGTTAT
TAGATTTAGAGTCTCCTATACGACGACCTCAAAAAACCCAATTGGTTAATCCCTCATTGACACAACGCCATCAGTTGAACATGATGAGTAATTTTGAAGGTGATCACCAG
TCCATTGGGATTTTGGACTCGAAAAGCTTGGGACAGAAGGATTTATTGATGGCGTTTAATAGAGGGAAAGCTATTGCCTCTGCTTGCATTACAAACAACTACACGAGTGA
AGAAGATGAGCCAAGTTATACCGAGGATGGTGAGTGCTCTGAGTTTTTAAAGGGCAAAAAGGGCTCTCCATGGCAAAGAATGAAGTGGACAGATGAGATTGTGAGGCTTC
TCATAGCAGTGGTTGCTTGTGTAGGTGACGATGGCGAGGCTGGAATGGGATCGAAGAGAAAATCTGGGATTTTGCATAAGAAGGGAAAATGGAGAACAGTGTCAAAGATT
ATGCAAAGTAAGGGGTGTCATGTTTCTCCACAGCAGTGTGAGGACAAGTTTAATGACTTAAACAAAAGATACAAGAGATTAAACGATATCCTTGGGAAGGGAACCAGTTG
TAGGGTTGTGGAGAACCCTGCACTCATGGACTCAATGCCTCACCTCTCAAGTAAAGCCAAGGATGATGTAAGAAAAATATTGAGCTCAAAACACTTGTTTTATAAGGAAA
TGTGTGCTTACCATAATGGACAAACAATTCCTGGTTGCCAAGATGTTGATTTCCAAGGTAAAATTTTGCCTGCTGCAAATTTCTCCAAAGGAAATAATGAATCAGAAGAA
GCTGAGGATAGTGATAGTGATAGTGACAGTGGTGAATCAGATAATGAAGATGATCACTCTCCTGCGGAAAATAGATTATGGTCGTCTGAATCTCGTGGCAGGGATAAAGT
GAGTGCAGATGATGGTCCTCTTTGGTCAAATTCTGTTGGAAAAAATGAATTTGAAGGTCAAATTGATGTCTTTCTTTCAGATCCAACAAAGTCCCAATGGGAGCGCAAAG
TGTGGATTAAAAAACAGATGCTACAACTTCAGGAGCAATGTAACAGCTTCCAGGCTCAATCTGTTGAACTTGAGAAACAACGTTTCAAATGGTTAAGATATTGCAGTAAG
AAAAATAGGGATTTGGAAAGAGCGAGGCTTGAAAATGAGAGGATGAAACTAGATAATGAGCAGAGAGTACTGCAACTGAAGCGGAAGGAAATGGAACTAGAATCGAAAAG
GTCTGATTCAGCCGTTGGCCCAATCCTTGCCTTCGATAGAATTCAAGGGAGAGAGCAACTTGATTTGGGTATGCATTGAAAGCAGACGAAGAACTGAGAAAACCATCTAT
ATTTCAGAGCTGTTGGTTTGCATGAGATTTTGTCTATCATTGTGGAACGAATCCTTTAAGCCAAATGCTTGTTATCATGCTAACCTTGTAAGCATGTGTATACTTCTATT
TCACTTTAGGCTGACCCTTCTTGTTGGCCCTTGGAACTTTCTGCTGTAAAGCTCAATGATTAATGTTTAAGTGGCTTGGTTTTCATTTTCTACAAAATTTCTTCTTTCAT
TTTTCAAGGATATAAATTTGTGTGACTATATGGATTTATGTTAACATGCGAGTCTGGTGAACTGCTTGTTCTGAGAATAACGTCAAATCTCTTGCTCCCTTCATGGACAT
CCGAGATTCTCAGTTTCGGAGATCACAGACCTTATTAAAGCTATCAACAAGAGAAGACTCCCTTTGAAAAGTGCTTGGACGACAAAATCATGGCCATGGATAAACTCTCC
ACTCTTCTTTCACTGGTACCGGTTTTGCTGAGAAAAATATCTACTTCTGCCGTAGTACCTACCCTCTCCTTCATTATTTGATACATCATTAAGTGTATTTTATGGACCCA
TACAATTCTTAAAATATTATCTCTGTGATAAATAATGAATGGAGGTAGTATTTTCATACTATGCAATGTTAGAAATTTTTCCTGTCTTTCTCGGATGCTCATCTGCTATG
CTATCAGAATAAAGCTCTCAAATTATGAGCTTTTAAGTGTTATTATTAACTTTTGACCTGAATTTCCCAATATATGCCATTCTGATAAACATACTGATGAGCTGAATTTT
GGTTTCTTTCTTCAATTTTTGAAGTGAATTATGATTTATAAG
Protein sequenceShow/hide protein sequence
MDSSGLGGGFLSGNGGLLDLESPIRRPQKTQLVNPSLTQRHQLNMMSNFEGDHQSIGILDSKSLGQKDLLMAFNRGKAIASACITNNYTSEEDEPSYTEDGECSEFLKGK
KGSPWQRMKWTDEIVRLLIAVVACVGDDGEAGMGSKRKSGILHKKGKWRTVSKIMQSKGCHVSPQQCEDKFNDLNKRYKRLNDILGKGTSCRVVENPALMDSMPHLSSKA
KDDVRKILSSKHLFYKEMCAYHNGQTIPGCQDVDFQGKILPAANFSKGNNESEEAEDSDSDSDSGESDNEDDHSPAENRLWSSESRGRDKVSADDGPLWSNSVGKNEFEG
QIDVFLSDPTKSQWERKVWIKKQMLQLQEQCNSFQAQSVELEKQRFKWLRYCSKKNRDLERARLENERMKLDNEQRVLQLKRKEMELESKRSDSAVGPILAFDRIQGREQ
LDLGMH