; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017432 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017432
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMYB domain class transcription factor
Genome locationtig00153047:845297..856582
RNA-Seq ExpressionSgr017432
SyntenySgr017432
Gene Ontology termsGO:0010119 - regulation of stomatal movement (biological process)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587447.1 Transcription factor MYB86, partial [Cucurbita argyrosperma subsp. sororia]2.0e-13379.13Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLK KLRKGLWSPEEDEKLFNYI TFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ ME M+       +KPR+E A HHLH  ISN TISS     NDSN +Y KQ EDS+ H+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  -SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE
         SY G       A  Q QSS+RSCDQN+LYSNSSFG+PSYSSSEHGN SRTDFS+NSASGLSSFFMNEVKESSSNSSVVSNYSG+H   N   NGGFSWE
Subjt:  -SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE

Query:  AENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS
        AENKL+SLFQFQTNEVK +E  KGSS  +ETK IQTQ ++ N  S
Subjt:  AENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS

XP_008464521.1 PREDICTED: transcription factor MYB32-like isoform X1 [Cucumis melo]1.1e-13474.16Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ MEA+K EKKN     EKP++E  HH H   + +    S +N          E+S+QH+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---
         SYL GEY  NSQFA+AQ+ S++RS DQNF YSNSS G+ SYSS EHGN+SRTDFS+NSASGL+SFFMNEVKESSS NSSVVSNYSGYH+NNNPGEN   
Subjt:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---

Query:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY
             GGFSWE+ENKL++L +FQTN++K LEFKGSS  E     QN   + +   Y
Subjt:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY

XP_022134589.1 protein ODORANT1-like [Momordica charantia]1.6e-15485.13Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLH+VLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLS
        DNEIKNFWNS LKKKLMKQGIDPATHKPLE+ MEAM   KK+Y EK RV+VAHHL   ISN TISSEASFL DSN YY KQTEDSAQH+ VNKIEFDS+S
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLS

Query:  YLGGEY-NSQFAAAQFQ-SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE
        Y+GGEY NSQFAA Q+Q ++ RSCD N+ YSNSSFGLPSYSSSEHGN+SRTDFS+NSASGLSSFF+NEVKESSSNSSVVSNYSGYHVNN    NGGFSWE
Subjt:  YLGGEY-NSQFAAAQFQ-SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE

Query:  AENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS
         ENKL++LFQFQTNEVK LEFKGSSG+ETK IQTQN++ +  S
Subjt:  AENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS

XP_022933373.1 transcription factor MYB61-like isoform X1 [Cucurbita moschata]1.2e-13379.07Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCC K KLRKGLWSPEEDEKLFNYI TFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ ME M+       +KPR+E A HHLH  ISN TISS     NDSN +Y KQ EDS+ H+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWEA
        S   G      A  Q QS++RSCDQNFLYSNSSFG+PSYSSSEHGN SRTDFS+NSASGLSSFFMNEVKESSSNSSVVSNYSG+H   N   NGGFSWEA
Subjt:  SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWEA

Query:  ENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS
        ENKL+SLFQFQTNEVK +E  KGSS  +ETK IQTQN++ N  S
Subjt:  ENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS

XP_038879668.1 transcription repressor MYB5-like [Benincasa hispida]2.1e-13877.84Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLII+LHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED---MEAMKLEKKN--YAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDS
        DNEIKNFWNSCLKKKLMKQGIDP THKPLED   MEAM+  KKN    EKPR+EVAHH H +   IS   +   DSN YY K+TEDS+QH+ V KIEFD 
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED---MEAMKLEKKN--YAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDS

Query:  L-SYLGGEY--NSQFAAAQFQSSVRSC-DQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGE--N
        L S+L G+Y  NSQFAAAQ+QS++RS  DQNF YSNSS G+PSYSSSEHGN+SRTDFS+NSASGL+SFF+NEVKESSSNSSVVSNYSGYH+ NNPGE  N
Subjt:  L-SYLGGEY--NSQFAAAQFQSSVRSC-DQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGE--N

Query:  G-GFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY
        G GFSWE+ENKL++L QFQTN++K LEFKGSS +ET I  QN   + +   Y
Subjt:  G-GFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY

TrEMBL top hitse value%identityAlignment
A0A1S3CLT8 transcription factor MYB32-like isoform X15.3e-13574.16Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ MEA+K EKKN     EKP++E  HH H   + +    S +N          E+S+QH+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---
         SYL GEY  NSQFA+AQ+ S++RS DQNF YSNSS G+ SYSS EHGN+SRTDFS+NSASGL+SFFMNEVKESSS NSSVVSNYSGYH+NNNPGEN   
Subjt:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---

Query:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY
             GGFSWE+ENKL++L +FQTN++K LEFKGSS  E     QN   + +   Y
Subjt:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY

A0A5A7UUJ1 Transcription factor MYB32-like isoform X15.3e-13574.16Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ MEA+K EKKN     EKP++E  HH H   + +    S +N          E+S+QH+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYA---EKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---
         SYL GEY  NSQFA+AQ+ S++RS DQNF YSNSS G+ SYSS EHGN+SRTDFS+NSASGL+SFFMNEVKESSS NSSVVSNYSGYH+NNNPGEN   
Subjt:  -SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSS-NSSVVSNYSGYHVNNNPGEN---

Query:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY
             GGFSWE+ENKL++L +FQTN++K LEFKGSS  E     QN   + +   Y
Subjt:  -----GGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQNSARNISSREY

A0A6J1BZ59 protein ODORANT1-like7.8e-15585.13Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLH+VLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLS
        DNEIKNFWNS LKKKLMKQGIDPATHKPLE+ MEAM   KK+Y EK RV+VAHHL   ISN TISSEASFL DSN YY KQTEDSAQH+ VNKIEFDS+S
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLS

Query:  YLGGEY-NSQFAAAQFQ-SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE
        Y+GGEY NSQFAA Q+Q ++ RSCD N+ YSNSSFGLPSYSSSEHGN+SRTDFS+NSASGLSSFF+NEVKESSSNSSVVSNYSGYHVNN    NGGFSWE
Subjt:  YLGGEY-NSQFAAAQFQ-SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWE

Query:  AENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS
         ENKL++LFQFQTNEVK LEFKGSSG+ETK IQTQN++ +  S
Subjt:  AENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS

A0A6J1F4K3 transcription factor MYB61-like isoform X15.8e-13479.07Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCC K KLRKGLWSPEEDEKLFNYI TFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL
        DNEIKNFWNSCLKKKLMKQGIDPATHKPLE+ ME M+       +KPR+E A HHLH  ISN TISS     NDSN +Y KQ EDS+ H+ VNKIEFDSL
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVA-HHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL

Query:  SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWEA
        S   G      A  Q QS++RSCDQNFLYSNSSFG+PSYSSSEHGN SRTDFS+NSASGLSSFFMNEVKESSSNSSVVSNYSG+H   N   NGGFSWEA
Subjt:  SYLGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWEA

Query:  ENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS
        ENKL+SLFQFQTNEVK +E  KGSS  +ETK IQTQN++ N  S
Subjt:  ENKLDSLFQFQTNEVKALE-FKGSSG-QETK-IQTQNSARNISS

A0A6J1JM49 transcription factor MYB86-like4.6e-13175.87Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGRNSCCLKPKLRKGLWSPEEDEKLFNYIT+FGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEE+LIISLHE LGNRWAQIA +LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL-
        DNEIKNFWNSCLKKKLMKQGIDP THKPLED MEAMK       +KPR EV H L   +SN TISSEAS LNDSN+YY KQTEDS+QH+ VNKIEFDSL 
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED-MEAMKLEKKNYAEKPRVEVAHHLHA-ISN-TISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSL-

Query:  SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSW
        SYLGGEY  NSQFAAAQ+Q S      N  YSNSS G+PSY+SSEHGN+SRTDFS+NS SG S FFMNEVKESSSNSSV+SNYSGY        +GGFSW
Subjt:  SYLGGEY--NSQFAAAQFQSSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSW

Query:  EAENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS
        E+E KL+ L QF+TN++K +  KGSS +ETK I+TQ ++ +  S
Subjt:  EAENKLDSLFQFQTNEVKALEFKGSSGQETK-IQTQNSARNISS

SwissProt top hitse value%identityAlignment
P20027 Myb-related protein Hv331.7e-5880.99Show/hide
Query:  KPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFW
        +PK+RKGLWSPEEDEKL+N+I   GVGCWSSVP+LA L RCGKSCRLRWINYLRPDLKRG FSQQEED I++LH++LGNRW+QIA+ LPGRTDNEIKNFW
Subjt:  KPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFW

Query:  NSCLKKKLMKQGIDPATHKPL
        NSC+KKKL +QGIDPATHKP+
Subjt:  NSCLKKKLMKQGIDPATHKPL

P80073 Myb-related protein Pp23.1e-5268.7Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR  CC K  LR+G W+ EED+KL ++IT  G+ CW ++PKLAGL RCGKSCRLRW NYLRPDLKRG+FS+ EE+LI+ LH  LGNRW++IAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED
        DNEIKN+WN+ LKK+L  QG+DP TH PLED
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLED

Q8LPH6 Transcription factor MYB865.2e-6380.29Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL NYIT  G GCWSSVPKLAGL+RCGKSCRLRWINYLRPDLKRG FSQ EE LII LH  LGNRW+QIA +LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKL
        DNEIKNFWNSCLKKKL ++GIDP THKPL   E   L
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKL

Q8VZQ2 Transcription factor MYB613.1e-6044.21Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL  +IT  G GCWSSVPKLAGL+RCGKSCRLRWINYLRPDLKRG FS +EE+LI+ LH VLGNRW+QIA++LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG
        DNEIKN WNS +KKKL ++GIDP THKP+ ++E+   + K      +     H    S++ +++  FL        ++  D + ++   K+ F+S   L 
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG

Query:  GEYNSQFAA---AQFQ--SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPG---ENGGF
           +S   +    QF   + V S  Q  +    S  LP                +NS+S +S    + VK ++ N    +N      NNN     +NGGF
Subjt:  GEYNSQFAA---AQFQ--SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPG---ENGGF

Query:  SWEAENKLDSLFQFQTN----EVKALEF
        SW   N   S  Q + N    E+K  E+
Subjt:  SWEAENKLDSLFQFQTN----EVKALEF

Q9SPG3 Transcription factor MYB261.3e-5065.28Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAG---------LERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQ
        MG +SCC K K+++GLWSPEEDEKL NYI ++G GCWSSVPK AG         L+RCGKSCRLRWINYLRPDLKRG FS QE  LII LH +LGNRWAQ
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAG---------LERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQ

Query:  IAAQLPGRTDNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAM
        IA  LPGRTDNE+KNFWNS +KKKLM        H  L  M ++
Subjt:  IAAQLPGRTDNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAM

Arabidopsis top hitse value%identityAlignment
AT1G09540.1 myb domain protein 612.2e-6144.21Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL  +IT  G GCWSSVPKLAGL+RCGKSCRLRWINYLRPDLKRG FS +EE+LI+ LH VLGNRW+QIA++LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG
        DNEIKN WNS +KKKL ++GIDP THKP+ ++E+   + K      +     H    S++ +++  FL        ++  D + ++   K+ F+S   L 
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG

Query:  GEYNSQFAA---AQFQ--SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPG---ENGGF
           +S   +    QF   + V S  Q  +    S  LP                +NS+S +S    + VK ++ N    +N      NNN     +NGGF
Subjt:  GEYNSQFAA---AQFQ--SSVRSCDQNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPG---ENGGF

Query:  SWEAENKLDSLFQFQTN----EVKALEF
        SW   N   S  Q + N    E+K  E+
Subjt:  SWEAENKLDSLFQFQTN----EVKALEF

AT1G57560.1 myb domain protein 501.5e-6246.84Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        M R+SCC K KLRKGLWSPEEDEKL NYIT  G GCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRG FS +E++LI+ LH VLGNRW+QIAA+LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNR-YYGKQTEDSAQHYNVNKIEFDSLSY-
        DNEIKN WNSC+KKKLMK+GIDP THKPL                   EV    +   N  ++  SF +++N+  + K+T D A++    K E +S+S  
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNR-YYGKQTEDSAQHYNVNKIEFDSLSY-

Query:  --LGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLP-------SYSSSEHGNISRTDFSDNS--ASGLSSFFMNEVK--ESSSNSSVVSNYSGYHVNNN
          L     +QF       S    D       S   LP       + S  +H N+S  ++  NS   S L++  M E+K  E   N S+ S         +
Subjt:  --LGGEYNSQFAAAQFQSSVRSCDQNFLYSNSSFGLP-------SYSSSEHGNISRTDFSDNS--ASGLSSFFMNEVK--ESSSNSSVVSNYSGYHVNNN

Query:  PGENGGFSWEAENKLD
           N  F W      D
Subjt:  PGENGGFSWEAENKLD

AT4G01680.1 myb domain protein 551.2e-5942.9Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL  YIT +G GCWSSVPK AGL+RCGKSCRLRWINYLRPDLKRG FSQ EE+LII LH VLGNRW+QIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG
        DNEIKN WNSCLKKKL  +GIDP THK L ++E    +K    EK +          S+T +   +  N+++  Y         ++   ++  ++ S + 
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLG

Query:  ------------GEYNSQFAAAQFQSSVRSCDQNFL-----YSNSSFGLPSYSSS--------EHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVS
                    G  +         S+V      F      Y +S+ GL    +S        EH  I  +++++++  G  +       E + N   +S
Subjt:  ------------GEYNSQFAAAQFQSSVRSCDQNFL-----YSNSSFGLPSYSSS--------EHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVS

Query:  NYSGYHVNNN-PGENGGFSWEAEN
        N+S   + ++   E   F  EA N
Subjt:  NYSGYHVNNN-PGENGGFSWEAEN

AT4G01680.3 myb domain protein 552.4e-6365.71Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL  YIT +G GCWSSVPK AGL+RCGKSCRLRWINYLRPDLKRG FSQ EE+LII LH VLGNRW+QIAAQLPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYY
        DNEIKN WNSCLKKKL  +GIDP THK L ++E    +K    EK +          S+T +   +  N+++  Y
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYY

AT5G26660.1 myb domain protein 863.7e-6480.29Show/hide
Query:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT
        MGR+SCC K KLRKGLWSPEEDEKL NYIT  G GCWSSVPKLAGL+RCGKSCRLRWINYLRPDLKRG FSQ EE LII LH  LGNRW+QIA +LPGRT
Subjt:  MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRT

Query:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKL
        DNEIKNFWNSCLKKKL ++GIDP THKPL   E   L
Subjt:  DNEIKNFWNSCLKKKLMKQGIDPATHKPLEDMEAMKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACGCAATTCTTGTTGCTTGAAGCCTAAACTCCGCAAAGGCCTCTGGTCCCCTGAGGAAGATGAGAAACTCTTCAATTACATTACCACTTTTGGCGTTGGCTGCTG
GAGCTCCGTCCCCAAGCTCGCTGGCTTGGAGAGGTGTGGCAAGAGTTGCAGGCTGAGGTGGATAAACTACTTGAGGCCTGATTTGAAGAGAGGGATGTTCTCGCAGCAGG
AGGAGGATCTCATTATCAGTCTTCATGAAGTTCTGGGAAACAGGTGGGCTCAGATTGCTGCACAGTTACCTGGAAGAACAGACAATGAGATTAAGAACTTTTGGAATTCG
TGTTTGAAGAAGAAGCTGATGAAGCAAGGAATCGACCCAGCAACCCACAAGCCATTGGAGGACATGGAAGCCATGAAATTAGAGAAGAAGAATTATGCAGAGAAGCCACG
CGTGGAAGTGGCTCATCATCTTCATGCCATTTCAAATACAATTTCTTCAGAAGCTTCGTTTCTTAATGATTCGAATCGTTACTACGGCAAGCAAACAGAAGATTCAGCGC
AGCATTATAACGTCAATAAGATTGAATTCGACTCTCTCTCCTACTTGGGTGGTGAGTATAATTCACAGTTTGCAGCAGCACAATTCCAGTCGAGTGTCAGATCATGCGAC
CAGAACTTTTTGTATTCAAATTCAAGTTTTGGGCTGCCGAGTTATTCAAGTTCTGAGCATGGGAACATATCAAGAACAGATTTTTCTGATAATTCAGCTTCTGGTTTGAG
CTCCTTCTTCATGAACGAGGTGAAGGAGAGCTCGAGCAATAGCTCGGTGGTGAGCAATTATTCTGGGTACCATGTGAACAACAATCCAGGGGAAAATGGAGGTTTCTCAT
GGGAAGCTGAGAACAAGCTCGACAGTTTGTTTCAGTTTCAGACAAACGAAGTGAAAGCCTTGGAATTCAAGGGAAGTTCAGGGCAAGAAACCAAGATTCAAACTCAAAAC
TCAGCAAGGAACATCTCATCACGTGAATATTGGGTCACCCAAAGATTATGCTTCAAGAAAGAAGCTCTTCTAAGAAACTTGGCCTCTGAACCAGCTAAAGGCAAACAATT
TGAACCAGGAGGAAATTCAGTTCCCTTCGATCCTCCCAATTCTGATACCTTATCACCTGGACTTTTTATTTTTATTTTTTTTCTTTTACTATTGTTCGGGGAAGGGGGGG
GGGGGTGGTTGTTAATCACCATTCCAAAAGAGAAAGGAATGCCTGCGACTTGCCTGATAGAAGTGCCAGAAGAATCCATACTCATAATTAGCCACAGTACAAATAAAGGA
CACAGTCAGGCGCCTAGATCTTCTAACTTCTGCCAACCCAGTCCTCCAGTCTTGATGCTTCCACAGAATTCCGTGATCCTCTTCATGCATGCACACACAATTTTCAATTG
TTTCAACACCACCAGTGAAATTTGTAAACAGGAGGAAGTGGAACAAGTTTGCGGTCTTCAAATTCTATCACCACCATGTTTTGGATATCCACAAGGACATGTATGCCCTC
AACTGGACGAGCATAACCATTTTCCATTGGGCAGTCACTTTCAGTTCTACAAAATACTAGAGGTTTTGCAAGTCTTCGACCAGGAGCATCAACTTCGCTGTGATAACCAA
CACACCTGCAGCAAAGAGAGAATCGCATCGACAGGACGAATTAACGACGGCATGGAAACTTTCTTGGTACGGCGATCGTCGGCGCGATCGTTCGAGGACAGGTTCCAATC
TTGGACCGCATTGGCTGAAGCAAGAGGCGCAGCCTCTCGAGGAACGGAAATAGAACCGGCACCACGCGCATGGCAGCAAAGATCACCTGAAAACGTCGCTTTTTTCGAAG
CTGGGGCCATTGCAAAGGATTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGACGCAATTCTTGTTGCTTGAAGCCTAAACTCCGCAAAGGCCTCTGGTCCCCTGAGGAAGATGAGAAACTCTTCAATTACATTACCACTTTTGGCGTTGGCTGCTG
GAGCTCCGTCCCCAAGCTCGCTGGCTTGGAGAGGTGTGGCAAGAGTTGCAGGCTGAGGTGGATAAACTACTTGAGGCCTGATTTGAAGAGAGGGATGTTCTCGCAGCAGG
AGGAGGATCTCATTATCAGTCTTCATGAAGTTCTGGGAAACAGGTGGGCTCAGATTGCTGCACAGTTACCTGGAAGAACAGACAATGAGATTAAGAACTTTTGGAATTCG
TGTTTGAAGAAGAAGCTGATGAAGCAAGGAATCGACCCAGCAACCCACAAGCCATTGGAGGACATGGAAGCCATGAAATTAGAGAAGAAGAATTATGCAGAGAAGCCACG
CGTGGAAGTGGCTCATCATCTTCATGCCATTTCAAATACAATTTCTTCAGAAGCTTCGTTTCTTAATGATTCGAATCGTTACTACGGCAAGCAAACAGAAGATTCAGCGC
AGCATTATAACGTCAATAAGATTGAATTCGACTCTCTCTCCTACTTGGGTGGTGAGTATAATTCACAGTTTGCAGCAGCACAATTCCAGTCGAGTGTCAGATCATGCGAC
CAGAACTTTTTGTATTCAAATTCAAGTTTTGGGCTGCCGAGTTATTCAAGTTCTGAGCATGGGAACATATCAAGAACAGATTTTTCTGATAATTCAGCTTCTGGTTTGAG
CTCCTTCTTCATGAACGAGGTGAAGGAGAGCTCGAGCAATAGCTCGGTGGTGAGCAATTATTCTGGGTACCATGTGAACAACAATCCAGGGGAAAATGGAGGTTTCTCAT
GGGAAGCTGAGAACAAGCTCGACAGTTTGTTTCAGTTTCAGACAAACGAAGTGAAAGCCTTGGAATTCAAGGGAAGTTCAGGGCAAGAAACCAAGATTCAAACTCAAAAC
TCAGCAAGGAACATCTCATCACGTGAATATTGGGTCACCCAAAGATTATGCTTCAAGAAAGAAGCTCTTCTAAGAAACTTGGCCTCTGAACCAGCTAAAGGCAAACAATT
TGAACCAGGAGGAAATTCAGTTCCCTTCGATCCTCCCAATTCTGATACCTTATCACCTGGACTTTTTATTTTTATTTTTTTTCTTTTACTATTGTTCGGGGAAGGGGGGG
GGGGGTGGTTGTTAATCACCATTCCAAAAGAGAAAGGAATGCCTGCGACTTGCCTGATAGAAGTGCCAGAAGAATCCATACTCATAATTAGCCACAGTACAAATAAAGGA
CACAGTCAGGCGCCTAGATCTTCTAACTTCTGCCAACCCAGTCCTCCAGTCTTGATGCTTCCACAGAATTCCGTGATCCTCTTCATGCATGCACACACAATTTTCAATTG
TTTCAACACCACCAGTGAAATTTGTAAACAGGAGGAAGTGGAACAAGTTTGCGGTCTTCAAATTCTATCACCACCATGTTTTGGATATCCACAAGGACATGTATGCCCTC
AACTGGACGAGCATAACCATTTTCCATTGGGCAGTCACTTTCAGTTCTACAAAATACTAGAGGTTTTGCAAGTCTTCGACCAGGAGCATCAACTTCGCTGTGATAACCAA
CACACCTGCAGCAAAGAGAGAATCGCATCGACAGGACGAATTAACGACGGCATGGAAACTTTCTTGGTACGGCGATCGTCGGCGCGATCGTTCGAGGACAGGTTCCAATC
TTGGACCGCATTGGCTGAAGCAAGAGGCGCAGCCTCTCGAGGAACGGAAATAGAACCGGCACCACGCGCATGGCAGCAAAGATCACCTGAAAACGTCGCTTTTTTCGAAG
CTGGGGCCATTGCAAAGGATTCTTAA
Protein sequenceShow/hide protein sequence
MGRNSCCLKPKLRKGLWSPEEDEKLFNYITTFGVGCWSSVPKLAGLERCGKSCRLRWINYLRPDLKRGMFSQQEEDLIISLHEVLGNRWAQIAAQLPGRTDNEIKNFWNS
CLKKKLMKQGIDPATHKPLEDMEAMKLEKKNYAEKPRVEVAHHLHAISNTISSEASFLNDSNRYYGKQTEDSAQHYNVNKIEFDSLSYLGGEYNSQFAAAQFQSSVRSCD
QNFLYSNSSFGLPSYSSSEHGNISRTDFSDNSASGLSSFFMNEVKESSSNSSVVSNYSGYHVNNNPGENGGFSWEAENKLDSLFQFQTNEVKALEFKGSSGQETKIQTQN
SARNISSREYWVTQRLCFKKEALLRNLASEPAKGKQFEPGGNSVPFDPPNSDTLSPGLFIFIFFLLLLFGEGGGGWLLITIPKEKGMPATCLIEVPEESILIISHSTNKG
HSQAPRSSNFCQPSPPVLMLPQNSVILFMHAHTIFNCFNTTSEICKQEEVEQVCGLQILSPPCFGYPQGHVCPQLDEHNHFPLGSHFQFYKILEVLQVFDQEHQLRCDNQ
HTCSKERIASTGRINDGMETFLVRRSSARSFEDRFQSWTALAEARGAASRGTEIEPAPRAWQQRSPENVAFFEAGAIAKDS