; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029081 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029081
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptiontranscription factor MYB124 isoform X2
Genome locationtig00153210:3085161..3106712
RNA-Seq ExpressionSgr029081
SyntenySgr029081
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR002372 - Pyrrolo-quinoline quinone repeat
IPR009057 - Homeobox-like domain superfamily
IPR011047 - Quinoprotein alcohol dehydrogenase-like superfamily
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR017930 - Myb domain
IPR018391 - Pyrrolo-quinoline quinone beta-propeller repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151136.1 transcription factor MYB124 isoform X1 [Momordica charantia]1.0e-14070.69Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL T+DSKKKERHIVSWTQQ              EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFIN++ KRVLFQNGSN VSSET Q VKRMRRAHISDTTE CSLEDGSKKPC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN
         MD QLRAPFAVL                                                                           ILQDFLSRSK N
Subjt:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN

Query:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP
        DIPK  TPDIE  VE+IK+ VEDLRSSNDGSQWRQP+LHESPGSS YSTGSTLVS TAEEKI QSQ EIGTHHQEAKFESGSTCTGEQNDFGESEKE +P
Subjt:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP

Query:  KTTLER
        K TLER
Subjt:  KTTLER

XP_022151137.1 transcription factor MYB124 isoform X2 [Momordica charantia]5.9e-14170.17Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL T+DSKKKERHIVSWTQQ              EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFIN++ KRVLFQNGSN VSSET Q VKRMRRAHISDTTE CSLEDGSKKPC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN
         MD QLRAPFAVL                                                                           ILQDFLSRSK N
Subjt:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN

Query:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP
        DIPK  TPDIE  VE+IK+ VEDLRSSNDGSQWRQP+LHESPGSS YSTGSTLVS TAEEKI QSQ EIGTHHQEAKFESGSTCTGEQNDFGESEKE +P
Subjt:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP

Query:  KTTLERDHY
        K TLER+ +
Subjt:  KTTLERDHY

XP_023537464.1 transcription factor MYB124-like isoform X2 [Cucurbita pepo subsp. pepo]5.0e-14069.53Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TED KKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFINHNNKRVLFQ GSNDVSSET Q  KR+RRAHIS TTE CSL+DG KKPC +
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL-------------------------------------------------------------------------ILQDFLSRSKENDI
        AMDQQLRAPFAVL                                                                         ILQ+FLSR+KENDI
Subjt:  AMDQQLRAPFAVL-------------------------------------------------------------------------ILQDFLSRSKENDI

Query:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKT
        PK  TPDI+L VED K  VEDLRSSNDGSQWR+ NLHESPGSS YSTGSTLVS TAEEKIDQSQPEIGTHHQE KFESGSTCTGEQN+ GESEKE+LPKT
Subjt:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKT

Query:  TLERDHY
         LER+ +
Subjt:  TLERDHY

XP_038892945.1 transcription factor MYB124 isoform X1 [Benincasa hispida]3.8e-14070.48Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TEDSKKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEAL+KEN+SSFINHNNKRVLF  GSNDVSSET QPVKR+RRAHIS  TE C LEDGSK+PC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKENDIPKSATPDIELH
        AMDQQLRAPF VL                                                              ILQDFLSRSKEND+PK+ TPD++L 
Subjt:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKENDIPKSATPDIELH

Query:  VEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLER
        +ED K  VEDLRSSNDGSQWR+PNLHESPGSS YSTGSTLVS T EEK+DQSQPEIGT HQE KFESGSTCTGEQN+ GES KE+L KT LER
Subjt:  VEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLER

XP_038892946.1 transcription factor MYB124 isoform X2 [Benincasa hispida]2.2e-14069.95Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TEDSKKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEAL+KEN+SSFINHNNKRVLF  GSNDVSSET QPVKR+RRAHIS  TE C LEDGSK+PC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKENDIPKSATPDIELH
        AMDQQLRAPF VL                                                              ILQDFLSRSKEND+PK+ TPD++L 
Subjt:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKENDIPKSATPDIELH

Query:  VEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY
        +ED K  VEDLRSSNDGSQWR+PNLHESPGSS YSTGSTLVS T EEK+DQSQPEIGT HQE KFESGSTCTGEQN+ GES KE+L KT LER+ +
Subjt:  VEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY

TrEMBL top hitse value%identityAlignment
A0A0A0KVW3 Uncharacterized protein1.6e-13970.35Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TEDSKKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFINHNNKRV+FQ GSNDVSSET QPVKR+RRAHIS TTE CSLEDGSKKP   
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE
        A+DQQLRAP AVL                                                              ILQDFLSRSKEN+  IPK  TPDI+
Subjt:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE

Query:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY
        L +ED K  VEDLRSSNDGSQWR+PNLHESPGSS YSTGSTLVS TAEEKIDQSQPEIGTHHQE +FESGS+CT EQ + GES KE+LPKT LER+ +
Subjt:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY

A0A1S3BFL7 myb-like protein Q isoform X21.6e-13970.85Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TEDSKKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFINHNNKRVLFQ GSNDVSSET QPVKR+RRAHIS TTE C LEDGSKKP   
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE
        AMDQQ RAPFAVL                                                              ILQDFLSRSKEND  I K  TPDI+
Subjt:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE

Query:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY
        L +ED K  VEDLRSSNDGSQWR+PNLHESPGSS YSTGSTLVS TAEEKIDQSQPEIGTHHQE +FESGS+CT EQ + GESEKE+LPKT LER+ +
Subjt:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY

A0A5A7SUS6 Myb-like protein Q isoform X21.6e-13970.85Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL TEDSKKKERHIVSWTQQ              EDDILREQI+ HGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFINHNNKRVLFQ GSNDVSSET QPVKR+RRAHIS TTE C LEDGSKKP   
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE
        AMDQQ RAPFAVL                                                              ILQDFLSRSKEND  I K  TPDI+
Subjt:  AMDQQLRAPFAVL--------------------------------------------------------------ILQDFLSRSKEND--IPKSATPDIE

Query:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY
        L +ED K  VEDLRSSNDGSQWR+PNLHESPGSS YSTGSTLVS TAEEKIDQSQPEIGTHHQE +FESGS+CT EQ + GESEKE+LPKT LER+ +
Subjt:  LHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLERDHY

A0A6J1DCP7 transcription factor MYB124 isoform X22.9e-14170.17Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL T+DSKKKERHIVSWTQQ              EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFIN++ KRVLFQNGSN VSSET Q VKRMRRAHISDTTE CSLEDGSKKPC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN
         MD QLRAPFAVL                                                                           ILQDFLSRSK N
Subjt:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN

Query:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP
        DIPK  TPDIE  VE+IK+ VEDLRSSNDGSQWRQP+LHESPGSS YSTGSTLVS TAEEKI QSQ EIGTHHQEAKFESGSTCTGEQNDFGESEKE +P
Subjt:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP

Query:  KTTLERDHY
        K TLER+ +
Subjt:  KTTLERDHY

A0A6J1DDN8 transcription factor MYB124 isoform X14.9e-14170.69Show/hide
Query:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
        MQDTKKKL T+DSKKKERHIVSWTQQ              EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA
Subjt:  MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEA

Query:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT
        QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN+SSFIN++ KRVLFQNGSN VSSET Q VKRMRRAHISDTTE CSLEDGSKKPC  
Subjt:  QKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSET-QPVKRMRRAHISDTTEVCSLEDGSKKPCRT

Query:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN
         MD QLRAPFAVL                                                                           ILQDFLSRSK N
Subjt:  AMDQQLRAPFAVL---------------------------------------------------------------------------ILQDFLSRSKEN

Query:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP
        DIPK  TPDIE  VE+IK+ VEDLRSSNDGSQWRQP+LHESPGSS YSTGSTLVS TAEEKI QSQ EIGTHHQEAKFESGSTCTGEQNDFGESEKE +P
Subjt:  DIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLP

Query:  KTTLER
        K TLER
Subjt:  KTTLER

SwissProt top hitse value%identityAlignment
F4IRB4 Transcription factor MYB884.9e-8247.89Show/hide
Query:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK
        KKK++  ++DSKKKERHIV+W+ +              EDDILR+QISL GTENWAIIASKF DK+TRQCRRRWYTYLNSDFK+GGWSPEED LLCEAQ+
Subjt:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK

Query:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR
        +FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+AKEN  +  +N +NKR+LF +G +      SE+   K+MRR+HI + TE+ S  D S     
Subjt:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR

Query:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI
        + M+QQ R PF+V+                                                                        +LQDFL++SKEND+
Subjt:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI

Query:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ
         +   PDI+  +++ KDLVEDLRSSN+ SQ  WRQP+LH+SP SS YS    +GST+++H + +K  Q  S  +  +H Q
Subjt:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ

P10244 Myb-related protein B1.7e-2146.81Show/hide
Query:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKR
        ED  + E +  +GT+ W +IA   K +  +QCR RW+ +LN + KK  W+ EED ++CEA K+ GNRW EIAK++ GRTDNAVKN + +  K++
Subjt:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKR

P48972 Myb-related protein B1.7e-2146.81Show/hide
Query:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKR
        ED  + E +  +GT+ W +IA   K +  +QCR RW+ +LN + KK  W+ EED ++CEA K+ GNRW EIAK++ GRTDNAVKN + +  K++
Subjt:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKR

P52551 Myb-related protein B7.5e-2243.75Show/hide
Query:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK
        ED+ + E +  +GT++W +IA + + +  +QCR RW+ +LN + KK  W+ EED ++C+A K+ GNRW EIAK++ GRTDNAVKN + +  K++ +
Subjt:  EDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK

Q94FL6 Transcription factor MYB1243.5e-8046.54Show/hide
Query:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL
        M+DTKKK        +DSKKKERHIV+W+Q+              ED ILREQI+LHGTENWAIIASKFKDK+TRQCRRRWYTYLNSDFK+GGWSPEED+
Subjt:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL

Query:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG
        LLCEAQ++FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+ K++     N N KR+LF +G        +ET   K+++R+HI D TE+ +   G
Subjt:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG

Query:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND
          + C   ++QQ+R+PF+VL                                                                   +LQDFL++ KEND
Subjt:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND

Query:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE
        + +   PDI+  +E+ KDL+EDLRS  + +Q  WRQP+LH+SP SS YS+GST++   + +K      +  T H++
Subjt:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE

Arabidopsis top hitse value%identityAlignment
AT1G14350.1 Duplicated homeodomain-like superfamily protein2.5e-8146.54Show/hide
Query:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL
        M+DTKKK        +DSKKKERHIV+W+Q+              ED ILREQI+LHGTENWAIIASKFKDK+TRQCRRRWYTYLNSDFK+GGWSPEED+
Subjt:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL

Query:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG
        LLCEAQ++FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+ K++     N N KR+LF +G        +ET   K+++R+HI D TE+ +   G
Subjt:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG

Query:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND
          + C   ++QQ+R+PF+VL                                                                   +LQDFL++ KEND
Subjt:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND

Query:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE
        + +   PDI+  +E+ KDL+EDLRS  + +Q  WRQP+LH+SP SS YS+GST++   + +K      +  T H++
Subjt:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE

AT1G14350.2 Duplicated homeodomain-like superfamily protein2.5e-8146.54Show/hide
Query:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL
        M+DTKKK        +DSKKKERHIV+W+Q+              ED ILREQI+LHGTENWAIIASKFKDK+TRQCRRRWYTYLNSDFK+GGWSPEED+
Subjt:  MQDTKKKLV-----TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDL

Query:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG
        LLCEAQ++FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+ K++     N N KR+LF +G        +ET   K+++R+HI D TE+ +   G
Subjt:  LLCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNG---SNDVSSETQPVKRMRRAHISDTTEVCSLEDG

Query:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND
          + C   ++QQ+R+PF+VL                                                                   +LQDFL++ KEND
Subjt:  SKKPCRTAMDQQLRAPFAVL-------------------------------------------------------------------ILQDFLSRSKEND

Query:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE
        + +   PDI+  +E+ KDL+EDLRS  + +Q  WRQP+LH+SP SS YS+GST++   + +K      +  T H++
Subjt:  IPKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQE

AT2G02820.1 myb domain protein 883.5e-8347.89Show/hide
Query:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK
        KKK++  ++DSKKKERHIV+W+ +              EDDILR+QISL GTENWAIIASKF DK+TRQCRRRWYTYLNSDFK+GGWSPEED LLCEAQ+
Subjt:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK

Query:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR
        +FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+AKEN  +  +N +NKR+LF +G +      SE+   K+MRR+HI + TE+ S  D S     
Subjt:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR

Query:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI
        + M+QQ R PF+V+                                                                        +LQDFL++SKEND+
Subjt:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI

Query:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ
         +   PDI+  +++ KDLVEDLRSSN+ SQ  WRQP+LH+SP SS YS    +GST+++H + +K  Q  S  +  +H Q
Subjt:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ

AT2G02820.2 myb domain protein 883.5e-8347.89Show/hide
Query:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK
        KKK++  ++DSKKKERHIV+W+ +              EDDILR+QISL GTENWAIIASKF DK+TRQCRRRWYTYLNSDFK+GGWSPEED LLCEAQ+
Subjt:  KKKLV--TEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQK

Query:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR
        +FGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAK+EA+AKEN  +  +N +NKR+LF +G +      SE+   K+MRR+HI + TE+ S  D S     
Subjt:  IFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKEN-TSSFINHNNKRVLFQNGSN---DVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCR

Query:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI
        + M+QQ R PF+V+                                                                        +LQDFL++SKEND+
Subjt:  TAMDQQLRAPFAVL------------------------------------------------------------------------ILQDFLSRSKENDI

Query:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ
         +   PDI+  +++ KDLVEDLRSSN+ SQ  WRQP+LH+SP SS YS    +GST+++H + +K  Q  S  +  +H Q
Subjt:  PKSATPDIELHVEDIKDLVEDLRSSNDGSQ--WRQPNLHESPGSSGYS----TGSTLVSHTAEEKIDQ--SQPEIGTHHQ

AT3G27785.1 myb domain protein 1183.2e-2034.62Show/hide
Query:  QDTKKKLVTEDSKKKE-----RHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLL
        Q+ +  + TE  KKK      R I   T++ SI       +   ED +L + + LHGT+ W+ IA   + +  +QCR RW+ +L  D KK GW+ EED++
Subjt:  QDTKKKLVTEDSKKKE-----RHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLL

Query:  LCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHN
        L +A K  GNRW EIA+ + GRT+N +KN +    +++       K+  S  +  N
Subjt:  LCEAQKIFGNRWTEIAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGACACAAAGAAGAAGCTCGTCACCGAAGATTCCAAGAAGAAGGAGCGACACATCGTTTCGTGGACACAACAGGTTTCCATTTCTTTTCCTTCTCTGTTTCACTT
TTTGATTCTTGAGGATGATATTCTCCGGGAGCAGATTAGCCTACATGGAACAGAAAACTGGGCAATCATTGCGTCGAAATTCAAGGATAAAACAACAAGACAGTGCAGAA
GAAGATGGTACACATACTTGAATTCTGATTTTAAGAAAGGGGGATGGTCTCCGGAAGAGGATTTGCTTCTATGCGAGGCGCAGAAAATATTTGGAAACAGATGGACGGAA
ATAGCAAAGGTGGTGTCAGGCAGAACGGACAATGCAGTGAAAAACAGGTTTACCACCCTGTGTAAAAAGAGAGCCAAATATGAAGCATTAGCGAAAGAGAACACAAGTTC
ATTCATCAACCACAACAACAAGAGGGTCCTATTTCAAAATGGGAGTAATGATGTCTCGTCAGAAACTCAACCGGTTAAGAGGATGAGGAGAGCACACATATCTGATACTA
CAGAAGTTTGCTCGCTTGAGGATGGATCAAAAAAGCCATGCAGAACAGCAATGGATCAGCAGCTGAGAGCCCCATTTGCAGTACTGATTCTCCAAGATTTCCTCAGTCGA
AGCAAAGAAAATGACATACCCAAATCTGCAACTCCGGATATTGAACTTCATGTGGAAGACATTAAAGATTTAGTGGAGGATTTAAGGAGTAGCAATGATGGTAGCCAATG
GAGACAACCCAATCTCCACGAGTCTCCTGGCAGTTCAGGATACAGTACTGGATCAACACTAGTATCCCATACCGCTGAGGAAAAAATAGATCAAAGTCAACCTGAGATAG
GGACTCATCATCAGGAAGCTAAATTTGAATCTGGGTCAACTTGCACTGGAGAGCAAAATGATTTTGGTGAATCTGAGAAAGAAATGCTTCCCAAGACAACCCTGGAGCGA
GACCATTACAACGCAGCCCCACTGCTTCGAAGACAGGGGATTTCCACTGAGCAGAGAAGACACCACAATGCTGACAAACTTGCGAGTTGTGGTAATGGTGGTATTAGCAA
GGGAACCAAATCGGCTTATTGTAAGGAAGATAAAATTTTGGCCTACTGCTCCACAGAGACAATAGAGAAGAATGTCCCACGCTGCCTCAGGATGCTGTCTGCAGAATTCT
ATGGCATGATATCCAGTACCATTAAGGAAGGAAAGGTGTTCAAATCTCTTTCCGTCGGATCCAAAGCGCTTAGTCGACCTAAGAGAAACGAAAAATTACTTAAAATCATC
AAAATGGACTTGAATCCGTTGGCGAGCAAAGCAAGAATATCGAGGATTTGCGCTCCTCAGGAACGTGATTCTTCCGAAGAGAAAAAGGAGTTCCTGAAGAACGCCTTGGT
AAATATAAGCGGACCAAATCCCGGCGACGCAAAAGGCGAGCACGAGCACTCGGCGGAATCCGCTGCCATGAGCTTCCATTCTCAGCCCTTGATGTCGCCGGGGAAACTAG
ATCCAGCAAAGATATTTAGAATTGCAAATGTACGTAAGAGAAAGAAGAGATTGAGGATCGAAGATTATAGAAGAATGGCGAAGAACCGAATTCGAGAAGAGGGAGAACCC
GGAGAAGTATCTTTTAAACGGAAGGCCAATGGCGCCGTTAGCGTGGACTTCCAATGGCGCGTAATGGGCTACGGAAGAAGGCCAGTACGACGTATCATCTATGATAAGAC
GATTCAATTTTCAAACCAGGTTCGTCTTTGTGCTGCCATATTTCTTGATGCAAATGTTCGAATGGAAGTGGGGTTTGTTTGCTGTATAAATGTAGGCCTGCAATGTGTTT
GGTTACCCAAGAAAATTTTTGTTGATGAAAGAGAAAGTCAGCTGCCCACTAGAGTTCCCTTGACCACTTCATTGGGGCAATTTGTCAAACTCTTCGTCCTTCTTATGGTG
GTAACCTTGTTTGCATTTCCTGATGTTACGAGTGGAGAATGGCTTAATCATGGGCATGATATAACAAACAGAAGAGACGCAGTGGGAGAGTTTCGGATCAACAGAAAAAC
TGTGTCGAAATTACGACTAAAATGGAAATTCTTAGCCGGAAGGGACATTTCCGCCACTCCGGCAGTGGCCAACGGTGTCGTCTACTTTCCGTCGTGGAATGATTTCTGTA
CGCTTATCGAGATCGACGCCGACGGTCGCCGGAAACCTCCTGATCGTCGAATTTATGGTCCTGCGGTTGTGATTGCCGTGGCGAGATCGAATGGGAGGCTGGTTTGGTTG
ACGGAGCTTGATCCGAATCCTACATTTACGATTCCAGACAACGGCGGTCGATTGGGAGGCTACGCCGGAGCCGCCATATGGGGAAGCAGCCCCGCTATCGATGAACGAAG
GAGACTTGTTTATGTCGGAACTGGGAACCTCTACACAGCGCCACCCGAGGTGTTGAAATGTCAAGAGTTGCAGAATAATCAAACCACAAGACCCACCCACCCACCCCGAC
CAATGCATCGGCCCAGACATTCACTTCAATTCAATCTTGGTCTGGATATCGACACCGGCAACATCGTCTGGTTCACTCAGCTGGGTGGATACGACGTCTTCTTCTTCGTC
TGTTTGGATCCCAACAACCCGGATTGTCCACCCGGTCCGAATTTGGATGCAGATTTTGGAGAAGCCCCCATGTGCTTACGATCAAGCCTAACGCCTCCCGGCCGGCCAGA
CTCGATGTTGTGGTGGCTGTGCAGAAGAGCGGGTTCGCTTGTGCTTTGGATCGTGATACCGGCGACATTGTTTGGTCCAGGTTGGCTGGACCTGGAGGCAAAGAAGGAGG
AGGCACATGGGGCGCAGCGACGGATAGAAGAAGAGTTGGCGCTGGACGTGGACACCGGTCGAATCCTCTGGTCGACGGCGAACCCCAGCAACGAAACAGCCAATGCGCCT
GTGTCGGTGGCCAACGGCGTCCTTTTCGCCGGCTCCGTCGCTCCAAATGGTCCGATATATGCCATGGACGCTAAAAGTGGAGAGATTATCTGGTCGTATAACACCGGAGC
CACTGTTTATGCAGGCATTGCAGTGAGTTATGGATGTATCTATTTAGGGAATGGATACAAGATTGGGCTGTCGATATTCCACCCCACTTGGACCGCTGGAACTTCCCTCT
TTGCTTTCTGCGTGTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGACACAAAGAAGAAGCTCGTCACCGAAGATTCCAAGAAGAAGGAGCGACACATCGTTTCGTGGACACAACAGGTTTCCATTTCTTTTCCTTCTCTGTTTCACTT
TTTGATTCTTGAGGATGATATTCTCCGGGAGCAGATTAGCCTACATGGAACAGAAAACTGGGCAATCATTGCGTCGAAATTCAAGGATAAAACAACAAGACAGTGCAGAA
GAAGATGGTACACATACTTGAATTCTGATTTTAAGAAAGGGGGATGGTCTCCGGAAGAGGATTTGCTTCTATGCGAGGCGCAGAAAATATTTGGAAACAGATGGACGGAA
ATAGCAAAGGTGGTGTCAGGCAGAACGGACAATGCAGTGAAAAACAGGTTTACCACCCTGTGTAAAAAGAGAGCCAAATATGAAGCATTAGCGAAAGAGAACACAAGTTC
ATTCATCAACCACAACAACAAGAGGGTCCTATTTCAAAATGGGAGTAATGATGTCTCGTCAGAAACTCAACCGGTTAAGAGGATGAGGAGAGCACACATATCTGATACTA
CAGAAGTTTGCTCGCTTGAGGATGGATCAAAAAAGCCATGCAGAACAGCAATGGATCAGCAGCTGAGAGCCCCATTTGCAGTACTGATTCTCCAAGATTTCCTCAGTCGA
AGCAAAGAAAATGACATACCCAAATCTGCAACTCCGGATATTGAACTTCATGTGGAAGACATTAAAGATTTAGTGGAGGATTTAAGGAGTAGCAATGATGGTAGCCAATG
GAGACAACCCAATCTCCACGAGTCTCCTGGCAGTTCAGGATACAGTACTGGATCAACACTAGTATCCCATACCGCTGAGGAAAAAATAGATCAAAGTCAACCTGAGATAG
GGACTCATCATCAGGAAGCTAAATTTGAATCTGGGTCAACTTGCACTGGAGAGCAAAATGATTTTGGTGAATCTGAGAAAGAAATGCTTCCCAAGACAACCCTGGAGCGA
GACCATTACAACGCAGCCCCACTGCTTCGAAGACAGGGGATTTCCACTGAGCAGAGAAGACACCACAATGCTGACAAACTTGCGAGTTGTGGTAATGGTGGTATTAGCAA
GGGAACCAAATCGGCTTATTGTAAGGAAGATAAAATTTTGGCCTACTGCTCCACAGAGACAATAGAGAAGAATGTCCCACGCTGCCTCAGGATGCTGTCTGCAGAATTCT
ATGGCATGATATCCAGTACCATTAAGGAAGGAAAGGTGTTCAAATCTCTTTCCGTCGGATCCAAAGCGCTTAGTCGACCTAAGAGAAACGAAAAATTACTTAAAATCATC
AAAATGGACTTGAATCCGTTGGCGAGCAAAGCAAGAATATCGAGGATTTGCGCTCCTCAGGAACGTGATTCTTCCGAAGAGAAAAAGGAGTTCCTGAAGAACGCCTTGGT
AAATATAAGCGGACCAAATCCCGGCGACGCAAAAGGCGAGCACGAGCACTCGGCGGAATCCGCTGCCATGAGCTTCCATTCTCAGCCCTTGATGTCGCCGGGGAAACTAG
ATCCAGCAAAGATATTTAGAATTGCAAATGTACGTAAGAGAAAGAAGAGATTGAGGATCGAAGATTATAGAAGAATGGCGAAGAACCGAATTCGAGAAGAGGGAGAACCC
GGAGAAGTATCTTTTAAACGGAAGGCCAATGGCGCCGTTAGCGTGGACTTCCAATGGCGCGTAATGGGCTACGGAAGAAGGCCAGTACGACGTATCATCTATGATAAGAC
GATTCAATTTTCAAACCAGGTTCGTCTTTGTGCTGCCATATTTCTTGATGCAAATGTTCGAATGGAAGTGGGGTTTGTTTGCTGTATAAATGTAGGCCTGCAATGTGTTT
GGTTACCCAAGAAAATTTTTGTTGATGAAAGAGAAAGTCAGCTGCCCACTAGAGTTCCCTTGACCACTTCATTGGGGCAATTTGTCAAACTCTTCGTCCTTCTTATGGTG
GTAACCTTGTTTGCATTTCCTGATGTTACGAGTGGAGAATGGCTTAATCATGGGCATGATATAACAAACAGAAGAGACGCAGTGGGAGAGTTTCGGATCAACAGAAAAAC
TGTGTCGAAATTACGACTAAAATGGAAATTCTTAGCCGGAAGGGACATTTCCGCCACTCCGGCAGTGGCCAACGGTGTCGTCTACTTTCCGTCGTGGAATGATTTCTGTA
CGCTTATCGAGATCGACGCCGACGGTCGCCGGAAACCTCCTGATCGTCGAATTTATGGTCCTGCGGTTGTGATTGCCGTGGCGAGATCGAATGGGAGGCTGGTTTGGTTG
ACGGAGCTTGATCCGAATCCTACATTTACGATTCCAGACAACGGCGGTCGATTGGGAGGCTACGCCGGAGCCGCCATATGGGGAAGCAGCCCCGCTATCGATGAACGAAG
GAGACTTGTTTATGTCGGAACTGGGAACCTCTACACAGCGCCACCCGAGGTGTTGAAATGTCAAGAGTTGCAGAATAATCAAACCACAAGACCCACCCACCCACCCCGAC
CAATGCATCGGCCCAGACATTCACTTCAATTCAATCTTGGTCTGGATATCGACACCGGCAACATCGTCTGGTTCACTCAGCTGGGTGGATACGACGTCTTCTTCTTCGTC
TGTTTGGATCCCAACAACCCGGATTGTCCACCCGGTCCGAATTTGGATGCAGATTTTGGAGAAGCCCCCATGTGCTTACGATCAAGCCTAACGCCTCCCGGCCGGCCAGA
CTCGATGTTGTGGTGGCTGTGCAGAAGAGCGGGTTCGCTTGTGCTTTGGATCGTGATACCGGCGACATTGTTTGGTCCAGGTTGGCTGGACCTGGAGGCAAAGAAGGAGG
AGGCACATGGGGCGCAGCGACGGATAGAAGAAGAGTTGGCGCTGGACGTGGACACCGGTCGAATCCTCTGGTCGACGGCGAACCCCAGCAACGAAACAGCCAATGCGCCT
GTGTCGGTGGCCAACGGCGTCCTTTTCGCCGGCTCCGTCGCTCCAAATGGTCCGATATATGCCATGGACGCTAAAAGTGGAGAGATTATCTGGTCGTATAACACCGGAGC
CACTGTTTATGCAGGCATTGCAGTGAGTTATGGATGTATCTATTTAGGGAATGGATACAAGATTGGGCTGTCGATATTCCACCCCACTTGGACCGCTGGAACTTCCCTCT
TTGCTTTCTGCGTGTCGTGA
Protein sequenceShow/hide protein sequence
MQDTKKKLVTEDSKKKERHIVSWTQQVSISFPSLFHFLILEDDILREQISLHGTENWAIIASKFKDKTTRQCRRRWYTYLNSDFKKGGWSPEEDLLLCEAQKIFGNRWTE
IAKVVSGRTDNAVKNRFTTLCKKRAKYEALAKENTSSFINHNNKRVLFQNGSNDVSSETQPVKRMRRAHISDTTEVCSLEDGSKKPCRTAMDQQLRAPFAVLILQDFLSR
SKENDIPKSATPDIELHVEDIKDLVEDLRSSNDGSQWRQPNLHESPGSSGYSTGSTLVSHTAEEKIDQSQPEIGTHHQEAKFESGSTCTGEQNDFGESEKEMLPKTTLER
DHYNAAPLLRRQGISTEQRRHHNADKLASCGNGGISKGTKSAYCKEDKILAYCSTETIEKNVPRCLRMLSAEFYGMISSTIKEGKVFKSLSVGSKALSRPKRNEKLLKII
KMDLNPLASKARISRICAPQERDSSEEKKEFLKNALVNISGPNPGDAKGEHEHSAESAAMSFHSQPLMSPGKLDPAKIFRIANVRKRKKRLRIEDYRRMAKNRIREEGEP
GEVSFKRKANGAVSVDFQWRVMGYGRRPVRRIIYDKTIQFSNQVRLCAAIFLDANVRMEVGFVCCINVGLQCVWLPKKIFVDERESQLPTRVPLTTSLGQFVKLFVLLMV
VTLFAFPDVTSGEWLNHGHDITNRRDAVGEFRINRKTVSKLRLKWKFLAGRDISATPAVANGVVYFPSWNDFCTLIEIDADGRRKPPDRRIYGPAVVIAVARSNGRLVWL
TELDPNPTFTIPDNGGRLGGYAGAAIWGSSPAIDERRRLVYVGTGNLYTAPPEVLKCQELQNNQTTRPTHPPRPMHRPRHSLQFNLGLDIDTGNIVWFTQLGGYDVFFFV
CLDPNNPDCPPGPNLDADFGEAPMCLRSSLTPPGRPDSMLWWLCRRAGSLVLWIVIPATLFGPGWLDLEAKKEEAHGAQRRIEEELALDVDTGRILWSTANPSNETANAP
VSVANGVLFAGSVAPNGPIYAMDAKSGEIIWSYNTGATVYAGIAVSYGCIYLGNGYKIGLSIFHPTWTAGTSLFAFCVS