; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020157 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020157
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMyb transcription factor
Genome locationtig00153449:416304..427670
RNA-Seq ExpressionSgr020157
SyntenySgr020157
Gene Ontology termsGO:0009723 - response to ethylene (biological process)
GO:0009733 - response to auxin (biological process)
GO:0009739 - response to gibberellin (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR006447 - Myb domain, plants
IPR009057 - Homeobox-like domain superfamily
IPR017884 - SANT domain
IPR017930 - Myb domain
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041072.1 transcription factor MYB1R1 [Cucumis melo var. makuwa]7.7e-17694.61Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRR+RRSSLFDIVADE V+ SIVQQDFLSVNSSHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

XP_008448819.1 PREDICTED: transcription factor MYB1R1 [Cucumis melo]3.5e-17694.91Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE V+ SIVQQDFLSVNSSHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

XP_011650388.1 transcription factor KUA1 [Cucumis sativus]2.5e-17494.33Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPP-TV
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE VE SIVQQDFLS NSSHAE+QSNNPLPTPP TV
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPP-TV

Query:  DEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSL
        DEECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGH+GPSSL
Subjt:  DEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSL

Query:  SLKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        SLKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  SLKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

XP_022923376.1 transcription factor KUA1 [Cucurbita moschata]5.5e-17493.41Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSG LQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFL+GLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVA E VE SIVQQDFLSVN+SHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGEP P + +  QCCYPVVYPAYV PFFPFSIPFYSGYSAE +NKE HEVLKPTAVHSKSPLNVDEL+GMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPI AV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

XP_038904873.1 transcription factor KUA1 [Benincasa hispida]4.5e-17694.91Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHG AADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE VE SIVQQDFLSVNSSHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGE  P + D  QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

TrEMBL top hitse value%identityAlignment
A0A0A0L263 HTH myb-type domain-containing protein1.2e-17494.33Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPP-TV
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE VE SIVQQDFLS NSSHAE+QSNNPLPTPP TV
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPP-TV

Query:  DEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSL
        DEECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGH+GPSSL
Subjt:  DEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSL

Query:  SLKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        SLKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  SLKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

A0A1S3BLH5 transcription factor MYB1R11.7e-17694.91Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE V+ SIVQQDFLSVNSSHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

A0A5A7TC81 Transcription factor MYB1R13.7e-17694.61Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNR VKLFGVRLTDGSIRKSASMGNL+HYAGSGSGALQ GSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFLLGLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRR+RRSSLFDIVADE V+ SIVQQDFLSVNSSHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGE  P + DG QCCYPVVYPAYVAPFFPFSIPFYSGYSAET+NKE HEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPIHAV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

A0A6J1E6M7 transcription factor KUA12.7e-17493.41Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSG LQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFL+GLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVA E VE SIVQQDFLSVN+SHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGEP P + +  QCCYPVVYPAYV PFFPFSIPFYSGYSAE +NKE HEVLKPTAVHSKSPLNVDEL+GMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLLEGSSRRSAFHANPASGSENM+SGGSPI AV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

A0A6J1HMV4 transcription factor MYBS36.0e-17493.11Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD
        TEEEHRMFL+GLQKLGKGDWRGIARNYV+SRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVA E VE SIVQQDFLSVN+SHAE+QSNNPLPTPPTVD
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVD

Query:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS
        EECESMDSTNSNDGEP P + +  QCCYPVVYPAYV PFFPFSIPFYSGYSAE++NKE HEVLKPTAVH KSPLNVDEL+GMSKLSLGESIGHAGPSSLS
Subjt:  EECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLS

Query:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV
        LKLL+GSSRRSAFHANPASGSENM+SGGSPI AV
Subjt:  LKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV

SwissProt top hitse value%identityAlignment
Q2V9B0 Transcription factor MYB1R12.7e-3852.17Show/hide
Query:  VKLFGVRLTDGSIRKSASMGNLSHYA-GSGSGALQGGSNNPASPGETPEHGVAAD-GYAS-EDFVPGSSSSCRERKKGVPWTEEEHRMFLLGLQKLGKGD
        + LFGVR+    +RKS S+ +LS Y   + +    GG NN +S        VA D GYAS +D V   S+S RERK+GVPWTEEEH++FLLGLQK+GKGD
Subjt:  VKLFGVRLTDGSIRKSASMGNLSHYA-GSGSGALQGGSNNPASPGETPEHGVAAD-GYAS-EDFVPGSSSSCRERKKGVPWTEEEHRMFLLGLQKLGKGD

Query:  WRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE----PVETSIVQQDFLSVNSSHAETQSNNPLPTPPTV
        WRGI+RN+V +RTPTQVASHAQKYF+R++N++RR+RRSSLFDI  D     P+E    +Q+   V  +   T   N  P  PTV
Subjt:  WRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADE----PVETSIVQQDFLSVNSSHAETQSNNPLPTPPTV

Q4W946 2-oxoglutarate-Fe(II) type oxidoreductase2.1e-3834.05Show/hide
Query:  PIIDLS------SQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE---HRGYTPLYVEKLDPTALSSRGDAKEVF
        PI+D S      SQ +  +   +RE CL+ GFF +  H V  EL  R FD +++FF LP+ EK K+ R     +RGY         P    S  D KE  
Subjt:  PIIDLS------SQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE---HRGYTPLYVEKLDPTALSSRGDAKEVF

Query:  DIGA-LAGN---ATENDLNQWPSRELLPSWRSTMESFSKQAM-------LAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELC
         +G  LA +     +  LN  P+R   P     +E F   +M          + + +++A+ ++ +ETFF+ +   +  +A LR LHYP       E   
Subjt:  DIGA-LAGN---ATENDLNQWPSRELLPSWRSTMESFSKQAM-------LAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELC

Query:  GASAHSDYGMITLLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVI-PVGEERYSVVFFLDPNEDCIVECLQSCCSE
        G  AH DY  ITLL+ +G  GLQV  E   Q   W DV  +  A+IVN  ++  R TN  ++S LHRV+   G ERYS+ FF   N D + ECL     E
Subjt:  GASAHSDYGMITLLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVI-PVGEERYSVVFFLDPNEDCIVECLQSCCSE

Query:  SSPPRYPPIRSGDYLKEQLSSFSEAA
          P R+PP    + + E +    E A
Subjt:  SSPPRYPPIRSGDYLKEQLSSFSEAA

Q7XC57 Transcription factor MYBS34.4e-9760.98Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHG-VAADGYASEDFVPGSSSSCRERKKGVP
        MTRRCSHCSHNGHNSRTCPNRGVK+FGVRLTDGSIRKSASMGNLS    S +G+  GG    ASP + P+    AADGYAS+DFV GSSS+ R+RKKGVP
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHG-VAADGYASEDFVPGSSSSCRERKKGVP

Query:  WTEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQ-SNNPLPTPPT
        WTEEEHR FLLGLQKLGKGDWRGI+RN+V+SRTPTQVASHAQKYFIRQ+N++RRKRRSSLFD+V DE ++   +            ETQ  N P   PP 
Subjt:  WTEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQ-SNNPLPTPPT

Query:  VDEECESMDSTNSNDGEPPPPKA---DGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSN-KEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHA
         +EE +SM+S  S   E     A   D  Q  YPV+ PAY +PF  FS+PF+     E    +E HE++KP  VHSKSP+NVDEL+GMSKLS+GES    
Subjt:  VDEECESMDSTNSNDGEPPPPKA---DGSQCCYPVVYPAYVAPFFPFSIPFYSGYSAETSN-KEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHA

Query:  GPSSLSLKLLEGSSRRSAFHANPASGSE
          +SLSL L+ G +R+SAFHANP + ++
Subjt:  GPSSLSLKLLEGSSRRSAFHANPASGSE

Q9FKF9 Probable transcription factor At5g616207.6e-3335.56Show/hide
Query:  MTRRCSHCSHNGHNSRTCPN----RGVKLFGVRLTDG--------SIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPG-S
        + + CSHC HNGHN+RTC N      VKLFGV ++          ++RKS S+GNL     +       GS +P +        V   GY S+  +    
Subjt:  MTRRCSHCSHNGHNSRTCPN----RGVKLFGVRLTDG--------SIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPG-S

Query:  SSSCRERKKGVPWTEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAET
          +  E+KKG PWTEEEHR FL+GL KLGKGDWRGIA+++V +RTPTQVASHAQKYFIR     +RKRR+SLFDI  ++  E     QD      +  +T
Subjt:  SSSCRERKKGVPWTEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAET

Query:  QSNNPLP----------TPPTVDEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVA-PFFPFSIPFYSGYSAETSNKEGHEVLKPT-AVHSKSPLN
            P+           T   +    +++        +P PP  +     Y   YP Y A P  P      SG         G  + +P+ A +  +   
Subjt:  QSNNPLP----------TPPTVDEECESMDSTNSNDGEPPPPKADGSQCCYPVVYPAYVA-PFFPFSIPFYSGYSAETSNKEGHEVLKPT-AVHSKSPLN

Query:  VDELIGMSKLSLGES
        +D  IG+   + G S
Subjt:  VDELIGMSKLSLGES

Q9LVS0 Transcription factor KUA13.6e-10761.76Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHC+HNGHNSRTCPNRGVKLFGVRLT+GSIRKSASMGNLSHY GSGSG    GSN P SPG+ P+H VA DGYASEDFV GSSSS RERKKG PW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVAD----------EPVETSI-VQQDFLSVNSSHAETQS
        TEEEHRMFLLGLQKLGKGDWRGI+RNYV +RTPTQVASHAQKYFIRQ+NVSRRKRRSSLFD+V D          EP E +I V+ +    +S H +T +
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVAD----------EPVETSI-VQQDFLSVNSSHAETQS

Query:  NNPLPTPPTVD-EECESMDSTNSNDGEP------------------------PPPKADGSQCCYPVVYPAYVAPFFPFSIPFY-SGYSAETSNK-EGHEV
         + L  P  ++ EECESMDSTNS  GEP                        P P+  GS   +P++YP Y +P++PF  P + +GY  E   K E HE+
Subjt:  NNPLPTPPTVD-EECESMDSTNSNDGEP------------------------PPPKADGSQCCYPVVYPAYVAPFFPFSIPFY-SGYSAETSNK-EGHEV

Query:  LKPTAVHSKSPLNVDELIGMSKLSLGESIGHA-GPSSLSLKLLEG-SSRRSAFHANPASGSENMNSGGSPIHAV
        L+PTAVHSK+P+NVDEL+GMSKLSL ES  H     SLSLKL  G SSR+SAFH NP+S S ++    S IHA+
Subjt:  LKPTAVHSKSPLNVDELIGMSKLSLGESIGHA-GPSSLSLKLLEG-SSRRSAFHANPASGSENMNSGGSPIHAV

Arabidopsis top hitse value%identityAlignment
AT1G35190.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein5.0e-8047.54Show/hide
Query:  LPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE-HRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAG
        L  IDL++ D     +S+++ACLD GFFY++NHG+ +E ++ VF++S+K F+LP+EEK K+ R E HRGYTP+  E LDP      GD KE + IG    
Subjt:  LPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE-HRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAG

Query:  NATEN------DLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMIT
            +        N WP  ++LP WR TME + ++A+     IA L+A+AL+LD  +F++   L  P+A +RLL Y G +  P++ +    AHSD+GM+T
Subjt:  NATEN------DLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMIT

Query:  LLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGD
        LL T+GV GLQ+CK+K   P+ WE V  I  AFIVN GDMLERW+N  F+STLHRV+  G+ERYS+ FF++PN DC+VECL +C SES  P+YPPI+   
Subjt:  LLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGD

Query:  YLKEQ
        YL ++
Subjt:  YLKEQ

AT3G46490.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein8.2e-7544.59Show/hide
Query:  LPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE-HRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAG
        L  IDL + D    A+ +++ACLD GFFY++NHG+ +EL +  F+ S+KFF+LP+EEK K+ R E +RGY P +   LDP     RGD KE F IG    
Subjt:  LPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKE-HRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAG

Query:  ------NATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMIT
              +   +  N WP+ ++LP WR TME + ++A+   + IA ++A+AL+LD  +F     L NP+A + L HY G    P++ +    AHSD+GM++
Subjt:  ------NATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMIT

Query:  LLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGD
        LL T+GV GLQ+CK+K  +P+ WE    I  A+IVN GD+LERW+N  F+STLHRV+  G++RYS+ FFL P+ DCI+ECL +C SE++ P+YP I+   
Subjt:  LLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGD

Query:  YLKEQ
        Y+ ++
Subjt:  YLKEQ

AT4G16765.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.5e-8964.29Show/hide
Query:  LSRKEHRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAGNATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNP
        L R++  GYTPLY EKLDP +LSS GD+KE F  G+L G   +   NQWPS  +LPSWR TME++ K  +  GR++  LIA+AL+LDE FFEK+GAL++P
Subjt:  LSRKEHRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAGNATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNP

Query:  MAYLRLLHYPGDLRSPNEELCGASAHSDYGMITLLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVV
         A +RLL YPG++ S + E  GASAHSDYGM+TLL+T+GVPGLQVC++K  QP +WEDV  I  AFIVN GDM+ERWTN LFRSTLHRV+PVG+ERYSVV
Subjt:  MAYLRLLHYPGDLRSPNEELCGASAHSDYGMITLLVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVV

Query:  FFLDPNEDCIVECLQSCCSESSPPRYPPIRSGDYLKEQ
        FFLDPN DC V+CL+SCCSE+ PPR+PPI +GDY+KE+
Subjt:  FFLDPNEDCIVECLQSCCSESSPPRYPPIRSGDYLKEQ

AT4G16770.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.5e-10255.24Show/hide
Query:  MKAALQLPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKEHRGYTPLYVEKLDPTALSSRGDAKEVFDI
        M  AL+LPIIDLSS +++S +  IR+ACLD+GFFYL NHGV +EL+E V  ES+K FSLP++EK  ++R   RGY+PLY EKL+ ++ +S GD+KE+F  
Subjt:  MKAALQLPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKEHRGYTPLYVEKLDPTALSSRGDAKEVFDI

Query:  GALAGNATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMITL
        G+  G   +   N+WP  ELLP WR TME + K  M  G+++  L+A+ALNL+E +FE++GA ++  A +RLL Y G+  S  EE CGASAHSD+GMITL
Subjt:  GALAGNATENDLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMITL

Query:  LVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGDY
        L T+GV GLQVC++K  +P+VWEDV  I   F+VN GD++ERWTN LFRSTLHRV+ VG+ER+SV  F+DP+ +C+VECL+SCCSE+SPP++PP+R+ DY
Subjt:  LVTNGVPGLQVCKEKFNQPRVWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGDY

Query:  LKEQLSSFSEAAAYS
          E+ S     A+YS
Subjt:  LKEQLSSFSEAAAYS

AT5G47390.1 myb-like transcription factor family protein2.5e-10861.76Show/hide
Query:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW
        MTRRCSHC+HNGHNSRTCPNRGVKLFGVRLT+GSIRKSASMGNLSHY GSGSG    GSN P SPG+ P+H VA DGYASEDFV GSSSS RERKKG PW
Subjt:  MTRRCSHCSHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPW

Query:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVAD----------EPVETSI-VQQDFLSVNSSHAETQS
        TEEEHRMFLLGLQKLGKGDWRGI+RNYV +RTPTQVASHAQKYFIRQ+NVSRRKRRSSLFD+V D          EP E +I V+ +    +S H +T +
Subjt:  TEEEHRMFLLGLQKLGKGDWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVAD----------EPVETSI-VQQDFLSVNSSHAETQS

Query:  NNPLPTPPTVD-EECESMDSTNSNDGEP------------------------PPPKADGSQCCYPVVYPAYVAPFFPFSIPFY-SGYSAETSNK-EGHEV
         + L  P  ++ EECESMDSTNS  GEP                        P P+  GS   +P++YP Y +P++PF  P + +GY  E   K E HE+
Subjt:  NNPLPTPPTVD-EECESMDSTNSNDGEP------------------------PPPKADGSQCCYPVVYPAYVAPFFPFSIPFY-SGYSAETSNK-EGHEV

Query:  LKPTAVHSKSPLNVDELIGMSKLSLGESIGHA-GPSSLSLKLLEG-SSRRSAFHANPASGSENMNSGGSPIHAV
        L+PTAVHSK+P+NVDEL+GMSKLSL ES  H     SLSLKL  G SSR+SAFH NP+S S ++    S IHA+
Subjt:  LKPTAVHSKSPLNVDELIGMSKLSLGESIGHA-GPSSLSLKLLEG-SSRRSAFHANPASGSENMNSGGSPIHAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTGCTCTGCAACTCCCGATCATCGATCTCTCCTCTCAAGACCGCATCTCCATGGCTCTCTCAATCCGCGAGGCGTGCTTGGATTACGGATTCTTTTACCTGGT
AAATCATGGCGTAGAAGATGAGTTGCTGGAGAGAGTGTTTGATGAAAGCAGGAAGTTCTTCTCTCTGCCTATCGAGGAGAAGACGAAACTTTCTCGGAAGGAGCATCGGG
GTTACACCCCTCTTTATGTTGAGAAGCTCGATCCGACCGCCTTGAGTTCTAGAGGTGATGCAAAAGAGGTGTTTGATATTGGCGCTTTGGCAGGAAATGCAACCGAGAAT
GACTTGAATCAGTGGCCTTCAAGAGAGCTGCTGCCTTCTTGGAGATCTACAATGGAATCCTTCTCTAAGCAAGCAATGCTTGCTGGAAGACAAATAGCATCTTTGATTGC
CATGGCTTTGAATTTGGATGAAACCTTCTTTGAGAAAATTGGTGCCTTGGATAATCCAATGGCATATCTTCGGCTTCTACATTATCCAGGTGACTTGAGATCTCCTAATG
AAGAACTATGTGGTGCTTCTGCACATTCAGATTATGGGATGATCACTCTCCTTGTTACCAATGGGGTTCCCGGACTTCAGGTATGTAAGGAAAAGTTCAATCAACCACGA
GTTTGGGAAGATGTTCTCCATATAGACAAGGCATTCATTGTCAATGCTGGGGACATGTTGGAAAGATGGACTAACTGTTTGTTCCGGTCAACTCTACATCGAGTGATTCC
AGTTGGAGAAGAACGTTATTCGGTGGTTTTCTTCTTAGATCCCAATGAAGATTGTATTGTGGAATGTCTTCAAAGTTGTTGTAGTGAGTCATCTCCTCCGAGATATCCTC
CAATTCGCAGTGGAGATTACTTGAAAGAACAGCTGAGTTCGTTTTCTGAAGCGGCGGCGTATTCCGTTCGATCCGGCGCCGGAAGAATGACTCGGAGGTGCTCGCATTGC
AGCCACAATGGGCACAACTCTCGGACGTGTCCCAATCGTGGCGTGAAGCTCTTCGGAGTGCGATTGACCGACGGCTCCATCCGGAAGAGTGCTAGTATGGGTAATCTGAG
TCACTATGCAGGATCCGGGTCGGGTGCGCTGCAAGGCGGGTCGAACAATCCGGCTTCTCCCGGCGAGACTCCGGAGCATGGTGTTGCTGCCGATGGCTACGCCTCGGAGG
ATTTCGTTCCTGGCTCGTCTTCGAGTTGCCGTGAGAGGAAGAAAGGTGTTCCATGGACTGAGGAGGAGCATAGGATGTTTTTACTTGGACTACAGAAACTTGGAAAAGGA
GACTGGCGCGGGATAGCACGTAATTATGTCATATCTAGGACACCTACTCAAGTGGCCAGCCATGCCCAAAAATATTTCATAAGGCAGACCAATGTATCAAGAAGAAAGAG
ACGCTCCAGTTTGTTTGATATTGTCGCAGACGAACCTGTTGAAACTTCAATTGTGCAGCAAGACTTCCTATCTGTTAACAGTTCACATGCTGAAACACAAAGCAATAACC
CATTGCCTACACCTCCTACTGTGGACGAAGAGTGTGAATCGATGGATTCCACCAACTCGAATGATGGAGAACCACCACCTCCAAAGGCAGATGGTTCCCAATGCTGTTAT
CCAGTGGTATATCCTGCATATGTTGCACCATTCTTTCCGTTTTCTATTCCATTTTACTCGGGATACAGTGCAGAGACGTCTAATAAGGAGGGACATGAGGTTCTTAAGCC
AACAGCTGTGCATTCAAAGAGTCCACTCAATGTTGATGAGCTGATTGGCATGTCAAAACTCAGTCTGGGAGAATCTATTGGTCATGCTGGCCCCTCTTCTCTTTCACTGA
AACTGCTCGAGGGGTCGTCTAGACGGTCTGCTTTCCATGCAAATCCAGCTTCTGGCAGTGAAAACATGAATTCTGGTGGTAGTCCAATCCATGCTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTGCTCTGCAACTCCCGATCATCGATCTCTCCTCTCAAGACCGCATCTCCATGGCTCTCTCAATCCGCGAGGCGTGCTTGGATTACGGATTCTTTTACCTGGT
AAATCATGGCGTAGAAGATGAGTTGCTGGAGAGAGTGTTTGATGAAAGCAGGAAGTTCTTCTCTCTGCCTATCGAGGAGAAGACGAAACTTTCTCGGAAGGAGCATCGGG
GTTACACCCCTCTTTATGTTGAGAAGCTCGATCCGACCGCCTTGAGTTCTAGAGGTGATGCAAAAGAGGTGTTTGATATTGGCGCTTTGGCAGGAAATGCAACCGAGAAT
GACTTGAATCAGTGGCCTTCAAGAGAGCTGCTGCCTTCTTGGAGATCTACAATGGAATCCTTCTCTAAGCAAGCAATGCTTGCTGGAAGACAAATAGCATCTTTGATTGC
CATGGCTTTGAATTTGGATGAAACCTTCTTTGAGAAAATTGGTGCCTTGGATAATCCAATGGCATATCTTCGGCTTCTACATTATCCAGGTGACTTGAGATCTCCTAATG
AAGAACTATGTGGTGCTTCTGCACATTCAGATTATGGGATGATCACTCTCCTTGTTACCAATGGGGTTCCCGGACTTCAGGTATGTAAGGAAAAGTTCAATCAACCACGA
GTTTGGGAAGATGTTCTCCATATAGACAAGGCATTCATTGTCAATGCTGGGGACATGTTGGAAAGATGGACTAACTGTTTGTTCCGGTCAACTCTACATCGAGTGATTCC
AGTTGGAGAAGAACGTTATTCGGTGGTTTTCTTCTTAGATCCCAATGAAGATTGTATTGTGGAATGTCTTCAAAGTTGTTGTAGTGAGTCATCTCCTCCGAGATATCCTC
CAATTCGCAGTGGAGATTACTTGAAAGAACAGCTGAGTTCGTTTTCTGAAGCGGCGGCGTATTCCGTTCGATCCGGCGCCGGAAGAATGACTCGGAGGTGCTCGCATTGC
AGCCACAATGGGCACAACTCTCGGACGTGTCCCAATCGTGGCGTGAAGCTCTTCGGAGTGCGATTGACCGACGGCTCCATCCGGAAGAGTGCTAGTATGGGTAATCTGAG
TCACTATGCAGGATCCGGGTCGGGTGCGCTGCAAGGCGGGTCGAACAATCCGGCTTCTCCCGGCGAGACTCCGGAGCATGGTGTTGCTGCCGATGGCTACGCCTCGGAGG
ATTTCGTTCCTGGCTCGTCTTCGAGTTGCCGTGAGAGGAAGAAAGGTGTTCCATGGACTGAGGAGGAGCATAGGATGTTTTTACTTGGACTACAGAAACTTGGAAAAGGA
GACTGGCGCGGGATAGCACGTAATTATGTCATATCTAGGACACCTACTCAAGTGGCCAGCCATGCCCAAAAATATTTCATAAGGCAGACCAATGTATCAAGAAGAAAGAG
ACGCTCCAGTTTGTTTGATATTGTCGCAGACGAACCTGTTGAAACTTCAATTGTGCAGCAAGACTTCCTATCTGTTAACAGTTCACATGCTGAAACACAAAGCAATAACC
CATTGCCTACACCTCCTACTGTGGACGAAGAGTGTGAATCGATGGATTCCACCAACTCGAATGATGGAGAACCACCACCTCCAAAGGCAGATGGTTCCCAATGCTGTTAT
CCAGTGGTATATCCTGCATATGTTGCACCATTCTTTCCGTTTTCTATTCCATTTTACTCGGGATACAGTGCAGAGACGTCTAATAAGGAGGGACATGAGGTTCTTAAGCC
AACAGCTGTGCATTCAAAGAGTCCACTCAATGTTGATGAGCTGATTGGCATGTCAAAACTCAGTCTGGGAGAATCTATTGGTCATGCTGGCCCCTCTTCTCTTTCACTGA
AACTGCTCGAGGGGTCGTCTAGACGGTCTGCTTTCCATGCAAATCCAGCTTCTGGCAGTGAAAACATGAATTCTGGTGGTAGTCCAATCCATGCTGTTTAA
Protein sequenceShow/hide protein sequence
MKAALQLPIIDLSSQDRISMALSIREACLDYGFFYLVNHGVEDELLERVFDESRKFFSLPIEEKTKLSRKEHRGYTPLYVEKLDPTALSSRGDAKEVFDIGALAGNATEN
DLNQWPSRELLPSWRSTMESFSKQAMLAGRQIASLIAMALNLDETFFEKIGALDNPMAYLRLLHYPGDLRSPNEELCGASAHSDYGMITLLVTNGVPGLQVCKEKFNQPR
VWEDVLHIDKAFIVNAGDMLERWTNCLFRSTLHRVIPVGEERYSVVFFLDPNEDCIVECLQSCCSESSPPRYPPIRSGDYLKEQLSSFSEAAAYSVRSGAGRMTRRCSHC
SHNGHNSRTCPNRGVKLFGVRLTDGSIRKSASMGNLSHYAGSGSGALQGGSNNPASPGETPEHGVAADGYASEDFVPGSSSSCRERKKGVPWTEEEHRMFLLGLQKLGKG
DWRGIARNYVISRTPTQVASHAQKYFIRQTNVSRRKRRSSLFDIVADEPVETSIVQQDFLSVNSSHAETQSNNPLPTPPTVDEECESMDSTNSNDGEPPPPKADGSQCCY
PVVYPAYVAPFFPFSIPFYSGYSAETSNKEGHEVLKPTAVHSKSPLNVDELIGMSKLSLGESIGHAGPSSLSLKLLEGSSRRSAFHANPASGSENMNSGGSPIHAV