; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003537 (gene) of Snake gourd v1 genome

Gene IDTan0003537
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontranscription factor MYB44-like
Genome locationContig00035_ERROPOS14500000+:14507..16349
RNA-Seq ExpressionTan0003537
SyntenyTan0003537
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595010.1 Transcription factor MYB44, partial [Cucurbita argyrosperma subsp. sororia]6.7e-14686.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG Y+NPGSPSGSD+SDSSVPV+SP VYRPVARTG VIPPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGVGA  + SNR GSGSTAQVPL+A FAQIQSM ASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

KAG7027038.1 Transcription factor MYB44, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-14686.69Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG Y+NPGSPSGSD+SDSSVPV+SP VYRPVARTG VIPPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGVGA  + SNR GSGSTAQVPL+A FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

XP_022963361.1 transcription factor MYB44-like [Cucurbita moschata]6.7e-14686.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG YMNPGSPSGSD+SDSSVPV+SP VYRPVARTG V+PPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGV A  D SNR GSGS AQVPL+A FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

XP_023003268.1 transcription factor MYB44-like [Cucurbita maxima]9.6e-14586.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHAN GNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG YM+PGSPSGSD SDSSVPV+SP VYRPVARTG VIPPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGVGA  + SNR GSGSTAQVPL+A FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

XP_023517499.1 transcription factor MYB44-like [Cucurbita pepo subsp. pepo]3.0e-14686.69Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG YMNPGSPSGSD+SDSSVPV+SP VYRPVARTG VIPPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGVGA  + SNR GSGSTAQVPL+  FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

TrEMBL top hitse value%identityAlignment
A0A0A0KJL5 Sucrose responsive element binding protein7.9e-14586.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MA+TRK+MDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLL+GRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCS MMNEG+E D N QP+KKSVSAGAAVN SNG YM+PGSPSGSD+SDSSVPV+SPTVYRPVARTGGVIPPGESAP SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPG    VD S   GSGSTAQVPLMA FAQIQSM  +EQ R AQP    GGA EKINGFGVF+ADLMAVMQEMIKSEVK YMEGLSEQRGR CF +AKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGI+NV F
Subjt:  GGIRNVGF

A0A1S3B0F2 transcription factor MYB441.7e-14285.16Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MA+TRK+MDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLL+GRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCS MMNEG+E D N QP+KKS+SAGAAVN SNG YM+PGSPSGSD+SDSSVPV++PTVYRPVARTGGVIPPGESAP SA+DPPTSL L
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYME--GLSEQRGRACFPQA
        SLPG    VD S   GSGSTAQVPLMA FAQIQSM  +EQ R AQP    GGAAEKINGFGVF+ADLMAVMQEMIKSEVK YME  GLSEQRGR CF +A
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYME--GLSEQRGRACFPQA

Query:  KAGGIRNVGF
        KAGGI+NV F
Subjt:  KAGGIRNVGF

A0A5D3CR91 Transcription factor MYB441.5e-14385.81Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MA+TRK+MDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLL+GRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCS MMNEG+E D N QP+KKS+SAGAAVN SNG YM+PGSPSGSD+SDSSVPV+SPTVYRPVARTGGVIPPGESAP SA+DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYME--GLSEQRGRACFPQA
        SLPG    VD S   GSGSTAQVPLMA FAQIQSM  +EQ R AQP    GGAAEKINGFGVF+ADLMAVMQEMIKSEVK YME  GLSEQRGR CF +A
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYME--GLSEQRGRACFPQA

Query:  KAGGIRNVGF
        KAGGI+NV F
Subjt:  KAGGIRNVGF

A0A6J1HFY1 transcription factor MYB44-like3.2e-14686.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG YMNPGSPSGSD+SDSSVPV+SP VYRPVARTG V+PPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGV A  D SNR GSGS AQVPL+A FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

A0A6J1KW03 transcription factor MYB44-like4.7e-14586.36Show/hide
Query:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA
        MAV R EMDRIKGPWSPEEDDALQRLVQKHGPRNWSLIS+SI GRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHAN GNRWATIARLLSGRTDNA
Subjt:  MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNA

Query:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL
        VKNHWNSTLKRKCSSMMNEGFEG+ N QPLKKS SAGAAVN+SNG YM+PGSPSGSD SDSSVPV+SP VYRPVARTG VIPPGES   SA DPPTSLSL
Subjt:  VKNHWNSTLKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSL

Query:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA
        SLPGVGA  + SNR GSGSTAQVPL+A FAQIQSMAASE ARA+Q DNR   +AEKINGFG+F+ADLMAVMQEMI+SEVK YMEGLS+ RGR CF QAKA
Subjt:  SLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKA

Query:  GGIRNVGF
        GGIRNVGF
Subjt:  GGIRNVGF

SwissProt top hitse value%identityAlignment
O04192 Transcription factor MYB251.1e-3745.15Show/hide
Query:  RIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNSTL
        ++KGPW PE+D+AL RLV+  GPRNW+LIS+ IPGRSGKSCRLRWCNQL P ++ +PFS EE+  I+ A A  GN+W+ IA+LL GRTDNA+KNHWNS L
Subjt:  RIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNSTL

Query:  KRKCSS-------MMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPT---VYRPVARTG--GVIPPGESAPCSA----AD
        +RK +        M N      L    +++  +A    ++         S    D      P    +   VYRPVAR G   V  PG  APC      A 
Subjt:  KRKCSS-------MMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPT---VYRPVARTG--GVIPPGESAPCSA----AD

Query:  PPTSLS
         P SL+
Subjt:  PPTSLS

O23160 Transcription factor MYB731.1e-7452.76Show/hide
Query:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN
        TRK M+RIKGPWSPEEDD LQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSP+VEHR FS EEDETIIRAHA FGN+WATI+RLL+GRTDNA+KN
Subjt:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN

Query:  HWNSTLKRKCSSM-------MNEGFEGDL-NGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS---VPVMSPTVYRPVARTGGVIPPGESAPCSAA
        HWNSTLKRKCS          N G++G+L   QPLK++ S G    VS G YM+PGSPSGSDVS+ S     V  PTV   V           +A  S  
Subjt:  HWNSTLKRKCSSM-------MNEGFEGDL-NGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS---VPVMSPTVYRPVARTGGVIPPGESAPCSAA

Query:  DPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQR--
        DPPT LSLSLP    TV V+          +    T A++  +   EQ    + + +G          G F  + M V+QEMI++EV+ YM  L      
Subjt:  DPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQR--

Query:  --------GRACFPQAKAGGIRNVGF
                G +C PQ+     R VGF
Subjt:  --------GRACFPQAKAGGIRNVGF

Q42575 Transcription factor MYB11.5e-3642.71Show/hide
Query:  RKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNH
        R   DR+KGPWS EEDD L  LV++ G RNWS I++SIPGRSGKSCRLRWCNQL+P +    F+  ED+ II AHA  GN+WA IA+LL GRTDNA+KNH
Subjt:  RKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNH

Query:  WNSTLKRK-----------CSSMM--NEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPP
        WNS L+R+             S++  + GF+        ++++S+G   +V+       G  + + +  S    +  T    ++R     PP
Subjt:  WNSTLKRK-----------CSSMM--NEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPP

Q9FDW1 Transcription factor MYB442.2e-7555.9Show/hide
Query:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST
        DRIKGPWSPEED+ L+RLV K+GPRNW++ISKSIPGRSGKSCRLRWCNQLSPQVEHRPFS EEDETI RAHA FGN+WATIARLL+GRTDNAVKNHWNST
Subjt:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST

Query:  LKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS-VPVM-SPTVYRPVARTGGVI-PPGESAPCSAADPPTSLSLSLPGV
        LKRKC    + G++G  + +P+K+SVSAG+   V  G YM+PGSP+GSDVSDSS +P++ S  +++PV R G V+ P       S+ DPPTSLSLSLPG 
Subjt:  LKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS-VPVM-SPTVYRPVARTGGVI-PPGESAPCSAADPPTSLSLSLPGV

Query:  GATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRG--GGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG
          + + SNR    +              S        +  P + G  G   E    F     + MAV+QEMIK+EV+ YM  +    G
Subjt:  GATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRG--GGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG

Q9SN12 Transcription factor MYB774.6e-6548.74Show/hide
Query:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST
        DR+KGPWS EED+ L+R+V+K+GPRNWS ISKSIPGRSGKSCRLRWCNQLSP+VEHRPFSPEEDETI+ A A FGN+WATIARLL+GRTDNAVKNHWNST
Subjt:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST

Query:  LKRKCSSMMN----EGFEGDLNGQPLKKSVSAGAA-VNVSNGFYMNPGSPSGSDVSDSSVPVMSPT-----VYRPVARTGG--VIP---PGESAPCSAAD
        LKRKCS  +        E D +    ++SVS  +A   V  G YM+P SP+G DVSDSS  + SP+     +++P+  +GG  V+P   P E +  S+ D
Subjt:  LKRKCSSMMN----EGFEGDLNGQPLKKSVSAGAA-VNVSNGFYMNPGSPSGSDVSDSSVPVMSPT-----VYRPVARTGG--VIP---PGESAPCSAAD

Query:  PPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG--
        PPTSLSLSLPG   T   S+   + +    P              E       + RG G             + M V+QEMIK+EV+ YM  + +  G  
Subjt:  PPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG--

Query:  --RACFPQAKAGGIRNVG
             +     GG R+ G
Subjt:  --RACFPQAKAGGIRNVG

Arabidopsis top hitse value%identityAlignment
AT2G23290.1 myb domain protein 701.1e-6950.45Show/hide
Query:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN
        TRKEMDRIKGPWSPEEDD LQ LVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSP+VEHR F+ EED+TII AHA FGN+WATIARLL+GRTDNA+KN
Subjt:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN

Query:  HWNSTLKRKCSS-------------MMNEGFEGDLNGQ-PLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS------VPVMSPT-VYRPVARTGGVIP
        HWNSTLKRKCS                N G++G+L  + PLK+  S G  V V         SP+GSDVS+ S      +PV S   V++P AR GGV+ 
Subjt:  HWNSTLKRKCSS-------------MMNEGFEGDLNGQ-PLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS------VPVMSPT-VYRPVARTGGVIP

Query:  PGESAPCSAADPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRY
           S      DP T L LSLP V           + ST    L      ++     E+ R             +I+G G    D M V+QEMIK+EV+ Y
Subjt:  PGESAPCSAADPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRY

Query:  MEGLSEQRG-------RACFPQAKAGGIRNVGF
        M  L    G        +C  Q   G  RNVGF
Subjt:  MEGLSEQRG-------RACFPQAKAGGIRNVGF

AT3G50060.1 myb domain protein 773.3e-6648.74Show/hide
Query:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST
        DR+KGPWS EED+ L+R+V+K+GPRNWS ISKSIPGRSGKSCRLRWCNQLSP+VEHRPFSPEEDETI+ A A FGN+WATIARLL+GRTDNAVKNHWNST
Subjt:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST

Query:  LKRKCSSMMN----EGFEGDLNGQPLKKSVSAGAA-VNVSNGFYMNPGSPSGSDVSDSSVPVMSPT-----VYRPVARTGG--VIP---PGESAPCSAAD
        LKRKCS  +        E D +    ++SVS  +A   V  G YM+P SP+G DVSDSS  + SP+     +++P+  +GG  V+P   P E +  S+ D
Subjt:  LKRKCSSMMN----EGFEGDLNGQPLKKSVSAGAA-VNVSNGFYMNPGSPSGSDVSDSSVPVMSPT-----VYRPVARTGG--VIP---PGESAPCSAAD

Query:  PPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG--
        PPTSLSLSLPG   T   S+   + +    P              E       + RG G             + M V+QEMIK+EV+ YM  + +  G  
Subjt:  PPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG--

Query:  --RACFPQAKAGGIRNVG
             +     GG R+ G
Subjt:  --RACFPQAKAGGIRNVG

AT3G55730.1 myb domain protein 1097.3e-4248.04Show/hide
Query:  RIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNSTL
        ++KGPWS EED  L +LV+K GPRNWSLI++ IPGRSGKSCRLRWCNQL P ++ +PFS EED  II AHA  GN+WA IA+LL+GRTDNA+KNHWNSTL
Subjt:  RIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNSTL

Query:  KRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGS-----PSGSDVSDSSVPVMSPTVYRPVARTGGVIP-----PGESAPCSAADPPTSLS
        +RK + + N       NGQ +  SV+  +  N +     NP S     P G D++ S  P   P V   V       P       E AP   ++ PT  +
Subjt:  KRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGS-----PSGSDVSDSSVPVMSPTVYRPVARTGGVIP-----PGESAPCSAADPPTSLS

Query:  LSLP
        +  P
Subjt:  LSLP

AT4G37260.1 myb domain protein 737.8e-7652.76Show/hide
Query:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN
        TRK M+RIKGPWSPEEDD LQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSP+VEHR FS EEDETIIRAHA FGN+WATI+RLL+GRTDNA+KN
Subjt:  TRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKN

Query:  HWNSTLKRKCSSM-------MNEGFEGDL-NGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS---VPVMSPTVYRPVARTGGVIPPGESAPCSAA
        HWNSTLKRKCS          N G++G+L   QPLK++ S G    VS G YM+PGSPSGSDVS+ S     V  PTV   V           +A  S  
Subjt:  HWNSTLKRKCSSM-------MNEGFEGDL-NGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS---VPVMSPTVYRPVARTGGVIPPGESAPCSAA

Query:  DPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQR--
        DPPT LSLSLP    TV V+          +    T A++  +   EQ    + + +G          G F  + M V+QEMI++EV+ YM  L      
Subjt:  DPPTSLSLSLPGVGATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQR--

Query:  --------GRACFPQAKAGGIRNVGF
                G +C PQ+     R VGF
Subjt:  --------GRACFPQAKAGGIRNVGF

AT5G67300.1 myb domain protein r11.6e-7655.9Show/hide
Query:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST
        DRIKGPWSPEED+ L+RLV K+GPRNW++ISKSIPGRSGKSCRLRWCNQLSPQVEHRPFS EEDETI RAHA FGN+WATIARLL+GRTDNAVKNHWNST
Subjt:  DRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNST

Query:  LKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS-VPVM-SPTVYRPVARTGGVI-PPGESAPCSAADPPTSLSLSLPGV
        LKRKC    + G++G  + +P+K+SVSAG+   V  G YM+PGSP+GSDVSDSS +P++ S  +++PV R G V+ P       S+ DPPTSLSLSLPG 
Subjt:  LKRKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSS-VPVM-SPTVYRPVARTGGVI-PPGESAPCSAADPPTSLSLSLPGV

Query:  GATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRG--GGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG
          + + SNR    +              S        +  P + G  G   E    F     + MAV+QEMIK+EV+ YM  +    G
Subjt:  GATVDVSNRGGSGSTAQVPLMATFAQIQSMAASEQARAAQPDNRG--GGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTACCCGTAAAGAGATGGATCGGATCAAGGGTCCGTGGAGCCCTGAGGAAGACGACGCTCTACAGAGACTGGTCCAGAAGCATGGCCCACGTAACTGGTCTCT
CATCAGCAAATCGATTCCCGGCCGCTCCGGTAAGTCCTGCCGGCTCCGGTGGTGCAATCAGCTCTCCCCTCAGGTGGAGCACCGCCCCTTCTCGCCGGAGGAGGACGAGA
CCATTATCAGAGCTCATGCTAACTTCGGCAACAGATGGGCTACTATTGCTCGGCTTCTTTCCGGCCGGACGGATAACGCCGTCAAGAATCACTGGAATTCGACGCTGAAG
CGAAAGTGTTCATCGATGATGAATGAGGGGTTTGAAGGGGATCTCAACGGTCAGCCTCTGAAGAAATCGGTCAGCGCCGGCGCCGCCGTCAACGTCTCCAACGGGTTCTA
TATGAACCCCGGTAGTCCATCGGGATCTGACGTAAGCGATTCTAGCGTTCCTGTTATGTCTCCGACCGTCTACCGCCCCGTTGCCAGAACAGGCGGCGTTATACCCCCGG
GAGAATCGGCGCCGTGCTCGGCCGCCGACCCACCAACGTCGCTGTCGTTGTCTCTCCCTGGAGTGGGTGCCACCGTCGACGTTTCGAACCGCGGTGGAAGTGGATCAACG
GCGCAGGTGCCTCTAATGGCGACATTTGCTCAAATACAGAGCATGGCAGCATCGGAGCAGGCGAGGGCGGCGCAGCCAGATAACAGAGGCGGTGGTGCGGCCGAGAAAAT
TAATGGGTTTGGAGTATTCAATGCAGATTTAATGGCGGTGATGCAAGAAATGATAAAATCGGAGGTGAAAAGGTATATGGAAGGGTTATCGGAGCAGAGAGGAAGAGCTT
GTTTTCCGCAGGCTAAAGCCGGTGGGATTAGAAACGTTGGTTTTTAA
mRNA sequenceShow/hide mRNA sequence
TTAAAACAAGCCAACCGGCTCAGTCGGTTGTCCGTCTCATCCCTTTTATAAGAACCCCACCCTCCATTAACCCTACAGAACTACAGAGAGCGAGCCAAACAAACAGGCCT
GAGCTCCACGACGCCATTAATACCCATTTTCAAACGAAACATTTAACCTTTATTCCCCTTTTCCATTTCCCCAATTTTCATGATCTTTTAAAGCTTCTATTAATAATTCG
TTCATGGCGGTTACCCGTAAAGAGATGGATCGGATCAAGGGTCCGTGGAGCCCTGAGGAAGACGACGCTCTACAGAGACTGGTCCAGAAGCATGGCCCACGTAACTGGTC
TCTCATCAGCAAATCGATTCCCGGCCGCTCCGGTAAGTCCTGCCGGCTCCGGTGGTGCAATCAGCTCTCCCCTCAGGTGGAGCACCGCCCCTTCTCGCCGGAGGAGGACG
AGACCATTATCAGAGCTCATGCTAACTTCGGCAACAGATGGGCTACTATTGCTCGGCTTCTTTCCGGCCGGACGGATAACGCCGTCAAGAATCACTGGAATTCGACGCTG
AAGCGAAAGTGTTCATCGATGATGAATGAGGGGTTTGAAGGGGATCTCAACGGTCAGCCTCTGAAGAAATCGGTCAGCGCCGGCGCCGCCGTCAACGTCTCCAACGGGTT
CTATATGAACCCCGGTAGTCCATCGGGATCTGACGTAAGCGATTCTAGCGTTCCTGTTATGTCTCCGACCGTCTACCGCCCCGTTGCCAGAACAGGCGGCGTTATACCCC
CGGGAGAATCGGCGCCGTGCTCGGCCGCCGACCCACCAACGTCGCTGTCGTTGTCTCTCCCTGGAGTGGGTGCCACCGTCGACGTTTCGAACCGCGGTGGAAGTGGATCA
ACGGCGCAGGTGCCTCTAATGGCGACATTTGCTCAAATACAGAGCATGGCAGCATCGGAGCAGGCGAGGGCGGCGCAGCCAGATAACAGAGGCGGTGGTGCGGCCGAGAA
AATTAATGGGTTTGGAGTATTCAATGCAGATTTAATGGCGGTGATGCAAGAAATGATAAAATCGGAGGTGAAAAGGTATATGGAAGGGTTATCGGAGCAGAGAGGAAGAG
CTTGTTTTCCGCAGGCTAAAGCCGGTGGGATTAGAAACGTTGGTTTTTAAGCAAACTGCGATTAATCATCAAGATATAAACTCTTTAAAGAGAAGAATTGTTGATTAGGA
TTTGGATTTGGAGGAGATTAGAATGATGATTGCAGTTAGTTACCATCTCGGACTTTGAGGAGAAGGAATTTTAGGGGAGAATGAGTACCACAGTAAGACAATATGTTTTG
TGGGGGAAGAAGAAAATCTGGGAGTTTGGTTTCTTTCCATTTTTCAACTCTTTTTTTTTTTAATTGTTGTTTTTTAAGGAAATAATATTTAGTTAATGTACAGAGAAGGG
AATGAAAGAAAAAAGAAAAGGGGGAAGTTCCCAATTCCATTACAGACAGGAAATTGAACAGAGGAAAAGTTAGGATGAAGTTGATGATGGACCCTAAAACTGAAAAAAGT
TGGGAGTTTTAAATTTAGAACTTAGCTTTCATTCTTGACGTTATTTGTCTCTCATTTTAAAGACCACATGGTCAAAAAAAAAAAAAAAAACCCAAAATCAAAACCAAAAC
CAAAAATCAGAAGTTGAGAACATAACTCCAATTGATTTCGTTAAGAATGTTGTGAGTTTCTTCTTGTGTGTTTGAAAATGGCAGCCCCGTTTAAGTTTTGAATAGCGGTG
AAATTTTGTTCTAATTTGTCTGGAAATGGAGAATAGCCAGGGTGAATTTTTTGAGTTATGGACTGGAAGCAATTGCATTGAGG
Protein sequenceShow/hide protein sequence
MAVTRKEMDRIKGPWSPEEDDALQRLVQKHGPRNWSLISKSIPGRSGKSCRLRWCNQLSPQVEHRPFSPEEDETIIRAHANFGNRWATIARLLSGRTDNAVKNHWNSTLK
RKCSSMMNEGFEGDLNGQPLKKSVSAGAAVNVSNGFYMNPGSPSGSDVSDSSVPVMSPTVYRPVARTGGVIPPGESAPCSAADPPTSLSLSLPGVGATVDVSNRGGSGST
AQVPLMATFAQIQSMAASEQARAAQPDNRGGGAAEKINGFGVFNADLMAVMQEMIKSEVKRYMEGLSEQRGRACFPQAKAGGIRNVGF