; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002147 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002147
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationscaffold30:2728268..2736043
RNA-Seq ExpressionMS002147
SyntenyMS002147
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0073.12Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DMEALRRACRL G N E+Y NP LS PAAG  +      GSDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSNKPDQSCD  GPL+MDS+ TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LSAM YGPAENSHVACYR ALTKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QG SA+SDDLDNI ASIKDLDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRN W+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------
        +VPLLQEAR+IQK ALISNFVDRESERP+LGPTDFRP+PN++LLCN DDP  AP RNV+   +       S   DAPK+RKSNN  N+A           
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------

Query:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ
         +   E  S KPQR++  +GAYT +RKG  +   N+E+CAEQN +T+SLEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE Q
Subjt:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ

Query:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR
        NSTGSSGVSVLSEM NDMDEYNPS  PDT  LA  T DD I E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR+
Subjt:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR

Query:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI
        + ++             + E E S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+ TTTH +K DG SS   EV++D ND T+
Subjt:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI

Query:  ASFLKNISKKKK
        A  LK+  K+KK
Subjt:  ASFLKNISKKKK

XP_022135114.1 uncharacterized protein LOC111007172 isoform X1 [Momordica charantia]0.0e+0097.47Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQI SGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM
        RRQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

XP_022135115.1 uncharacterized protein LOC111007172 isoform X2 [Momordica charantia]0.0e+0097.57Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
        ERENLGKGIRQQFQEMVLQISMDQISGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
Subjt:  ERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS

Query:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
        EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
Subjt:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR

Query:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
        GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
Subjt:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE

Query:  VPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQR
        VPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQR
Subjt:  VPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQR

Query:  RQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEMM
        RQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEMM
Subjt:  RQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEMM

Query:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
        NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
Subjt:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK

Query:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

XP_022135116.1 transcription factor MYB3R-2 isoform X3 [Momordica charantia]0.0e+0093.12Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQI SGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSL                                           GDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM
        RRQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0073.05Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+Y NP LS PAAG  +      GSDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFE LR+IQRRF+AYESD LSNKPDQSCD  GPL+MDS  TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LS+M YGPAENSHVACYR A TKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QG SA+SDDLDNI ASIK LDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRNPW+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAV G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------
        +VPLLQEAR+IQK ALISNFVDRESERP+LGPTDFRP+PN++LLCN DDP  APKRNV+   +       S   DAPKRRKSNN  N+A           
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------

Query:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ
         +   E  S KPQR++  +GAYT +RKG  +   N+E+CAEQN +T S+EVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE Q
Subjt:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ

Query:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR
        NSTGSSGVSVLSEM NDMDEYNPS LPDT  LA  T  DDI E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR+
Subjt:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR

Query:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI
        + ++             + ELE S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+ TTTH +K DG SS   EV++D ND T+
Subjt:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI

Query:  ASFLKNISKKKKH
        A  L +  K+KKH
Subjt:  ASFLKNISKKKKH

TrEMBL top hitse value%identityAlignment
A0A6J1BZQ5 transcription factor MYB3R-2 isoform X30.0e+0093.12Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQI SGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSL                                           GDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM
        RRQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

A0A6J1C075 uncharacterized protein LOC111007172 isoform X20.0e+0097.57Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
        ERENLGKGIRQQFQEMVLQISMDQISGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
Subjt:  ERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS

Query:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
        EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
Subjt:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR

Query:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
        GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
Subjt:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE

Query:  VPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQR
        VPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQR
Subjt:  VPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQR

Query:  RQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEMM
        RQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEMM
Subjt:  RQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEMM

Query:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
        NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
Subjt:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK

Query:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

A0A6J1C0J5 uncharacterized protein LOC111007172 isoform X10.0e+0097.47Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSP AGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSN  DQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQI SGLQG SADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQI-SGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERP+LGPTDFRPIPNTNLLCNADDPNPAPKRNVKP   +         DAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKP--SLLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM
        RRQN NGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSS+KLEVES+GNDTTIASFLKNISKKKKHK+
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKA

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.0e+0072.85Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+  NP LS PAAG  +      GSDSDDVDDLEL+RNI+NRFS AAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSNKPDQSCD  GPL+MDS+ TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LSAM YGPAENSHVACYR ALTKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QG SA+SDDLDNI ASIK LDITPE+IREFLPKVNWDKLA  YL GRSGAECEARWLNFEDPLINRN W+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------
        +VPLLQEAR+IQK ALISNFVDRESERP+LGPTDFRP+PN++LLCN DDP  APKRNV+   +       S   DAPK+ KSNN  NQA           
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQA-----------

Query:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ
         +   E  S KPQR++  +GAYT +RKG  +   N+E+CAEQN +T SLEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE Q
Subjt:  -NGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEYQ

Query:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR
        NSTGSSGVSVLSEM NDMDEYNPS  PDT  LA  T  DDI E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR+
Subjt:  NSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRR

Query:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI
        + ++             + ELE S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+ TTTH +K DG SS   EV++D ND T+
Subjt:  RTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTTI

Query:  ASFLKNISKKKKH
        A  LK+  K+KKH
Subjt:  ASFLKNISKKKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.0e+0072.88Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+Y NP LS PAAG  +      GSDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSNKPDQSCD  GPL+MDS+ T+V R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKF+R
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKP-QTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWS
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   LSQM+DPRVQLISA KP Q KDSSKKDK+LSAM YGPAENSHVACYR+ALTKF   V+RK+WS
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKP-QTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWS

Query:  NVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWS
        N ERENLGKGIRQQFQEMVLQIS+DQIS +QG SA+SDDLDNI ASIKDLDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRNPW+
Subjt:  NVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWS

Query:  TSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPART
        TSEDKNLL TIQQKGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPART
Subjt:  TSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPART

Query:  KRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFP
        KRG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFP
Subjt:  KRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFP

Query:  NEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQ-----------
        N+VPLLQEAR+IQK ALISNFVDRESERP+LGPTDFRP+PN++LLCN DDP  APKRNV+   +       S   DAPK+RKSNN  N+           
Subjt:  NEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYS---DAPKRRKSNNHSNQ-----------

Query:  -ANGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEY
          +   E  S KPQR++  +GAYT +RKG  +   N+E+CAEQN +T +LEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE 
Subjt:  -ANGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEY

Query:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR
        QNSTGSSGVSVLSEM NDMDEYNPS LPDT  LA  T  DDI E K   VAD DL  SNSFSLP  CL LRT DSEGVDS SV E TDKS +V K QGRR
Subjt:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR

Query:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTT
        ++ ++             + ELE S   ELH  NQ KKRKH+ST TS  GT+E VEE DDCTL GFLQK+LK+ TTTH +K DG SS   EV++D ND T
Subjt:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKK-TTTHKRKFDGGSSEKLEVESDGNDTT

Query:  IASFLKNISKKKKH
        +A  LK   K+KKH
Subjt:  IASFLKNISKKKKH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb8.2e-3138.64Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ IA+ L  R   QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ + GR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARRIQKAALISNF
        +P ++K  WTEEED  +  A +  G  W ++A  +P RTDN  +  W      +V     LQE+ +  + A+ ++F
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARRIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog5.7e-3232.71Show/hide
Query:  VNWDKLASF-YLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+F + G R+    +++W N  +P  N+  WS  E + L    +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASF-YLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCE-WT
           +  +     W  +A  + GR   Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ V  RN  QCRERW N L+ S    E +T
Subjt:  AVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCE-WT

Query:  EEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVD
          ED +L  A++  G  +W K    +P +T  + RRR+ +L          A +++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVD

Q54NA6 Myb-like protein L2.1e-5836.98Show/hide
Query:  PAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQ-FQEMVLQISMDQIS---------GLQGLSADSDDLDNIF------ASIKDLDITPERI
        PA+N      R+     PL    ++W+  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ +N         SIKD        
Subjt:  PAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQ-FQEMVLQISMDQIS---------GLQGLSADSDDLDNIF------ASIKDLDITPERI

Query:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
         + + +V  + L       RS  E   RW N +DP IN+ P++  EDK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED
Subjt:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED

Query:  DKLRSAVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRK
        + L   + +   G   DWQ I   + GR G QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      V GR  VQCRER+ N LDP L K
Subjt:  DKLRSAVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRK

Query:  CEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTD
          WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D
Subjt:  CEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTD

Q5SXM2 snRNA-activating protein complex subunit 42.3e-4128.48Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTK
        ++ N   Q+ I+ K+      + +N++  Q+ ++++D   S  + T  +  + + P   +    KP  KD             GP  N           K
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTK

Query:  FPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGL-QGLSADSDDLD---------NIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGG
            +   KW N E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D++  PE   +   L   +W+K+++  + G 
Subjt:  FPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGL-QGLSADSDDLD---------NIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGG

Query:  RSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQ
        RS  E    W N E P IN+  WS  E++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  + VG    ++
Subjt:  RSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQ

Query:  AIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHG
         I   +EGR   Q   RW KSLDP   K+G + P+ED +L  AV  +G ++W K  E VPGR+  QCR+R+   L  SL+K  W  +E+ +L   I+++G
Subjt:  AIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHG

Query:  YS-WTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLG
           W K+A+ +P R+ ++C  +WK +   +  L +  RR + +   S+     S   S G
Subjt:  YS-WTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLG

Q8BP86 snRNA-activating protein complex subunit 42.8e-3931.76Show/hide
Query:  KWSNVERENLGKGIRQQFQEMVLQISM----------DQISGLQGLSADSDDLDNIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGGRSGAECEA
        KW + E+  L K +     + +LQ  +           ++S      A    +      I+D++  PE   +   L   +W+K+++  + G RS  E   
Subjt:  KWSNVERENLGKGIRQQFQEMVLQISM----------DQISGLQGLSADSDDLDNIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGGRSGAECEA

Query:  RWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQAIASTLEG
         W + E P I++  WST E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  + VG+   ++ I   +EG
Subjt:  RWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQAIASTLEG

Query:  RAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYS-WTKVA
        R   Q   RW KSLDP+  KRG + P+ED +L  AV  +G ++W K  E VPGR+  QCR+R+   L  SL+K  W  +E+ +L   I+++G   W ++A
Subjt:  RAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYS-WTKVA

Query:  ACVPSRTDNECRRRWKKL
        + +P R+ ++C  +WK L
Subjt:  ACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-31.5e-2740.14Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV       W+ IA +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ +PGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW
        +P + K  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-31.5e-2740.14Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV       W+ IA +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ +PGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW
        +P + K  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r17.3e-16845.71Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSF-----GGSKPGSDSDDVDDLELVRNIRNR----------
        M+R S  E  D      ++DDE+D   ED+E LRRAC +   N +++       A+  GS      GG +  SDS++ DD E++R I+++          
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSF-----GGSKPGSDSDDVDDLELVRNIRNR----------

Query:  ------FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYE---------SDALSNKPD-QSCDFGGPLEMDSNET
               S+ +D E +            LSL    +LPP+  S DEE+D FETLRAI+RRFSAY+         +D+   K    + D     E+ S   
Subjt:  ------FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYE---------SDALSNKPD-QSCDFGGPLEMDSNET

Query:  DVGRQTSSGRSSM------------LALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPR
                G+S +            +     + P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K ++  +I+KDFQ+SC+R T   L Q  DPR
Subjt:  DVGRQTSSGRSSM------------LALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPR

Query:  VQLISAQKPQTKDSSK----------KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSAD
        V+LIS +K    DSS+           DK++S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +  
Subjt:  VQLISAQKPQTKDSSK----------KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSAD

Query:  SDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCL
          D+D I  SI +L+ITPE IR+FLPK+NWD L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL
Subjt:  SDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCL

Query:  SRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRN
        +RYQRSLN SILK+EWT +EDD+LR+AV + G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR 
Subjt:  SRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRN

Query:  QVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDF
        Q QCRERW NCLDP + + +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERP+L  +  
Subjt:  QVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDF

Query:  RPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYSDA---PKRRK
          +P+ +L    D      KR  K           SDA   PKRR+
Subjt:  RPIPNTNLLCNADDPNPAPKRNVKPSLLNFILFFYSDA---PKRRK

AT3G18100.2 myb domain protein 4r12.1e-15453.59Show/hide
Query:  NLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------KDKQL
        + P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K ++  +I+KDFQ+SC+R T   L Q  DPRV+LIS +K    DSS+           DK++
Subjt:  NLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------KDKQL

Query:  SAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWD
        S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD
Subjt:  SAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERIREFLPKVNWD

Query:  KLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVL
         L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV + 
Subjt:  KLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVL

Query:  GVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEI
        G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR Q QCRERW NCLDP + + +WTEEED +L  
Subjt:  GVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEI

Query:  AIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFI
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERP+L  +    +P+ +L    D      KR  K       
Subjt:  AIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPSLLNFI

Query:  LFFYSDA---PKRRK
            SDA   PKRR+
Subjt:  LFFYSDA---PKRRK

AT3G18100.3 myb domain protein 4r15.2e-15044.14Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSF-----GGSKPGSDSDDVDDLELVRNIRNR----------
        M+R S  E  D      ++DDE+D   ED+E LRRAC +   N +++       A+  GS      GG +  SDS++ DD E++R I+++          
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSF-----GGSKPGSDSDDVDDLELVRNIRNR----------

Query:  ------FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGR
               S+ +D E +            LSL    +LPP+  S DEE+D FETLRAI+RRFSAY+            +FG   +  ++     +Q +  +
Subjt:  ------FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGR

Query:  SSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-KKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK-----
         S+   ++          +    K   +      +  H+   +EEN +KL+QR        SS  R T     +M DPRV+LIS +K    DSS+     
Subjt:  SSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-KKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK-----

Query:  -----KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERI
              DK++S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE I
Subjt:  -----KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGLSADSDDLDNIFASIKDLDITPERI

Query:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
        R+FLPK+NWD L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +ED
Subjt:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED

Query:  DKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEW
        D+LR+AV + G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR Q QCRERW NCLDP + + +W
Subjt:  DKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEW

Query:  TEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKR
        TEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERP+L  +    +P+ +L    D      KR
Subjt:  TEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKR

Query:  NVKPSLLNFILFFYSDA---PKRRK
          K           SDA   PKRR+
Subjt:  NVKPSLLNFILFFYSDA---PKRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCCGCAGCCATAACGAAGGCGGTGACCAGGAGCTCTCCACCGGCGAGGAAGACGATGAAGATGATGTGTTTGATGAGGACATGGAAGCCCTCCGGCGAGCCTG
TAGGCTTGTTGGAGCTAATCCCGAAGAGTATAATAATCCTCCCTTGTCTCCTGCCGCCGGAGCTGGCTCGTTCGGCGGTAGCAAACCTGGCTCTGATTCTGATGATGTTG
ATGATCTCGAACTTGTTCGGAATATACGGAACCGGTTCTCGATTGCGGCCGATGATGAGGACCAGGCTTTGTCTCTGCATCCGTTGAGTACTCTCCCACCGGTGTCGCCG
GACGAGGAGGAGGACGATTTCGAGACGCTTCGCGCGATTCAGCGGCGGTTTTCGGCGTATGAAAGTGATGCTTTGAGCAATAAGCCCGATCAGTCTTGTGACTTTGGTGG
GCCTCTAGAGATGGATTCCAACGAAACAGATGTTGGGAGACAGACTTCCTCAGGAAGGTCGTCTATGCTAGCCCTTGAAAAGGGAAATTTGCCTAAGGCTGCATTGGCAT
TTATTGATGCAATCAAGAAGAATAGGTCTCAGCAGAAGTTTATTCGAAGTAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTTAGACAACGTTTCAAA
ATTCTCAAAGATTTCCAGAGTTCATGTAGAAGGAGAACAGGTTATCAACTGTCTCAGATGATTGATCCTCGAGTCCAGTTAATTTCAGCACAAAAACCACAGACTAAGGA
TTCATCAAAGAAGGACAAACAGTTATCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCACTTTCAGTAAATC
GAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTTGGGAAGGGAATAAGACAGCAATTTCAAGAAATGGTTCTTCAAATTTCAATGGATCAAATCAGTGGGCTACAAGGA
CTTTCAGCAGATTCAGATGATCTGGATAACATTTTTGCATCAATAAAAGACCTTGACATTACTCCTGAGAGGATTAGGGAATTTCTGCCAAAAGTAAATTGGGACAAATT
GGCTTCCTTTTATCTTGGGGGTCGCTCAGGTGCAGAATGTGAAGCCAGGTGGTTGAATTTTGAAGATCCTCTAATTAATCGGAATCCATGGTCTACAAGTGAGGATAAGA
ATCTTTTGCTTACTATCCAACAGAAAGGGCTGAATAACTGGATTAACATAGCAGTTTCATTGGGTACGAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGT
TTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTCCGATCTGCTGTTGCCGTTCTTGGTGTAGGAGATTGGCAGGCAATAGCTTCTACTTT
GGAAGGACGAGCTGGTACACAGTGCTCTAATAGATGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCGCTTTCACTCCAGATGAAGACAATCGCTTGAAAATTG
CCGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTGTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTG
AGAAAGTGTGAATGGACAGAAGAGGAGGATTTAAGGCTCGAAATAGCAATTCAGGAGCATGGATATAGCTGGACAAAGGTAGCTGCATGTGTGCCTTCACGTACAGATAA
TGAGTGCCGGAGGAGATGGAAGAAGTTATTCCCCAATGAAGTTCCCTTGCTCCAGGAAGCTAGAAGGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGAGAAT
CAGAGCGTCCTTCTTTAGGTCCTACTGACTTTCGACCTATACCCAATACAAATTTATTATGTAATGCGGATGATCCAAATCCTGCCCCAAAAAGAAATGTCAAGCCGAGT
CTATTGAACTTCATTTTATTTTTCTACAGCGATGCTCCAAAGAGGAGGAAATCAAATAACCATAGCAATCAAGCTAATGGAGCAACTGAGCAGGTATCTAAAAAACCTCA
AAGAAGACAAAATGGAAATGGAGCTTATACTGCTAAGAGGAAAGGGGTTCTGGAGCCACGTCCTAACAATGAGAAATGTGCTGAACAAAATTTGGAAACTGAGAGCCTCG
AGGTGCAGCTGAATAGTAGCGTATCAGGGAGGACCAACAGTGAGTGCCCCGAGATTGTTTATGAGAATGGTATGGAGGAATGTGAGAACAAAGTGGCAGAGAAGCTTTCT
AAAAGTGATTTATTCTTTTCAGAACAGGAAGAATATCAGAACTCGACAGGGTCTTCTGGAGTGTCAGTATTGTCAGAAATGATGAACGACATGGACGAGTATAATCCCTC
TATCCTTCCAGACACAGCACCGTTGGCTTGTACTACTGGGGATGACGATATAACGGAAAGGAAGGACACAAGGGTTGCAGACACGGATCTGGTTGACAGTAACAGTTTCT
CGTTACCGCATGGTTGCTTAGGACTCAGGACTAATGACAGTGAAGGCGTCGATAGCTGTTCTGTCGGTGAAACTACAGATAAAAGCGATATGGTTTATAAGCACCAAGGT
AGAAGGAGGAGAACTAATCAACCAAAGAAGGAGCTGGAGAGATCAGCGAACAAGGAGCTTCATCCTCATAACCAACCAAAGAAGCGAAAGCATAACAGCACATATACAAG
TGAGTCGGGAACATTGGAGCCAGTCGAAGAAGCAGATGACTGCACTCTCCAAGGCTTTCTGCAAAAGAAATTGAAGAAGACAACCACTCATAAAAGGAAATTTGATGGCG
GTTCTAGCGAAAAACTAGAAGTTGAAAGCGATGGTAATGATACTACTATCGCCTCGTTTCTCAAGAATATATCGAAGAAAAAGAAGCATAAAGCTCCTAATGGT
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCCGCAGCCATAACGAAGGCGGTGACCAGGAGCTCTCCACCGGCGAGGAAGACGATGAAGATGATGTGTTTGATGAGGACATGGAAGCCCTCCGGCGAGCCTG
TAGGCTTGTTGGAGCTAATCCCGAAGAGTATAATAATCCTCCCTTGTCTCCTGCCGCCGGAGCTGGCTCGTTCGGCGGTAGCAAACCTGGCTCTGATTCTGATGATGTTG
ATGATCTCGAACTTGTTCGGAATATACGGAACCGGTTCTCGATTGCGGCCGATGATGAGGACCAGGCTTTGTCTCTGCATCCGTTGAGTACTCTCCCACCGGTGTCGCCG
GACGAGGAGGAGGACGATTTCGAGACGCTTCGCGCGATTCAGCGGCGGTTTTCGGCGTATGAAAGTGATGCTTTGAGCAATAAGCCCGATCAGTCTTGTGACTTTGGTGG
GCCTCTAGAGATGGATTCCAACGAAACAGATGTTGGGAGACAGACTTCCTCAGGAAGGTCGTCTATGCTAGCCCTTGAAAAGGGAAATTTGCCTAAGGCTGCATTGGCAT
TTATTGATGCAATCAAGAAGAATAGGTCTCAGCAGAAGTTTATTCGAAGTAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTTAGACAACGTTTCAAA
ATTCTCAAAGATTTCCAGAGTTCATGTAGAAGGAGAACAGGTTATCAACTGTCTCAGATGATTGATCCTCGAGTCCAGTTAATTTCAGCACAAAAACCACAGACTAAGGA
TTCATCAAAGAAGGACAAACAGTTATCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCACTTTCAGTAAATC
GAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTTGGGAAGGGAATAAGACAGCAATTTCAAGAAATGGTTCTTCAAATTTCAATGGATCAAATCAGTGGGCTACAAGGA
CTTTCAGCAGATTCAGATGATCTGGATAACATTTTTGCATCAATAAAAGACCTTGACATTACTCCTGAGAGGATTAGGGAATTTCTGCCAAAAGTAAATTGGGACAAATT
GGCTTCCTTTTATCTTGGGGGTCGCTCAGGTGCAGAATGTGAAGCCAGGTGGTTGAATTTTGAAGATCCTCTAATTAATCGGAATCCATGGTCTACAAGTGAGGATAAGA
ATCTTTTGCTTACTATCCAACAGAAAGGGCTGAATAACTGGATTAACATAGCAGTTTCATTGGGTACGAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGT
TTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTCCGATCTGCTGTTGCCGTTCTTGGTGTAGGAGATTGGCAGGCAATAGCTTCTACTTT
GGAAGGACGAGCTGGTACACAGTGCTCTAATAGATGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCGCTTTCACTCCAGATGAAGACAATCGCTTGAAAATTG
CCGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTGTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTG
AGAAAGTGTGAATGGACAGAAGAGGAGGATTTAAGGCTCGAAATAGCAATTCAGGAGCATGGATATAGCTGGACAAAGGTAGCTGCATGTGTGCCTTCACGTACAGATAA
TGAGTGCCGGAGGAGATGGAAGAAGTTATTCCCCAATGAAGTTCCCTTGCTCCAGGAAGCTAGAAGGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGAGAAT
CAGAGCGTCCTTCTTTAGGTCCTACTGACTTTCGACCTATACCCAATACAAATTTATTATGTAATGCGGATGATCCAAATCCTGCCCCAAAAAGAAATGTCAAGCCGAGT
CTATTGAACTTCATTTTATTTTTCTACAGCGATGCTCCAAAGAGGAGGAAATCAAATAACCATAGCAATCAAGCTAATGGAGCAACTGAGCAGGTATCTAAAAAACCTCA
AAGAAGACAAAATGGAAATGGAGCTTATACTGCTAAGAGGAAAGGGGTTCTGGAGCCACGTCCTAACAATGAGAAATGTGCTGAACAAAATTTGGAAACTGAGAGCCTCG
AGGTGCAGCTGAATAGTAGCGTATCAGGGAGGACCAACAGTGAGTGCCCCGAGATTGTTTATGAGAATGGTATGGAGGAATGTGAGAACAAAGTGGCAGAGAAGCTTTCT
AAAAGTGATTTATTCTTTTCAGAACAGGAAGAATATCAGAACTCGACAGGGTCTTCTGGAGTGTCAGTATTGTCAGAAATGATGAACGACATGGACGAGTATAATCCCTC
TATCCTTCCAGACACAGCACCGTTGGCTTGTACTACTGGGGATGACGATATAACGGAAAGGAAGGACACAAGGGTTGCAGACACGGATCTGGTTGACAGTAACAGTTTCT
CGTTACCGCATGGTTGCTTAGGACTCAGGACTAATGACAGTGAAGGCGTCGATAGCTGTTCTGTCGGTGAAACTACAGATAAAAGCGATATGGTTTATAAGCACCAAGGT
AGAAGGAGGAGAACTAATCAACCAAAGAAGGAGCTGGAGAGATCAGCGAACAAGGAGCTTCATCCTCATAACCAACCAAAGAAGCGAAAGCATAACAGCACATATACAAG
TGAGTCGGGAACATTGGAGCCAGTCGAAGAAGCAGATGACTGCACTCTCCAAGGCTTTCTGCAAAAGAAATTGAAGAAGACAACCACTCATAAAAGGAAATTTGATGGCG
GTTCTAGCGAAAAACTAGAAGTTGAAAGCGATGGTAATGATACTACTATCGCCTCGTTTCTCAAGAATATATCGAAGAAAAAGAAGCATAAAGCTCCTAATGGT
Protein sequenceShow/hide protein sequence
MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPAAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLHPLSTLPPVSP
DEEEDDFETLRAIQRRFSAYESDALSNKPDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFK
ILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQG
LSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRS
LNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSL
RKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPSLGPTDFRPIPNTNLLCNADDPNPAPKRNVKPS
LLNFILFFYSDAPKRRKSNNHSNQANGATEQVSKKPQRRQNGNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLS
KSDLFFSEQEEYQNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQG
RRRRTNQPKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSEKLEVESDGNDTTIASFLKNISKKKKHKAPNG