; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0340 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0340
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription factor MYB3R-2 isoform X3
Genome locationMC06:2719219..2727645
RNA-Seq ExpressionMC06g0340
SyntenyMC06g0340
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589626.1 Myb-like protein L, partial [Cucurbita argyrosperma subsp. sororia]0.073.74Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DMEALRRACRL G N E+Y NP LS P AG  + G     SDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSN  DQSCD  GPL+MDS+ TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LSAM YGPAENSHVACYR ALTKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QGFSA+SDDLDNI ASIKDLDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRN W+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----
        +VPLLQEAR+IQK ALISNFVDRESERPALGPTDFRP+PN++LLCN DDP  AP RNV+ R   V R EK+A GDAPK+RKSNN  N+A+  T QV    
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----

Query:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC
                 S KPQR++ R+GAYT +RKG  +   N+E+CAEQN +T+SLEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE 
Subjt:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC

Query:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR
        QNSTGSSGVSVLSEM NDMDEYNPS  PDT  LA  T DD I E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR
Subjt:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR

Query:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT
        ++ ++             + E E S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+TTT H +K DG SS   EV+++ ND T
Subjt:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT

Query:  IASFLKNISKKKK
        +A  LK+  K+KK
Subjt:  IASFLKNISKKKK

XP_022135114.1 uncharacterized protein LOC111007172 isoform X1 [Momordica charantia]0.099.7Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQIS GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM
        RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

XP_022135115.1 uncharacterized protein LOC111007172 isoform X2 [Momordica charantia]0.099.8Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
        ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
Subjt:  ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS

Query:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
        EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
Subjt:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR

Query:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
        GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
Subjt:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE

Query:  VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR
        VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR
Subjt:  VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR

Query:  RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEMM
        RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEMM
Subjt:  RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEMM

Query:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
        NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
Subjt:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK

Query:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

XP_022135116.1 transcription factor MYB3R-2 isoform X3 [Momordica charantia]0.095.35Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQIS GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLG                                           DWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM
        RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

XP_023515735.1 uncharacterized protein LOC111779809 isoform X1 [Cucurbita pepo subsp. pepo]0.073.67Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+Y NP LS P AG  + G     SDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFE LR+IQRRF+AYESD LSN  DQSCD  GPL+MDS  TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LS+M YGPAENSHVACYR A TKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QGFSA+SDDLDNI ASIK LDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRNPW+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVAV G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----
        +VPLLQEAR+IQK ALISNFVDRESERPALGPTDFRP+PN++LLCN DDP  APKRNV+ R   V R EK+A GDAPKRRKSNN  N+A+  T QV    
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----

Query:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC
                 S KPQR++ R+GAYT +RKG  +   N+E+CAEQN +T S+EVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE 
Subjt:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC

Query:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR
        QNSTGSSGVSVLSEM NDMDEYNPS LPDT  LA  T DD I E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR
Subjt:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR

Query:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT
        ++ ++             + ELE S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+TTT H +K DG SS   EV+++ ND T
Subjt:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT

Query:  IASFLKNISKKKKH
        +A  L +  K+KKH
Subjt:  IASFLKNISKKKKH

TrEMBL top hitse value%identityAlignment
A0A6J1BZQ5 transcription factor MYB3R-2 isoform X30.095.35Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQIS GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLG                                           DWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM
        RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

A0A6J1C075 uncharacterized protein LOC111007172 isoform X20.099.8Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
        ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS
Subjt:  ERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTS

Query:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
        EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR
Subjt:  EDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKR

Query:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
        GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE
Subjt:  GAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNE

Query:  VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR
        VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR
Subjt:  VPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQR

Query:  RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEMM
        RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEMM
Subjt:  RQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEMM

Query:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
        NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK
Subjt:  NDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSANK

Query:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  ELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

A0A6J1C0J5 uncharacterized protein LOC111007172 isoform X10.099.7Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
        MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLH

Query:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
        PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS
Subjt:  PLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRS

Query:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
        KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV
Subjt:  KMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNV

Query:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
        ERENLGKGIRQQFQEMVLQISMDQIS GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
Subjt:  ERENLGKGIRQQFQEMVLQISMDQIS-GLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
        EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQ

Query:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM
        RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFF EQEE QNSTGSSGVSVLSEM
Subjt:  RRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEECQNSTGSSGVSVLSEM

Query:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
        MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN
Subjt:  MNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRRRRTNQPKKELERSAN

Query:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
        KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS
Subjt:  KELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS

A0A6J1E6Z7 uncharacterized protein LOC111430000 isoform X10.073.47Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+  NP LS P AG  + G     SDSDDVDDLEL+RNI+NRFS AAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSN  DQSCD  GPL+MDS+ TDV R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKFIR
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   L+QM+DPRVQLISA KPQ KDSSKKDK+LSAM YGPAENSHVACYR ALTKF   V+RK+WSN
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSN

Query:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST
         ERENLGKGIRQQFQEMVLQIS+DQIS +QGFSA+SDDLDNI ASIK LDITPE+IREFLPKVNWDKLA  YL GRSGAECEARWLNFEDPLINRN W+T
Subjt:  VERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWST

Query:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK
        SEDKNLL TIQQKGLNNWI +AVSLGTNRTPFQCLSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPARTK
Subjt:  SEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTK

Query:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN
        RG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFPN
Subjt:  RGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPN

Query:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----
        +VPLLQEAR+IQK ALISNFVDRESERPALGPTDFRP+PN++LLCN DDP  APKRNV+ R   V R EK+A GDAPK+ KSNN  NQA+  T QV    
Subjt:  EVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV----

Query:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC
                 S KPQR++ R+GAYT +RKG  +   N+E+CAEQN +T SLEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE 
Subjt:  ---------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEEC

Query:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR
        QNSTGSSGVSVLSEM NDMDEYNPS  PDT  LA  T DD I E K   VAD DL DSNSFSLP  CL LRT DSEGVDS SV E TDKS  V K QGRR
Subjt:  QNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGRR

Query:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT
        ++ ++             + ELE S   ELH  NQ KKRKH+ T TS  GT+E VEE DDCTLQGFLQK+LK+TTT H +K DG SS   EV+++ ND T
Subjt:  RRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDTT

Query:  IASFLKNISKKKKH
        +A  LK+  K+KKH
Subjt:  IASFLKNISKKKKH

A0A6J1JKV7 uncharacterized protein LOC111485355 isoform X10.073.5Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL
        MSRRSH +GGD+EL   EEDDEDD+ D+DME LRRACRL G N E+Y NP LS P AG  + G     SDSDDVDDLEL+RNI+NRFSIAAD++      
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLS-PTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSL

Query:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR
         PLS LPPV+ DEEEDDFETLR+IQRRF+AYESD LSN  DQSCD  GPL+MDS+ T+V R TSS RSSM+A EKG+LPKAALAFIDAIKKNRSQQKF+R
Subjt:  HPLSTLPPVSPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIR

Query:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQ-TKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWS
        SKMIHLEARIEENKKLR+RFK+LK FQ SCRR+T   LSQM+DPRVQLISA KPQ  KDSSKKDK+LSAM YGPAENSHVACYR+ALTKF   V+RK+WS
Subjt:  SKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQ-TKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWS

Query:  NVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWS
        N ERENLGKGIRQQFQEMVLQIS+DQIS +QGFSA+SDDLDNI ASIKDLDITPE+IREFLPKVNWDKLAS YL GRSGAECEARWLNFEDPLINRNPW+
Subjt:  NVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWS

Query:  TSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPART
        TSEDKNLL TIQQKGLNNWI++AVSLGTNRTPFQ LSRYQRSLNASILK EWTKDEDDKLRSAVA+ G GDWQA+ASTLEGR G QCSNRWKKSLDPART
Subjt:  TSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPART

Query:  KRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFP
        KRG FTPDED+RLKIAVLLFGPKNWNKKAEF+PGRNQVQCRERWFNCLDPSLR+CEWTEEEDLRLEIAIQEHGYSW KVAACVPSRTDNECRRRWKKLFP
Subjt:  KRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFP

Query:  NEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV---
        N+VPLLQEAR+IQK ALISNFVDRESERPALGPTDFRP+PN++LLCN DDP  APKRNV+ R   V R EK+A GDAPK+RKSNN  N+ +  T QV   
Subjt:  NEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPRTR-VPRKEKNATGDAPKRRKSNNHSNQANGATEQV---

Query:  ----------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEE
                  S KPQR++ R+GAYT +RKG  +   N+E+CAEQN +T +LEVQLN    + R NS+CPE V ENGME  ENK AE  S+  + FSEQEE
Subjt:  ----------SKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLN-SSVSGRTNSECPEIVYENGMEECENKVAEKLSKSDLFFSEQEE

Query:  CQNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGR
         QNSTGSSGVSVLSEM NDMDEYNPS LPDT  LA  T DD I E K   VAD DL  SNSFSLP  CL LRT DSEGVDS SV E TDKS +V K QGR
Subjt:  CQNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKHQGR

Query:  RRRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDT
        R++ ++             + ELE S   ELH  NQ KKRKH+ST TS  GT+E VEE DDCTL GFLQK+LK+TTT H +K DG SS   EV+++ ND 
Subjt:  RRRTNQ------------PKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTT-HKRKFDGGSSDKLEVESNGNDT

Query:  TIASFLKNISKKKKH
        T+A  LK   K+KKH
Subjt:  TIASFLKNISKKKKH

SwissProt top hitse value%identityAlignment
P46200 Transcriptional activator Myb8.2e-3138.64Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT++ED+KL+  V   G  DW+ IA+ L  R   QC +RW+K L+P   K G +T +ED R+   V  +GPK W+  A+ + GR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARRIQKAALISNF
        +P ++K  WTEEED  +  A +  G  W ++A  +P RTDN  +  W      +V     LQE+ +  + A+ ++F
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVP---LLQEARRIQKAALISNF

P91868 snRNA-activating protein complex subunit 4 homolog5.7e-3232.71Show/hide
Query:  VNWDKLASF-YLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        V W  +A+F + G R+    +++W N  +P  N+  WS  E + L    +     +W  +A++LGTNRT +QC+ +Y+  ++     +EW++DED KL +
Subjt:  VNWDKLASF-YLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCE-WT
           +  +     W  +A  + GR   Q   R+  +LD A  K G +T  ED  L  AV  +G K+W K A+ V  RN  QCRERW N L+ S    E +T
Subjt:  AVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCE-WT

Query:  EEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVD
          ED +L  A++  G  +W K    +P +T  + RRR+ +L          A +++ AA   N VD
Subjt:  EEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVD

Q54NA6 Myb-like protein L5.1e-5735.01Show/hide
Query:  PAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQ-FQEMVLQISMDQIS---------GLQGFSADSDDLDNIF------ASIKDLDITPERI
        PA+N      R+     PL    ++W+  E E L KGI+++  Q+ + ++S D++S          +Q  S ++++ +N         SIKD        
Subjt:  PAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQ-FQEMVLQISMDQIS---------GLQGFSADSDDLDNIF------ASIKDLDITPERI

Query:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED
         + + +V  + L       RS  E   RW N +DP IN+ P++  EDK LL   ++   + W  I++ LGTNRTP  C+ RYQRSLN+ ++KREWTK+ED
Subjt:  REFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDED

Query:  DKLRSAVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRK
        + L   + +   G   DWQ I   + GR G QC +RW K+LDP+  K+G ++P+ED  L  AV  +G  NW      V GR  VQCRER+ N LDP L K
Subjt:  DKLRSAVAVLGVG---DWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRK

Query:  CEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDP
          WT +ED RL     + G   W+ VA  + +RTDN+C RRWK+L    N +   QE    +K   +SNF  R+ ER  L   D   I          + 
Subjt:  CEWTEEEDLRLEIAIQEHGY-SWTKVAACVPSRTDNECRRRWKKL--FPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDP

Query:  NPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQ
           PK N K +T +      +T       K++N  NQ
Subjt:  NPAPKRNVKPRTRVPRKEKNATGDAPKRRKSNNHSNQ

Q5SXM2 snRNA-activating protein complex subunit 48.7e-4128.93Show/hide
Query:  IKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTK
        ++ N   Q+ I+ K+      + +N++  Q+ ++++D   S  + T  +  + + P   +    KP  KD             GP  N           K
Subjt:  IKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTK

Query:  FPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGL-QGFSADSDDLD---------NIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGG
            +   KW N E+  L K +     + +LQ  + ++  L Q  S  S +L+              I+D++  PE   +   L   +W+K+++  + G 
Subjt:  FPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGL-QGFSADSDDLD---------NIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGG

Query:  RSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQ
        RS  E    W N E P IN+  WS  E++ L       G   W  IA  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  + VG    ++
Subjt:  RSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQ

Query:  AIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHG
         I   +EGR   Q   RW KSLDP   K+G + P+ED +L  AV  +G ++W K  E VPGR+  QCR+R+   L  SL+K  W  +E+ +L   I+++G
Subjt:  AIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHG

Query:  YS-WTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARR
           W K+A+ +P R+ ++C  +WK +   +  L +  RR
Subjt:  YS-WTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARR

Q8BP86 snRNA-activating protein complex subunit 43.7e-3931.76Show/hide
Query:  KWSNVERENLGKGIRQQFQEMVLQISM----------DQISGLQGFSADSDDLDNIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGGRSGAECEA
        KW + E+  L K +     + +LQ  +           ++S      A    +      I+D++  PE   +   L   +W+K+++  + G RS  E   
Subjt:  KWSNVERENLGKGIRQQFQEMVLQISM----------DQISGLQGFSADSDDLDNIFASIKDLDITPER--IREFLPKVNWDKLASF-YLGGRSGAECEA

Query:  RWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQAIASTLEG
         W + E P I++  WST E + L       G   W  +A  LGT+R+ FQCL ++Q+  N ++ ++EWT++ED  L   V  + VG+   ++ I   +EG
Subjt:  RWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVLGVGD---WQAIASTLEG

Query:  RAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYS-WTKVA
        R   Q   RW KSLDP+  KRG + P+ED +L  AV  +G ++W K  E VPGR+  QCR+R+   L  SL+K  W  +E+ +L   I+++G   W ++A
Subjt:  RAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYS-WTKVA

Query:  ACVPSRTDNECRRRWKKL
        + +P R+ ++C  +WK L
Subjt:  ACVPSRTDNECRRRWKKL

Arabidopsis top hitse value%identityAlignment
AT3G09370.1 myb domain protein 3r-31.5e-2740.14Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV       W+ IA +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ +PGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW
        +P + K  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW

AT3G09370.2 myb domain protein 3r-31.5e-2740.14Show/hide
Query:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL
        K  WT +ED+ LR AV       W+ IA +   R   QC +RW+K L+P   K G +T +ED ++   V  +GP  W+  A+ +PGR   QCRERW N L
Subjt:  KREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCL

Query:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW
        +P + K  WT EE++ L  A + HG  W ++A  +P RTDN  +  W
Subjt:  DPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRW

AT3G18100.1 myb domain protein 4r14.0e-16643.59Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNR---------------
        M+R S  E  D      ++DDE+D   ED+E LRRAC +   N +++ +   + +      GG +  SDS++ DD E++R I+++               
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNR---------------

Query:  -FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYES-DALSNNLDQSCDFGGPLEMDSNE---------------
          S+ +D E +            LSL    +LPP+  S DEE+D FETLRAI+RRFSAY++ D+    ++ S      +    NE               
Subjt:  -FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYES-DALSNNLDQSCDFGGPLEMDSNE---------------

Query:  TDVGR------QTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLIS
         D G+       +   +   +     + P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K ++  +I+KDFQ+SC+R T   L Q  DPRV+LIS
Subjt:  TDVGR------QTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLIS

Query:  AQKPQTKDSSK----------KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLD
         +K    DSS+           DK++S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D
Subjt:  AQKPQTKDSSK----------KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLD

Query:  NIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQR
         I  SI +L+ITPE IR+FLPK+NWD L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQR
Subjt:  NIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQR

Query:  SLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCR
        SLN SILK+EWT +EDD+LR+AV + G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR Q QCR
Subjt:  SLNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCR

Query:  ERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPN
        ERW NCLDP + + +WTEEED +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERPAL  +    +P+
Subjt:  ERWFNCLDPSLRKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPN

Query:  TNLLCNADDPNPAPKRNVKPR----TRVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE
         +L    D      KR  K +     R P++     KN +GD  ++       N+ N   E+    +    + + N       +RK V E    NE
Subjt:  TNLLCNADDPNPAPKRNVKPR----TRVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE

AT3G18100.2 myb domain protein 4r18.6e-15350.18Show/hide
Query:  NLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------KDKQL
        + P+AA AF+DAI++NR+ QKF+R K+  +EA IE+N+K ++  +I+KDFQ+SC+R T   L Q  DPRV+LIS +K    DSS+           DK++
Subjt:  NLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------KDKQL

Query:  SAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWD
        S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLPK+NWD
Subjt:  SAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLPKVNWD

Query:  KLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVL
         L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+AV + 
Subjt:  KLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRSAVAVL

Query:  GVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEI
        G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR Q QCRERW NCLDP + + +WTEEED +L  
Subjt:  GVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEEDLRLEI

Query:  AIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPR----T
        AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERPAL  +    +P+ +L    D      KR  K +     
Subjt:  AIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPR----T

Query:  RVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE
        R P++     KN +GD  ++       N+ N   E+    +    + + N       +RK V E    NE
Subjt:  RVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE

AT3G18100.3 myb domain protein 4r14.9e-14842.19Show/hide
Query:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNR---------------
        M+R S  E  D      ++DDE+D   ED+E LRRAC +   N +++ +   + +      GG +  SDS++ DD E++R I+++               
Subjt:  MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNR---------------

Query:  -FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLA
          S+ +D E +            LSL    +LPP+  S DEE+D FETLRAI+RRFSAY+            +FG   +  ++     +Q +  + S+  
Subjt:  -FSIAADDEDQ-----------ALSLHPLSTLPPV--SPDEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLA

Query:  LEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-KKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------
         ++          +    K   +      +  H+   +EEN +KL+QR        SS  R T     +M DPRV+LIS +K    DSS+          
Subjt:  LEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEEN-KKLRQRFKILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSK----------

Query:  KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLP
         DK++S +  GPAEN  V  YRMAL K+P+SV R+KWS  E +NL KG++Q+ Q+++L  ++++ S L+G +    D+D I  SI +L+ITPE IR+FLP
Subjt:  KDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQGFSADSDDLDNIFASIKDLDITPERIREFLP

Query:  KVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS
        K+NWD L    +  RS AECEARW++ EDPLIN  PW+ +EDKNLL TI+Q  L +W++IAVSLGTNRTPFQCL+RYQRSLN SILK+EWT +EDD+LR+
Subjt:  KVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRSLNASILKREWTKDEDDKLRS

Query:  AVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEED
        AV + G  DWQ++A+ L+GR GTQCSNRWKKSL P  T++G ++ +ED R+K+AV LFG +NW+K ++FVPGR Q QCRERW NCLDP + + +WTEEED
Subjt:  AVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSLRKCEWTEEED

Query:  LRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPR
         +L  AI EHGYSW+KVA  +  RTDN+C RRWK+L+P++V LLQEARR+QK A + NFVDRESERPAL  +    +P+ +L    D      KR  K +
Subjt:  LRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPR

Query:  ----TRVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE
             R P++     KN +GD  ++       N+ N   E+    +    + + N       +RK V E    NE
Subjt:  ----TRVPRKE----KNATGDAPKRRKSNNHSNQANGATEQ----VSKKPQRRQNRNGAYTAKRKGVLEPRPNNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCCGCAGCCATAACGAAGGCGGTGACCAGGAGCTCTCCACCGGCGAGGAAGACGATGAAGATGATGTGTTTGATGAGGACATGGAAGCCCTCCGGCGAGCCTG
TAGGCTTGTTGGAGCTAATCCCGAAGAGTATAATAATCCTCCCTTGTCTCCTACCGCCGGAGCTGGCTCGTTCGGCGGTAGCAAACCTGGCTCTGATTCTGATGATGTTG
ATGATCTCGAACTTGTTCGGAATATACGGAACCGGTTCTCGATTGCGGCCGATGATGAGGACCAGGCTTTGTCTCTGCATCCGTTGAGTACTCTCCCACCGGTGTCGCCG
GACGAGGAGGAGGACGATTTCGAGACGCTTCGCGCGATTCAGCGGCGGTTTTCGGCGTATGAAAGTGATGCTTTGAGCAATAATCTCGATCAGTCTTGTGACTTTGGTGG
GCCTCTAGAGATGGATTCCAACGAAACAGATGTTGGGAGACAGACTTCCTCAGGAAGGTCGTCTATGCTAGCCCTTGAAAAGGGAAATTTGCCTAAGGCTGCATTGGCAT
TTATTGATGCAATCAAGAAGAATAGGTCTCAGCAGAAGTTTATTCGAAGTAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTTAGACAACGTTTCAAA
ATTCTCAAAGATTTCCAGAGTTCATGTAGAAGGAGAACAGGTTATCAACTGTCTCAGATGATTGATCCTCGAGTCCAGTTAATTTCAGCACAAAAACCACAGACTAAGGA
TTCATCAAAGAAGGACAAACAGTTATCTGCAATGTATTATGGCCCAGCTGAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCACTTTCAGTAAATC
GAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTTGGGAAGGGAATAAGACAGCAATTTCAAGAAATGGTTCTTCAAATTTCAATGGATCAAATCAGTGGGCTACAAGGA
TTTTCAGCAGATTCAGATGATCTGGATAACATTTTTGCATCAATAAAAGACCTTGACATTACTCCTGAGAGGATTAGGGAATTTCTGCCAAAAGTAAATTGGGACAAATT
GGCTTCCTTTTATCTTGGGGGTCGCTCAGGTGCAGAATGTGAAGCCAGGTGGTTGAATTTTGAAGATCCTCTAATTAATCGGAATCCATGGTCTACAAGTGAGGATAAGA
ATCTTTTGCTTACTATCCAACAGAAAGGGCTGAATAACTGGATTAACATAGCAGTTTCATTGGGTACGAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGT
TTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATAAACTCCGATCTGCTGTTGCCGTTCTTGGTGTAGGAGATTGGCAGGCAATAGCTTCTACTTT
GGAAGGACGAGCTGGTACACAGTGCTCTAATAGATGGAAAAAATCCCTTGACCCAGCTAGGACAAAAAGAGGCGCGTTCACTCCAGATGAAGACAATCGCTTGAAAATTG
CCGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTGTACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTG
AGAAAGTGTGAATGGACAGAAGAGGAGGATTTAAGGCTCGAAATAGCAATTCAGGAGCATGGATATAGCTGGACAAAGGTAGCTGCATGTGTGCCTTCACGTACAGATAA
TGAGTGCCGGAGGAGATGGAAGAAGTTATTCCCCAATGAAGTTCCCTTGCTCCAGGAAGCTAGAAGGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGAGAAT
CAGAGCGTCCTGCTTTAGGTCCTACTGACTTTCGACCTATACCCAATACAAATTTATTATGTAATGCGGATGATCCAAATCCTGCCCCAAAAAGAAATGTCAAGCCGAGG
ACACGAGTGCCAAGGAAGGAAAAGAATGCTACTGGCGATGCTCCAAAGAGGAGGAAATCAAATAACCATAGCAATCAAGCTAATGGAGCAACTGAGCAGGTATCTAAAAA
ACCTCAAAGAAGACAAAATAGAAATGGAGCTTATACTGCTAAGAGGAAAGGGGTTCTGGAGCCACGTCCTAACAATGAGAAATGTGCTGAACAAAATTTGGAAACTGAGA
GCCTCGAGGTGCAGCTGAATAGTAGCGTATCAGGGAGGACCAACAGTGAGTGCCCCGAGATTGTTTATGAGAATGGTATGGAGGAATGTGAGAACAAAGTGGCAGAGAAG
CTTTCTAAAAGTGATTTATTCTTTTCAGAACAGGAAGAATGTCAGAACTCGACAGGGTCTTCTGGAGTGTCAGTATTGTCAGAAATGATGAACGACATGGACGAGTATAA
TCCCTCTATCCTTCCAGACACAGCACCGTTGGCTTGTACTACTGGGGATGACGATATAACGGAAAGGAAGGACACAAGGGTTGCAGACACGGATCTGGTTGACAGTAACA
GTTTCTCGTTACCGCATGGTTGCTTAGGACTCAGGACTAATGACAGTGAAGGCGTCGATAGCTGTTCTGTCGGTGAAACTACAGATAAAAGCGATATGGTTTATAAGCAC
CAAGGTAGAAGGAGGAGAACTAATCAACCAAAGAAGGAGCTGGAGAGATCAGCGAACAAGGAGCTTCATCCTCATAACCAACCAAAGAAGCGAAAGCATAACAGCACATA
TACAAGTGAGTCGGGAACATTGGAGCCAGTCGAAGAAGCAGATGACTGCACTCTCCAAGGCTTTCTGCAAAAGAAATTGAAGAAGACAACCACTCATAAAAGGAAATTTG
ATGGCGGTTCTAGCGATAAACTAGAAGTTGAAAGCAATGGTAATGATACTACTATCGCCTCGTTTCTCAAGAATATATCGAAGAAAAAGAAGCATAAAAGCTCCTAA
mRNA sequenceShow/hide mRNA sequence
CCCAAATTAGGATCAAGCCCAATTAATAAGCCCAAAATATCGCCGCTTGGAGTTTCGACAAATTGAAAAGAAAGGCTTATTTATTTAAGACCGCCACAATTCCGGGAAGT
CCCCTTCGGAGTTCCGACTGCAACCTTGAGTCAGTGATTCGTACGTCCCGCTCATTCATCAATGTCTCGCCGCAGCCATAACGAAGGCGGTGACCAGGAGCTCTCCACCG
GCGAGGAAGACGATGAAGATGATGTGTTTGATGAGGACATGGAAGCCCTCCGGCGAGCCTGTAGGCTTGTTGGAGCTAATCCCGAAGAGTATAATAATCCTCCCTTGTCT
CCTACCGCCGGAGCTGGCTCGTTCGGCGGTAGCAAACCTGGCTCTGATTCTGATGATGTTGATGATCTCGAACTTGTTCGGAATATACGGAACCGGTTCTCGATTGCGGC
CGATGATGAGGACCAGGCTTTGTCTCTGCATCCGTTGAGTACTCTCCCACCGGTGTCGCCGGACGAGGAGGAGGACGATTTCGAGACGCTTCGCGCGATTCAGCGGCGGT
TTTCGGCGTATGAAAGTGATGCTTTGAGCAATAATCTCGATCAGTCTTGTGACTTTGGTGGGCCTCTAGAGATGGATTCCAACGAAACAGATGTTGGGAGACAGACTTCC
TCAGGAAGGTCGTCTATGCTAGCCCTTGAAAAGGGAAATTTGCCTAAGGCTGCATTGGCATTTATTGATGCAATCAAGAAGAATAGGTCTCAGCAGAAGTTTATTCGAAG
TAAGATGATTCACCTTGAAGCTAGAATTGAGGAGAACAAAAAGCTTAGACAACGTTTCAAAATTCTCAAAGATTTCCAGAGTTCATGTAGAAGGAGAACAGGTTATCAAC
TGTCTCAGATGATTGATCCTCGAGTCCAGTTAATTTCAGCACAAAAACCACAGACTAAGGATTCATCAAAGAAGGACAAACAGTTATCTGCAATGTATTATGGCCCAGCT
GAGAATTCTCATGTTGCATGCTACAGAATGGCATTGACAAAGTTTCCACTTTCAGTAAATCGAAAAAAATGGTCCAATGTAGAAAGGGAGAATCTTGGGAAGGGAATAAG
ACAGCAATTTCAAGAAATGGTTCTTCAAATTTCAATGGATCAAATCAGTGGGCTACAAGGATTTTCAGCAGATTCAGATGATCTGGATAACATTTTTGCATCAATAAAAG
ACCTTGACATTACTCCTGAGAGGATTAGGGAATTTCTGCCAAAAGTAAATTGGGACAAATTGGCTTCCTTTTATCTTGGGGGTCGCTCAGGTGCAGAATGTGAAGCCAGG
TGGTTGAATTTTGAAGATCCTCTAATTAATCGGAATCCATGGTCTACAAGTGAGGATAAGAATCTTTTGCTTACTATCCAACAGAAAGGGCTGAATAACTGGATTAACAT
AGCAGTTTCATTGGGTACGAACAGAACTCCTTTTCAGTGCTTGTCTCGGTATCAAAGGAGTTTAAATGCTTCCATATTAAAGAGGGAGTGGACCAAAGATGAGGATGATA
AACTCCGATCTGCTGTTGCCGTTCTTGGTGTAGGAGATTGGCAGGCAATAGCTTCTACTTTGGAAGGACGAGCTGGTACACAGTGCTCTAATAGATGGAAAAAATCCCTT
GACCCAGCTAGGACAAAAAGAGGCGCGTTCACTCCAGATGAAGACAATCGCTTGAAAATTGCCGTACTGCTTTTTGGGCCTAAAAATTGGAACAAGAAAGCAGAATTTGT
ACCTGGTCGAAATCAAGTTCAATGCAGAGAAAGATGGTTCAATTGTTTAGATCCTTCCTTGAGAAAGTGTGAATGGACAGAAGAGGAGGATTTAAGGCTCGAAATAGCAA
TTCAGGAGCATGGATATAGCTGGACAAAGGTAGCTGCATGTGTGCCTTCACGTACAGATAATGAGTGCCGGAGGAGATGGAAGAAGTTATTCCCCAATGAAGTTCCCTTG
CTCCAGGAAGCTAGAAGGATTCAGAAGGCTGCTCTTATTAGCAACTTTGTTGATAGAGAATCAGAGCGTCCTGCTTTAGGTCCTACTGACTTTCGACCTATACCCAATAC
AAATTTATTATGTAATGCGGATGATCCAAATCCTGCCCCAAAAAGAAATGTCAAGCCGAGGACACGAGTGCCAAGGAAGGAAAAGAATGCTACTGGCGATGCTCCAAAGA
GGAGGAAATCAAATAACCATAGCAATCAAGCTAATGGAGCAACTGAGCAGGTATCTAAAAAACCTCAAAGAAGACAAAATAGAAATGGAGCTTATACTGCTAAGAGGAAA
GGGGTTCTGGAGCCACGTCCTAACAATGAGAAATGTGCTGAACAAAATTTGGAAACTGAGAGCCTCGAGGTGCAGCTGAATAGTAGCGTATCAGGGAGGACCAACAGTGA
GTGCCCCGAGATTGTTTATGAGAATGGTATGGAGGAATGTGAGAACAAAGTGGCAGAGAAGCTTTCTAAAAGTGATTTATTCTTTTCAGAACAGGAAGAATGTCAGAACT
CGACAGGGTCTTCTGGAGTGTCAGTATTGTCAGAAATGATGAACGACATGGACGAGTATAATCCCTCTATCCTTCCAGACACAGCACCGTTGGCTTGTACTACTGGGGAT
GACGATATAACGGAAAGGAAGGACACAAGGGTTGCAGACACGGATCTGGTTGACAGTAACAGTTTCTCGTTACCGCATGGTTGCTTAGGACTCAGGACTAATGACAGTGA
AGGCGTCGATAGCTGTTCTGTCGGTGAAACTACAGATAAAAGCGATATGGTTTATAAGCACCAAGGTAGAAGGAGGAGAACTAATCAACCAAAGAAGGAGCTGGAGAGAT
CAGCGAACAAGGAGCTTCATCCTCATAACCAACCAAAGAAGCGAAAGCATAACAGCACATATACAAGTGAGTCGGGAACATTGGAGCCAGTCGAAGAAGCAGATGACTGC
ACTCTCCAAGGCTTTCTGCAAAAGAAATTGAAGAAGACAACCACTCATAAAAGGAAATTTGATGGCGGTTCTAGCGATAAACTAGAAGTTGAAAGCAATGGTAATGATAC
TACTATCGCCTCGTTTCTCAAGAATATATCGAAGAAAAAGAAGCATAAAAGCTCCTAATGGTGGTGGGTGATGTTATTTAGCCATCCTGTTCGGCGGAGCTCGGAAGCAG
AAGCAGATTGATGATGGAAAGCTGTACAAATATTAGGCCAATGGTGAGAGCGAGTGTACCAAATAAAGCTCAGAGGCCGCAGAATAGTTAATTCCACAGCTAAAGGATTT
GAGTTTTATGGTTGACGTTGACGAATTTTTGGTTTGTGATGTAGCTGAGAGGTTTAGTGATTTTGTCTGTGTCAAAAGTCATTTTTTGCGCTAATTTCCTTTGTCAACAT
TTTCACAAGCGAGCAAATTTTGATGAGTTGCCCGGTAGTGATAATCTTCTCTTTGTATATTTCTCGCCTTAAATGAAATCAGAGGATGTGTATAGGCCTTTTTACCTTTT
TGGTTAGTTCGCCCTATTGTTGGAAATTGTTCTATTTTTTTTCTGTTCTTAATATCAATGGCTTTTTATTTTTGTTATATAAGTACTTCAGAAGTACAAATGCCAAC
Protein sequenceShow/hide protein sequence
MSRRSHNEGGDQELSTGEEDDEDDVFDEDMEALRRACRLVGANPEEYNNPPLSPTAGAGSFGGSKPGSDSDDVDDLELVRNIRNRFSIAADDEDQALSLHPLSTLPPVSP
DEEEDDFETLRAIQRRFSAYESDALSNNLDQSCDFGGPLEMDSNETDVGRQTSSGRSSMLALEKGNLPKAALAFIDAIKKNRSQQKFIRSKMIHLEARIEENKKLRQRFK
ILKDFQSSCRRRTGYQLSQMIDPRVQLISAQKPQTKDSSKKDKQLSAMYYGPAENSHVACYRMALTKFPLSVNRKKWSNVERENLGKGIRQQFQEMVLQISMDQISGLQG
FSADSDDLDNIFASIKDLDITPERIREFLPKVNWDKLASFYLGGRSGAECEARWLNFEDPLINRNPWSTSEDKNLLLTIQQKGLNNWINIAVSLGTNRTPFQCLSRYQRS
LNASILKREWTKDEDDKLRSAVAVLGVGDWQAIASTLEGRAGTQCSNRWKKSLDPARTKRGAFTPDEDNRLKIAVLLFGPKNWNKKAEFVPGRNQVQCRERWFNCLDPSL
RKCEWTEEEDLRLEIAIQEHGYSWTKVAACVPSRTDNECRRRWKKLFPNEVPLLQEARRIQKAALISNFVDRESERPALGPTDFRPIPNTNLLCNADDPNPAPKRNVKPR
TRVPRKEKNATGDAPKRRKSNNHSNQANGATEQVSKKPQRRQNRNGAYTAKRKGVLEPRPNNEKCAEQNLETESLEVQLNSSVSGRTNSECPEIVYENGMEECENKVAEK
LSKSDLFFSEQEECQNSTGSSGVSVLSEMMNDMDEYNPSILPDTAPLACTTGDDDITERKDTRVADTDLVDSNSFSLPHGCLGLRTNDSEGVDSCSVGETTDKSDMVYKH
QGRRRRTNQPKKELERSANKELHPHNQPKKRKHNSTYTSESGTLEPVEEADDCTLQGFLQKKLKKTTTHKRKFDGGSSDKLEVESNGNDTTIASFLKNISKKKKHKSS