; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g34070 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g34070
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionalpha N-terminal protein methyltransferase 1
Genome locationchr1:24025132..24039187
RNA-Seq ExpressionMoc01g34070
SyntenyMoc01g34070
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006480 - N-terminal protein amino acid methylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR003657 - WRKY domain
IPR008576 - Alpha-N-methyltransferase NTM1
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR036576 - WRKY domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011658332.1 alpha N-terminal protein methyltransferase 1 isoform X1 [Cucumis sativus]7.3e-15786.03Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        ME SGADTDGHEFKNAEEMW E VGNPTKRT+WYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSIL ERF FAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLI+YFNEVDLLEPVSHFLEAAR +LAPEN+G  SD+HKATNFFC+PLQEFTP AGRYD+IWVQWCIGHLTDEDFISFFKRAK GLK GGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGF+LDKEDRSITRSD YYK+LFNQCGLY FKS+DQKGFP+ELFPVKMYALTTE PKR+SR KREQSNRPG+IKP++IQI +R NWK QAS FS ++G
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLC
        DAIAIRP +    LC
Subjt:  DAIAIRPNSKPLTLC

XP_022146650.1 probable WRKY transcription factor 14 [Momordica charantia]1.5e-242100Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
        MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF

Query:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV
        FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV
Subjt:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV

Query:  CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN
        CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN
Subjt:  CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN

Query:  SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL
        SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL
Subjt:  SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL

Query:  FDWAAENNNNNANSTSFEEAAAGKR
        FDWAAENNNNNANSTSFEEAAAGKR
Subjt:  FDWAAENNNNNANSTSFEEAAAGKR

XP_022146715.1 alpha N-terminal protein methyltransferase 1 isoform X1 [Momordica charantia]1.9e-189100Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLCHLKALCIT
        DAIAIRPNSKPLTLCHLKALCIT
Subjt:  DAIAIRPNSKPLTLCHLKALCIT

XP_022146716.1 alpha N-terminal protein methyltransferase 1 isoform X2 [Momordica charantia]7.0e-160100Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK
        ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK

XP_038881057.1 alpha N-terminal protein methyltransferase 1 isoform X2 [Benincasa hispida]1.0e-15886.98Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        ME SGADTDGHEFKNAEEMW EQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSIL+ERF F GKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAAR SLAPEN+GA SD+HKATNFFC+PLQEFTP AGRYD+IWVQWCIGHLTDEDFISFFKRAK GLK GGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGFILDKEDRSITRSD YYK+LFNQCGLYPFKS+DQKGFP+ELFPVKMYAL+TE PKR+SRTKREQ+NRPG+IKP++IQI +R NWK QAS FS ++G
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLC
        + +AIRP S    LC
Subjt:  DAIAIRPNSKPLTLC

TrEMBL top hitse value%identityAlignment
A0A0A0KIY9 Uncharacterized protein3.5e-15786.03Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        ME SGADTDGHEFKNAEEMW E VGNPTKRT+WYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSIL ERF FAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLI+YFNEVDLLEPVSHFLEAAR +LAPEN+G  SD+HKATNFFC+PLQEFTP AGRYD+IWVQWCIGHLTDEDFISFFKRAK GLK GGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGF+LDKEDRSITRSD YYK+LFNQCGLY FKS+DQKGFP+ELFPVKMYALTTE PKR+SR KREQSNRPG+IKP++IQI +R NWK QAS FS ++G
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLC
        DAIAIRP +    LC
Subjt:  DAIAIRPNSKPLTLC

A0A1S3AZL9 alpha N-terminal protein methyltransferase 1 isoform X11.7e-15686.03Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        ME SGADTDGHEFKNAEEMW E VGNPTKRT+WYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSIL ER  FAGKDRPLVALDCGSGIGR+T
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAAR SLAPEN+G  SD+HKATNFFC+PLQEFTP AGRYD+IWVQWCIGHLTDEDFISFFKRAK GLK GGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGFILDKEDRSITRSD YYK+LFNQCGLY FKS+DQKGFP+ELFPVKMYALTTE PKR+SR KREQSNRPG+IKP++IQI +R NWK QAS FS ++G
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLC
        D IAIRP +    LC
Subjt:  DAIAIRPNSKPLTLC

A0A6J1CXU5 probable WRKY transcription factor 147.1e-243100Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
        MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF

Query:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV
        FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV
Subjt:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVV

Query:  CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN
        CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN
Subjt:  CVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPN

Query:  SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL
        SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL
Subjt:  SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGL

Query:  FDWAAENNNNNANSTSFEEAAAGKR
        FDWAAENNNNNANSTSFEEAAAGKR
Subjt:  FDWAAENNNNNANSTSFEEAAAGKR

A0A6J1CY14 alpha N-terminal protein methyltransferase 1 isoform X19.2e-190100Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
        ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIRTRLNWKPQASGFSCIRG

Query:  DAIAIRPNSKPLTLCHLKALCIT
        DAIAIRPNSKPLTLCHLKALCIT
Subjt:  DAIAIRPNSKPLTLCHLKALCIT

A0A6J1D070 alpha N-terminal protein methyltransferase 1 isoform X23.4e-160100Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
        MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT
Subjt:  MEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERFHFAGKDRPLVALDCGSGIGRVT

Query:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
        KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI
Subjt:  KNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILKENI

Query:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK
        ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK
Subjt:  ARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIK

SwissProt top hitse value%identityAlignment
A2XMJ1 Alpha N-terminal protein methyltransferase 18.5e-9257.48Show/hide
Query:  EGREFGDGGAVGHQAVAVASDMEVSGADTDGHEFKNAEE---MWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSE
        EGREF     +    +  A+D  VS A  +      A        ++ G   KR +WY + + YWQGVEAS +GVLGGYG VND D+ GS+ FL+ +L+E
Subjt:  EGREFGDGGAVGHQAVAVASDMEVSGADTDGHEFKNAEE---MWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSE

Query:  RFHFAGKDRPLVALDCGSGIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDED
        RF  A   R LVALDCGSGIGRVTKN L+R+FNEVDL+EPVSHFLEAA+E+L  E      D HKA NF+CVPLQ+FTP  GRYD+IW+QWCIG L D+D
Subjt:  RFHFAGKDRPLVALDCGSGIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDED

Query:  FISFFKRAKAGLKGGGIFILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQ-SNRPGII
        FISFF RAK GLK  G F+LKENIAR+GF+LDKED SITRSD Y+K LF +CGLY    KDQ   P+ELF VKMYAL TE PK     KR +  N P +I
Subjt:  FISFFKRAKAGLKGGGIFILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQ-SNRPGII

Query:  K
        +
Subjt:  K

O64747 Probable WRKY transcription factor 357.5e-6442.41Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
        M+N+QGDLTD+VRG  S          P G S +     P+     +     TS     + FGDPF   +M+DPL+H   +  S  S +  + S+ +   
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF

Query:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNS----TSSTPLINANSSEFHH---FLDN-------TAVQISSPRNP
        F    + DH             +  ++F R ++IS S N    S+ +S  +  +S     S   +IN N++        +DN       + VQISS    
Subjt:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNS----TSSTPLINANSSEFHH---FLDN-------TAVQISSPRNP

Query:  TGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA
         GIKRRKSQA+KVVC+PAP A +SR +GEV+PSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA
Subjt:  TGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA

Query:  GSSRSQQSKS-------------SAPNSKLNPQNPTPDQPADTNKTQVFGAP----TNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFA
        GS+RS  S S             ++P+S++   N + D+P ++N       P      +KEE ++ ++     D  D    ++         +Q +DFFA
Subjt:  GSSRSQQSKS-------------SAPNSKLNPQNPTPDQPADTNKTQVFGAP----TNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFA

Query:  DLEELDTDPLNLLFS
        DL+EL+ D L +L S
Subjt:  DLEELDTDPLNLLFS

Q10CT5 Alpha N-terminal protein methyltransferase 13.8e-9257.81Show/hide
Query:  EGREFGDGGAVGHQAVAVASDMEVSGADTDGHEFKNAEE---MWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSE
        EGREF     +    +  A+D  VS A  +      A        E+ G   KR +WY + + YWQGVEAS +GVLGGYG VND D+ GS+ FL+ +L+E
Subjt:  EGREFGDGGAVGHQAVAVASDMEVSGADTDGHEFKNAEE---MWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSE

Query:  RFHFAGKDRPLVALDCGSGIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDED
        RF  A   R LVALDCGSGIGRVTKN L+R+FNEVDL+EPVSHFLEAA+E+L  E      D HKA NF+CVPLQ+FTP  GRYD+IW+QWCIG L D+D
Subjt:  RFHFAGKDRPLVALDCGSGIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDED

Query:  FISFFKRAKAGLKGGGIFILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQ-SNRPGII
        FISFF RAK GLK  G F+LKENIAR+GF+LDKED SITRSD Y+K LF +CGLY    KDQ   P+ELF VKMYAL TE PK     KR +  N P +I
Subjt:  FISFFKRAKAGLKGGGIFILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQ-SNRPGII

Query:  K
        +
Subjt:  K

Q5PP70 Alpha N-terminal protein methyltransferase 19.7e-11268.57Show/hide
Query:  MEVSGADTDGHEFKNAEEMWTEQV--GNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERF-HFAGKDRPLVALDCGSGIG
        M++ G D++G EF + +EMW E++  G+ TK+TQWYR+GV YW+GVEASVDGVLGGYGHVNDADI+GSEVFLK++L ER  +  G ++ LVALDCGSGIG
Subjt:  MEVSGADTDGHEFKNAEEMWTEQV--GNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERF-HFAGKDRPLVALDCGSGIG

Query:  RVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILK
        R+TKNLLIRYFNEVDLLEPV+ FL+AARE+LA     A S+ HKATNFFCVPLQEFTPAAGRYD+IWVQWCIGHLTD DF+SFF RAK  LK GG F++K
Subjt:  RVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIFILK

Query:  ENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTK-REQSNRPGIIK
        EN+A++GF+LDKED SITRSD Y+K LF QCGL+ +++KDQKG P+ELF VKMYALT + P +  RT+ + +SNRP IIK
Subjt:  ENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTK-REQSNRPGIIK

Q9SA80 Probable WRKY transcription factor 146.3e-7144.07Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQF---HPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTT
        MEN+QGDLTD+VRG       G  +  P    S  W     HP+          P+      + FGDPF   +M DPLL EL+   +S   S   +++  
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQF---HPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTT

Query:  CTFFNG-----GHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSP-----LFPNSTSSTPL----INANSSEFHHFLDNTA----VQ
            NG       ++DH             +  +IF R ++IS S N    SSP +SP     +   + +++P     ++ NS      +D T     +Q
Subjt:  CTFFNG-----GHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSP-----LFPNSTSSTPL----INANSSEFHHFLDNTA----VQ

Query:  ISSPRNPTGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP
        ISSPRN  G+KRRKSQA+KVVC+PAP A +SR +GEV+PSDLWAWRKYGQKPIKGSP+PRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP
Subjt:  ISSPRNPTGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP

Query:  TQRNALAGSSRSQQSKSSAPN-----------SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFF
         QRNALAGS+RS  S SS PN           S +  QN T   P+ T       + + +K+E  D  E+   +D+ D ++   Y P  +++ +Q DDFF
Subjt:  TQRNALAGSSRSQQSKSSAPN-----------SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFF

Query:  ADLEELDTDPLNLLFSTGGDDDGQQK----EAKFGLFDWAAENNNNN
        ADLEEL+ D L++L S G   DG+ K    +     F W+ +NN NN
Subjt:  ADLEELDTDPLNLLFSTGGDDDGQQK----EAKFGLFDWAAENNNNN

Arabidopsis top hitse value%identityAlignment
AT1G30650.1 WRKY DNA-binding protein 144.5e-7244.07Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQF---HPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTT
        MEN+QGDLTD+VRG       G  +  P    S  W     HP+          P+      + FGDPF   +M DPLL EL+   +S   S   +++  
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQF---HPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTT

Query:  CTFFNG-----GHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSP-----LFPNSTSSTPL----INANSSEFHHFLDNTA----VQ
            NG       ++DH             +  +IF R ++IS S N    SSP +SP     +   + +++P     ++ NS      +D T     +Q
Subjt:  CTFFNG-----GHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSP-----LFPNSTSSTPL----INANSSEFHHFLDNTA----VQ

Query:  ISSPRNPTGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP
        ISSPRN  G+KRRKSQA+KVVC+PAP A +SR +GEV+PSDLWAWRKYGQKPIKGSP+PRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP
Subjt:  ISSPRNPTGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWP

Query:  TQRNALAGSSRSQQSKSSAPN-----------SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFF
         QRNALAGS+RS  S SS PN           S +  QN T   P+ T       + + +K+E  D  E+   +D+ D ++   Y P  +++ +Q DDFF
Subjt:  TQRNALAGSSRSQQSKSSAPN-----------SKLNPQNPTPDQPADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFF

Query:  ADLEELDTDPLNLLFSTGGDDDGQQK----EAKFGLFDWAAENNNNN
        ADLEEL+ D L++L S G   DG+ K    +     F W+ +NN NN
Subjt:  ADLEELDTDPLNLLFSTGGDDDGQQK----EAKFGLFDWAAENNNNN

AT2G34830.1 WRKY DNA-binding protein 355.3e-6542.41Show/hide
Query:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF
        M+N+QGDLTD+VRG  S          P G S +     P+     +     TS     + FGDPF   +M+DPL+H   +  S  S +  + S+ +   
Subjt:  MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTF

Query:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNS----TSSTPLINANSSEFHH---FLDN-------TAVQISSPRNP
        F    + DH             +  ++F R ++IS S N    S+ +S  +  +S     S   +IN N++        +DN       + VQISS    
Subjt:  FNGGHDQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNS----TSSTPLINANSSEFHH---FLDN-------TAVQISSPRNP

Query:  TGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA
         GIKRRKSQA+KVVC+PAP A +SR +GEV+PSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA
Subjt:  TGIKRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALA

Query:  GSSRSQQSKS-------------SAPNSKLNPQNPTPDQPADTNKTQVFGAP----TNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFA
        GS+RS  S S             ++P+S++   N + D+P ++N       P      +KEE ++ ++     D  D    ++         +Q +DFFA
Subjt:  GSSRSQQSKS-------------SAPNSKLNPQNPTPDQPADTNKTQVFGAP----TNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFA

Query:  DLEELDTDPLNLLFS
        DL+EL+ D L +L S
Subjt:  DLEELDTDPLNLLFS

AT5G44450.1 methyltransferases5.3e-11367.84Show/hide
Query:  ASDMEVSGADTDGHEFKNAEEMWTEQV--GNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERF-HFAGKDRPLVALDCGS
        ++ M++ G D++G EF + +EMW E++  G+ TK+TQWYR+GV YW+GVEASVDGVLGGYGHVNDADI+GSEVFLK++L ER  +  G ++ LVALDCGS
Subjt:  ASDMEVSGADTDGHEFKNAEEMWTEQV--GNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSILSERF-HFAGKDRPLVALDCGS

Query:  GIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIF
        GIGR+TKNLLIRYFNEVDLLEPV+ FL+AARE+LA     A S+ HKATNFFCVPLQEFTPAAGRYD+IWVQWCIGHLTD DF+SFF RAK  LK GG F
Subjt:  GIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFISFFKRAKAGLKGGGIF

Query:  ILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTK-REQSNRPGIIK
        ++KEN+A++GF+LDKED SITRSD Y+K LF QCGL+ +++KDQKG P+ELF VKMYALT + P +  RT+ + +SNRP IIK
Subjt:  ILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTK-REQSNRPGIIK

AT5G45050.1 Disease resistance protein (TIR-NBS-LRR class)2.7e-3243.12Show/hide
Query:  KRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSS
        +R+ ++ ++VVCV   V   SR       SDLW WRKYGQKPIK SPYPR YYRC+SSKGC ARKQVERSRTDPN+ VITY SEHNHP+PT RN LAGS+
Subjt:  KRRKSQARKVVCVPAPVAASSRPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSS

Query:  RSQQSKSS----APNSKLNPQNPTPDQPADTNKTQVFGAPTNVKEEA-IDHQEIIQAEDNGDFS---EGFQYMPTNNNNNNQSDDFFADLEELDTDPLNL
        RS  SK S    + +S ++     PD      K+ +  +P +    A +  +E ++  DN +F    E   ++P         +D FAD+++L+ +   +
Subjt:  RSQQSKSS----APNSKLNPQNPTPDQPADTNKTQVFGAPTNVKEEA-IDHQEIIQAEDNGDFS---EGFQYMPTNNNNNNQSDDFFADLEELDTDPLNL

Query:  LF---STGGDDDGQQKEA
             S+GG+ + Q K +
Subjt:  LF---STGGDDDGQQKEA

AT5G52830.1 WRKY DNA-binding protein 271.8e-3339.78Show/hide
Query:  VIKTPPSSSNNTNIFSRMLQI-SPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVVCVPAPVAASSRP
        ++  P +SS + NI  +  Q+   S ++ P   P S  +FP STSS+  +      F    D    Q S P  P   ++RK+Q ++ +C           
Subjt:  VIKTPPSSSNNTNIFSRMLQI-SPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVVCVPAPVAASSRP

Query:  NGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPNSKLNPQNPTPDQ
          E + SDLWAWRKYGQKPIKGSPYPR YYRCSSSKGC ARKQVERS  DPN+ ++TYT EH HP PT RN+LAGS+R          +K  P NP P  
Subjt:  NGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPNSKLNPQNPTPDQ

Query:  P----ADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTD
             +DT K ++  +PT   +   D Q     E NGD  E       N     + ++   D EE + D
Subjt:  P----ADTNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATTACCAAGGAGATTTGACTGATATTGTGAGAGGCAGCTCCAGTAGTGCCACTTATGGCTGCAAAATTGAAGACCCTTTTGGTTTTTCATCAGCCGAG
TGGCAATTTCACCCATCCATGAGTAATTTGGATCAGCTGGAACAACAACCAACATCGCAACAACATCAATACAGCAGCTTTGGAGATCCATTTTGTGCAGCTGCC
ATGAGAGATCCACTCCTTCACGAGCTCGATATGTGTGGCTCTTCTTCTTCTCCTTCAGCGGCTTCTAATTCTTCCACTACTTGTACCTTTTTCAATGGCGGCCAT
GATCAAGATCATGATCATCACCTTGTAATCAAAACCCCACCTTCTTCTTCCAATAATACTAATATTTTCTCACGTATGCTTCAGATCTCTCCTTCCACAAATAAG
TTCCCCATTTCTTCTCCTCATTCTTCCCCTTTGTTTCCCAATTCCACCTCCTCCACACCCTTAATTAATGCTAATTCTTCTGAGTTTCATCATTTCCTCGACAAC
ACCGCCGTTCAGATCTCTTCTCCCCGGAATCCCACCGGTATCAAGCGAAGGAAGAGCCAGGCAAGGAAGGTGGTTTGTGTTCCTGCACCAGTGGCTGCCAGCAGT
AGGCCTAATGGAGAAGTTATTCCTTCTGATCTCTGGGCTTGGAGGAAATATGGTCAGAAACCCATTAAAGGTTCTCCCTATCCTAGGGGATACTACAGATGCAGC
AGCTCAAAGGGTTGCTCAGCCAGAAAACAAGTGGAGAGAAGCCGAACAGATCCCAATATGCTGGTCATTACCTACACATCAGAGCACAACCATCCATGGCCGACC
CAAAGAAACGCCCTGGCCGGCTCTTCCCGCTCCCAACAATCAAAAAGCAGCGCACCAAATTCAAAACTCAACCCCCAAAACCCAACCCCAGATCAGCCTGCTGAC
ACCAACAAAACCCAAGTATTTGGTGCACCAACAAATGTCAAGGAAGAGGCAATTGATCATCAGGAGATCATTCAAGCAGAAGACAATGGAGATTTCAGCGAAGGG
TTTCAATACATGCCCACCAATAACAATAACAACAATCAAAGTGATGATTTTTTTGCGGATTTGGAAGAGTTGGATACCGACCCTTTAAACCTCTTGTTCAGCACT
GGAGGAGATGATGATGGGCAGCAAAAAGAAGCCAAATTTGGTCTATTTGATTGGGCAGCGGAAAACAACAACAACAACGCCAATTCCACATCGTTTGAGGAAGCA
GCTGCTGGAAAGAGAGTCGAGCTGCTACAAAGGGAAGGGCGGGAGTTTGGGGACGGCGGCGCGGTGGGGCATCAGGCGGTGGCAGTGGCGTCTGATATGGAGGTG
AGCGGGGCAGATACGGACGGCCATGAGTTCAAAAATGCGGAGGAGATGTGGACGGAGCAGGTGGGGAACCCCACCAAGCGCACCCAGTGGTATCGCGAAGGCGTA
GGGTATTGGCAGGGAGTCGAGGCATCAGTCGATGGGGTTTTGGGAGGCTATGGCCATGTTAACGACGCCGATATCTTGGGCAGTGAAGTCTTTCTCAAATCCATT
CTGTCCGAGCGTTTTCATTTCGCTGGAAAAGACCGCCCCCTTGTTGCTCTTGATTGTGGTTCTGGCATTGGGAGAGTCACAAAAAATCTTCTTATCAGATATTTT
AACGAGGTTGACCTTCTTGAACCTGTATCACATTTCCTAGAGGCTGCCCGTGAAAGCTTGGCTCCAGAAAATAGTGGTGCCTCCTCTGATATGCACAAAGCCACT
AACTTCTTTTGTGTGCCTCTACAGGAATTCACGCCAGCTGCTGGAAGATATGATATTATATGGGTTCAGTGGTGTATTGGCCATCTTACAGATGAGGACTTCATC
TCCTTCTTTAAAAGAGCCAAGGCAGGGCTAAAAGGTGGTGGAATTTTCATCCTGAAGGAAAATATTGCTAGATCTGGATTCATATTAGATAAAGAAGATCGAAGC
ATCACAAGATCTGATATATACTATAAGAATCTATTCAACCAGTGTGGCCTCTATCCATTTAAATCAAAGGATCAAAAGGGATTTCCCGAGGAATTGTTTCCTGTG
AAGATGTATGCATTAACTACAGAGGGGCCCAAGAGGAATTCTCGCACCAAAAGGGAACAAAGCAATAGACCTGGCATTATCAAGCCACTTAACATACAAATCCGT
ACTAGATTGAACTGGAAACCTCAAGCGTCTGGTTTCTCATGCATTCGAGGAGATGCCATTGCCATCCGTCCCAATTCCAAACCCCTTACTCTTTGTCATCTTAAA
GCACTGTGCATTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATTACCAAGGAGATTTGACTGATATTGTGAGAGGCAGCTCCAGTAGTGCCACTTATGGCTGCAAAATTGAAGACCCTTTTGGTTTTTCATCAGCCGAG
TGGCAATTTCACCCATCCATGAGTAATTTGGATCAGCTGGAACAACAACCAACATCGCAACAACATCAATACAGCAGCTTTGGAGATCCATTTTGTGCAGCTGCC
ATGAGAGATCCACTCCTTCACGAGCTCGATATGTGTGGCTCTTCTTCTTCTCCTTCAGCGGCTTCTAATTCTTCCACTACTTGTACCTTTTTCAATGGCGGCCAT
GATCAAGATCATGATCATCACCTTGTAATCAAAACCCCACCTTCTTCTTCCAATAATACTAATATTTTCTCACGTATGCTTCAGATCTCTCCTTCCACAAATAAG
TTCCCCATTTCTTCTCCTCATTCTTCCCCTTTGTTTCCCAATTCCACCTCCTCCACACCCTTAATTAATGCTAATTCTTCTGAGTTTCATCATTTCCTCGACAAC
ACCGCCGTTCAGATCTCTTCTCCCCGGAATCCCACCGGTATCAAGCGAAGGAAGAGCCAGGCAAGGAAGGTGGTTTGTGTTCCTGCACCAGTGGCTGCCAGCAGT
AGGCCTAATGGAGAAGTTATTCCTTCTGATCTCTGGGCTTGGAGGAAATATGGTCAGAAACCCATTAAAGGTTCTCCCTATCCTAGGGGATACTACAGATGCAGC
AGCTCAAAGGGTTGCTCAGCCAGAAAACAAGTGGAGAGAAGCCGAACAGATCCCAATATGCTGGTCATTACCTACACATCAGAGCACAACCATCCATGGCCGACC
CAAAGAAACGCCCTGGCCGGCTCTTCCCGCTCCCAACAATCAAAAAGCAGCGCACCAAATTCAAAACTCAACCCCCAAAACCCAACCCCAGATCAGCCTGCTGAC
ACCAACAAAACCCAAGTATTTGGTGCACCAACAAATGTCAAGGAAGAGGCAATTGATCATCAGGAGATCATTCAAGCAGAAGACAATGGAGATTTCAGCGAAGGG
TTTCAATACATGCCCACCAATAACAATAACAACAATCAAAGTGATGATTTTTTTGCGGATTTGGAAGAGTTGGATACCGACCCTTTAAACCTCTTGTTCAGCACT
GGAGGAGATGATGATGGGCAGCAAAAAGAAGCCAAATTTGGTCTATTTGATTGGGCAGCGGAAAACAACAACAACAACGCCAATTCCACATCGTTTGAGGAAGCA
GCTGCTGGAAAGAGAGTCGAGCTGCTACAAAGGGAAGGGCGGGAGTTTGGGGACGGCGGCGCGGTGGGGCATCAGGCGGTGGCAGTGGCGTCTGATATGGAGGTG
AGCGGGGCAGATACGGACGGCCATGAGTTCAAAAATGCGGAGGAGATGTGGACGGAGCAGGTGGGGAACCCCACCAAGCGCACCCAGTGGTATCGCGAAGGCGTA
GGGTATTGGCAGGGAGTCGAGGCATCAGTCGATGGGGTTTTGGGAGGCTATGGCCATGTTAACGACGCCGATATCTTGGGCAGTGAAGTCTTTCTCAAATCCATT
CTGTCCGAGCGTTTTCATTTCGCTGGAAAAGACCGCCCCCTTGTTGCTCTTGATTGTGGTTCTGGCATTGGGAGAGTCACAAAAAATCTTCTTATCAGATATTTT
AACGAGGTTGACCTTCTTGAACCTGTATCACATTTCCTAGAGGCTGCCCGTGAAAGCTTGGCTCCAGAAAATAGTGGTGCCTCCTCTGATATGCACAAAGCCACT
AACTTCTTTTGTGTGCCTCTACAGGAATTCACGCCAGCTGCTGGAAGATATGATATTATATGGGTTCAGTGGTGTATTGGCCATCTTACAGATGAGGACTTCATC
TCCTTCTTTAAAAGAGCCAAGGCAGGGCTAAAAGGTGGTGGAATTTTCATCCTGAAGGAAAATATTGCTAGATCTGGATTCATATTAGATAAAGAAGATCGAAGC
ATCACAAGATCTGATATATACTATAAGAATCTATTCAACCAGTGTGGCCTCTATCCATTTAAATCAAAGGATCAAAAGGGATTTCCCGAGGAATTGTTTCCTGTG
AAGATGTATGCATTAACTACAGAGGGGCCCAAGAGGAATTCTCGCACCAAAAGGGAACAAAGCAATAGACCTGGCATTATCAAGCCACTTAACATACAAATCCGT
ACTAGATTGAACTGGAAACCTCAAGCGTCTGGTTTCTCATGCATTCGAGGAGATGCCATTGCCATCCGTCCCAATTCCAAACCCCTTACTCTTTGTCATCTTAAA
GCACTGTGCATTACATAA
Protein sequenceShow/hide protein sequence
MENYQGDLTDIVRGSSSSATYGCKIEDPFGFSSAEWQFHPSMSNLDQLEQQPTSQQHQYSSFGDPFCAAAMRDPLLHELDMCGSSSSPSAASNSSTTCTFFNGGH
DQDHDHHLVIKTPPSSSNNTNIFSRMLQISPSTNKFPISSPHSSPLFPNSTSSTPLINANSSEFHHFLDNTAVQISSPRNPTGIKRRKSQARKVVCVPAPVAASS
RPNGEVIPSDLWAWRKYGQKPIKGSPYPRGYYRCSSSKGCSARKQVERSRTDPNMLVITYTSEHNHPWPTQRNALAGSSRSQQSKSSAPNSKLNPQNPTPDQPAD
TNKTQVFGAPTNVKEEAIDHQEIIQAEDNGDFSEGFQYMPTNNNNNNQSDDFFADLEELDTDPLNLLFSTGGDDDGQQKEAKFGLFDWAAENNNNNANSTSFEEA
AAGKRVELLQREGREFGDGGAVGHQAVAVASDMEVSGADTDGHEFKNAEEMWTEQVGNPTKRTQWYREGVGYWQGVEASVDGVLGGYGHVNDADILGSEVFLKSI
LSERFHFAGKDRPLVALDCGSGIGRVTKNLLIRYFNEVDLLEPVSHFLEAARESLAPENSGASSDMHKATNFFCVPLQEFTPAAGRYDIIWVQWCIGHLTDEDFI
SFFKRAKAGLKGGGIFILKENIARSGFILDKEDRSITRSDIYYKNLFNQCGLYPFKSKDQKGFPEELFPVKMYALTTEGPKRNSRTKREQSNRPGIIKPLNIQIR
TRLNWKPQASGFSCIRGDAIAIRPNSKPLTLCHLKALCIT