; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein SET DOMAIN GROUP 40-like
Genome locationchr8:26785278..26790448
RNA-Seq ExpressionMoc08g36210
SyntenyMoc08g36210
Gene Ontology termsGO:0005509 - calcium ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR015353 - Rubisco LSMT, substrate-binding domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581196.1 Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. sororia]1.1e-24083.51Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        M TE SFE+LLRWAAD GISDSVDK+ S+SCLGRSLCV  FPDAGGRGLGAVR+L  GELVL+VPKSVL TTQSL L++EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIG+GS+SWWFPYFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSH EWR VKGLME+S +K QLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPE ES DI DVSSFSQHAS  G++TTD LH+E+ DTQRALTDGGF+E VSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIY++SSWPKESL+I QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE+ VMQWLSKNCH VL+NLPTSVEED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICK+QDLQ+PR LGKM ST GGEFCAFL+TNGL+N +E EL+LTGK+K SL+RWKLAVQWRI YKKAL+DCISYCTRT  SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

KAG7017936.1 Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma]9.6e-24083.3Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        M TE SFE+LLRWAAD GISDSVDK+ S+SCLGRSLCV  FPDAGGRGLGAVR+L  GELVL+VPKSVL TTQSL L++EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIG+GS+SWWFPYFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EWR VKGLME+S +K QLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPE ES DI DVSSFSQHAS  G++TTD LH+E+ DTQRALTDGGF+E VSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIY++SSWPKESL+I QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE+ VMQWLSKNCH VL+NLPTSVEED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICK+QDLQ+PR LGKM ST GGEFCAFL+TNGL+N +E EL+LTGK+K SL+RWKLAVQWRI YKKAL+DCISYCTRT  SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

XP_022143354.1 protein SET DOMAIN GROUP 40 isoform X1 [Momordica charantia]1.2e-285100Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

XP_022143355.1 protein SET DOMAIN GROUP 40 isoform X2 [Momordica charantia]7.5e-26995.46Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQ                      
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

XP_022983189.1 protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima]5.1e-24183.71Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        M TEGSFE+LLRWAAD GISDSVDK+SS+SCLGRSLCV  FPDAGGRGLGAVR+L  GELVL+VPKSVL TTQSL L++EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIG+GS+SWWFPYFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEWR VKGLME+SN+K QLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDI DVSSFSQHAS  G++TTD LH+E+ DTQRALTDGGF+E VSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEH+IY++SSWPKESL+I QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE+ VMQWLSKNCH VL+NLPTSVEED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICK+QDLQ P  LGKML T GGEFCAFL+T GL+N +E EL LTGK+K SL+RWKLAVQWRI YKKAL+DC SYCTRT  SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

TrEMBL top hitse value%identityAlignment
A0A5D3BQD3 Protein SET DOMAIN GROUP 40 isoform X21.2e-22479.38Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        METEGSF +LLRWAAD GISDS+D+ +S SCLGRSLCVS FPD+GGRGL AVR LN GEL+LR PKSVL TTQSL LE+EKL+MALK +PSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLL EI +G++S WFPY KHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKS  +WR VKGLM++SN+K QLQTFKAWLWASATISSR LYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGES +  DV SF  HAS   ++   E  EEQ D+Q  LTDGGF+E  SAYCFYARESYKKG+QVLLSYGTYTN+ELLEYYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPND+VFIP+EHDIY +SSWPKESLYI QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE  VMQWLSKNCHTVL+NLPTS+EED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNI KVQDLQ+ R L KML TYGGE CAFL+TNG++N DEAE  L+ K+K SL+RWKLAVQWR+ YKKAL+DCI YCTRTI SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

A0A6J1CNL1 protein SET DOMAIN GROUP 40 isoform X23.6e-26995.46Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQ                      
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

A0A6J1CP24 protein SET DOMAIN GROUP 40 isoform X15.6e-286100Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

A0A6J1F4A7 protein SET DOMAIN GROUP 40 isoform X11.3e-23983.09Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        M  E SFE+LLRWAAD GISDSVDK+ S+SCLGRSLCV  FPDAGGRGLGAVR+L  GELVL+VPKSVL TTQSL L++EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIG+GS+SWWFPYFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EWR VKGLME+SN+K QLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPE ES DI DVSSFSQHAS  G++TTD LH+E+ DTQRALTDGGF+E VSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEHDIY++SSWPKESL+I QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE+ VMQWLSKNCH VL+NLPTSVEED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICK+QDLQ+PR LGKM ST  GEFCAFL+TNGL+N +E EL+LTGK+K SL+RWKLAVQWRI YKKAL+DCISYCTRT  SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

A0A6J1J6L6 protein SET DOMAIN GROUP 40 isoform X12.5e-24183.71Show/hide
Query:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF
        M TEGSFE+LLRWAAD GISDSVDK+SS+SCLGRSLCV  FPDAGGRGLGAVR+L  GELVL+VPKSVL TTQSL L++EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA
        CLLYEIG+GS+SWWFPYFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEWR VKGLME+SN+K QLQTFKAWLWASATISSRALYVPWDEA
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEA

Query:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL
        GCLCPVGDLFNYAAPEGESLDI DVSSFSQHAS  G++TTD LH+E+ DTQRALTDGGF+E VSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGF+L
Subjt:  GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFIL

Query:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        QENPNDRVFIPLEH+IY++SSWPKESL+I QNGNPSFALLSALRLWAT PNKRRGVGHLAYAGSQLSV NE+ VMQWLSKNCH VL+NLPTSVEED +LL
Subjt:  QENPNDRVFIPLEHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Query:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS
        CNICK+QDLQ P  LGKML T GGEFCAFL+T GL+N +E EL LTGK+K SL+RWKLAVQWRI YKKAL+DC SYCTRT  SLS
Subjt:  CNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS

SwissProt top hitse value%identityAlignment
B2KI88 Actin-histidine N-methyltransferase2.6e-1423.45Show/hide
Query:  EGSFENLLRWAADRGIS-DSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--F
        E  F +L++WA++ G S +  +  S                  G GL A R++   EL L VP+ +L T +S   +N  L     +   L +   +T  F
Subjt:  EGSFENLLRWAADRGIS-DSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--F

Query:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALY
         LL E  +  NS+W PY + LP  YD    FGE E + LQ   AI   ++  K   + +  + +V      +N K  L+   T++ + WA +++ +R   
Subjt:  CLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALY

Query:  VPWDEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNL
        +P ++       L P+ D+ N+                                     T   +T G   E     C  A + ++ G+Q+ + YGT +N 
Subjt:  VPWDEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNL

Query:  ELLEYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPNKRR-------GVGHLAYAGSQ---LS
        E + + GF    N +DRV I L           + ++   +  P  S++      P  S  LL+ LR++     + +        +  +   G+    +S
Subjt:  ELLEYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPNKRR-------GVGHLAYAGSQ---LS

Query:  VDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN
         DNE+ +  +L      +L    T++EED+  L N
Subjt:  VDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN

B7ZUF3 Actin-histidine N-methyltransferase2.4e-1524.19Show/hide
Query:  EGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLL
        E  F  L+ W  + G S            G  L    FP+  G GL A R +   EL L VP+ +L T +S          +  R         L F LL
Subjt:  EGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLL

Query:  YEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALYVPW
         E  +  NS+W PY K LP  YD    F E E Q LQ   AI   ++  K   + +  + +V     ++N K  L+   TF  + WA +++ +R   +P 
Subjt:  YEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALYVPW

Query:  DEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELL
        ++       L P+ D+ N+                                     T   +T G   E     C  A + +K G+Q+ + YGT +N E +
Subjt:  DEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELL

Query:  EYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPNKRRG----------VGHLAYAGSQLSVDN
         + GF  + N +DRV I L           + ++   +  P  S++      P  S  LL+ LR++    ++ +G          +  L  +   +S +N
Subjt:  EYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPNKRRG----------VGHLAYAGSQLSVDN

Query:  EMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        E+ +  +L      +L    T+VE+D ++L
Subjt:  EMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

E2RBS6 Actin-histidine N-methyltransferase1.7e-1323.56Show/hide
Query:  EGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--FC
        E  F +L++WA++ G S     E  N           F +  G GL A R++   EL L VP+ +L T +S   +N  L     +   L +   +T  F 
Subjt:  EGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--FC

Query:  LLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSN--VKRQLQTFKAWLWASATISSRALYVP
        LL E  +  NS+W PY + LP  YD    F E E + LQ   AI   ++  K   + +  + +V      +N    +   T++ + WA +++ +R   +P
Subjt:  LLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAI---WATEKAALKSHTEWREVKGLMEDSN--VKRQLQTFKAWLWASATISSRALYVP

Query:  WDEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLEL
         ++       L P+ D+ N+                                     T   +T G   E     C   R+ ++ G+Q+ + YGT +N E 
Subjt:  WDEAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLEL

Query:  LEYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATR----------PNKRRGVGHLAYAGSQLSVD
        + + GF    N +DRV I L           + ++   +  P  S++     +P  S  LL+ LR++              N    +  L  +   +S D
Subjt:  LEYYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATR----------PNKRRGVGHLAYAGSQLSVD

Query:  NEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN
        NE+ +  +L      +L    T++EED+  L N
Subjt:  NEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN

Q5ZML9 Actin-histidine N-methyltransferase1.7e-1323.54Show/hide
Query:  FENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--FCLLY
        F  L++WA + G S +   E +N             +  G GL A R +   EL L VP+ +L T +S   +N  L     +   L +   +T  F LL 
Subjt:  FENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLT--FCLLY

Query:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQAL---QVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALYVPWD
        E     NS+W PY + LP  YD    F E E Q L   Q  + +++  K   + +  + +V     +++ K  L+   T+  + WA +++ +R   +P +
Subjt:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQAL---QVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQ---TFKAWLWASATISSRALYVPWD

Query:  EAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLE
        +       L P+ D+ N+                                     T   +T G   E     C  A + +K G+Q+ + YGT +N E + 
Subjt:  EAG----CLCPVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLE

Query:  YYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPN--KRRGVGH--------LAYAGSQLSVDNE
        + GF    N +DRV I L           + ++   +  P  S++   +  P  S  LL+ LR++       K   +G         L  +   +S DNE
Subjt:  YYGFILQENPNDRVFIPL-----------EHDIYTTSSWPKESLYICQNGNP--SFALLSALRLWATRPN--KRRGVGH--------LAYAGSQLSVDNE

Query:  MSVMQWLSKNCHTVLSNLPTSVEEDRRLL
        + +  +L      +L    T+VE+D+  L
Subjt:  MSVMQWLSKNCHTVLSNLPTSVEEDRRLL

Q6NQJ8 Protein SET DOMAIN GROUP 401.7e-13853.59Show/hide
Query:  SFENLLRWAADRGISDSVDKES-SNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLY
        + E  LRWAA+ GISDS+D     +SCLG SL VS FPDAGGRGLGA R L  GELVL+VP+  L TT+S++ ++ KLS A+  + SLSSTQ L+ CLLY
Subjt:  SFENLLRWAADRGISDSVDKES-SNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLY

Query:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEAGCLC
        E+ +   S+W+PY  H+P+ YD+LATFG FEKQALQV+ A+WATEKA  K  +EW+E   LM++  +K + ++F+AWLWASATISSR L+VPWD AGCLC
Subjt:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEAGCLC

Query:  PVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENP
        PVGDLFNY AP   S   +   S +    +G      E H E+      LTDGGF+E V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN 
Subjt:  PVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENP

Query:  NDRVFIPLEHDIYT-TSSWPKESLYICQNGNPSFALLSALRLWATRPNKR-RGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN
        ND+VFIPLE  +++  SSWPK+SLYI Q+G  SFAL+S LRLW    ++R + V  L YAGSQ+SV NE+ VM+W+S+ C +VL +LPTSV ED  LL N
Subjt:  NDRVFIPLEHDIYT-TSSWPKESLYICQNGNPSFALLSALRLWATRPNKR-RGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN

Query:  ICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGL-----MNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSL
        I K+QD ++ R   K    +G E  AFL  N L     ++G   E   + K    L +W+ +VQWR+SYK+ L DCISYC   +++L
Subjt:  ICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGL-----MNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSL

Arabidopsis top hitse value%identityAlignment
AT1G24610.1 Rubisco methyltransferase family protein2.7e-0620.59Show/hide
Query:  GRGLGAVRNLNNGELVLRVPKSV---LFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVD
        G GL +   ++ G  ++ +P  V     +  S    +  LS   +R P      KL   LL E     + WW PY  +LP++Y +   F   + + LQ  
Subjt:  GRGLGAVRNLNNGELVLRVPKSV---LFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVD

Query:  YAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTF-------KAWLWASATISSRALYV---------PWDEAGCLCPVGDLFNYAAPEGESLDIRDVS
          +    K         +E++  +ED  VK     F        A  W  + +S+RA  +           D+   + P+ D+ N+              
Subjt:  YAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTF-------KAWLWASATISSRALYV---------PWDEAGCLCPVGDLFNYAAPEGESLDIRDVS

Query:  SFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENPNDRVFIPLEHDIYTTSSWPK--
        SF  +A          + E+ G     L               A    K+   +LL+YG  +N   L  YGF+++ NP D + +  +  +   +S     
Subjt:  SFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENPNDRVFIPLEHDIYTTSSWPK--

Query:  ESLYICQNGNPSFALLSALRLWATRPNKRRGVG-------------HLAYAGSQLSVD-------------------NEMSVMQWLSKNCHTVLSNLPTS
         S            LLS L L    PN +  +G              +   G  + V+                   NE++V + +   C   LS+ PT 
Subjt:  ESLYICQNGNPSFALLSALRLWATRPNKRRGVG-------------HLAYAGSQLSVD-------------------NEMSVMQWLSKNCHTVLSNLPTS

Query:  VEEDRRLL
        + ED  ++
Subjt:  VEEDRRLL

AT2G18850.1 SET domain-containing protein1.7e-0823.16Show/hide
Query:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSN--SWWFPYFKHLPQSYDILATFGEFEKQALQ
        D  GRG  A  +L  G++ L +P S + + +   + N  +   L+ +  ++S    T  LL+ + E  N  S + PYF  L +++    +FG      ++
Subjt:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSN--SWWFPYFKHLPQSYDILATFGEFEKQALQ

Query:  VDYAIWATEKAALKSHTEWREVKGLMEDSNVKR----QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHAS
        +D  +   E    K     R  + +   SN +     +L T++ +LWA     S ++ + + +     CL PV    N+              S   H  
Subjt:  VDYAIWATEKAALKSHTEWREVKGLMEDSNVKR----QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHAS

Query:  SGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQ-ENPNDRVFIPLEHDI-----------YTT--
          G +                     D + S+  F       KG+Q  LSYG Y++  LL +YGF+ + +NP D   IPL+ D+           +TT  
Subjt:  SGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQ-ENPNDRVFIPLEHDI-----------YTT--

Query:  --SSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDR
           +W   +  I   G P+  LL+ LR       K  G+ H +      +++ E+ V++ L      ++ NL  +   DR
Subjt:  --SSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDR

AT2G18850.2 SET domain-containing protein2.4e-0724.25Show/hide
Query:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSN--SWWFPYFKHLPQSYDILATFGEFEKQALQ
        D  GRG  A  +L  G++ L +P S + + +   + N  +   L+ +  ++S    T  LL+ + E  N  S + PYF  L +++    +FG      ++
Subjt:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSN--SWWFPYFKHLPQSYDILATFGEFEKQALQ

Query:  VDYAIWATEKAALKSHTEWREVKGLMEDSNVKR----QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHAS
        +D  +   E    K     R  + +   SN +     +L T++ +LWA     S ++ + + +     CL PV    N+              S   H  
Subjt:  VDYAIWATEKAALKSHTEWREVKGLMEDSNVKR----QLQTFKAWLWASATISSRALYVPWDEA---GCLCPVGDLFNYAAPEGESLDIRDVSSFSQHAS

Query:  SGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQ-ENPNDRVFIPL------EHDIYTTSSWPKES
          G +                     D + S+  F       KG+Q  LSYG Y++  LL +YGF+ + +NP D   IPL      + DI T  SW    
Subjt:  SGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQ-ENPNDRVFIPL------EHDIYTTSSWPKES

Query:  L
        L
Subjt:  L

AT3G07670.1 Rubisco methyltransferase family protein7.5e-0923.08Show/hide
Query:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKHLP-QSYDILATFGEFEKQALQV
        D G RGL A +NL  GE +L VP S++ +  S    N +    +KRY  +     L   L+ E     +S WF Y   LP Q Y +L     + +  L +
Subjt:  DAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGSNSWWFPYFKHLP-QSYDILATFGEFEKQALQV

Query:  DYAIWATEKAALKSHTEWREVKGLMEDSNVK----------RQLQTFKAWLWASATISSRALYVP-WDEAGCLCPVGDLFNYAAPEGESLDIRDVSSFSQ
                + A++  T    V G  ED   +          +++   + + W+   + SR + +P  D    L P  D+ N+       LD         
Subjt:  DYAIWATEKAALKSHTEWREVKGLMEDSNVK----------RQLQTFKAWLWASATISSRALYVP-WDEAGCLCPVGDLFNYAAPEGESLDIRDVSSFSQ

Query:  HASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQE--NPNDRVFIPL----------------
                                    +D+      F     Y+ G+QV +SYG  +N ELL  YGF+ +E  NP+D V + L                
Subjt:  HASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQE--NPNDRVFIPL----------------

Query:  -EHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGS-QLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL
         +H + T   +P     +   G P   L++   L  + P+ R     +A A S + S  N++   +        +L +  TS+ +  R L
Subjt:  -EHDIYTTSSWPKESLYICQNGNPSFALLSALRLWATRPNKRRGVGHLAYAGS-QLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLL

AT5G17240.1 SET domain group 401.2e-13953.59Show/hide
Query:  SFENLLRWAADRGISDSVDKES-SNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLY
        + E  LRWAA+ GISDS+D     +SCLG SL VS FPDAGGRGLGA R L  GELVL+VP+  L TT+S++ ++ KLS A+  + SLSSTQ L+ CLLY
Subjt:  SFENLLRWAADRGISDSVDKES-SNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLY

Query:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEAGCLC
        E+ +   S+W+PY  H+P+ YD+LATFG FEKQALQV+ A+WATEKA  K  +EW+E   LM++  +K + ++F+AWLWASATISSR L+VPWD AGCLC
Subjt:  EIGEGSNSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEAGCLC

Query:  PVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENP
        PVGDLFNY AP   S   +   S +    +G      E H E+      LTDGGF+E V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN 
Subjt:  PVGDLFNYAAPEGESLDIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENP

Query:  NDRVFIPLEHDIYT-TSSWPKESLYICQNGNPSFALLSALRLWATRPNKR-RGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN
        ND+VFIPLE  +++  SSWPK+SLYI Q+G  SFAL+S LRLW    ++R + V  L YAGSQ+SV NE+ VM+W+S+ C +VL +LPTSV ED  LL N
Subjt:  NDRVFIPLEHDIYT-TSSWPKESLYICQNGNPSFALLSALRLWATRPNKR-RGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCN

Query:  ICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGL-----MNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSL
        I K+QD ++ R   K    +G E  AFL  N L     ++G   E   + K    L +W+ +VQWR+SYK+ L DCISYC   +++L
Subjt:  ICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGL-----MNGDEAELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGAAGGAAGTTTCGAAAACCTGCTGAGATGGGCGGCGGATCGTGGAATTTCAGATTCTGTCGACAAAGAGAGTTCAAATTCTTGTCTGGGTCGTTCTTTATG
CGTCTCTCTCTTCCCTGATGCTGGCGGGAGAGGTTTAGGGGCTGTGCGTAATCTTAACAATGGAGAATTAGTACTGAGAGTTCCAAAATCTGTCTTGTTTACGACCCAAA
GTTTGCTGTTGGAAAATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGAAATTGACCTTCTGTTTACTCTATGAGATTGGTGAAGGGAGC
AATTCTTGGTGGTTTCCTTACTTCAAGCACTTGCCTCAGAGCTATGACATACTGGCAACTTTTGGAGAATTCGAAAAGCAAGCCCTGCAGGTGGATTATGCCATCTGGGC
AACAGAAAAGGCCGCTTTGAAGTCTCATACGGAGTGGAGAGAAGTTAAAGGACTTATGGAAGATTCTAATGTTAAAAGGCAACTTCAAACATTCAAGGCATGGCTTTGGG
CCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCTGGATGTCTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAGGGAGAGTCCCTT
GATATTAGGGATGTCTCATCTTTTTCACAACATGCTTCTTCGGGTGGAGACATGACTACTGACGAGTTACACGAAGAGCAGGGGGATACTCAGCGGGCTTTGACCGATGG
TGGATTTGATGAAAAGGTCTCTGCATACTGCTTTTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTG
AATATTATGGGTTTATTCTACAGGAAAATCCAAATGACAGAGTTTTTATACCTTTGGAACATGACATCTACACTACCAGTTCTTGGCCTAAGGAGTCTCTTTATATTTGT
CAAAACGGAAACCCATCTTTTGCTCTACTTTCCGCTCTGAGATTATGGGCAACCCGCCCGAACAAACGTAGAGGTGTTGGACATCTTGCTTATGCTGGGTCACAACTCTC
CGTCGATAACGAAATGTCTGTGATGCAGTGGTTATCAAAGAACTGCCATACTGTTTTAAGCAATCTGCCAACATCGGTTGAAGAAGACAGACGACTTCTGTGCAACATCT
GCAAAGTCCAGGATCTGCAGATACCGAGGGGACTCGGGAAGATGCTATCGACTTATGGTGGTGAGTTTTGTGCTTTCTTGAAGACCAATGGTCTGATGAACGGAGATGAA
GCCGAGTTACGTTTAACCGGGAAAGTAAAGCATTCTCTGGATAGATGGAAATTAGCAGTCCAGTGGAGGATCTCGTACAAGAAGGCTTTGATTGATTGCATAAGTTACTG
CACCAGAACTATTAGTTCTTTATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACTGAAGGAAGTTTCGAAAACCTGCTGAGATGGGCGGCGGATCGTGGAATTTCAGATTCTGTCGACAAAGAGAGTTCAAATTCTTGTCTGGGTCGTTCTTTATG
CGTCTCTCTCTTCCCTGATGCTGGCGGGAGAGGTTTAGGGGCTGTGCGTAATCTTAACAATGGAGAATTAGTACTGAGAGTTCCAAAATCTGTCTTGTTTACGACCCAAA
GTTTGCTGTTGGAAAATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGAAATTGACCTTCTGTTTACTCTATGAGATTGGTGAAGGGAGC
AATTCTTGGTGGTTTCCTTACTTCAAGCACTTGCCTCAGAGCTATGACATACTGGCAACTTTTGGAGAATTCGAAAAGCAAGCCCTGCAGGTGGATTATGCCATCTGGGC
AACAGAAAAGGCCGCTTTGAAGTCTCATACGGAGTGGAGAGAAGTTAAAGGACTTATGGAAGATTCTAATGTTAAAAGGCAACTTCAAACATTCAAGGCATGGCTTTGGG
CCTCTGCAACTATATCATCTAGGGCATTGTATGTACCATGGGATGAGGCTGGATGTCTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAGGGAGAGTCCCTT
GATATTAGGGATGTCTCATCTTTTTCACAACATGCTTCTTCGGGTGGAGACATGACTACTGACGAGTTACACGAAGAGCAGGGGGATACTCAGCGGGCTTTGACCGATGG
TGGATTTGATGAAAAGGTCTCTGCATACTGCTTTTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCTTG
AATATTATGGGTTTATTCTACAGGAAAATCCAAATGACAGAGTTTTTATACCTTTGGAACATGACATCTACACTACCAGTTCTTGGCCTAAGGAGTCTCTTTATATTTGT
CAAAACGGAAACCCATCTTTTGCTCTACTTTCCGCTCTGAGATTATGGGCAACCCGCCCGAACAAACGTAGAGGTGTTGGACATCTTGCTTATGCTGGGTCACAACTCTC
CGTCGATAACGAAATGTCTGTGATGCAGTGGTTATCAAAGAACTGCCATACTGTTTTAAGCAATCTGCCAACATCGGTTGAAGAAGACAGACGACTTCTGTGCAACATCT
GCAAAGTCCAGGATCTGCAGATACCGAGGGGACTCGGGAAGATGCTATCGACTTATGGTGGTGAGTTTTGTGCTTTCTTGAAGACCAATGGTCTGATGAACGGAGATGAA
GCCGAGTTACGTTTAACCGGGAAAGTAAAGCATTCTCTGGATAGATGGAAATTAGCAGTCCAGTGGAGGATCTCGTACAAGAAGGCTTTGATTGATTGCATAAGTTACTG
CACCAGAACTATTAGTTCTTTATCTTAA
Protein sequenceShow/hide protein sequence
METEGSFENLLRWAADRGISDSVDKESSNSCLGRSLCVSLFPDAGGRGLGAVRNLNNGELVLRVPKSVLFTTQSLLLENEKLSMALKRYPSLSSTQKLTFCLLYEIGEGS
NSWWFPYFKHLPQSYDILATFGEFEKQALQVDYAIWATEKAALKSHTEWREVKGLMEDSNVKRQLQTFKAWLWASATISSRALYVPWDEAGCLCPVGDLFNYAAPEGESL
DIRDVSSFSQHASSGGDMTTDELHEEQGDTQRALTDGGFDEKVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFILQENPNDRVFIPLEHDIYTTSSWPKESLYIC
QNGNPSFALLSALRLWATRPNKRRGVGHLAYAGSQLSVDNEMSVMQWLSKNCHTVLSNLPTSVEEDRRLLCNICKVQDLQIPRGLGKMLSTYGGEFCAFLKTNGLMNGDE
AELRLTGKVKHSLDRWKLAVQWRISYKKALIDCISYCTRTISSLS