; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029814 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029814
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSET domain-containing protein
Genome locationtig00153533:1420045..1433826
RNA-Seq ExpressionSgr029814
SyntenySgr029814
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0005515 - protein binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR001214 - SET domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153386.1 uncharacterized protein LOC111020893 isoform X1 [Momordica charantia]1.6e-28989.15Show/hide
Query:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
        QPSSSD+VRDD DCL+ LALDKNDHLF KKKKLLERQGFKSENCIYLKCSLCP+EVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
Subjt:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL

Query:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL
        N+IISLVDISLSSC P+Q NVLQ+LRKAVI MIHE+GDVYSMDAKTLGD  CVKENCLL WGE NGVRTSLQIAYVEG GRGTIAK DLDVGDTVLEIP+
Subjt:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL

Query:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN
        AIIISEELV+K+NMYPIL+KIE +SSETMLL+WSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAM+TLDGTLLFDELMQAKE LREQY+EL PALCNN
Subjt:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN

Query:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF
        HPDVFPEEFYSWEQFLWACELWY+NSLKI F DGNLRTCLVPIAGFLNHSLHPHILHYGKVDS+TNSLKFRLSRPCR  EQCYLSYGNY+GSHLV FYGF
Subjt:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF

Query:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE
        LPEGDNLND+IPLDIDFGDDD   ITSD +THMVRGTWLSKNQSIFHYG+PSPLLECLRKARC GL+TK KLQ SLE EMEVLNDL+SIFDGMMENL+DE
Subjt:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE

Query:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        NEDRSS EWDIKLAL+YKDLQRRI+SSG TSC+AGR+MVE ALCECMLEDTRG
Subjt:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

XP_022153388.1 uncharacterized protein LOC111020893 isoform X2 [Momordica charantia]5.6e-28788.79Show/hide
Query:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
        QPSSSD+VRDD DCL+ LALDKNDHLF KKKKLLERQGFKSENCIYLKCSLCP+EVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
Subjt:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL

Query:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL
        N+IISLVDISLSSC P+Q NVLQ+LRKAVI MIHE+GDVYSMDAKTLGD  CVKENCLL WGE NGVRTSLQIAYVEG GRGTIAK DLDVGDTVLEIP+
Subjt:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL

Query:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN
        AIIISEELV+K+NMYPIL+KIE +SSETMLL+WSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAM+TLDGTLLFDELMQAKE LREQY+EL PALCNN
Subjt:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN

Query:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF
        HPDVFPEEFYSWEQFLWACELWY+NSLKI F DGNLRTCLVPIAGFLNHSLHPHILHYGKVDS+TNSLKFRLSRPCR  EQCYLSYGNY+GSHLV FYGF
Subjt:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF

Query:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE
        LPEGDNLND+IPLDIDFGDDD   ITSD +THMVRGTWLSKNQSIFHYG+PSPLLECLRKARC GL+TK K   SLE EMEVLNDL+SIFDGMMENL+DE
Subjt:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE

Query:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        NEDRSS EWDIKLAL+YKDLQRRI+SSG TSC+AGR+MVE ALCECMLEDTRG
Subjt:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

XP_022937780.1 uncharacterized protein LOC111444075 isoform X1 [Cucurbita moschata]1.3e-28387.86Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLERQGFKSENCIYLKCSLC EEVDTVLKELVQIARIIHLNEPE+YFGEDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++GDVYSMDAKTLGD SCVKENCLLQWGE NGVRTSL+IAYVEGAGRG IAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MYPILSKIEGMSSETMLLIWSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFP LCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+L +CLVPIAGFLNHSLHPHILHY K DSDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD   IITSD STHMVRGTWLSKNQSIFHYGLPSPLLECL KARCP L TK KLQGSLENEMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG + VE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

XP_022969685.1 uncharacterized protein LOC111468639 isoform X1 [Cucurbita maxima]2.9e-28387.68Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPE+YF EDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++G VYSMDAKTLGD +CVKENCLLQWGE NGVRT L+IAYVEGAGRGTIAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MYPILSKIEGMSSETMLLIWSMKEKHI +SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFPALCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+LRTCLVPIAGFLNHSLHPHILHY K +SDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD  +  TSD STHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCP L TK KLQGSLENEMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG +MVE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

XP_023538303.1 uncharacterized protein LOC111799128 isoform X1 [Cucurbita pepo subsp. pepo]9.9e-28488.04Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLE+QGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPE+YFGEDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++GDVYSMDAKTLGD SCVKEN LLQWGE NGVRTSL+IAYVEGAGRG IAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MY ILSKIEGMSSETMLLIWSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFPALCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+LRTCLVPIAGFLNHSLHPHILHY K DSDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD + IITSD STHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCP L TK KLQGSLE EMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG + VE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

TrEMBL top hitse value%identityAlignment
A0A6J1DHB8 uncharacterized protein LOC111020893 isoform X22.7e-28788.79Show/hide
Query:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
        QPSSSD+VRDD DCL+ LALDKNDHLF KKKKLLERQGFKSENCIYLKCSLCP+EVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
Subjt:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL

Query:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL
        N+IISLVDISLSSC P+Q NVLQ+LRKAVI MIHE+GDVYSMDAKTLGD  CVKENCLL WGE NGVRTSLQIAYVEG GRGTIAK DLDVGDTVLEIP+
Subjt:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL

Query:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN
        AIIISEELV+K+NMYPIL+KIE +SSETMLL+WSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAM+TLDGTLLFDELMQAKE LREQY+EL PALCNN
Subjt:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN

Query:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF
        HPDVFPEEFYSWEQFLWACELWY+NSLKI F DGNLRTCLVPIAGFLNHSLHPHILHYGKVDS+TNSLKFRLSRPCR  EQCYLSYGNY+GSHLV FYGF
Subjt:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF

Query:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE
        LPEGDNLND+IPLDIDFGDDD   ITSD +THMVRGTWLSKNQSIFHYG+PSPLLECLRKARC GL+TK K   SLE EMEVLNDL+SIFDGMMENL+DE
Subjt:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE

Query:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        NEDRSS EWDIKLAL+YKDLQRRI+SSG TSC+AGR+MVE ALCECMLEDTRG
Subjt:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

A0A6J1DIR4 uncharacterized protein LOC111020893 isoform X17.6e-29089.15Show/hide
Query:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
        QPSSSD+VRDD DCL+ LALDKNDHLF KKKKLLERQGFKSENCIYLKCSLCP+EVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL
Subjt:  QPSSSDKVRDD-DCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEAL

Query:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL
        N+IISLVDISLSSC P+Q NVLQ+LRKAVI MIHE+GDVYSMDAKTLGD  CVKENCLL WGE NGVRTSLQIAYVEG GRGTIAK DLDVGDTVLEIP+
Subjt:  NSIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPL

Query:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN
        AIIISEELV+K+NMYPIL+KIE +SSETMLL+WSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAM+TLDGTLLFDELMQAKE LREQY+EL PALCNN
Subjt:  AIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNN

Query:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF
        HPDVFPEEFYSWEQFLWACELWY+NSLKI F DGNLRTCLVPIAGFLNHSLHPHILHYGKVDS+TNSLKFRLSRPCR  EQCYLSYGNY+GSHLV FYGF
Subjt:  HPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGF

Query:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE
        LPEGDNLND+IPLDIDFGDDD   ITSD +THMVRGTWLSKNQSIFHYG+PSPLLECLRKARC GL+TK KLQ SLE EMEVLNDL+SIFDGMMENL+DE
Subjt:  LPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDE

Query:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        NEDRSS EWDIKLAL+YKDLQRRI+SSG TSC+AGR+MVE ALCECMLEDTRG
Subjt:  NEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

A0A6J1FC73 N-lysine methyltransferase setd6 isoform X21.7e-28187.5Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLERQGFKSENCIYLKCSLC EEVDTVLKELVQIARIIHLNEPE+YFGEDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++GDVYSMDAKTLGD SCVKENCLLQWGE NGVRTSL+IAYVEGAGRG IAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MYPILSKIEGMSSETMLLIWSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFP LCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+L +CLVPIAGFLNHSLHPHILHY K DSDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD   IITSD STHMVRGTWLSKNQSIFHYGLPSPLLECL KARCP L T  KL+GSLENEMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG + VE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

A0A6J1FHS4 uncharacterized protein LOC111444075 isoform X16.2e-28487.86Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLERQGFKSENCIYLKCSLC EEVDTVLKELVQIARIIHLNEPE+YFGEDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++GDVYSMDAKTLGD SCVKENCLLQWGE NGVRTSL+IAYVEGAGRG IAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MYPILSKIEGMSSETMLLIWSMKEKHIA+SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFP LCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+L +CLVPIAGFLNHSLHPHILHY K DSDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD   IITSD STHMVRGTWLSKNQSIFHYGLPSPLLECL KARCP L TK KLQGSLENEMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG + VE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

A0A6J1I0M2 uncharacterized protein LOC111468639 isoform X11.4e-28387.68Show/hide
Query:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN
        PS+SD+VRD +DC + LAL++NDHLF KKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPE+YF EDDACTPADSYSPRNEMEALN
Subjt:  PSSSDKVRD-DDCLMFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALN

Query:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA
        +IISLVDI LSSC P+Q NVLQ+LRKA I MIH++G VYSMDAKTLGD +CVKENCLLQWGE NGVRT L+IAYVEGAGRGTIAK DL+VGDTVLEIPL 
Subjt:  SIISLVDISLSSCMPIQFNVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLA

Query:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH
        I+ISEELVQKT MYPILSKIEGMSSETMLLIWSMKEKHI +SKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLF E+MQAKEHLREQYNELFPALCNNH
Subjt:  IIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNH

Query:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
        PDVFPEE+YSWE+FLWACELWYSNS+KIMF DG+LRTCLVPIAGFLNHSLHPHILHY K +SDTNSLKFRLSRPCRAGE+CYLSYGNYS SHLVAFYGFL
Subjt:  PDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

Query:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN
        PEGDN+ND+IPLDIDFGDD  +  TSD STHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCP L TK KLQGSLENEMEVLNDLLSIFDGMMENLED N
Subjt:  PEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDEN

Query:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG
        EDRSSTEWDIKLAL YKDLQRRIVSS L SC AG +MVE AL ECM EDTRG
Subjt:  EDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG

SwissProt top hitse value%identityAlignment
P58467 SET domain-containing protein 41.9e-1123.44Show/hide
Query:  ERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILSKIEGMSSETM-LLIWSMKEKHI-ANSKFKVYFDTLPEAFNTGL
        ER    T L  A   G GRG ++K  L  G  ++ +P + +++ + V ++++ P + K +   S  + L  + + EKH    S +K Y D LP+++   +
Subjt:  ERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILSKIEGMSSETM-LLIWSMKEKHI-ANSKFKVYFDTLPEAFNTGL

Query:  SFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFP------EEFYSWEQFLWACELWYSNSLKIMFPDGNLRTC---------LVPIAG
              +      LL   L    E  R +  +LF +       + P      +  +S+  FLWA   W + + + ++     + C         L P   
Subjt:  SFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFP------EEFYSWEQFLWACELWYSNSLKIMFPDGNLRTC---------LVPIAG

Query:  FLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL
         LNHS  PH+      +  T   + R +  CR  ++ ++ YG +    L+  YGF+
Subjt:  FLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFL

P94026 Ribulose-1,5 bisphosphate carboxylase/oxygenase large subunit N-methyltransferase, chloroplastic1.5e-1324.6Show/hide
Query:  GRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDEL
        G G +AK D+  G+TVL++P    I+ + V ++ +  + S ++   S  + L   ++EK   +SK+K Y D LP++ ++ + +    +  + GT L    
Subjt:  GRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILSKIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDEL

Query:  MQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSL------HPHILHYGKVDSDTNSLKFRLS
        M  K++++ ++ ++   +   +  +FP    + + F WA  +  S +   +    N    LVP A   NH+       H H +  G     +  L F L 
Subjt:  MQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSL------HPHILHYGKVDSDTNSLKFRLS

Query:  RP--CRAGEQCYLSYG-NYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGDDDD
         P   +AG+Q ++ Y  N S + +   YGF+ E  +  D   L ++  + D+
Subjt:  RP--CRAGEQCYLSYG-NYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGDDDD

Arabidopsis top hitse value%identityAlignment
AT2G18850.1 SET domain-containing protein9.6e-16054.21Show/hide
Query:  LDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALNSIISLVDISLSSCMPIQF
        L K+D  ++ KKK L  +G   +  + L  SL  + ++  L++L+   RI++L++ E+YFGE DACTPA  YS RNE+ AL+ I+SL+ +S    M  Q 
Subjt:  LDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALNSIISLVDISLSSCMPIQF

Query:  NVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILS
        +  + LR A+ G I+E        A+ +    C KE+ L++WG+ NGV+T LQIA ++G GRG IA  DL  GD  LEIP++ IISEE V  ++MYPIL 
Subjt:  NVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILS

Query:  KIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWAC
          +G++SETMLL+W+M+EKH  +SKFK YFD+L E F TGLSFGV A+M LDGTLL DE+MQAKE LRE+Y+EL P L +NH +VFP E Y+WE +LWAC
Subjt:  KIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWAC

Query:  ELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGD
        EL+YSNS++I FPDG L+TCL+P+AGFLNHS++PHI+ YGKVD +T+SLKF +SRPC  GEQC+LSYGNYS SHL+ FYGFLP+GDN  D+IPLD D  D
Subjt:  ELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGD

Query:  DDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLED-ENEDRSSTEWDIKLALEYK
        D+D       +THM+RGTWLS N +IFHYGLP+PLL  LRKA     +++  L  +LE E+ VL +L S FD MM+NL D ++ DR + +WD+KLA+E+K
Subjt:  DDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLED-ENEDRSSTEWDIKLALEYK

Query:  DLQRRIVSSGLTSCFAGRQMVE
        + QR+IVSS L SC AG ++V+
Subjt:  DLQRRIVSSGLTSCFAGRQMVE

AT2G18850.2 SET domain-containing protein6.2e-15954.6Show/hide
Query:  LDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALNSIISLVDISLSSCMPIQF
        L K+D  ++ KKK L  +G   +  + L  SL  + ++  L++L+   RI++L++ E+YFGE DACTPA  YS RNE+ AL+ I+SL+ +S    M  Q 
Subjt:  LDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALNSIISLVDISLSSCMPIQF

Query:  NVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILS
        +  + LR A+ G I+E        A+ +    C KE+ L++WG+ NGV+T LQIA ++G GRG IA  DL  GD  LEIP++ IISEE V  ++MYPIL 
Subjt:  NVLQKLRKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILS

Query:  KIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWAC
          +G++SETMLL+W+M+EKH  +SKFK YFD+L E F TGLSFGV A+M LDGTLL DE+MQAKE LRE+Y+EL P L +NH +VFP E Y+WE +LWAC
Subjt:  KIEGMSSETMLLIWSMKEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWAC

Query:  ELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGD
        EL+YSNS++I FPDG L+TCL+P+AGFLNHS++PHI+ YGKVD +T+SLKF +SRPC  GEQC+LSYGNYS SHL+ FYGFLP+GDN  D+IPLD D  D
Subjt:  ELWYSNSLKIMFPDGNLRTCLVPIAGFLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGD

Query:  DDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLED-ENEDRSSTEWDIKLALEYK
        D+D       +THM+RGTWLS N +IFHYGLP+PLL  LRKA   GL  K     +LE E+ VL +L S FD MM+NL D ++ DR + +WD+KLA+E+K
Subjt:  DDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLLECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLED-ENEDRSSTEWDIKLALEYK

Query:  DLQRRIVSSGLTSCFAGRQMVE
        + QR+IVSS L SC AG ++V+
Subjt:  DLQRRIVSSGLTSCFAGRQMVE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGACGCCGTCCATCAGCCCCTCCATCTCCCACGCGATCGAACTCGACGCCAGAAGGAAGAGACCCGTCAGCACCGACCCGCTCAGCCTCCGATCCGCCATCAGAC
CAGCCGCGGCTCGGATCGCGATGGAGTAGGCCATCAAGAGAGCGTAGATACAAATCGTGGTGACGACAGGTCGGGTCCAGGAATTCTTGAGGGCGGCGAGGGCGGATTTG
AGGGTGGGACGCTTGCCGTTGAAGGAGTGGAGGGTGGAGGAAACGGTGAAGACGGCGGCGAGGAGGGAGAAAGCGTAAATGGGGAGGAAGAAGGCGGCACGGAGGCGGAG
GAGGGAGAAGGCGTCCTCGCGAGACTCGGAGAAGACATGGCGGTACTCGAAACGGGTAGGGGCATGGCGGACGACGGACTCGAGGTGGAAGATGTGGGCCTTGAGAGGGT
GGGAGGAGAGAGAGAGGGTGAAGAGGAGGAGAGAGAGAGGGAGGGCGAGGAGGATGAAGATGGAGAGGAAAATCCGTTTGTTTCTCGAGAAAATTTGAAGGGAATGGAGG
AGGACTTTGAAGGGGAAGAAGCAGAGAGATTCATTTCTAAACTGTTTGTTGCATTTGCTTGTGCGTTGCAGCCTTCTTCATCTGATAAAGTCAGAGATGACGACTGCTTA
ATGTTTCTAGCATTGGACAAAAATGACCATCTTTTCTACAAAAAGAAGAAATTATTAGAAAGGCAGGGTTTCAAGTCTGAGAATTGCATCTACTTGAAGTGCTCTCTGTG
TCCTGAGGAAGTAGATACTGTTCTGAAAGAATTGGTACAAATTGCAAGAATTATTCACTTAAATGAGCCTGAAATTTATTTTGGAGAAGATGATGCATGTACACCAGCAG
ATTCCTACAGCCCCAGGAATGAAATGGAGGCCCTCAATTCAATAATTTCTCTTGTTGACATCTCTCTCTCTAGTTGCATGCCTATCCAATTTAATGTCCTGCAAAAACTA
CGGAAGGCAGTTATTGGTATGATCCATGAGTTTGGAGATGTATACAGTATGGACGCTAAAACTTTGGGAGACACCAGCTGTGTGAAAGAAAATTGTTTGTTACAGTGGGG
TGAGAGAAATGGTGTTAGAACAAGCTTGCAGATAGCTTATGTTGAAGGTGCTGGTAGAGGAACCATAGCCAAAAATGATCTGGACGTTGGTGACACTGTATTGGAGATCC
CGCTGGCTATTATTATTTCTGAGGAACTTGTGCAGAAAACCAACATGTATCCCATATTATCAAAGATTGAAGGCATGTCATCTGAGACAATGTTGTTGATATGGAGCATG
AAGGAGAAGCACATTGCTAATTCCAAATTCAAGGTTTACTTTGACACACTACCAGAAGCCTTTAATACTGGGTTAAGTTTTGGAGTTGGCGCAATGATGACTTTGGACGG
AACCCTACTTTTCGATGAGCTAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTGTTTCCAGCCTTATGTAACAATCATCCTGATGTTTTCCCAGAAGAGT
TCTACTCATGGGAGCAGTTCTTATGGGCTTGTGAACTTTGGTACTCAAATAGCTTGAAAATCATGTTTCCTGATGGAAATCTTAGGACCTGCTTGGTTCCAATTGCAGGT
TTTCTCAACCATTCGTTGCACCCGCACATACTACACTATGGCAAAGTTGATTCAGATACAAATTCCTTGAAATTCCGTCTATCAAGACCCTGCCGTGCAGGGGAACAATG
TTATCTTAGTTATGGGAACTACTCTGGTTCTCATCTAGTTGCCTTCTATGGCTTTTTACCTGAAGGAGACAACCTAAATGATATCATTCCATTAGACATTGACTTTGGTG
ATGATGATGATAGTATCATCACATCTGACTGCAGTACTCATATGGTGAGGGGAACGTGGTTGTCAAAGAACCAAAGCATATTCCATTATGGTCTGCCCTCACCGTTATTA
GAGTGTCTACGAAAAGCTCGGTGCCCTGGATTATACACTAAGTGTAAGCTGCAAGGAAGCTTGGAAAATGAAATGGAAGTCCTCAATGATCTCCTGTCAATCTTTGATGG
AATGATGGAAAATCTTGAGGATGAAAACGAAGACAGGAGCAGTACAGAATGGGATATAAAGCTAGCACTGGAATACAAAGATCTACAAAGGAGGATAGTTTCCTCAGGTC
TGACTTCATGTTTTGCTGGTCGCCAGATGGTGGAATTTGCGTTATGCGAATGCATGCTGGAGGATACTCGAGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCGACGCCGTCCATCAGCCCCTCCATCTCCCACGCGATCGAACTCGACGCCAGAAGGAAGAGACCCGTCAGCACCGACCCGCTCAGCCTCCGATCCGCCATCAGAC
CAGCCGCGGCTCGGATCGCGATGGAGTAGGCCATCAAGAGAGCGTAGATACAAATCGTGGTGACGACAGGTCGGGTCCAGGAATTCTTGAGGGCGGCGAGGGCGGATTTG
AGGGTGGGACGCTTGCCGTTGAAGGAGTGGAGGGTGGAGGAAACGGTGAAGACGGCGGCGAGGAGGGAGAAAGCGTAAATGGGGAGGAAGAAGGCGGCACGGAGGCGGAG
GAGGGAGAAGGCGTCCTCGCGAGACTCGGAGAAGACATGGCGGTACTCGAAACGGGTAGGGGCATGGCGGACGACGGACTCGAGGTGGAAGATGTGGGCCTTGAGAGGGT
GGGAGGAGAGAGAGAGGGTGAAGAGGAGGAGAGAGAGAGGGAGGGCGAGGAGGATGAAGATGGAGAGGAAAATCCGTTTGTTTCTCGAGAAAATTTGAAGGGAATGGAGG
AGGACTTTGAAGGGGAAGAAGCAGAGAGATTCATTTCTAAACTGTTTGTTGCATTTGCTTGTGCGTTGCAGCCTTCTTCATCTGATAAAGTCAGAGATGACGACTGCTTA
ATGTTTCTAGCATTGGACAAAAATGACCATCTTTTCTACAAAAAGAAGAAATTATTAGAAAGGCAGGGTTTCAAGTCTGAGAATTGCATCTACTTGAAGTGCTCTCTGTG
TCCTGAGGAAGTAGATACTGTTCTGAAAGAATTGGTACAAATTGCAAGAATTATTCACTTAAATGAGCCTGAAATTTATTTTGGAGAAGATGATGCATGTACACCAGCAG
ATTCCTACAGCCCCAGGAATGAAATGGAGGCCCTCAATTCAATAATTTCTCTTGTTGACATCTCTCTCTCTAGTTGCATGCCTATCCAATTTAATGTCCTGCAAAAACTA
CGGAAGGCAGTTATTGGTATGATCCATGAGTTTGGAGATGTATACAGTATGGACGCTAAAACTTTGGGAGACACCAGCTGTGTGAAAGAAAATTGTTTGTTACAGTGGGG
TGAGAGAAATGGTGTTAGAACAAGCTTGCAGATAGCTTATGTTGAAGGTGCTGGTAGAGGAACCATAGCCAAAAATGATCTGGACGTTGGTGACACTGTATTGGAGATCC
CGCTGGCTATTATTATTTCTGAGGAACTTGTGCAGAAAACCAACATGTATCCCATATTATCAAAGATTGAAGGCATGTCATCTGAGACAATGTTGTTGATATGGAGCATG
AAGGAGAAGCACATTGCTAATTCCAAATTCAAGGTTTACTTTGACACACTACCAGAAGCCTTTAATACTGGGTTAAGTTTTGGAGTTGGCGCAATGATGACTTTGGACGG
AACCCTACTTTTCGATGAGCTAATGCAAGCAAAAGAGCACTTGCGGGAACAATACAATGAGTTGTTTCCAGCCTTATGTAACAATCATCCTGATGTTTTCCCAGAAGAGT
TCTACTCATGGGAGCAGTTCTTATGGGCTTGTGAACTTTGGTACTCAAATAGCTTGAAAATCATGTTTCCTGATGGAAATCTTAGGACCTGCTTGGTTCCAATTGCAGGT
TTTCTCAACCATTCGTTGCACCCGCACATACTACACTATGGCAAAGTTGATTCAGATACAAATTCCTTGAAATTCCGTCTATCAAGACCCTGCCGTGCAGGGGAACAATG
TTATCTTAGTTATGGGAACTACTCTGGTTCTCATCTAGTTGCCTTCTATGGCTTTTTACCTGAAGGAGACAACCTAAATGATATCATTCCATTAGACATTGACTTTGGTG
ATGATGATGATAGTATCATCACATCTGACTGCAGTACTCATATGGTGAGGGGAACGTGGTTGTCAAAGAACCAAAGCATATTCCATTATGGTCTGCCCTCACCGTTATTA
GAGTGTCTACGAAAAGCTCGGTGCCCTGGATTATACACTAAGTGTAAGCTGCAAGGAAGCTTGGAAAATGAAATGGAAGTCCTCAATGATCTCCTGTCAATCTTTGATGG
AATGATGGAAAATCTTGAGGATGAAAACGAAGACAGGAGCAGTACAGAATGGGATATAAAGCTAGCACTGGAATACAAAGATCTACAAAGGAGGATAGTTTCCTCAGGTC
TGACTTCATGTTTTGCTGGTCGCCAGATGGTGGAATTTGCGTTATGCGAATGCATGCTGGAGGATACTCGAGGCTAA
Protein sequenceShow/hide protein sequence
MIDAVHQPLHLPRDRTRRQKEETRQHRPAQPPIRHQTSRGSDRDGVGHQESVDTNRGDDRSGPGILEGGEGGFEGGTLAVEGVEGGGNGEDGGEEGESVNGEEEGGTEAE
EGEGVLARLGEDMAVLETGRGMADDGLEVEDVGLERVGGEREGEEEEREREGEEDEDGEENPFVSRENLKGMEEDFEGEEAERFISKLFVAFACALQPSSSDKVRDDDCL
MFLALDKNDHLFYKKKKLLERQGFKSENCIYLKCSLCPEEVDTVLKELVQIARIIHLNEPEIYFGEDDACTPADSYSPRNEMEALNSIISLVDISLSSCMPIQFNVLQKL
RKAVIGMIHEFGDVYSMDAKTLGDTSCVKENCLLQWGERNGVRTSLQIAYVEGAGRGTIAKNDLDVGDTVLEIPLAIIISEELVQKTNMYPILSKIEGMSSETMLLIWSM
KEKHIANSKFKVYFDTLPEAFNTGLSFGVGAMMTLDGTLLFDELMQAKEHLREQYNELFPALCNNHPDVFPEEFYSWEQFLWACELWYSNSLKIMFPDGNLRTCLVPIAG
FLNHSLHPHILHYGKVDSDTNSLKFRLSRPCRAGEQCYLSYGNYSGSHLVAFYGFLPEGDNLNDIIPLDIDFGDDDDSIITSDCSTHMVRGTWLSKNQSIFHYGLPSPLL
ECLRKARCPGLYTKCKLQGSLENEMEVLNDLLSIFDGMMENLEDENEDRSSTEWDIKLALEYKDLQRRIVSSGLTSCFAGRQMVEFALCECMLEDTRG