; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023584 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023584
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMethyltransf_11 domain-containing protein
Genome locationtig00000892:4732841..4755326
RNA-Seq ExpressionSgr023584
SyntenySgr023584
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR013216 - Methyltransferase type 11
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase
IPR041698 - Methyltransferase domain 25


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570536.1 hypothetical protein SDJN03_29451, partial [Cucurbita argyrosperma subsp. sororia]7.6e-17177.04Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISG G ++TGRL R+RRG IK Q ST SE AV+EEGQLERPNW+GETPLSRLVGALISFKP+YS+LKLGAR+V I             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRK++S+ILESDVYKELESVQNLSIVYPDYYLKPFHAYDEG+LSWLAAAEAEPATMSMIMRAV  A+S+DEAK+VV
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYS+NP+ DILDIGCS+GF TR LADKFP AKVTGLDLSPYFLAVA+YMDKKR PRKNAIRWLHGNGE++ LPS SFDL+SIAY+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIVN+LKESFRLLRPGGTI ITDQA KSKA+QELSPV+FTLLKSTEP+LDEYHLTDLE +M +VGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

XP_022149773.1 uncharacterized protein LOC111018123 isoform X1 [Momordica charantia]2.1e-17679.34Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG S+R+AL SGNGH R GRLR S+RGLI+ Q S RSEMAV+EEG+LERPNWSGET LSRLVGALISFKPL+SLLKLGARRVLI             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRKMTSEIL+SDVYKEL+SVQ+LSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAV DASS+DEAKE+V
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYSENP+ DILDIGCS+G STR LADKFP+AKVTGLDLSPYFLAVA+YMDKK APRKN+IRWLHGN E+SSLPSRSFDLVSIA+M H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIV++LKESFRLLRPGG   +TDQAPKSKA+QELSPV+FTLLKSTEPYLDEYHLTDLEGRM+E+GFVNVRS+LTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

XP_022944658.1 uncharacterized protein LOC111449047 [Cucurbita moschata]5.3e-17277.55Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISG G ++TGRL R+RRG IK Q ST SE+AV+EEGQLERPNW+GETPLSRLVGALISFKP+YS+LKLGAR+V I             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRK++S+ILESDVYKELESVQNLSIVYPDYYLKPFHAYDEG+LSWLAAAEAEPATMSMIMRAV  A+S+DEAK+VV
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYS+NP+ DILDIGCS+GF TR LADKFP AKVTGLDLSPYFLAVA+YMDKKRAPRKNAIRWLHGNGE++ LPSRSFDL+SIAY+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIVN+LKESFRLLRPGGTI ITDQA KSKA+QELSPV+FTLLKSTEP+LDEYHLTDLE +M +VGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

XP_022985996.1 uncharacterized protein LOC111483862 [Cucurbita maxima]9.9e-17177.04Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISGNG ++TGRL R+RRG I VQ ST SE AV+EEG+LERPNW+GETPLSRLVGALISFKP+YS+LKLGAR+V I             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRKM S+ILESDVYKELESVQNLSIVYPDYYLKPFHAYDEG+LSW+AAAEAEPATMSMIMRAV  A+S+DEAK+VV
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYS+NP+ DILDIGCSIGF TR LADKFP AKVTGLDLSPYFL+VA+YMDKKRAPRKNAIRWLHGNGE++ LPS SFDL+SIAY+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIVN+LKESFRLLRPGGTI ITDQA KSKA+QE+SPV+FTLLKSTEP+LDEYHLTDLE +M +VGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

XP_038902341.1 uncharacterized protein LOC120088974 [Benincasa hispida]1.6e-16876.53Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISGNG +RT    R++RG IK Q ST SE AV+EEGQLERPNWSG+TPLSRLVGALISFKPLYS+LKLGAR+VLI             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWR +TS+ILESDVYKELESVQN SIVYPDYYLKPFHAYDEG+LSWLAAAE +PATMSMIMRAV +ASS+DEAKE+V
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLR ++EHH++YS NP+ DILDIGCSIGF TR LADKFP AKVTGLDLSPYFLAVA+YMDKK+APRKNAIRWLHGNGED+SLPSRSFDL+SI+Y+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP VAIVN+LKESFR+LRPGGTI ITDQA KSK +QELSPV+FTLLKSTEP+LDEYHLTDLE +M+EVGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

TrEMBL top hitse value%identityAlignment
A0A1S3C2T1 uncharacterized protein LOC1034963711.9e-16775.57Show/hide
Query:  MALCGQSHRLALIS-GNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLG
        MALCG S +LALIS GNG +R GR+ RS RGLIKVQ ST SE+ VYEEG+LERPNWSG+TPLSRLVGALISFKPLYS+LKLGAR+VLI            
Subjt:  MALCGQSHRLALIS-GNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLG

Query:  LFLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEV
                        STAEK NI WRK+TS++LESDVYKEL+SVQN SIVYPDYYLKPFHAYDEG+LSWLAAAE +PATMSMIMRAV  ASS+DEAKE+
Subjt:  LFLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEV

Query:  VFGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYML
        VFGNWLR ++EHH++YS NP+ DILDIGCS+GF TR LADKFP AKVTGLDLSPYFLAVA+YMDKK+ PRKNAIRWLHGNGED+ LPSRSFDL+SI+Y+L
Subjt:  VFGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYML

Query:  HECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        HECP VAIVN+++ESFRLLRPGGTI ITDQA KSK +QELSPV+FTLLKSTEPYLDEYHLTDLE +M+E+GFVNV SRLTDPRH+T+TATVPL
Subjt:  HECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

A0A5D3CS29 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.9e-16775.57Show/hide
Query:  MALCGQSHRLALIS-GNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLG
        MALCG S +LALIS GNG +R GR+ RS RGLIKVQ ST SE+ VYEEG+LERPNWSG+TPLSRLVGALISFKPLYS+LKLGAR+VLI            
Subjt:  MALCGQSHRLALIS-GNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLG

Query:  LFLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEV
                        STAEK NI WRK+TS++LESDVYKEL+SVQN SIVYPDYYLKPFHAYDEG+LSWLAAAE +PATMSMIMRAV  ASS+DEAKE+
Subjt:  LFLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEV

Query:  VFGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYML
        VFGNWLR ++EHH++YS NP+ DILDIGCS+GF TR LADKFP AKVTGLDLSPYFLAVA+YMDKK+ PRKNAIRWLHGNGED+ LPSRSFDL+SI+Y+L
Subjt:  VFGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYML

Query:  HECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        HECP VAIVN+++ESFRLLRPGGTI ITDQA KSK +QELSPV+FTLLKSTEPYLDEYHLTDLE +M+E+GFVNV SRLTDPRH+T+TATVPL
Subjt:  HECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

A0A6J1D7N3 uncharacterized protein LOC111018123 isoform X11.0e-17679.34Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG S+R+AL SGNGH R GRLR S+RGLI+ Q S RSEMAV+EEG+LERPNWSGET LSRLVGALISFKPL+SLLKLGARRVLI             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRKMTSEIL+SDVYKEL+SVQ+LSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAV DASS+DEAKE+V
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYSENP+ DILDIGCS+G STR LADKFP+AKVTGLDLSPYFLAVA+YMDKK APRKN+IRWLHGN E+SSLPSRSFDLVSIA+M H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIV++LKESFRLLRPGG   +TDQAPKSKA+QELSPV+FTLLKSTEPYLDEYHLTDLEGRM+E+GFVNVRS+LTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

A0A6J1FY99 uncharacterized protein LOC1114490472.5e-17277.55Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISG G ++TGRL R+RRG IK Q ST SE+AV+EEGQLERPNW+GETPLSRLVGALISFKP+YS+LKLGAR+V I             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRK++S+ILESDVYKELESVQNLSIVYPDYYLKPFHAYDEG+LSWLAAAEAEPATMSMIMRAV  A+S+DEAK+VV
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYS+NP+ DILDIGCS+GF TR LADKFP AKVTGLDLSPYFLAVA+YMDKKRAPRKNAIRWLHGNGE++ LPSRSFDL+SIAY+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIVN+LKESFRLLRPGGTI ITDQA KSKA+QELSPV+FTLLKSTEP+LDEYHLTDLE +M +VGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

A0A6J1J9T8 uncharacterized protein LOC1114838624.8e-17177.04Show/hide
Query:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL
        MALCG SH+LALISGNG ++TGRL R+RRG I VQ ST SE AV+EEG+LERPNW+GETPLSRLVGALISFKP+YS+LKLGAR+V I             
Subjt:  MALCGQSHRLALISGNGHERTGRLRRSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGL

Query:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV
                       STAEK NIPWRKM S+ILESDVYKELESVQNLSIVYPDYYLKPFHAYDEG+LSW+AAAEAEPATMSMIMRAV  A+S+DEAK+VV
Subjt:  FLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILESDVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVV

Query:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH
        FGNWLRT++EHH+QYS+NP+ DILDIGCSIGF TR LADKFP AKVTGLDLSPYFL+VA+YMDKKRAPRKNAIRWLHGNGE++ LPS SFDL+SIAY+ H
Subjt:  FGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLH

Query:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL
        ECP+VAIVN+LKESFRLLRPGGTI ITDQA KSKA+QE+SPV+FTLLKSTEP+LDEYHLTDLE +M +VGFVNV SRLTDPRH+T+TATVPL
Subjt:  ECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL

SwissProt top hitse value%identityAlignment
A0AK43 Demethylmenaquinone methyltransferase2.9e-0832.41Show/hide
Query:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP
        ++LD+ C     +  +A++  P   VTGLD S   L V R  +K +    + +  +HGN  +   P  SFD V+I + L   P+   + +L+E +R+L+P
Subjt:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP

Query:  GGTIAITD
        GG +A  D
Subjt:  GGTIAITD

C1KWN1 Demethylmenaquinone methyltransferase2.9e-0832.41Show/hide
Query:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP
        ++LD+ C     +  +A++  P   VTGLD S   L V R  +K +    + +  +HGN  +   P  SFD V+I + L   P+   + +L+E +R+L+P
Subjt:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP

Query:  GGTIAITD
        GG +A  D
Subjt:  GGTIAITD

P67055 Demethylmenaquinone methyltransferase2.9e-0832.41Show/hide
Query:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP
        ++LD+ C     +  +A++  P   VTGLD S   L V R  +K +    + +  +HGN  +   P  SFD V+I + L   P+   + +L+E +R+L+P
Subjt:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP

Query:  GGTIAITD
        GG +A  D
Subjt:  GGTIAITD

P67056 Demethylmenaquinone methyltransferase2.9e-0832.41Show/hide
Query:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP
        ++LD+ C     +  +A++  P   VTGLD S   L V R  +K +    + +  +HGN  +   P  SFD V+I + L   P+   + +L+E +R+L+P
Subjt:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP

Query:  GGTIAITD
        GG +A  D
Subjt:  GGTIAITD

Q71Y84 Demethylmenaquinone methyltransferase2.9e-0832.41Show/hide
Query:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP
        ++LD+ C     +  +A++  P   VTGLD S   L V R  +K +    + +  +HGN  +   P  SFD V+I + L   P+   + +L+E +R+L+P
Subjt:  DILDIGCSIGFSTRYLADKF-PMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRP

Query:  GGTIAITD
        GG +A  D
Subjt:  GGTIAITD

Arabidopsis top hitse value%identityAlignment
AT1G48600.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.3e-0627.33Show/hide
Query:  TVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECPEV
        T  E   +    P Q +LD+GC IG    Y+A+ F +  V G+DLS   ++ A    ++    K ++ +   +    + P  SFD++ S   +LH   + 
Subjt:  TVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECPEV

Query:  AIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEG---RMKEVGFVNV
        A   L +  F+ L+PGG + ITD     ++ +  SP     +K        Y L D++     +K+ GF +V
Subjt:  AIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEG---RMKEVGFVNV

AT1G48600.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.3e-0627.33Show/hide
Query:  TVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECPEV
        T  E   +    P Q +LD+GC IG    Y+A+ F +  V G+DLS   ++ A    ++    K ++ +   +    + P  SFD++ S   +LH   + 
Subjt:  TVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECPEV

Query:  AIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEG---RMKEVGFVNV
        A   L +  F+ L+PGG + ITD     ++ +  SP     +K        Y L D++     +K+ GF +V
Subjt:  AIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLDEYHLTDLEG---RMKEVGFVNV

AT1G73600.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.4e-0528.79Show/hide
Query:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP
        L T  E        P Q +LD+GC IG    Y+A+ F +  V G+DLS   ++ A    +     K ++ +   +      P  +FD++ S   +LH   
Subjt:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP

Query:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKS
        + A   L +  ++ L+PGG + ITD  ++PK+
Subjt:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKS

AT1G73600.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.4e-0528.79Show/hide
Query:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP
        L T  E        P Q +LD+GC IG    Y+A+ F +  V G+DLS   ++ A    +     K ++ +   +      P  +FD++ S   +LH   
Subjt:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP

Query:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKS
        + A   L +  ++ L+PGG + ITD  ++PK+
Subjt:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKS

AT3G18000.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.2e-0628.68Show/hide
Query:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP
        L T  E   + +  P Q +LD+GC IG    Y+A+KF +  V G+DLS   ++ A    ++      ++ +   +      P  SFD++ S   +LH   
Subjt:  LRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAKVTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLV-SIAYMLHECP

Query:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKSKAIQ
        + A   L +  F+ L+PGG + I+D  ++PK+ + +
Subjt:  EVAIVNLLKESFRLLRPGGTIAITD--QAPKSKAIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACGTGTTAGTGTTAACCAGCTGCAAAGAAGGTAACATTGAAACTACTGAATTGCGGCGGTTCCTCGAGAGCTCCGTGCACCAGGCCATCAAATCTGTACTAAA
CAAGTTTTCCGATCCGCAAGAGTTCTCGTTCATGACATTTTCTAACAAGAGACGGCGAGTTTCCCTATTCAATTCTTCGAGCTCATCTTCCATTGGTGATTTCTGTATCC
CATCTCCTGACACTCCCTTTCCTCTCAAGGGCTGTAAATGGGATCACAGTGACATGAAAATGTATGAATACACTCAAATCTTCTTTCATATCGTGTTCGTAAACGGGACC
GAGTCACCAACTGGCTATAATCTCCGGCGATGGGCACTGCAGAGCAGGAAGAAGCAGAAGGGAGTCGATAAAGGTTCAGATGTCGACGAGGTCGGAAATGACGGTGTACG
ACGAGGGGCAGTTGGAGCGTCCCAAATGGTCTGGGGACACCCCCTCTCTCGGCTTGTGGGTGCTCTCATTTCCTTCAAGCCCCTCTACTCCATTCTTAAGCTCGGCGCCA
GACGTGTCATGATCAGTACGGCGGAGAAGAACAACATTCCATGGAGAAAAATGACGTCCGAGATTTTGGCGTCTGACGTTTACAAGGAGTTCGAGAGCGTTCAAAATCCT
CGATTGTCTACCCCGATTGCTGCAGCAGAAGTAGAGCCTGCAACAATGTCAATGATTATGCGAGCGGTACCCAACACCTCTTCATTAGATGAAGCAAAGGAAGTGGTATT
CGGAAATTGGCTTCGTTTGAGCACAAGATATCTGGCTGACAAGTTTCCACTAGCTAAAACTACTGGACTAGATTTGTCTCCTTACTTCCTTGCTGTGGCTCGATACATGG
ACAAGAAAAGAGCTCCAAGAAACAACGCAATAAGTTGGTTGCATGAAAATGGGGAAGATACTAGCTTGCCTTCAAAATCATTTGATCTAGTTTCCAGTGCTTATCTGTTC
CATGAATGTCCCCAAGTAGCAATAGTCAATCTGCTTAAGGAATCATTTCGACTTCTTCGACCTGGTGGCACAATTGCCATTACTGACAATGCGTTTACACATCCTTCTTC
TTCCACTCCTAAAGCCGGTGAACTGAGTCGCTTAGCTGATCTCTCTAATGGCGTTTGCGGACCGAGTCACCAACTCTCTATAATCTCCGGCAGTGGGCTCGACAGACCAG
CGCGACAAAGAAGAAGCAAAAGGGAATCGACGATGAAGGTTCAGATCTCGACGAGGTCGGAAGAGTCGCCGACGACGGTGTTCGAGGAGGGACAGTTAGAGCACCCCAAC
TGGTCTGGAGAGACCCCTCTCTCTCGGCTTGTGGGAGCTCTCATTTCCTTCAAACCCCTCTACTCCATTCTTAAGCTCGGCGCCAGACGTGTGATGATCAGCACGGCGGA
GAAGAACAACATTCCATGGCGAAAAATGACGTCGGAGATTCTGGAGTCGGACGTTTACGAGGAGTTCGAGAGAGTCCAGAATCTCTCCATCGCTTACCCCGACTATTACC
TGAAGCCTTTCCATGCATATGATGAGGGTCATCTTTCATGGCTTGCTGCAGCAGAAGTAGAGCCTGCAACAATGACAATGCTTATGCGAGCAGTACCCGATGCCTCTTCA
TTAGATGAAGCAAAGGAAATAGTGTTTGGAAATTGGCTTCGTGCAATTGATATGCATCATCTACAGTATTCAAGAAATCCCATTTTAGACATTCTAGACATTGGTTGTTC
TACTGGTTTAAGCACAAGATATTTGGCCGACAAGTTTCCACTTGCTAAAGTGACTGGACTAGATTTGTCTCCTTACTTCCTTGCTGTGGCTCGATACACTGAAAAAAAGA
GAGCCCCAAGAAAGAATGCAATCAGATGGTTGCATGAAAATGCAGAAGATTCCAGCTTGCCTTCAAGATCATTTGACTTAATTTCCATTGCTTATATGGAATCATTCCGG
CTTCTTCGACCTGGTGGCACAATTGCTATTATTGATGAAGCACCCAAATCAAAGGCTAATCAGAAATTGTCTCCAGTCCTATTTACATTGCTCAAAAGCACAGAACCATA
TTTGGATGAGTATCATCTCACTGATTTGGAAGGAAGAATGAGAGAAGTTGGATTTGTAATAGATACGGAGGATGATTTCTTCTGTTTACGGTTCACTCATCCTTTCTTCT
TCCAAACGGTGACTGGCGATCGCTTACCTCCTCCAATGGCATTGTGCGGACAGAGTCACCGGCTCGCTCTGATCTCGGGCAATGGCCACGAGAGAACAGGACGACTAAGA
AGAAGCAGAAGGGGCTTGATAAAAGTTCAGACGTCGACGAGGTCTGAAATGGCGGTGTACGAGGAGGGGCAGTTGGAGCGTCCCAATTGGTCTGGAGAGACCCCCCTCTC
TCGGCTTGTGGGTGCTCTCATTTCCTTCAAACCCCTCTATTCCCTTCTCAAGCTCGGCGCCAGACGAGTCCTCATCAGGTCTACATTGTGTAGACTTCTTTTGGTTTTGG
GTTTGTTTTTGGTTCCGGGTTGTAATCTGTTTTGGCATGTGGGTTGGAGCAGTACAGCGGAGAAGAACAATATTCCATGGCGAAAAATGACATCGGAAATTTTGGAGTCC
GACGTTTACAAGGAGCTGGAGAGCGTTCAAAATCTCTCTATTGTCTACCCCGATTATTACCTGAAGCCTTTCCATGCATATGATGAGGGTCATCTTTCATGGCTTGCTGC
AGCAGAAGCAGAGCCTGCAACAATGTCAATGATTATGCGAGCAGTAACCGATGCCTCTTCATTAGATGAAGCGAAGGAAGTAGTGTTTGGAAATTGGCTTCGTACGGTTG
ATGAGCATCATATGCAGTACTCAGAAAATCCTGTTCAAGACATTCTAGATATTGGTTGTTCTATAGGTTTCAGCACAAGATATTTAGCTGATAAGTTTCCCATGGCTAAA
GTGACTGGACTCGATTTATCTCCTTACTTCCTTGCGGTGGCTCGATACATGGACAAGAAAAGAGCTCCAAGAAAGAATGCAATAAGATGGTTGCATGGAAATGGGGAAGA
TTCTAGCTTGCCTTCAAGATCATTTGACCTAGTTTCCATTGCTTATATGCTCCATGAATGTCCCGAAGTAGCAATAGTCAATTTGCTCAAGGAATCATTTCGACTTCTTC
GACCCGGTGGCACAATTGCAATTACTGACCAAGCTCCCAAATCAAAGGCTATTCAGGAATTATCTCCAGTCATATTCACATTACTCAAAAGTACAGAACCATATCTGGAT
GAGTACCACCTCACTGATTTAGAAGGAAGAATGAAAGAAGTTGGATTTGTGAATGTGAGATCAAGATTGACCGACCCAAGACATATTACAATGACAGCAACCGTTCCACT
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAACGTGTTAGTGTTAACCAGCTGCAAAGAAGGTAACATTGAAACTACTGAATTGCGGCGGTTCCTCGAGAGCTCCGTGCACCAGGCCATCAAATCTGTACTAAA
CAAGTTTTCCGATCCGCAAGAGTTCTCGTTCATGACATTTTCTAACAAGAGACGGCGAGTTTCCCTATTCAATTCTTCGAGCTCATCTTCCATTGGTGATTTCTGTATCC
CATCTCCTGACACTCCCTTTCCTCTCAAGGGCTGTAAATGGGATCACAGTGACATGAAAATGTATGAATACACTCAAATCTTCTTTCATATCGTGTTCGTAAACGGGACC
GAGTCACCAACTGGCTATAATCTCCGGCGATGGGCACTGCAGAGCAGGAAGAAGCAGAAGGGAGTCGATAAAGGTTCAGATGTCGACGAGGTCGGAAATGACGGTGTACG
ACGAGGGGCAGTTGGAGCGTCCCAAATGGTCTGGGGACACCCCCTCTCTCGGCTTGTGGGTGCTCTCATTTCCTTCAAGCCCCTCTACTCCATTCTTAAGCTCGGCGCCA
GACGTGTCATGATCAGTACGGCGGAGAAGAACAACATTCCATGGAGAAAAATGACGTCCGAGATTTTGGCGTCTGACGTTTACAAGGAGTTCGAGAGCGTTCAAAATCCT
CGATTGTCTACCCCGATTGCTGCAGCAGAAGTAGAGCCTGCAACAATGTCAATGATTATGCGAGCGGTACCCAACACCTCTTCATTAGATGAAGCAAAGGAAGTGGTATT
CGGAAATTGGCTTCGTTTGAGCACAAGATATCTGGCTGACAAGTTTCCACTAGCTAAAACTACTGGACTAGATTTGTCTCCTTACTTCCTTGCTGTGGCTCGATACATGG
ACAAGAAAAGAGCTCCAAGAAACAACGCAATAAGTTGGTTGCATGAAAATGGGGAAGATACTAGCTTGCCTTCAAAATCATTTGATCTAGTTTCCAGTGCTTATCTGTTC
CATGAATGTCCCCAAGTAGCAATAGTCAATCTGCTTAAGGAATCATTTCGACTTCTTCGACCTGGTGGCACAATTGCCATTACTGACAATGCGTTTACACATCCTTCTTC
TTCCACTCCTAAAGCCGGTGAACTGAGTCGCTTAGCTGATCTCTCTAATGGCGTTTGCGGACCGAGTCACCAACTCTCTATAATCTCCGGCAGTGGGCTCGACAGACCAG
CGCGACAAAGAAGAAGCAAAAGGGAATCGACGATGAAGGTTCAGATCTCGACGAGGTCGGAAGAGTCGCCGACGACGGTGTTCGAGGAGGGACAGTTAGAGCACCCCAAC
TGGTCTGGAGAGACCCCTCTCTCTCGGCTTGTGGGAGCTCTCATTTCCTTCAAACCCCTCTACTCCATTCTTAAGCTCGGCGCCAGACGTGTGATGATCAGCACGGCGGA
GAAGAACAACATTCCATGGCGAAAAATGACGTCGGAGATTCTGGAGTCGGACGTTTACGAGGAGTTCGAGAGAGTCCAGAATCTCTCCATCGCTTACCCCGACTATTACC
TGAAGCCTTTCCATGCATATGATGAGGGTCATCTTTCATGGCTTGCTGCAGCAGAAGTAGAGCCTGCAACAATGACAATGCTTATGCGAGCAGTACCCGATGCCTCTTCA
TTAGATGAAGCAAAGGAAATAGTGTTTGGAAATTGGCTTCGTGCAATTGATATGCATCATCTACAGTATTCAAGAAATCCCATTTTAGACATTCTAGACATTGGTTGTTC
TACTGGTTTAAGCACAAGATATTTGGCCGACAAGTTTCCACTTGCTAAAGTGACTGGACTAGATTTGTCTCCTTACTTCCTTGCTGTGGCTCGATACACTGAAAAAAAGA
GAGCCCCAAGAAAGAATGCAATCAGATGGTTGCATGAAAATGCAGAAGATTCCAGCTTGCCTTCAAGATCATTTGACTTAATTTCCATTGCTTATATGGAATCATTCCGG
CTTCTTCGACCTGGTGGCACAATTGCTATTATTGATGAAGCACCCAAATCAAAGGCTAATCAGAAATTGTCTCCAGTCCTATTTACATTGCTCAAAAGCACAGAACCATA
TTTGGATGAGTATCATCTCACTGATTTGGAAGGAAGAATGAGAGAAGTTGGATTTGTAATAGATACGGAGGATGATTTCTTCTGTTTACGGTTCACTCATCCTTTCTTCT
TCCAAACGGTGACTGGCGATCGCTTACCTCCTCCAATGGCATTGTGCGGACAGAGTCACCGGCTCGCTCTGATCTCGGGCAATGGCCACGAGAGAACAGGACGACTAAGA
AGAAGCAGAAGGGGCTTGATAAAAGTTCAGACGTCGACGAGGTCTGAAATGGCGGTGTACGAGGAGGGGCAGTTGGAGCGTCCCAATTGGTCTGGAGAGACCCCCCTCTC
TCGGCTTGTGGGTGCTCTCATTTCCTTCAAACCCCTCTATTCCCTTCTCAAGCTCGGCGCCAGACGAGTCCTCATCAGGTCTACATTGTGTAGACTTCTTTTGGTTTTGG
GTTTGTTTTTGGTTCCGGGTTGTAATCTGTTTTGGCATGTGGGTTGGAGCAGTACAGCGGAGAAGAACAATATTCCATGGCGAAAAATGACATCGGAAATTTTGGAGTCC
GACGTTTACAAGGAGCTGGAGAGCGTTCAAAATCTCTCTATTGTCTACCCCGATTATTACCTGAAGCCTTTCCATGCATATGATGAGGGTCATCTTTCATGGCTTGCTGC
AGCAGAAGCAGAGCCTGCAACAATGTCAATGATTATGCGAGCAGTAACCGATGCCTCTTCATTAGATGAAGCGAAGGAAGTAGTGTTTGGAAATTGGCTTCGTACGGTTG
ATGAGCATCATATGCAGTACTCAGAAAATCCTGTTCAAGACATTCTAGATATTGGTTGTTCTATAGGTTTCAGCACAAGATATTTAGCTGATAAGTTTCCCATGGCTAAA
GTGACTGGACTCGATTTATCTCCTTACTTCCTTGCGGTGGCTCGATACATGGACAAGAAAAGAGCTCCAAGAAAGAATGCAATAAGATGGTTGCATGGAAATGGGGAAGA
TTCTAGCTTGCCTTCAAGATCATTTGACCTAGTTTCCATTGCTTATATGCTCCATGAATGTCCCGAAGTAGCAATAGTCAATTTGCTCAAGGAATCATTTCGACTTCTTC
GACCCGGTGGCACAATTGCAATTACTGACCAAGCTCCCAAATCAAAGGCTATTCAGGAATTATCTCCAGTCATATTCACATTACTCAAAAGTACAGAACCATATCTGGAT
GAGTACCACCTCACTGATTTAGAAGGAAGAATGAAAGAAGTTGGATTTGTGAATGTGAGATCAAGATTGACCGACCCAAGACATATTACAATGACAGCAACCGTTCCACT
TTAA
Protein sequenceShow/hide protein sequence
MKNVLVLTSCKEGNIETTELRRFLESSVHQAIKSVLNKFSDPQEFSFMTFSNKRRRVSLFNSSSSSSIGDFCIPSPDTPFPLKGCKWDHSDMKMYEYTQIFFHIVFVNGT
ESPTGYNLRRWALQSRKKQKGVDKGSDVDEVGNDGVRRGAVGASQMVWGHPLSRLVGALISFKPLYSILKLGARRVMISTAEKNNIPWRKMTSEILASDVYKEFESVQNP
RLSTPIAAAEVEPATMSMIMRAVPNTSSLDEAKEVVFGNWLRLSTRYLADKFPLAKTTGLDLSPYFLAVARYMDKKRAPRNNAISWLHENGEDTSLPSKSFDLVSSAYLF
HECPQVAIVNLLKESFRLLRPGGTIAITDNAFTHPSSSTPKAGELSRLADLSNGVCGPSHQLSIISGSGLDRPARQRRSKRESTMKVQISTRSEESPTTVFEEGQLEHPN
WSGETPLSRLVGALISFKPLYSILKLGARRVMISTAEKNNIPWRKMTSEILESDVYEEFERVQNLSIAYPDYYLKPFHAYDEGHLSWLAAAEVEPATMTMLMRAVPDASS
LDEAKEIVFGNWLRAIDMHHLQYSRNPILDILDIGCSTGLSTRYLADKFPLAKVTGLDLSPYFLAVARYTEKKRAPRKNAIRWLHENAEDSSLPSRSFDLISIAYMESFR
LLRPGGTIAIIDEAPKSKANQKLSPVLFTLLKSTEPYLDEYHLTDLEGRMREVGFVIDTEDDFFCLRFTHPFFFQTVTGDRLPPPMALCGQSHRLALISGNGHERTGRLR
RSRRGLIKVQTSTRSEMAVYEEGQLERPNWSGETPLSRLVGALISFKPLYSLLKLGARRVLIRSTLCRLLLVLGLFLVPGCNLFWHVGWSSTAEKNNIPWRKMTSEILES
DVYKELESVQNLSIVYPDYYLKPFHAYDEGHLSWLAAAEAEPATMSMIMRAVTDASSLDEAKEVVFGNWLRTVDEHHMQYSENPVQDILDIGCSIGFSTRYLADKFPMAK
VTGLDLSPYFLAVARYMDKKRAPRKNAIRWLHGNGEDSSLPSRSFDLVSIAYMLHECPEVAIVNLLKESFRLLRPGGTIAITDQAPKSKAIQELSPVIFTLLKSTEPYLD
EYHLTDLEGRMKEVGFVNVRSRLTDPRHITMTATVPL