; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021134 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021134
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein SET DOMAIN GROUP 40 isoform X1
Genome locationtig00153640:992031..1008092
RNA-Seq ExpressionSgr021134
SyntenySgr021134
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581196.1 Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. sororia]4.1e-12560.14Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M TE S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+LVL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKGSSSWWF YFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSH EW GVKGLMEES  K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+GN+TTD LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL
        QENPND+ F+ L +  +        SLFI                                      +K E+ ++ +     +       TSVE  + LL
Subjt:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ+PRELGK+ ST GGEF AFLETNG
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

KAG7017936.1 Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-12459.91Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M TE S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+LVL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKGSSSWWF YFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EW GVKGLMEES  K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+GN+TTD LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL
        QENPND+ F+ L +  +        SLFI                                      +K E+ ++ +     +       TSVE  + LL
Subjt:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ+PRELGK+ ST GGEF AFLETNG
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

XP_022143354.1 protein SET DOMAIN GROUP 40 isoform X1 [Momordica charantia]5.2e-12860.86Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        METEGS E+LLRWAAD GISDSV+K+SS+SCLGRSLCVS FPDAGGRGLGAVRNLN G+LVLRVPKSVL TTQSLLLE+EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIG+GS+SWWF YFKHLP SYD+LATFGEFEKQALQVDYAIWATEKAALKSHTEW  VKGLME+SN K Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F       G+MTTDELHEEQ DTQ ALTDGGF+E VSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGF+L
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDKF----------------------------------LFLW-----------NMSFI---LPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTS
        QENPND+                                   L LW           ++++    L V    S+  ++    H +L            TS
Subjt:  QENPNDKF----------------------------------LFLW-----------NMSFI---LPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTS

Query:  VEALDTLL-----MPDLQIPRELGKILSTYGGEFSAFLETNG
        VE    LL     + DLQIPR LGK+LSTYGGEF AFL+TNG
Subjt:  VEALDTLL-----MPDLQIPRELGKILSTYGGEFSAFLETNG

XP_022983189.1 protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima]1.4e-12560.37Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M TEGS ESLLRWAADHGISDSV+KQSSHSCLGRSLCV FFPDAGGRGLGAVR+L KG+LVL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKGSSSWWF YFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEW GVKGLMEESN K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+GN+TTD LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGFLL
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL
        QENPND+ F+ L +  +        SLFI                                      +K E+ ++ +     +       TSVE  + LL
Subjt:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ P ELGK+L T GGEF AFLET G
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

XP_038896047.1 protein SET DOMAIN GROUP 40 [Benincasa hispida]9.7e-12761.29Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M TE S  SLLRWAADHGISDSV++Q+SHSCLG SLCV FFPDAGGRGLGAVR LNKG+LVLRVPKSVL TTQSL LEDEKL+ ALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKG+SSWW  Y KHLP SYD+LATFGEFEKQALQVDY IWATEKAALKS  EW GVKGLMEE N K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCF-----------------------IFFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+G++TTDELHEEQRDTQWALTDGGFEE+VSAYCFYARESYKKG+QVLLSYGTY+NLELLEYYGFLL
Subjt:  GCF-----------------------IFFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDKFLF-----LWNM------SFILPVLGPRSLFIFIKMEI------------HL------------LLYFQLCDYG-----QPARTSVEALDTLL
        QENPNDK        ++N       S  +   G  S  +   + +            HL            +L  QL             TSVE  + LL
Subjt:  QENPNDKFLF-----LWNM------SFILPVLGPRSLFIFIKMEI------------HL------------LLYFQLCDYG-----QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ+PREL K+L TYGGEFSAFLETNG
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

TrEMBL top hitse value%identityAlignment
A0A0A0L7L4 SET domain-containing protein1.6e-12260.32Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        METEGSL SLLRWAADHGISDSV++ +SHSCLG SLCVSFFPD GGRGL AVR L KG+LVLR PKS+LLTTQSL LEDEKL MALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEI KG SSWWF Y KHLP SYD+LATFGEFEKQALQVDYAIWATEKAALKS T+W GV+GLM+ESN KSQ++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCF------------------IFFTTCFLSGNMTTDELH--EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN
        GC                         F S     DEL   EEQRD+QWALTDGGFEEN SAYCFYARESY+KG+QVLLSYGTYTNLELLEYYGFLLQEN
Subjt:  GCF------------------IFFTTCFLSGNMTTDELH--EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN

Query:  PNDKFLF----------LW-----------NMSFIL-----------------PVLGPRSLFIFIKMEIHLLLYFQLCDYG--QPARTSVEALDTLL---
        PNDK              W           N SF L                   L      + +K EI ++ +     +       TS+E  + LL   
Subjt:  PNDKFLF----------LW-----------NMSFIL-----------------PVLGPRSLFIFIKMEIHLLLYFQLCDYG--QPARTSVEALDTLL---

Query:  --MPDLQIPRELGKILSTYGGEFSAFLETNG
          + DLQ+PREL K L TYGGEF AFLETNG
Subjt:  --MPDLQIPRELGKILSTYGGEFSAFLETNG

A0A5D3BQD3 Protein SET DOMAIN GROUP 40 isoform X29.9e-11757.77Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        METEGS  SLLRWAADHGISDS+++ +S SCLGRSLCVSFFPD+GGRGL AVR LNKG+L+LR PKSVLLTTQSL LEDEKL+MALK +PSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLL EI KG+SS WF Y KHLP SYD+LATFGEFEKQALQVDYAIWATEKAALKS  +W GVKGLM+ESN K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCF------------------IFFTTCFLSGNMTTDELH--EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN
        GC                         F S     DEL   EEQRD+QW LTDGGFEEN SAYCFYARESYKKG+QVLLSYGTYTN+ELLEYYGFLLQEN
Subjt:  GCF------------------IFFTTCFLSGNMTTDELH--EEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQEN

Query:  PNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL---
        PNDK F+ + +  ++       SL+I                                      +K E  ++ +     +       TS+E  D LL   
Subjt:  PNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL---

Query:  --MPDLQIPRELGKILSTYGGEFSAFLETNG
          + DLQ+ REL K+L TYGGE  AFLETNG
Subjt:  --MPDLQIPRELGKILSTYGGEFSAFLETNG

A0A6J1CP24 protein SET DOMAIN GROUP 40 isoform X12.5e-12860.86Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        METEGS E+LLRWAAD GISDSV+K+SS+SCLGRSLCVS FPDAGGRGLGAVRNLN G+LVLRVPKSVL TTQSLLLE+EKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIG+GS+SWWF YFKHLP SYD+LATFGEFEKQALQVDYAIWATEKAALKSHTEW  VKGLME+SN K Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F       G+MTTDELHEEQ DTQ ALTDGGF+E VSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGF+L
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDKF----------------------------------LFLW-----------NMSFI---LPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTS
        QENPND+                                   L LW           ++++    L V    S+  ++    H +L            TS
Subjt:  QENPNDKF----------------------------------LFLW-----------NMSFI---LPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTS

Query:  VEALDTLL-----MPDLQIPRELGKILSTYGGEFSAFLETNG
        VE    LL     + DLQIPR LGK+LSTYGGEF AFL+TNG
Subjt:  VEALDTLL-----MPDLQIPRELGKILSTYGGEFSAFLETNG

A0A6J1F4A7 protein SET DOMAIN GROUP 40 isoform X14.9e-12459.68Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M  E S ESLLRWAADHGISDSV+KQ SHSCLGRSLCV FFPDAGGRGLGAVR+L KG+LVL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKGSSSWWF YFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KS  EW GVKGLMEESN K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+GN+TTD LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTYTNLELL+YYGFLL
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL
        QENPND+ F+ L +  +        SLFI                                      +K E+ ++ +     +       TSVE  + LL
Subjt:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ+PRELGK+ ST  GEF AFLETNG
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

A0A6J1J6L6 protein SET DOMAIN GROUP 40 isoform X16.8e-12660.37Show/hide
Query:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF
        M TEGS ESLLRWAADHGISDSV+KQSSHSCLGRSLCV FFPDAGGRGLGAVR+L KG+LVL+VPKSVLLTTQSL L+DEKLSMALKRYPSLSSTQKLTF
Subjt:  METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTF

Query:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------
        CLLYEIGKGSSSWWF YFKHLP +Y+ LATFGEFEKQALQVDYA+W  EKAA KSHTEW GVKGLMEESN K+Q++TFKAWLWASAT             
Subjt:  CLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------

Query:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL
        GC                         F     L+GN+TTD LH+E++DTQ ALTDGGFEENVSAYCFYARESYK+G+QVLLSYGTY+NLELL+YYGFLL
Subjt:  GCFI-----------------------FFTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLL

Query:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL
        QENPND+ F+ L +  +        SLFI                                      +K E+ ++ +     +       TSVE  + LL
Subjt:  QENPNDK-FLFLWNMSFILPVLGPRSLFIF-------------------------------------IKMEIHLLLYFQLCDYG--QPARTSVEALDTLL

Query:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG
             + DLQ P ELGK+L T GGEF AFLET G
Subjt:  -----MPDLQIPRELGKILSTYGGEFSAFLETNG

SwissProt top hitse value%identityAlignment
B5FW36 Actin-histidine N-methyltransferase7.5e-0525.27Show/hide
Query:  EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLL
        E     L++WA+++G   SVE             V+F  +  G GL A R++   +L L VP+ +L+T +S          +  R         L F LL
Subjt:  EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLL

Query:  YEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAI---WATEKAALKSHTEWVGVKGLMEESNSKSQIK---TFKAWLWASATGCFIFFTTCF
         E     +S+W  Y + LP  YD    F E E + LQ   AI   ++  K   + +  +  V      +N K  +K   T++ + WA ++          
Subjt:  YEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAI---WATEKAALKSHTEWVGVKGLMEESNSKSQIK---TFKAWLWASATGCFIFFTTCF

Query:  LSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK
          G+  T     L +    T   +T G   E+    C  A + ++ G+Q+ + YGT +N E + + GF    N +D+
Subjt:  LSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK

B7ZUF3 Actin-histidine N-methyltransferase3.1e-0627.5Show/hide
Query:  FPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQ
        FP+  G GL A R +   +L L VP+ +L+T +S          +  R         L F LL E     +S+W  Y K LP+ YD    F E E Q LQ
Subjt:  FPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQ

Query:  VDYAI---WATEKAALKSHTEWVGVKGLMEESNSKSQIK---TFKAWLWASATGCFIFFTTCFLSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYC
           AI   ++  K   + +  +  V      +N K  +K   TF  + WA ++            G+  T     L +    T   +T G   E+    C
Subjt:  VDYAI---WATEKAALKSHTEWVGVKGLMEESNSKSQIK---TFKAWLWASATGCFIFFTTCFLSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYC

Query:  FYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK
          A + +K G+Q+ + YGT +N E + + GF  + N +D+
Subjt:  FYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK

Q5ZML9 Actin-histidine N-methyltransferase9.8e-0522.96Show/hide
Query:  LLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKG
        L++WA ++G S    + ++              +  G GL A R +   +L L VP+ +L+T +S          +  R         L F LL E    
Subjt:  LLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKG

Query:  SSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----TFKAWLWASATGCFIFFTTCFLSGNMTT
         +S+W  Y + LP  YD    F E E Q L+   AI         +  ++     +++   + S++      T+  + WA ++            G+  T
Subjt:  SSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----TFKAWLWASATGCFIFFTTCFLSGNMTT

Query:  ---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK
             L +    T   +T G   E+    C  A + +K G+Q+ + YGT +N E + + GF    N +D+
Subjt:  ---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK

Q6NQJ8 Protein SET DOMAIN GROUP 401.7e-7350.16Show/hide
Query:  SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLY
        ++E+ LRWAA+ GISDS++  +   SCLG SL VS FPDAGGRGLGA R L KG+LVL+VP+  L+TT+S++ +D KLS A+  + SLSSTQ L+ CLLY
Subjt:  SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLY

Query:  EIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------GCFI
        E+ K   S+W+ Y  H+P  YD+LATFG FEKQALQV+ A+WATEKA  K  +EW     LM+E   K + ++F+AWLWASAT             GC  
Subjt:  EIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------GCFI

Query:  ----FFTTCFLSGNMTTDELHEEQRDTQWA----------LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW
             F          T +  E   + + A          LTDGGFEE+V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN NDK F+ L 
Subjt:  ----FFTTCFLSGNMTTDELHEEQRDTQWA----------LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW

Query:  NMSFILPVLGPR-SLFI
           F L    P+ SL+I
Subjt:  NMSFILPVLGPR-SLFI

Q7SXS7 Actin-histidine N-methyltransferase1.2e-0524.19Show/hide
Query:  EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLL
        E     L+ WAA          +   SC G    +S F D  G GL A +++   +L L +P+ +L+T +S   ++  L     +   L +   +T  L 
Subjt:  EGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLL

Query:  YEIGKGS-SSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----TFKAWLWASATGCFIFFTTCF
            + + SS W  Y K LP  YD    F E E + L    AI         +  ++     ++    + S++      TF  + WA ++          
Subjt:  YEIGKGS-SSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIK-----TFKAWLWASATGCFIFFTTCF

Query:  LSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK
          G+  T     L +    T   +T G   E+    C  A + YK+G+Q+ + YGT +N E + + GF  ++N +D+
Subjt:  LSGNMTT---DELHEEQRDTQWALTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK

Arabidopsis top hitse value%identityAlignment
AT3G07670.1 Rubisco methyltransferase family protein1.6e-0525.71Show/hide
Query:  DAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHS-YDMLATFGEFEKQALQV
        D G RGL A +NL KG+ +L VP S++++  S     E     +KRY  +     L   L+ E     SS WF Y   LP   Y +L     + +  L +
Subjt:  DAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHS-YDMLATFGEFEKQALQV

Query:  DYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASATGCFIFFTTCFLSGNMTTDELHEEQRDTQWAL--------------TDGGFEENV
                + A++  T  VG    +         + F   ++   T  + F       G + +  +     D ++AL              T   ++++ 
Subjt:  DYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASATGCFIFFTTCFLSGNMTTDELHEEQRDTQWAL--------------TDGGFEENV

Query:  SAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQE--NPND
            F     Y+ G+QV +SYG  +N ELL  YGF+ +E  NP+D
Subjt:  SAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQE--NPND

AT5G14260.1 Rubisco methyltransferase family protein2.2e-0422.92Show/hide
Query:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD
        + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +G  S W+ Y + L    D     G+ + ++       ++D
Subjt:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD

Query:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR
        Y   +  KA +    E     G+  E N    +       F+ + +   T  F F  F   F++       L       ++AL   G    + AYC   +
Subjt:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR

Query:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL
                         YK G  +++  G   N +LL  YGF+ ++NP D+ +
Subjt:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL

AT5G14260.2 Rubisco methyltransferase family protein2.2e-0422.92Show/hide
Query:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD
        + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +G  S W+ Y + L    D     G+ + ++       ++D
Subjt:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD

Query:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR
        Y   +  KA +    E     G+  E N    +       F+ + +   T  F F  F   F++       L       ++AL   G    + AYC   +
Subjt:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR

Query:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL
                         YK G  +++  G   N +LL  YGF+ ++NP D+ +
Subjt:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL

AT5G14260.3 Rubisco methyltransferase family protein2.2e-0422.92Show/hide
Query:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD
        + A  +L KGD+   VP S+++T + +L    +    L     LS    L   L+YE  +G  S W+ Y + L    D     G+ + ++       ++D
Subjt:  LGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQA------LQVD

Query:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR
        Y   +  KA +    E     G+  E N    +       F+ + +   T  F F  F   F++       L       ++AL   G    + AYC   +
Subjt:  YAIWATEKAALKSHTEWVGVKGLMEESNSKSQI-----KTFKAWLWASATGCFIF--FTTCFLSGNMTTDELHEEQRDTQWALTDGGFEENVSAYCFYAR

Query:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL
                         YK G  +++  G   N +LL  YGF+ ++NP D+ +
Subjt:  ---------------ESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFL

AT5G17240.1 SET domain group 401.2e-7450.16Show/hide
Query:  SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLY
        ++E+ LRWAA+ GISDS++  +   SCLG SL VS FPDAGGRGLGA R L KG+LVL+VP+  L+TT+S++ +D KLS A+  + SLSSTQ L+ CLLY
Subjt:  SLESLLRWAADHGISDSVE-KQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLY

Query:  EIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------GCFI
        E+ K   S+W+ Y  H+P  YD+LATFG FEKQALQV+ A+WATEKA  K  +EW     LM+E   K + ++F+AWLWASAT             GC  
Subjt:  EIGKGSSSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASAT-------------GCFI

Query:  ----FFTTCFLSGNMTTDELHEEQRDTQWA----------LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW
             F          T +  E   + + A          LTDGGFEE+V+AYC YAR +Y+ G+QVLL YGTYTNLELLE+YGF+L+EN NDK F+ L 
Subjt:  ----FFTTCFLSGNMTTDELHEEQRDTQWA----------LTDGGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDK-FLFLW

Query:  NMSFILPVLGPR-SLFI
           F L    P+ SL+I
Subjt:  NMSFILPVLGPR-SLFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAGAAGGAAGTCTTGAAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGAGAAACAGAGTTCACATTCTTGTCTAGGTCGTTCATTATG
TGTCTCTTTCTTCCCTGATGCTGGCGGGAGAGGTTTGGGGGCTGTTCGTAATCTTAACAAAGGAGATTTAGTACTAAGAGTTCCAAAATCTGTCTTGTTGACGACCCAGA
GTTTGTTGTTGGAAGATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCAACTCAGAAGTTGACCTTCTGTTTACTCTATGAGATTGGTAAAGGAAGC
AGTTCTTGGTGGTTCACTTACTTCAAGCACTTGCCCCACAGTTATGACATGCTGGCAACTTTTGGAGAATTTGAAAAGCAAGCGCTGCAGGTGGATTACGCCATCTGGGC
AACAGAAAAGGCTGCTTTGAAGTCTCATACAGAGTGGGTAGGAGTTAAAGGACTAATGGAAGAATCTAATAGTAAAAGCCAAATCAAAACATTCAAGGCATGGCTTTGGG
CCTCTGCAACTGGATGTTTCATCTTTTTCACTACATGCTTCTTGAGCGGAAACATGACTACTGATGAGTTACACGAAGAGCAACGAGATACTCAGTGGGCTTTGACCGAT
GGCGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCT
TGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAGTTTTTATTCCTTTGGAACATGTCATTTATACTACCAGTTCTTGGCCCAAGGAGTCTCTTTATATTCA
TCAAAATGGAAATCCATCTTTTGCTCTACTTTCAGCTTTGCGATTATGGGCAACCCGCCCGAACAAGCGTAGAGGCGTTGGACACCTTGCTTATGCCGGATCTGCAGATA
CCAAGGGAGCTTGGGAAGATACTATCGACTTATGGAGGCGAGTTTTCTGCTTTCTTAGAGACCAATGGTGCAGAAAGCAATTTCCTATCTTCTAGAATGGGCTATGATGT
TTTGGCTCTCGGCCATCCATTGCCTCTACTCTTACCAGTACCACCAGGCTTCAACCCTTCAACCCTTTTAGACTTCTCTCTTTCAATGCACCCTTTTGGCAACTCCTTTT
CACCTCCATTCGCTGAAACTTCAGTCACTAATGGTGCAAACACCCTCTCAATCTCAAGTCTTATACAGTCGTACAATTTGAAGAGTTTCAAATTTGAGCTGATCAAGGAA
TCAATGGTGTCCCATGAAATGATCCACAGTAAAAGGGCAGAATTCACCTCTTATCTTGAGAACTTAGTGTTAGAGAATCCTGAAACCAACTCTTCTACAAGCATCAATGG
AACTCCTTCTTCACAGCTTCTGATGAGATTCAATAAGCAAAGCATTAAGCATAATATGAATGGCAGCTTCATATACAACACCATCGTAATGCATCCCATCCACTGTACAG
CGAACCCCACAATTCCAACTCAAACTGCCCTCATTGTATCGGTCATCTTCTTCCTCTTCTCCTCAGTGTTTAACATGGAGTTTATCAGCGTTGGCATTCCGATCCAGAAC
AAGTGCGGGGTTCTAATGGAGACAGAGCCCGTTAGAGGGCCTTCAGAACCCAGCTCTGGAGTCAATGGAAGCAGTGAAACAACCGAAGTTCCGAGAGATTCTAAGGAAAA
TCGAAAGTCTGATGCATTGATTGGAAGCATAAGGAGCCCAAATGAAATCCAATTTCATCCCAGTTTCGCTGATCAGAATTTGATAATTGCTATGCCGCTTGAACAGATCG
CCCCGGATGGCTTCCATTCGATCGGAATCCAACGTCAAATTCAGCATGTTGCTCCGCCGCAATCGAAGAGATCGACGCCCCTGTTCTACCGTCTGATTCGTCTTCCGATG
CTCGACAACTCACATTCATCTATGATCGTCGGTACACTCCGTGAAATCTGCGTCATTCCGGCGTACCTGTCGCCGACGGCACCGAGCAATTGGAGATACTCGACGACGAA
CACCATATTTATAGCTTTGGCGTGGGAAGTCAGTGGGAGAGAAGAAAAGGAGGGAAATCCGGGGAAGATTGGAGCTTTTAAAGAAAAAAAAAATTTTTTTTTTTCTTTTT
CTTTTTTTAAAGCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAGAAGGAAGTCTTGAAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGAGAAACAGAGTTCACATTCTTGTCTAGGTCGTTCATTATG
TGTCTCTTTCTTCCCTGATGCTGGCGGGAGAGGTTTGGGGGCTGTTCGTAATCTTAACAAAGGAGATTTAGTACTAAGAGTTCCAAAATCTGTCTTGTTGACGACCCAGA
GTTTGTTGTTGGAAGATGAGAAGCTCTCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCAACTCAGAAGTTGACCTTCTGTTTACTCTATGAGATTGGTAAAGGAAGC
AGTTCTTGGTGGTTCACTTACTTCAAGCACTTGCCCCACAGTTATGACATGCTGGCAACTTTTGGAGAATTTGAAAAGCAAGCGCTGCAGGTGGATTACGCCATCTGGGC
AACAGAAAAGGCTGCTTTGAAGTCTCATACAGAGTGGGTAGGAGTTAAAGGACTAATGGAAGAATCTAATAGTAAAAGCCAAATCAAAACATTCAAGGCATGGCTTTGGG
CCTCTGCAACTGGATGTTTCATCTTTTTCACTACATGCTTCTTGAGCGGAAACATGACTACTGATGAGTTACACGAAGAGCAACGAGATACTCAGTGGGCTTTGACCGAT
GGCGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCAAGGGAAAGTTATAAGAAGGGACAGCAGGTTCTTTTAAGCTATGGTACATACACAAACTTAGAGCTTCT
TGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAGTTTTTATTCCTTTGGAACATGTCATTTATACTACCAGTTCTTGGCCCAAGGAGTCTCTTTATATTCA
TCAAAATGGAAATCCATCTTTTGCTCTACTTTCAGCTTTGCGATTATGGGCAACCCGCCCGAACAAGCGTAGAGGCGTTGGACACCTTGCTTATGCCGGATCTGCAGATA
CCAAGGGAGCTTGGGAAGATACTATCGACTTATGGAGGCGAGTTTTCTGCTTTCTTAGAGACCAATGGTGCAGAAAGCAATTTCCTATCTTCTAGAATGGGCTATGATGT
TTTGGCTCTCGGCCATCCATTGCCTCTACTCTTACCAGTACCACCAGGCTTCAACCCTTCAACCCTTTTAGACTTCTCTCTTTCAATGCACCCTTTTGGCAACTCCTTTT
CACCTCCATTCGCTGAAACTTCAGTCACTAATGGTGCAAACACCCTCTCAATCTCAAGTCTTATACAGTCGTACAATTTGAAGAGTTTCAAATTTGAGCTGATCAAGGAA
TCAATGGTGTCCCATGAAATGATCCACAGTAAAAGGGCAGAATTCACCTCTTATCTTGAGAACTTAGTGTTAGAGAATCCTGAAACCAACTCTTCTACAAGCATCAATGG
AACTCCTTCTTCACAGCTTCTGATGAGATTCAATAAGCAAAGCATTAAGCATAATATGAATGGCAGCTTCATATACAACACCATCGTAATGCATCCCATCCACTGTACAG
CGAACCCCACAATTCCAACTCAAACTGCCCTCATTGTATCGGTCATCTTCTTCCTCTTCTCCTCAGTGTTTAACATGGAGTTTATCAGCGTTGGCATTCCGATCCAGAAC
AAGTGCGGGGTTCTAATGGAGACAGAGCCCGTTAGAGGGCCTTCAGAACCCAGCTCTGGAGTCAATGGAAGCAGTGAAACAACCGAAGTTCCGAGAGATTCTAAGGAAAA
TCGAAAGTCTGATGCATTGATTGGAAGCATAAGGAGCCCAAATGAAATCCAATTTCATCCCAGTTTCGCTGATCAGAATTTGATAATTGCTATGCCGCTTGAACAGATCG
CCCCGGATGGCTTCCATTCGATCGGAATCCAACGTCAAATTCAGCATGTTGCTCCGCCGCAATCGAAGAGATCGACGCCCCTGTTCTACCGTCTGATTCGTCTTCCGATG
CTCGACAACTCACATTCATCTATGATCGTCGGTACACTCCGTGAAATCTGCGTCATTCCGGCGTACCTGTCGCCGACGGCACCGAGCAATTGGAGATACTCGACGACGAA
CACCATATTTATAGCTTTGGCGTGGGAAGTCAGTGGGAGAGAAGAAAAGGAGGGAAATCCGGGGAAGATTGGAGCTTTTAAAGAAAAAAAAAATTTTTTTTTTTCTTTTT
CTTTTTTTAAAGCTTTTTAA
Protein sequenceShow/hide protein sequence
METEGSLESLLRWAADHGISDSVEKQSSHSCLGRSLCVSFFPDAGGRGLGAVRNLNKGDLVLRVPKSVLLTTQSLLLEDEKLSMALKRYPSLSSTQKLTFCLLYEIGKGS
SSWWFTYFKHLPHSYDMLATFGEFEKQALQVDYAIWATEKAALKSHTEWVGVKGLMEESNSKSQIKTFKAWLWASATGCFIFFTTCFLSGNMTTDELHEEQRDTQWALTD
GGFEENVSAYCFYARESYKKGQQVLLSYGTYTNLELLEYYGFLLQENPNDKFLFLWNMSFILPVLGPRSLFIFIKMEIHLLLYFQLCDYGQPARTSVEALDTLLMPDLQI
PRELGKILSTYGGEFSAFLETNGAESNFLSSRMGYDVLALGHPLPLLLPVPPGFNPSTLLDFSLSMHPFGNSFSPPFAETSVTNGANTLSISSLIQSYNLKSFKFELIKE
SMVSHEMIHSKRAEFTSYLENLVLENPETNSSTSINGTPSSQLLMRFNKQSIKHNMNGSFIYNTIVMHPIHCTANPTIPTQTALIVSVIFFLFSSVFNMEFISVGIPIQN
KCGVLMETEPVRGPSEPSSGVNGSSETTEVPRDSKENRKSDALIGSIRSPNEIQFHPSFADQNLIIAMPLEQIAPDGFHSIGIQRQIQHVAPPQSKRSTPLFYRLIRLPM
LDNSHSSMIVGTLREICVIPAYLSPTAPSNWRYSTTNTIFIALAWEVSGREEKEGNPGKIGAFKEKKNFFFSFSFFKAF