; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G003440 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G003440
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein SET DOMAIN GROUP 40-like
Genome locationchr05:4354513..4360738
RNA-Seq ExpressionLsi05G003440
SyntenyLsi05G003440
Gene Ontology termsGO:0005509 - calcium ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR015353 - Rubisco LSMT, substrate-binding domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017936.1 Protein SET DOMAIN GROUP 40, partial [Cucurbita argyrosperma subsp. argyrosperma]9.0e-21976.02Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M TE SF SLLRWAADHGISD VD+Q SHSCLGRSLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYA+W  EKAA KSR EWRGVKGLM+ES IKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPE ES D+MDVSSFS HASLNG++TTD LH+E++DTQ ALTDGGFEENVSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YK+GEQVLLSYGTY+NLELL+YYGFLLQENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KM  T GGEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        ISYCTRT CSLSS
Subjt:  ISYCTRTICSLSS

XP_008457031.1 PREDICTED: protein SET DOMAIN GROUP 40 isoform X3 [Cucumis melo]3.7e-22081.65Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST
        METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQLNKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST

Query:  QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNY
        QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKAWLWASAT                            S + LYVPWDEAGCLCPVGDLFNY
Subjt:  QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNY

Query:  AAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPM
        AAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+
Subjt:  AAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPM

Query:  EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVP
        EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV 
Subjt:  EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVP

Query:  RELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS
        REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTICSLSS
Subjt:  RELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS

XP_022983189.1 protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima]1.9e-22176.61Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M TEGSF SLLRWAADHGISD VD+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYA+W  EKAA KS TEWRGVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPEGESLD+MDVSSFS HASLNG++TTD LH+E++DTQ ALTDGGFEENVSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YK+GEQVLLSYGTYSNLELL+YYGFLLQENPND+VFIP+EH+IYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQ P EL KMLLT GGEFCAFLET GLVNR E ELHL+GKIKRSLERWKLAVQWR+LYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
         SYCTRT CSLSS
Subjt:  ISYCTRTICSLSS

XP_023528315.1 protein SET DOMAIN GROUP 40 [Cucurbita pepo subsp. pepo]2.6e-21875.63Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M TE SF SLLRWAADHGISD  D+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLT QSLSL+DEKL+ ALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYA+W  EKAA KSR EWRGVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPE ES D++DVSSFS HASLNG++TTD LH++++DTQ ALTDGGFEENVSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YK+GEQVLLSYGTYSNLELL+YYGFLLQENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KML T GGEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        ISYCTRT CSLSS
Subjt:  ISYCTRTICSLSS

XP_038896047.1 protein SET DOMAIN GROUP 40 [Benincasa hispida]4.8e-22879.73Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M TE SFGSLLRWAADHGISD VDQQTSHSCLG SLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVL TTQSLSLEDEKLA ALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDY IWATEKAALKS  EWRGVKGLM+E NIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPEGES+D  DVS FSPHASLNGD+TTDELHEE+RDTQWALTDGGFEE+VSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIP+EHDIY+SSSWPKESLY+HQNGNPSF+LLSALRLWATHPNKRRGVGHLAY+GSQLS+KNEIL
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQ LSKNC TVLNNLPTSVEEDNQLLCNICKIQDLQVPREL+KMLLTYGGEF AFLETNG+VNR+EAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        ISYCTRTICSLSS
Subjt:  ISYCTRTICSLSS

TrEMBL top hitse value%identityAlignment
A0A1S3C4J5 protein SET DOMAIN GROUP 40 isoform X25.9e-21675.44Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S + LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        I YCTRTICSLSS
Subjt:  ISYCTRTICSLSS

A0A1S3C590 protein SET DOMAIN GROUP 40 isoform X31.8e-22081.65Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST
        METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GG     RGL AVRQLNKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST

Query:  QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNY
        QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKAWLWASAT                            S + LYVPWDEAGCLCPVGDLFNY
Subjt:  QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNY

Query:  AAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPM
        AAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+
Subjt:  AAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPM

Query:  EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVP
        EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV 
Subjt:  EHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVP

Query:  RELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS
        REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTICSLSS
Subjt:  RELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTICSLSS

A0A5D3BQD3 Protein SET DOMAIN GROUP 40 isoform X25.9e-21675.44Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        METEGSFGSLLRWAADHGISD +DQ TS SCLGRSLCV FFPD+GGRGL AVRQLNKGEL+LR PKSVLLTTQSLSLEDEKLAMALK +PSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYAIWATEKAALKSR +WRGVKGLMQESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S + LYVPWDEAGCLCPVGDLFNYAAPEGES + MDV SF  HASLN ++   E  EE+RD+QW LTDGGFEEN SAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YKKGEQVLLSYGTY+N+ELLEYYGFLLQENPNDKVFIP+EHDIY SSSWPKESLY+HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCHTVLNNLPTS+EED+QLLCNI K+QDLQV REL+KMLLTYGGE CAFLETNG+VNR+EAE HLS K+KRSLERWKLAVQWRLLYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        I YCTRTICSLSS
Subjt:  ISYCTRTICSLSS

A0A6J1F4A7 protein SET DOMAIN GROUP 40 isoform X11.3e-21875.83Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M  E SF SLLRWAADHGISD VD+Q SHSCLGRSLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYA+W  EKAA KSR EWRGVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPE ES D+MDVSSFS HASLNG++TTD LH+E++DTQ ALTDGGFEENVSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YK+GEQVLLSYGTY+NLELL+YYGFLLQENPND+VFIP+EHDIYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQVPREL KM  T  GEFCAFLETNGLVNR E EL L+GKIKRSLERWKLAVQWR+LYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
        ISYCTRT CSLSS
Subjt:  ISYCTRTICSLSS

A0A6J1J6L6 protein SET DOMAIN GROUP 40 isoform X19.4e-22276.61Show/hide
Query:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----
        M TEGSF SLLRWAADHGISD VD+Q+SHSCLGRSLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQSLSL+DEKL+MALKRYPSLSST     
Subjt:  METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSST-----

Query:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA
                                               QVDYA+W  EKAA KS TEWRGVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  ---------------------------------------QVDYAIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVA

Query:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES
                       S +ALYVPWDEAGCLCPVGDLFNYAAPEGESLD+MDVSSFS HASLNG++TTD LH+E++DTQ ALTDGGFEENVSAYCFYARES
Subjt:  CTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARES

Query:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL
        YK+GEQVLLSYGTYSNLELL+YYGFLLQENPND+VFIP+EH+IYSSSSWPKESL++HQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+L
Subjt:  YKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEIL

Query:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC
        VMQWLSKNCH VLNNLPTSVEEDNQLLCNICKIQDLQ P EL KMLLT GGEFCAFLET GLVNR E ELHL+GKIKRSLERWKLAVQWR+LYKKALVDC
Subjt:  VMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDC

Query:  ISYCTRTICSLSS
         SYCTRT CSLSS
Subjt:  ISYCTRTICSLSS

SwissProt top hitse value%identityAlignment
B0VX69 Actin-histidine N-methyltransferase4.0e-0425.41Show/hide
Query:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S
        T   +T G   E+    C  A + ++ GEQ+ + YGT SN E + + GF    N +D+V I            M+ ++ + +  P  S++      P  S
Subjt:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S

Query:  FALLSALRLWA-------THPNKRRGVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQV
          LL+ LR++         H      +  +   G+    +S  NE+ +  +L      +L    T++EED  +L N    QDL V
Subjt:  FALLSALRLWA-------THPNKRRGVGHLAYAGSQ---LSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQV

B7ZUF3 Actin-histidine N-methyltransferase4.3e-0626.44Show/hide
Query:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLY-VHQNGNP-S
        T   +T G   E+    C  A + +K GEQ+ + YGT SN E + + GF  + N +D+V I            M+ ++ + +  P  S++ +H    P S
Subjt:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLY-VHQNGNP-S

Query:  FALLSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL
          LL+ LR++  + ++ +G          +  L  +   +S +NEI +  +L      +L    T+VE+DN++L
Subjt:  FALLSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL

Q5ZML9 Actin-histidine N-methyltransferase3.1e-0425.29Show/hide
Query:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S
        T   +T G   E+    C  A + +K GEQ+ + YGT SN E + + GF    N +D+V I            M+ ++ + +  P  S++   +  P  S
Subjt:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S

Query:  FALLSALRLWATHPN--KRRGVGH--------LAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL
          LL+ LR++  +    K   +G         L  +   +S  NE+ +  +L      +L    T+VE+D   L
Subjt:  FALLSALRLWATHPN--KRRGVGH--------LAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL

Q6NQJ8 Protein SET DOMAIN GROUP 401.7e-11146.17Show/hide
Query:  SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQV----------
        + LRWAA+ GISD +D  +   SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+  L+TT+S+  +D KL+ A+  + SLSSTQ+          
Subjt:  SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQV----------

Query:  ----------------DY------------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGS
                        DY                  A+WATEKA  K ++EW+    LM+E  +K + ++F+AWLWASAT                    
Subjt:  ----------------DY------------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGS

Query:  AQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQV
                S + L+VPWD AGCLCPVGDLFNY AP   S       S +   ++       E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQV
Subjt:  AQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQV

Query:  LLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWL
        LL YGTY+NLELLE+YGF+L+EN NDKVFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  L YAGSQ+S+KNEILVM+W+
Subjt:  LLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWL

Query:  SKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS
        S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK    +G E  AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCIS
Subjt:  SKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS

Query:  YCTRTICSL
        YC   + +L
Subjt:  YCTRTICSL

Q7SXS7 Actin-histidine N-methyltransferase3.7e-0525.86Show/hide
Query:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S
        T   +T G   E+    C  A + YK+GEQ+ + YGT SN E + + GF  ++N +D+V I            M+ ++ + +  P  S++      P  S
Subjt:  TQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFI-----------PMEHDIYSSSSWPKESLYVHQNGNP--S

Query:  FALLSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL
          LL+ LR++     + R           +  L      +S +NEI +  +L      +L    T+ EED  +L
Subjt:  FALLSALRLWATHPNKRRG----------VGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLL

Arabidopsis top hitse value%identityAlignment
AT2G18850.1 SET domain-containing protein3.2e-0427.04Show/hide
Query:  GGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQ-ENPNDKVFIPMEHDIYSS---------------SSWPKESLYVHQNGNPSFALL
        G  +   S+  F       KGEQ  LSYG YS+  LL +YGFL + +NP D   IP++ D+                   +W   +  +   G P+  LL
Subjt:  GGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQ-ENPNDKVFIPMEHDIYSS---------------SSWPKESLYVHQNGNPSFALL

Query:  SALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNL--PTSVEEDN
        + LR       K  G+ H +      +++ EI V++ L      ++ NL    S++ +N
Subjt:  SALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNL--PTSVEEDN

AT5G17240.1 SET domain group 401.2e-11246.17Show/hide
Query:  SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQV----------
        + LRWAA+ GISD +D  +   SCLG SL V  FPDAGGRGLGA R+L KGELVL+VP+  L+TT+S+  +D KL+ A+  + SLSSTQ+          
Subjt:  SLLRWAADHGISDPVD-QQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQV----------

Query:  ----------------DY------------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGS
                        DY                  A+WATEKA  K ++EW+    LM+E  +K + ++F+AWLWASAT                    
Subjt:  ----------------DY------------------AIWATEKAALKSRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGS

Query:  AQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQV
                S + L+VPWD AGCLCPVGDLFNY AP   S       S +   ++       E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQV
Subjt:  AQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGDMTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQV

Query:  LLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWL
        LL YGTY+NLELLE+YGF+L+EN NDKVFIP+E  ++S +SSWPK+SLY+HQ+G  SFAL+S LRLW    ++R + V  L YAGSQ+S+KNEILVM+W+
Subjt:  LLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYS-SSSWPKESLYVHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWL

Query:  SKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS
        S+ C +VL +LPTSV ED  LL NI K+QD ++  E QK    +G E  AFL+ N L +    +   +  S K  R L +W+ +VQWRL YK+ L DCIS
Subjt:  SKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVN---RNEAELHLSGKIKRSLERWKLAVQWRLLYKKALVDCIS

Query:  YCTRTICSL
        YC   + +L
Subjt:  YCTRTICSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTG
CGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAA
GTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTCTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAA
TCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTTGGACTGTTTT
TCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAAAGGCTTTTGGATCAGCCCAGACAAGGATACTAGGATGTTCGTTTAAGGCATTGTATGTACCATGGGATGAGGCCG
GATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGAC
ATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTA
TAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTC
CTATGGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCA
ACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCTTGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATAC
TGTTCTAAACAATCTGCCAACATCAGTTGAAGAAGACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGA
CTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACCAATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAG
CTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCTTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAGTTATTTGGCATTTGAAAAAAAAGGGCCAATTGATATTTTGGACCCAAATAGCAGGTCATTCGTATAAATTTCCCAAATATTTTGAACTTAAAAGCCAAATCTAA
ATTTGACTCAAAGCTTAAGGGATGGAAACGTATTTTTCCCTCAAGGGGCAGAGGTTTTGATGGAGGGTTTGTGAAATGGAAACTGAAGGAAGTTTTGGAAGCCTGCTGAG
ATGGGCGGCGGATCATGGAATTTCAGATCCTGTCGACCAACAGACTTCACATTCTTGTTTGGGTCGTTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTT
TAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGCTGAGAGTTCCAAAATCTGTCTTGTTAACGACCCAAAGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCT
CTGAAGAGATACCCATCTCTTTCTTCTACTCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCTTTGAAATCTCGTACGGAGTGGAGAGGAGTCAAAGGACTAAT
GCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTTGGACTGTTTTTCTTCTGACTTGTGAAGTGGAAGTTGCTTGCACAA
AGGCTTTTGGATCAGCCCAGACAAGGATACTAGGATGTTCGTTTAAGGCATTGTATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTAT
GCTGCACCTGAAGGGGAGTCCCTTGATGTTATGGATGTTTCGTCTTTTTCACCACATGCTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGA
TACTCAATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCCTACTGCTTCTATGCTCGGGAAAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTA
CATACTCAAATTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGACAAAGTTTTCATTCCTATGGAACATGACATTTATAGTTCCAGTTCTTGG
CCCAAGGAGTCTCTTTATGTTCATCAAAATGGAAACCCATCTTTTGCTCTACTTTCTGCTCTACGATTATGGGCAACCCACCCGAACAAGCGCAGAGGTGTCGGGCACCT
TGCTTATGCTGGGTCACAACTCTCCATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTCTAAACAATCTGCCAACATCAGTTGAAGAAG
ACAATCAGCTTCTGTGCAACATCTGCAAAATCCAGGATCTACAGGTACCAAGGGAGCTCCAGAAGATGCTGTTGACTTATGGAGGTGAATTTTGTGCTTTCTTGGAGACC
AATGGTCTGGTGAATAGAAATGAAGCCGAGTTACATTTATCCGGGAAAATAAAACGATCTCTGGAGAGATGGAAGCTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGC
TTTGGTTGATTGCATAAGTTACTGCACAAGAACTATTTGTTCTCTATCTTCTTAATCTGGTTCAGGTTGTGATTAGCAGGTTCTAACTTAAGTTACCTATTAAATGAACT
TTTTAGGAATCAAAAGATAAGAGAATGGTATGAATATGATAGCATTGAACCATCTAGTAAGGATGGTGTTGGGAGAAGTTGTATATGTAATATCAGTATTTCATAACTCA
AAGAGAATCCCTTATCAAGCTCTTTACTCGATCTCTTTTAC
Protein sequenceShow/hide protein sequence
METEGSFGSLLRWAADHGISDPVDQQTSHSCLGRSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQSLSLEDEKLAMALKRYPSLSSTQVDYAIWATEKAALK
SRTEWRGVKGLMQESNIKNQLQTFKAWLWASATWTVFLLTCEVEVACTKAFGSAQTRILGCSFKALYVPWDEAGCLCPVGDLFNYAAPEGESLDVMDVSSFSPHASLNGD
MTTDELHEEKRDTQWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNDKVFIPMEHDIYSSSSWPKESLYVHQNGNPSFALLSALRLWA
THPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNNLPTSVEEDNQLLCNICKIQDLQVPRELQKMLLTYGGEFCAFLETNGLVNRNEAELHLSGKIKRSLERWK
LAVQWRLLYKKALVDCISYCTRTICSLSS