; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011370 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011370
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein SET DOMAIN GROUP 40-like
Genome locationCG_Chr05:13424047..13429131
RNA-Seq ExpressionClCG05G011370
SyntenyClCG05G011370
Gene Ontology termsGO:0005509 - calcium ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR001214 - SET domain
IPR015353 - Rubisco LSMT, substrate-binding domain
IPR036464 - Rubisco LSMT, substrate-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004145844.1 protein SET DOMAIN GROUP 40 isoform X4 [Cucumis sativus]1.5e-23470.47Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGELVLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLLYEI KG SSWWFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
         TL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++   EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+NL
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNEILVMQWLSKNCHTVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TS+EEDNQLLCNI+KVQDLQV REL+K LLTYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI YCT TI SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

XP_008457030.1 PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Cucumis melo]6.9e-23270.13Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GGRGL AVRQLNKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLL EI KG SS WFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
         TL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TS+EED+QLLCNI+KVQDLQV RELRK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTI SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

XP_022983189.1 protein SET DOMAIN GROUP 40 isoform X1 [Cucurbita maxima]2.6e-23168.79Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        M TEGSF SLLRWAADHGISDSVD+Q+SHSCLG SLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQ LSL+DEKL+MAL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLLYEIGKG+SSWWFPY KHLP +Y+ LATFG+FEKQALQVDYA+W  EKAA KS  EW GVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
          L+VPWDEAGCLCPVGDLFNYAAPEGE LD+MDVSSFS H SLNG++TTD LH+E++DT  ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNL
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELL+YYGFLLQENPN++VFIP+EH+IYSSSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+LVMQWLSKNCH VLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TSVEEDNQLLCNI K+QDLQ   EL K+LLT GGEFCAFLET GLVN ++TE+H++ KIKRSLERWKLAVQWR+LYKKALVDC SYCTRT  SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

XP_031739341.1 protein SET DOMAIN GROUP 40 isoform X3 [Cucumis sativus]3.7e-23370.35Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGELVLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQAL-QVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSN
        CLLYEI KG SSWWFPYLKHLPQSYDILATFG+FEKQAL QVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKAWLWASAT            
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQAL-QVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSN

Query:  ILSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSS
                                                                                                           S
Subjt:  ILSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSS

Query:  DLTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSN
          TL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++   EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+N
Subjt:  DLTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSN

Query:  LELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDL
        LELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNEILVMQWLSKNCHTVLN+L
Subjt:  LELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDL

Query:  PTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        PTS+EEDNQLLCNI+KVQDLQV REL+K LLTYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI YCT TI SLSS
Subjt:  PTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

XP_038896047.1 protein SET DOMAIN GROUP 40 [Benincasa hispida]4.3e-24272.82Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        M TE SFGSLLRWAADHGISDSVDQQTSHSCLG SLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVL TTQ LSLEDEKLA AL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLLYEIGKGTSSWW PYLKHLPQSYDILATFG+FEKQALQVDY IWATEKAALKS MEW GVKGLM+E NIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
          L+VPWDEAGCLCPVGDLFNYAAPEGE +D  DVS FSPH SLNGD+TTDELHEE+RDT WALTDGGFEE+VSAYCFYARESYKKGEQVLLSYGTYSNL
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIP+EHDIY+SSSWPKESLYIHQNGNPSF+LLSALRLWATHPNKRRGVGHLAY+GSQLS+KNEILVMQ LSKNC TVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TSVEEDNQLLCNI K+QDLQV RELRK+LLTYGGEF AFLETNG+VN D+ E+H+S KIKRSLERWKLAVQWRLLYKKALVDCISYCTRTI SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

TrEMBL top hitse value%identityAlignment
A0A0A0L7L4 SET domain-containing protein5.3e-23069.71Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGS GSLLRWAADHGISDSVDQ TSHSCLGHSLCV FFPD GGRGL AVRQL KGELVLR PKS+LLTTQ LSLEDEKL MAL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLLYEI KG SSWWFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSR +W GV+GLMQESNIK+QLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
         TL+VPWDEAGCLCPVGDLFNYAAPEGE  + +DV SF  H SLN ++   EL EE+RD+ WALTDGGFEEN SAYCFYARESY+KGEQVLLSYGTY+NL
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNEILVMQWLSKNCHTVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTI
        TS+EEDNQLLCNI+KVQDLQV REL+K LLTYGGEFCAFLETNG+VN D+ E H SQK+KRSL+RWKLAVQWRLLYKKALVDCI    R +
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTI

A0A1S3C4J5 protein SET DOMAIN GROUP 40 isoform X23.3e-23270.13Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GGRGL AVRQLNKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLL EI KG SS WFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
         TL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TS+EED+QLLCNI+KVQDLQV RELRK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTI SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

A0A1S3C4N2 protein SET DOMAIN GROUP 40 isoform X12.4e-23069.55Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSST
        METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GG     RGL AVRQLNKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSST
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGG-----RGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSST

Query:  QKLTFCLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTP
        QKLTFCLL EI KG SS WFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKAWLWASAT        
Subjt:  QKLTFCLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTP

Query:  ILSNILSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVH
                                                                                                            
Subjt:  ILSNILSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVH

Query:  CFSSDLTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYG
           S  TL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYG
Subjt:  CFSSDLTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYG

Query:  TYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTV
        TY+N+ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTV
Subjt:  TYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTV

Query:  LNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLS
        LN+LPTS+EED+QLLCNI+KVQDLQV RELRK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTI SLS
Subjt:  LNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLS

Query:  S
        S
Subjt:  S

A0A5D3BQD3 Protein SET DOMAIN GROUP 40 isoform X23.3e-23270.13Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        METEGSFGSLLRWAADHGISDS+DQ TS SCLG SLCV FFPD+GGRGL AVRQLNKGEL+LR PKSVLLTTQ LSLEDEKLAMAL  +PSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLL EI KG SS WFPYLKHLPQSYDILATFG+FEKQALQVDYAIWATEKAALKSRM+W GVKGLMQESNIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
         TL+VPWDEAGCLCPVGDLFNYAAPEGE  + MDV SF  H SLN ++   E  EE+RD+ W LTDGGFEEN SAYCFYARESYKKGEQVLLSYGTY+N+
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELLEYYGFLLQENPN+KVFIPIEHDIY SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE LVMQWLSKNCHTVLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TS+EED+QLLCNI+KVQDLQV RELRK+LLTYGGE CAFLETNG+VN D+ E H+S+K+KRSLERWKLAVQWRLLYKKALVDCI YCTRTI SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

A0A6J1J6L6 protein SET DOMAIN GROUP 40 isoform X11.3e-23168.79Show/hide
Query:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF
        M TEGSF SLLRWAADHGISDSVD+Q+SHSCLG SLCVCFFPDAGGRGLGAVR L KGELVL+VPKSVLLTTQ LSL+DEKL+MAL  YPSLSSTQKLTF
Subjt:  METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTF

Query:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI
        CLLYEIGKG+SSWWFPY KHLP +Y+ LATFG+FEKQALQVDYA+W  EKAA KS  EW GVKGLM+ESNIKNQLQTFKAWLWASAT             
Subjt:  CLLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNI

Query:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD
                                                                                                          S 
Subjt:  LSLPSPPCSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSD

Query:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL
          L+VPWDEAGCLCPVGDLFNYAAPEGE LD+MDVSSFS H SLNG++TTD LH+E++DT  ALTDGGFEENVSAYCFYARESYK+GEQVLLSYGTYSNL
Subjt:  LTLHVPWDEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNL

Query:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP
        ELL+YYGFLLQENPN++VFIP+EH+IYSSSSWPKESL+IHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLS+KNE+LVMQWLSKNCH VLN+LP
Subjt:  ELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIHQNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLP

Query:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS
        TSVEEDNQLLCNI K+QDLQ   EL K+LLT GGEFCAFLET GLVN ++TE+H++ KIKRSLERWKLAVQWR+LYKKALVDC SYCTRT  SLSS
Subjt:  TSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS

SwissProt top hitse value%identityAlignment
B2KI88 Actin-histidine N-methyltransferase6.1e-0529.86Show/hide
Query:  EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FC
        E  F  L++WA+++G S    +  S                 G GL A R +   EL L VP+ +L+T +  S ++  L    ++   L +   +T  F 
Subjt:  EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FC

Query:  LLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI
        LL E     +S+W PY++ LP  YD    FG+ E + LQ   AI
Subjt:  LLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI

B7ZUF3 Actin-histidine N-methyltransferase6.1e-0531.94Show/hide
Query:  EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FC
        E  F  L+ W  ++G S            G  L    FP+  G GL A R++   EL L VP+ +L+T +  S +   L    ++   L +   +T  F 
Subjt:  EGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FC

Query:  LLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI
        LL E     +S+W PY+K LP  YD    F + E Q LQ   AI
Subjt:  LLYEIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI

Q5ZML9 Actin-histidine N-methyltransferase2.3e-0428.37Show/hide
Query:  FGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FCLLY
        F  L++WA ++G S    +  +              +  G GL A R++   EL L VP+ +L+T +  S ++  L    ++   L +   +T  F LL 
Subjt:  FGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLT--FCLLY

Query:  EIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI
        E     +S+W PY++ LP  YD    F + E Q L+   AI
Subjt:  EIGKGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAI

Q6NQJ8 Protein SET DOMAIN GROUP 407.1e-13145.1Show/hide
Query:  SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIG
        + LRWAA+ GISDS+D  +   SCLGHSL V  FPDAGGRGLGA R+L KGELVL+VP+  L+TT+ +  +D KL+ A+N + SLSSTQ L+ CLLYE+ 
Subjt:  SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIG

Query:  KGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNILSLPSPP
        K   S+W+PYL H+P+ YD+LATFG+FEKQALQV+ A+WATEKA  K + EW     LM+E  +K + ++F+AWLWASAT                    
Subjt:  KGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNILSLPSPP

Query:  CSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSDLTLHVPW
                                                                                                   S  TLHVPW
Subjt:  CSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSDLTLHVPW

Query:  DEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYG
        D AGCLCPVGDLFNY AP G+  +       + +V   G +   E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YG
Subjt:  DEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYG

Query:  FLLQENPNEKVFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEE
        F+L+EN N+KVFIP+E  ++S +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ+S+KNEILVM+W+S+ C +VL DLPTSV E
Subjt:  FLLQENPNEKVFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEE

Query:  DNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSL
        D  LL NI K+QD ++  E +K    +G E  AFL+ N L +        +  S+K  R L +W+ +VQWRL YK+ L DCISYC   +++L
Subjt:  DNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSL

Arabidopsis top hitse value%identityAlignment
AT5G17240.1 SET domain group 405.1e-13245.1Show/hide
Query:  SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIG
        + LRWAA+ GISDS+D  +   SCLGHSL V  FPDAGGRGLGA R+L KGELVL+VP+  L+TT+ +  +D KL+ A+N + SLSSTQ L+ CLLYE+ 
Subjt:  SLLRWAADHGISDSVD-QQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIG

Query:  KGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNILSLPSPP
        K   S+W+PYL H+P+ YD+LATFG+FEKQALQV+ A+WATEKA  K + EW     LM+E  +K + ++F+AWLWASAT                    
Subjt:  KGTSSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNILSLPSPP

Query:  CSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSDLTLHVPW
                                                                                                   S  TLHVPW
Subjt:  CSVPKVLTPSLKVTLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSDLTLHVPW

Query:  DEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYG
        D AGCLCPVGDLFNY AP G+  +       + +V   G +   E H E+      LTDGGFEE+V+AYC YAR +Y+ GEQVLL YGTY+NLELLE+YG
Subjt:  DEAGCLCPVGDLFNYAAPEGEPLDVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYG

Query:  FLLQENPNEKVFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEE
        F+L+EN N+KVFIP+E  ++S +SSWPK+SLYIHQ+G  SFAL+S LRLW    ++R + V  L YAGSQ+S+KNEILVM+W+S+ C +VL DLPTSV E
Subjt:  FLLQENPNEKVFIPIEHDIYS-SSSWPKESLYIHQNGNPSFALLSALRLWATHPNKR-RGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEE

Query:  DNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSL
        D  LL NI K+QD ++  E +K    +G E  AFL+ N L +        +  S+K  R L +W+ +VQWRL YK+ L DCISYC   +++L
Subjt:  DNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVN---SDDTEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACTGAAGGTAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGACCAACAGACTTCACATTCTTGTTTGGGCCATTCTTTGTG
CGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGTTGAGAGTTCCAAAATCTGTCTTGTTGACGACCCAAG
GTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTTTGAACGAATACCCATCTCTTTCTTCTACTCAGAAGTTGACCTTTTGTTTACTCTATGAGATTGGTAAAGGAACT
AGTTCTTGGTGGTTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTAGCAACTTTTGGAGATTTTGAAAAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGC
AACAGAGAAGGCTGCATTGAAATCTCGTATGGAGTGGGGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAAAAACCAACTCCAAACATTCAAGGCATGGCTTTGGG
CCTCTGCAACTCCAGTATGCATCCTTAAGACTCCTATTCTATCTAATATACTCTCTCTCCCATCGCCACCTTGCTCTGTGCCTAAAGTCCTGACCCCCTCGCTTAAGGTC
ACCCTTAACTTAGGTATTCTTTGTCTCCTTGAAATCTGTGGACTGTCAGTCCTCCTCCTCATGAAAGATTACCCTTGTGCCACCCTGCCTCCAATAGTTCACTGTTTTTC
TTCTGACTTGTGGAGTGGAAGTTGCTTGCACAAAGGCTCTTGGATCAGCCCAAACGTGGATAGCAGTCCTCCTCATGAAAGATTACCCTTGTGCCACCCGACTCAAATAG
TTCACTGTTTTTCTTCTGACTTGACATTGCATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACTTGTTTAATTATGCTGCACCTGAAGGGGAGCCCCTT
GACGTTATGGATGTTTCGTCTTTTTCACCACATGTTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAAGAGAAAAGAGATACTCCATGGGCTTTGACAGATGG
TGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCTCGGGAGAGTTATAAGAAGGGAGAGCAGGTTCTTTTAAGCTATGGTACATACTCAAACTTAGAGCTTCTTG
AATATTATGGGTTTCTTCTACAGGAAAATCCAAATGAAAAAGTTTTCATTCCTATAGAACATGACATTTATAGTTCCAGTTCTTGGCCCAAGGAATCTCTTTACATTCAT
CAAAATGGAAACCCATCTTTTGCTCTCCTTTCTGCCCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGTGTCGGGCATCTTGCTTATGCTGGGTCACAACTCTC
GATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACGATCTGCCTACATCAGTTGAAGAAGACAATCAGCTTCTATGCAACATCT
CCAAAGTCCAAGATCTGCAGGTAACAAGGGAGCTCCGGAAGGTGCTGTTGACTTACGGAGGTGAGTTTTGTGCCTTCTTGGAGACCAATGGTCTGGTGAATAGTGATGAC
ACCGAGGTACATATATCCCAGAAAATAAAACGCTCTCTGGAGAGATGGAAACTAGCAGTCCAGTGGAGGCTCTTGTACAAGAAGGCGTTGGTTGATTGCATAAGTTACTG
CACCAGAACTATTAGTTCTCTATCTTCTTAA
mRNA sequenceShow/hide mRNA sequence
AGCAGAGGTTTTGATGGAGTTGGAGGGTTAGTGAAATGGAAACTGAAGGTAGTTTTGGAAGCCTGCTGAGATGGGCGGCGGATCATGGAATTTCAGATTCTGTCGACCAA
CAGACTTCACATTCTTGTTTGGGCCATTCTTTGTGCGTCTGTTTCTTCCCTGATGCCGGCGGGAGAGGTTTAGGGGCTGTTCGTCAGCTTAACAAAGGAGAGTTAGTGTT
GAGAGTTCCAAAATCTGTCTTGTTGACGACCCAAGGTTTGTCGTTGGAAGATGAGAAGCTCGCCATGGCTTTGAACGAATACCCATCTCTTTCTTCTACTCAGAAGTTGA
CCTTTTGTTTACTCTATGAGATTGGTAAAGGAACTAGTTCTTGGTGGTTCCCTTACTTAAAGCATTTGCCCCAGAGTTACGACATACTAGCAACTTTTGGAGATTTTGAA
AAGCAAGCCCTGCAGGTTGATTATGCTATCTGGGCAACAGAGAAGGCTGCATTGAAATCTCGTATGGAGTGGGGAGGAGTCAAAGGACTAATGCAAGAGTCCAATATTAA
AAACCAACTCCAAACATTCAAGGCATGGCTTTGGGCCTCTGCAACTCCAGTATGCATCCTTAAGACTCCTATTCTATCTAATATACTCTCTCTCCCATCGCCACCTTGCT
CTGTGCCTAAAGTCCTGACCCCCTCGCTTAAGGTCACCCTTAACTTAGGTATTCTTTGTCTCCTTGAAATCTGTGGACTGTCAGTCCTCCTCCTCATGAAAGATTACCCT
TGTGCCACCCTGCCTCCAATAGTTCACTGTTTTTCTTCTGACTTGTGGAGTGGAAGTTGCTTGCACAAAGGCTCTTGGATCAGCCCAAACGTGGATAGCAGTCCTCCTCA
TGAAAGATTACCCTTGTGCCACCCGACTCAAATAGTTCACTGTTTTTCTTCTGACTTGACATTGCATGTACCATGGGATGAGGCCGGATGTTTATGTCCAGTTGGTGACT
TGTTTAATTATGCTGCACCTGAAGGGGAGCCCCTTGACGTTATGGATGTTTCGTCTTTTTCACCACATGTTTCTTTGAATGGAGACATGACTACTGATGAGTTACATGAA
GAGAAAAGAGATACTCCATGGGCTTTGACAGATGGTGGATTTGAGGAAAATGTTTCTGCATACTGCTTCTATGCTCGGGAGAGTTATAAGAAGGGAGAGCAGGTTCTTTT
AAGCTATGGTACATACTCAAACTTAGAGCTTCTTGAATATTATGGGTTTCTTCTACAGGAAAATCCAAATGAAAAAGTTTTCATTCCTATAGAACATGACATTTATAGTT
CCAGTTCTTGGCCCAAGGAATCTCTTTACATTCATCAAAATGGAAACCCATCTTTTGCTCTCCTTTCTGCCCTGCGATTATGGGCAACCCACCCGAACAAGCGTAGAGGT
GTCGGGCATCTTGCTTATGCTGGGTCACAACTCTCGATCAAGAATGAAATATTAGTCATGCAGTGGTTATCCAAGAACTGCCATACTGTTTTAAACGATCTGCCTACATC
AGTTGAAGAAGACAATCAGCTTCTATGCAACATCTCCAAAGTCCAAGATCTGCAGGTAACAAGGGAGCTCCGGAAGGTGCTGTTGACTTACGGAGGTGAGTTTTGTGCCT
TCTTGGAGACCAATGGTCTGGTGAATAGTGATGACACCGAGGTACATATATCCCAGAAAATAAAACGCTCTCTGGAGAGATGGAAACTAGCAGTCCAGTGGAGGCTCTTG
TACAAGAAGGCGTTGGTTGATTGCATAAGTTACTGCACCAGAACTATTAGTTCTCTATCTTCTTAATCAGGTTCAGGTTGTGATTGCCAGGTTCTAACTTAAGTTACCTA
TTAAATCAATTTTCTAGGTATCATCAAAAGATTAGAATGTTCTAAATATGATAGCATTGAACCGCTTGCTTCTGTTTGCACTTGCTTTACGCCCCTGATTTGCCAAGAAT
TCAGATTTTTATATATGTACTTTATGTGAAATCTTTAGGTGAAAAGGAAAACGATGAAAAGACATTAGTTTTCGTACCTTGGTTTTTGCAGAATTTTTGTTCTATCTGTA
GTGTTAAATATCTAAATTTGGACCTTTCAAAGTACAAACCTATATTTTGATTCTGGTAAGGATGGTGTTGGGAGAAGTTATATTTGTAATATCAGTATGTCTTAAGCTTT
TTGGTTGGTATCATCTTCATATTCATAAAGAGAAACCCTT
Protein sequenceShow/hide protein sequence
METEGSFGSLLRWAADHGISDSVDQQTSHSCLGHSLCVCFFPDAGGRGLGAVRQLNKGELVLRVPKSVLLTTQGLSLEDEKLAMALNEYPSLSSTQKLTFCLLYEIGKGT
SSWWFPYLKHLPQSYDILATFGDFEKQALQVDYAIWATEKAALKSRMEWGGVKGLMQESNIKNQLQTFKAWLWASATPVCILKTPILSNILSLPSPPCSVPKVLTPSLKV
TLNLGILCLLEICGLSVLLLMKDYPCATLPPIVHCFSSDLWSGSCLHKGSWISPNVDSSPPHERLPLCHPTQIVHCFSSDLTLHVPWDEAGCLCPVGDLFNYAAPEGEPL
DVMDVSSFSPHVSLNGDMTTDELHEEKRDTPWALTDGGFEENVSAYCFYARESYKKGEQVLLSYGTYSNLELLEYYGFLLQENPNEKVFIPIEHDIYSSSSWPKESLYIH
QNGNPSFALLSALRLWATHPNKRRGVGHLAYAGSQLSIKNEILVMQWLSKNCHTVLNDLPTSVEEDNQLLCNISKVQDLQVTRELRKVLLTYGGEFCAFLETNGLVNSDD
TEVHISQKIKRSLERWKLAVQWRLLYKKALVDCISYCTRTISSLSS