; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G31890 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G31890
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionSAM_MT_RSMB_NOP domain-containing protein
Genome locationChr6:27042553..27046987
RNA-Seq ExpressionCSPI06G31890
SyntenyCSPI06G31890
Gene Ontology termsGO:0001510 - RNA methylation (biological process)
GO:0015031 - protein transport (biological process)
GO:0035672 - oligopeptide transmembrane transport (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
GO:0035673 - oligopeptide transmembrane transporter activity (molecular function)
InterPro domainsIPR001678 - SAM-dependent methyltransferase RsmB/NOP2-type
IPR023267 - RNA (C5-cytosine) methyltransferase
IPR023269 - RNA (C5-cytosine) methyltransferase, subfamily 9
IPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025665.1 putative 28S rRNA (cytosine-C(5))-methyltransferase isoform X1 [Cucumis melo var. makuwa]1.8e-21595.66Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI+
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

TYK12538.1 oligopeptide transporter 7-like [Cucumis melo var. makuwa]4.2e-20995.79Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQT
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQT
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQT

XP_004134857.1 uncharacterized protein LOC101221513 [Cucumis sativus]7.0e-22899.25Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQ VS ELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR
        NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR

Query:  G
        G
Subjt:  G

XP_008440832.1 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485139 [Cucumis melo]8.8e-21595.41Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKK AKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI+
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

XP_038880813.1 uncharacterized protein LOC120072508 [Benincasa hispida]8.8e-20792.35Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIY+ATDSTPRYVRLKPGY+ADVEEIE EIKCKLEKV+WLPGFYSLPP VQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDS DRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKK AKAKQNV+ ELVQ  QDPELIFYG KSGVVGFTKSEI+RSPPE+  LSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        +LTVLQLRLL NGFRLLKSGGILVYSTCSLT+AQNEDVV+QFLK+NASAELQEIE ARNWPCKSG IPKTLRFDPLTSQTSGLFVAKFLK++
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

TrEMBL top hitse value%identityAlignment
A0A0A0KHA5 SAM_MT_RSMB_NOP domain-containing protein3.4e-22899.25Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQ VS ELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR
        NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESR

Query:  G
        G
Subjt:  G

A0A1S3B2T3 LOW QUALITY PROTEIN: uncharacterized protein LOC1034851394.3e-21595.41Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKK AKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI+
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

A0A5A7SMJ4 Putative 28S rRNA (Cytosine-C(5))-methyltransferase isoform X18.6e-21695.66Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI+
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

A0A5D3CN07 Oligopeptide transporter 7-like2.1e-20995.79Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGY+ADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+ GESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKKTAKAKQNV+ ELVQ  QDPELIFYGL SGVVGFTKSEIY+SPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQT
        NL+VLQLRLL NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE ARNWPCKSGGIPKTLRFDPLTSQT
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQT

A0A6J1IRX8 uncharacterized protein LOC111477890 isoform X11.5e-20491.07Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEEPSTSPLPAAFLDFLKEN LDPSIY+ATDSTPRYVRLKPGY+ADVEE+E EIKCKLEKVSWLPGFYSLPPDVQIAGS AYKTG+IYGIDAASGAAVTA
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        LD+ PGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSL PVNVTSES+  ESTLEDSVDRL+EWTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        KSWKERKK A AKQN + +LVQ  Q+PELIFYGLKSGVVGFTKSEIYRSP E+ELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTS QRRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS
        NLTVLQLRLL+NGFRLLKSGGILVYSTCSLTVAQNEDVV+QFLKDNASAELQEIE  RNWPCKSG IPKTLRFDP  SQTSGLFVAKFLK++
Subjt:  NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKIS

SwissProt top hitse value%identityAlignment
Q5SII2 Ribosomal RNA small subunit methyltransferase F1.1e-1032.3Show/hide
Query:  IYRSPPENELLSYG--YDRVLVDAECTHDGSI-KHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQF
        + ++PP     ++G  + RVL+DA C+ +G   K  +    WG ++ +R          +  +Q  LLA   RLL  GG+LVYSTC+    +NE VV  F
Subjt:  IYRSPPENELLSYG--YDRVLVDAECTHDGSI-KHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQF

Query:  LKDNASAELQEIEVARNWPCKSGGIP----------KTLRFDPLTSQTSGLFVAKFLKISG
        LK +    L++   AR  P  + G+P          KT R  P   +  G F+A+F K  G
Subjt:  LKDNASAELQEIEVARNWPCKSGGIP----------KTLRFDPLTSQTSGLFVAKFLKISG

Q84MA1 rRNA (cytosine-C(5))-methyltransferase NOP2C2.4e-1324.43Show/hide
Query:  HAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLA-----------ACRTMLQKYALGDICRLFVADGAT
        H    G+I+  +  S     ALD   G  +LD+CAAPG K   I  L++  G +   D S +++             C T  +  AL  +C     + +T
Subjt:  HAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLA-----------ACRTMLQKYALGDICRLFVADGAT

Query:  FSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV----GFTKSEIYRSPPENELLSYGYDRVL
          +   N +S +   E +   S + +   TSR+S  ++      ++N S E    G +    +     G +    G T+ +  R+          +DRVL
Subjt:  FSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV----GFTKSEIYRSPPENELLSYGYDRVL

Query:  VDAECTHDG-------SIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVA
        +DA C+  G        ++ +    + GW  +QR++LD                   +L++ GGILVYSTC++  ++NE VV   L       L      
Subjt:  VDAECTHDG-------SIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVA

Query:  RNWP----------------CKSGGIPKTLRFDPLTS-QTSGLFVAKF
           P                 K G      +FDP +   T G F+AKF
Subjt:  RNWP----------------CKSGGIPKTLRFDPLTS-QTSGLFVAKF

Q8CCT7 tRNA (cytosine(34)-C(5))-methyltransferase, mitochondrial5.5e-1030.14Show/hide
Query:  YDRVLVDAECTHDGSIKHIQKFESWGWTS-FQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIE-VA
        +D+VLVDA C++D          SW ++S  Q+      +  NL VLQ+ L+ +  + L+ GG+LVYSTC+L+ A+N+DV+ + L  +++    +I  +A
Subjt:  YDRVLVDAECTHDGSIKHIQKFESWGWTS-FQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIE-VA

Query:  RNWPCKSGGIPKTLRFDPLTSQTSG-----LFVAKFLKISGIKKRE
        R +       P   +   L     G     +++AK  K    +KR+
Subjt:  RNWPCKSGGIPKTLRFDPLTSQTSG-----LFVAKFLKISGIKKRE

Q9H649 tRNA (cytosine(34)-C(5))-methyltransferase, mitochondrial9.4e-1033.85Show/hide
Query:  YDRVLVDAECTHDGSIKHIQKFESWGWTS-FQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVAR
        +D+VLVDA C++D          SW ++S  Q+      +  NL +LQ+ LL +  + L+ GGILVYSTC+L+ A+N+DV+ + L  + +    +I+   
Subjt:  YDRVLVDAECTHDGSIKHIQKFESWGWTS-FQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVAR

Query:  NWPCKSGGIPKTLRFD---PLTSQTSGLFV
               GI +T   D     T Q  GL V
Subjt:  NWPCKSGGIPKTLRFD---PLTSQTSGLFV

Q9V106 tRNA (cytosine(49)-C(5))-methyltransferase5.9e-1224.24Show/hide
Query:  KADVEEIED--EIKCKLEKVSWL--PGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVS
        K +++EI+   E K +LE + W    GFY    DV       Y  G I   +A+S      LD  P   +LD+ AAPG+K   +   ++  G +   D  
Subjt:  KADVEEIED--EIKCKLEKVSWL--PGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVS

Query:  QHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV
        + R       L +  +  I ++ V DGA F+             E+T                                                     
Subjt:  QHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV

Query:  GFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKF-ESW--GWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNE
                            +DRVL+DA C+  G I+   KF  +W  G   +  R            LQ RL+   ++ LK GG+LVYSTC++   +NE
Subjt:  GFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKF-ESW--GWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNE

Query:  DVVDQFLKDNASAELQEI-------EVARNWPCK--SGGIPKTLRFDPLTSQTSGLFVAKFLK
        +VVD FL     A+L+++       E    W  +  S  + KT+R  P  + T   ++AK  K
Subjt:  DVVDQFLKDNASAELQEI-------EVARNWPCK--SGGIPKTLRFDPLTSQTSGLFVAKFLK

Arabidopsis top hitse value%identityAlignment
AT1G06560.1 NOL1/NOP2/sun family protein1.7e-1424.43Show/hide
Query:  HAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLA-----------ACRTMLQKYALGDICRLFVADGAT
        H    G+I+  +  S     ALD   G  +LD+CAAPG K   I  L++  G +   D S +++             C T  +  AL  +C     + +T
Subjt:  HAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLA-----------ACRTMLQKYALGDICRLFVADGAT

Query:  FSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV----GFTKSEIYRSPPENELLSYGYDRVL
          +   N +S +   E +   S + +   TSR+S  ++      ++N S E    G +    +     G +    G T+ +  R+          +DRVL
Subjt:  FSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVV----GFTKSEIYRSPPENELLSYGYDRVL

Query:  VDAECTHDG-------SIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVA
        +DA C+  G        ++ +    + GW  +QR++LD                   +L++ GGILVYSTC++  ++NE VV   L       L      
Subjt:  VDAECTHDG-------SIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVA

Query:  RNWP----------------CKSGGIPKTLRFDPLTS-QTSGLFVAKF
           P                 K G      +FDP +   T G F+AKF
Subjt:  RNWP----------------CKSGGIPKTLRFDPLTS-QTSGLFVAKF

AT5G55920.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein1.4e-0830.97Show/hide
Query:  DRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLL----KSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEV
        DRVL+DA C+  G    I K ES   T         +       LQ +LL     ++    K+GG +VYSTCS+ V +NE V+D  LK     +++ +  
Subjt:  DRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLL----KSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEV

Query:  ARNWPCK----------SGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESRG
          ++  K             + KT RF P      G FVAK  K+S +K+    G
Subjt:  ARNWPCK----------SGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESRG

AT5G66180.1 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein4.2e-14667.35Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEE S   LP +FL FL+ NGLDPSIY+  DS PRYVRLKPG++   EEIE EI CKLEKV+WLPGFYS+PPDV IA + AY+ G +YGIDAASGAAV+A
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        L I PG+HVLDLCAAPGAKLCM++DLL   G+ TGVDV++HRLAACRTML KY L +  RLF+ADG TFS+ P    + +   ES ++D  D  ++WTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        + +KERK+ AKA++N SV L Q+GQ  E+IFYG  SGV+G  K+E+YRS  +N+  SYGYD+VLVDAECTHDGSIKHIQKFE WGW + +RRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  -NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI
         NL  LQL LL NGFRLLK  G LVYSTCSLT AQNEDVVDQFL +N SAELQEIE+A++WPC+SG  PKTLRFDP TS TSGLFVAK  K+
Subjt:  -NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI

AT5G66180.2 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein3.1e-11765.65Show/hide
Query:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA
        MEE S   LP +FL FL+ NGLDPSIY+  DS PRYVRLKPG++   EEIE EI CKLEKV+WLPGFYS+PPDV IA + AY+ G +YGIDAASGAAV+A
Subjt:  MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTA

Query:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR
        L I PG+HVLDLCAAPGAKLCM++DLL   G+ TGVDV++HRLAACRTML KY L +  RLF+ADG TFS+ P    + +   ES ++D  D  ++WTSR
Subjt:  LDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSR

Query:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD
        + +KERK+ AKA++N SV L Q+GQ  E+IFYG  SGV+G  K+E+YRS  +N+  SYGYD+VLVDAECTHDGSIKHIQKFE WGW + +RRVLDAERTD
Subjt:  KSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD

Query:  -NLTVLQLRLLANGFRLLKSGGILVYSTC
         NL  LQL LL NGFRLLK  G LVYSTC
Subjt:  -NLTVLQLRLLANGFRLLKSGGILVYSTC

AT5G66180.3 S-adenosyl-L-methionine-dependent methyltransferases superfamily protein2.6e-13267.23Show/hide
Query:  YVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTG
        +V LKPG++   EEIE EI CKLEKV+WLPGFYS+PPDV IA + AY+ G +YGIDAASGAAV+AL I PG+HVLDLCAAPGAKLCM++DLL   G+ TG
Subjt:  YVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTALDILPGNHVLDLCAAPGAKLCMIVDLLDGLGSVTG

Query:  VDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLK
        VDV++HRLAACRTML KY L +  RLF+ADG TFS+ P    + +   ES ++D  D  ++WTSR+ +KERK+ AKA++N SV L Q+GQ  E+IFYG  
Subjt:  VDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVELVQSGQDPELIFYGLK

Query:  SGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD-NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQ
        SGV+G  K+E+YRS  +N+  SYGYD+VLVDAECTHDGSIKHIQKFE WGW + +RRVLDAERTD NL  LQL LL NGFRLLK  G LVYSTCSLT AQ
Subjt:  SGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTD-NLTVLQLRLLANGFRLLKSGGILVYSTCSLTVAQ

Query:  NEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI
        NEDVVDQFL +N SAELQEIE+A++WPC+SG  PKTLRFDP TS TSGLFVAK  K+
Subjt:  NEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAACCTTCTACATCGCCGTTGCCAGCGGCGTTTCTTGATTTTCTTAAGGAGAATGGTCTGGACCCTTCCATTTACTCCGCTACTGATTCCACTCCTCGTTATGT
CAGGTTGAAGCCTGGTTATAAAGCCGATGTCGAGGAGATAGAGGACGAGATCAAGTGCAAGCTTGAGAAAGTCAGTTGGCTGCCTGGATTCTATTCGCTTCCGCCTGATG
TGCAAATTGCTGGTTCTCATGCTTACAAAACAGGAAAGATATATGGGATAGATGCAGCTTCCGGTGCTGCTGTAACTGCCTTGGACATTTTACCGGGAAACCATGTTCTC
GATCTTTGTGCAGCTCCAGGGGCCAAACTTTGCATGATCGTAGACCTTCTTGATGGTTTGGGTTCTGTAACCGGAGTAGATGTTTCACAGCATCGTTTAGCTGCTTGTAG
AACAATGCTGCAAAAATATGCTCTTGGTGATATTTGCCGACTTTTTGTTGCTGATGGAGCAACGTTTTCTCTCACTCCCGTGAATGTTACTTCAGAGTCTAAATTTGGTG
AATCTACCTTAGAAGATAGCGTTGACAGATTGAGGGAGTGGACCTCCAGAAAATCTTGGAAGGAAAGGAAAAAAACAGCCAAAGCGAAACAAAATGTTTCTGTGGAGTTA
GTTCAGAGCGGTCAGGATCCAGAACTCATCTTTTATGGGCTGAAATCTGGTGTGGTTGGGTTCACTAAGAGTGAAATTTACCGAAGCCCTCCAGAAAATGAGTTGTTGAG
CTATGGGTATGATAGGGTCCTAGTGGATGCCGAGTGCACTCATGATGGCTCTATAAAGCATATCCAGAAATTTGAAAGTTGGGGTTGGACATCGTTTCAGCGCAGAGTAT
TGGACGCTGAGAGAACAGATAATTTGACTGTACTTCAGTTAAGACTCCTTGCCAATGGTTTTAGGTTGCTCAAATCTGGTGGGATACTTGTCTACAGCACTTGCAGTCTG
ACCGTAGCACAGAACGAGGATGTGGTGGATCAGTTTCTCAAGGATAATGCTTCCGCAGAGTTGCAGGAGATAGAAGTTGCCAGAAATTGGCCATGTAAGAGCGGAGGAAT
ACCAAAAACCTTACGCTTTGATCCTTTGACATCACAGACAAGTGGGCTTTTTGTAGCGAAATTCTTGAAGATAAGCGGAATAAAAAAGCGTGAATCACGTGGTTAG
mRNA sequenceShow/hide mRNA sequence
CAGTGACACAAAATCGCGTCCCGGAAACGCGGTTTCAAAACCCACATTCCCAGTTCTACGAATCGCCGCGACCGTTGTAAACCCTAGTTCAAGATTCAAATCCTCTCCAC
TTTCTTTCTTCCATTGATTTCGGCTTTGCCTTATAGTCCTAATCTATGGAGGAACCTTCTACATCGCCGTTGCCAGCGGCGTTTCTTGATTTTCTTAAGGAGAATGGTCT
GGACCCTTCCATTTACTCCGCTACTGATTCCACTCCTCGTTATGTCAGGTTGAAGCCTGGTTATAAAGCCGATGTCGAGGAGATAGAGGACGAGATCAAGTGCAAGCTTG
AGAAAGTCAGTTGGCTGCCTGGATTCTATTCGCTTCCGCCTGATGTGCAAATTGCTGGTTCTCATGCTTACAAAACAGGAAAGATATATGGGATAGATGCAGCTTCCGGT
GCTGCTGTAACTGCCTTGGACATTTTACCGGGAAACCATGTTCTCGATCTTTGTGCAGCTCCAGGGGCCAAACTTTGCATGATCGTAGACCTTCTTGATGGTTTGGGTTC
TGTAACCGGAGTAGATGTTTCACAGCATCGTTTAGCTGCTTGTAGAACAATGCTGCAAAAATATGCTCTTGGTGATATTTGCCGACTTTTTGTTGCTGATGGAGCAACGT
TTTCTCTCACTCCCGTGAATGTTACTTCAGAGTCTAAATTTGGTGAATCTACCTTAGAAGATAGCGTTGACAGATTGAGGGAGTGGACCTCCAGAAAATCTTGGAAGGAA
AGGAAAAAAACAGCCAAAGCGAAACAAAATGTTTCTGTGGAGTTAGTTCAGAGCGGTCAGGATCCAGAACTCATCTTTTATGGGCTGAAATCTGGTGTGGTTGGGTTCAC
TAAGAGTGAAATTTACCGAAGCCCTCCAGAAAATGAGTTGTTGAGCTATGGGTATGATAGGGTCCTAGTGGATGCCGAGTGCACTCATGATGGCTCTATAAAGCATATCC
AGAAATTTGAAAGTTGGGGTTGGACATCGTTTCAGCGCAGAGTATTGGACGCTGAGAGAACAGATAATTTGACTGTACTTCAGTTAAGACTCCTTGCCAATGGTTTTAGG
TTGCTCAAATCTGGTGGGATACTTGTCTACAGCACTTGCAGTCTGACCGTAGCACAGAACGAGGATGTGGTGGATCAGTTTCTCAAGGATAATGCTTCCGCAGAGTTGCA
GGAGATAGAAGTTGCCAGAAATTGGCCATGTAAGAGCGGAGGAATACCAAAAACCTTACGCTTTGATCCTTTGACATCACAGACAAGTGGGCTTTTTGTAGCGAAATTCT
TGAAGATAAGCGGAATAAAAAAGCGTGAATCACGTGGTTAGATCTCATTCTGGTGTTAATTGGGGATTTTGATAATTTACTCAACTTTGCTGTACAGTGCTTGAGACGTG
AAAAAGAAAGGTATAATATTTATTACACTCAATCGTTCATCAAATTAAGTCAATTCAAAATTAAGCTAAGCCATACCCCTTTTTAAAACTTCATATTATAATGTCAACCA
ATAGAGTGTGAGACCATATTTGTATTGCTTCATTTTTCTTTGGCTCTATTCTGTCACCTTATTTATTTATTGTTATTTTTAATAGAGATTGATTGCAGG
Protein sequenceShow/hide protein sequence
MEEPSTSPLPAAFLDFLKENGLDPSIYSATDSTPRYVRLKPGYKADVEEIEDEIKCKLEKVSWLPGFYSLPPDVQIAGSHAYKTGKIYGIDAASGAAVTALDILPGNHVL
DLCAAPGAKLCMIVDLLDGLGSVTGVDVSQHRLAACRTMLQKYALGDICRLFVADGATFSLTPVNVTSESKFGESTLEDSVDRLREWTSRKSWKERKKTAKAKQNVSVEL
VQSGQDPELIFYGLKSGVVGFTKSEIYRSPPENELLSYGYDRVLVDAECTHDGSIKHIQKFESWGWTSFQRRVLDAERTDNLTVLQLRLLANGFRLLKSGGILVYSTCSL
TVAQNEDVVDQFLKDNASAELQEIEVARNWPCKSGGIPKTLRFDPLTSQTSGLFVAKFLKISGIKKRESRG