; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003558 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003558
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRibosomal RNA small subunit methyltransferase G
Genome locationtig00002237:9858..11763
RNA-Seq ExpressionSgr003558
SyntenySgr003558
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]3.3e-9465.53Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKGV+ +S A A+FES  +GIKHQ LLQDYEELHNETEAMK+KL+IAKRKK TL  EVRFLRHRYE LK Q  N  PK G   PRNLE++PP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EA+SRKS+S F +NQK+  CS+KE+ ++ S P FDQKERVYR HEAAANRNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEELQAGF+P+R E+  KN F RSE+D KNS++++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]8.7e-9566.67Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKGV+ DS A A+FE+  +GIKHQ LLQDY+ELHNETEA+K+KL+IAKRKKATL  EVRFLRHRYE LKNQ  N  PK G    RNLE+RPP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
        EKSSRKREASLKPLAQAHDLNQRGGIYNG+EA+SRKS+S F +NQK+  CS+KE+ M+ S P FDQKERVYR HEAAANRNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEE+QAGFEPLR  E+  KN F RSE+D KNSD+++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]1.9e-10270.51Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RK  +MD DAYA+FE+P IG KH RLLQDYE+L N TE MKE+L+IAKRKK+TL AEVRFLRHRYEFLKNQS N+ PKHGL+ P+N E+RPPNAKK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPL--AQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLV
        EKSS+KREASLK L  AQA DLNQRGGIY+GMEA SRKSR VFH+NQK RMCS+ E++MH SSPIF+ KE +YRVHEAAA+RNMTPVFDLNQIS      
Subjt:  EKSSRKREASLKPL--AQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLV

Query:  LISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                     REEEELQAGFEP R EE  KNSFLRSENDGKNSD+MIS MCRNVGSGSNRAGKRKISWQDQVALRA
Subjt:  LISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]6.9e-7664Show/hide
Query:  MKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHI
        MK+KL+IAKRKK TL  EVRFLRHRYE LK Q  N  PK G   PRNLE++PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA+SRKS+S F +
Subjt:  MKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHI

Query:  NQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSE
        NQK+  CS+KE+ ++ S P FDQKERVYR HEAAANRNMTPVFDLNQIS                                   REEEELQAGF+P+R E
Subjt:  NQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSE

Query:  EALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
        +  KN F RSE+D KNS++++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  EALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]1.3e-9365.87Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKG+++DS+A A+FE+  IG+KHQ LLQDYEEL NETEAMKEKL+IAKRKK TL AEVRFLRHRYE LKN+  NT PK   + P +LE+ PP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
         KSSRK EASLKPLA+AHDLNQRGGIYNGMEA SRKS+S F+INQK+RMCS+KE+ +  S PIFDQKERVYRVHE   +RNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEELQAGFEPLR E+  KN F RSE+D KNSD+++SSMCRN G+GSN AGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein1.6e-9465.53Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKGV+ +S A A+FES  +GIKHQ LLQDYEELHNETEAMK+KL+IAKRKK TL  EVRFLRHRYE LK Q  N  PK G   PRNLE++PP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
        EKSSRKREASLKPLAQAHD+NQRGGIYNG+EA+SRKS+S F +NQK+  CS+KE+ ++ S P FDQKERVYR HEAAANRNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEELQAGF+P+R E+  KN F RSE+D KNS++++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936834.2e-9566.67Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKGV+ DS A A+FE+  +GIKHQ LLQDY+ELHNETEA+K+KL+IAKRKKATL  EVRFLRHRYE LKNQ  N  PK G    RNLE+RPP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
        EKSSRKREASLKPLAQAHDLNQRGGIYNG+EA+SRKS+S F +NQK+  CS+KE+ M+ S P FDQKERVYR HEAAANRNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEE+QAGFEPLR  E+  KN F RSE+D KNSD+++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein4.2e-9566.67Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RKGV+ DS A A+FE+  +GIKHQ LLQDY+ELHNETEA+K+KL+IAKRKKATL  EVRFLRHRYE LKNQ  N  PK G    RNLE+RPP  KK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI
        EKSSRKREASLKPLAQAHDLNQRGGIYNG+EA+SRKS+S F +NQK+  CS+KE+ M+ S P FDQKERVYR HEAAANRNMTPVFDLNQIS        
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLI

Query:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                   REEEE+QAGFEPLR  E+  KN F RSE+D KNSD+++SSMCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  SYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRS-EEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246969.4e-10370.51Show/hide
Query:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK
        MKK+RK  +MD DAYA+FE+P IG KH RLLQDYE+L N TE MKE+L+IAKRKK+TL AEVRFLRHRYEFLKNQS N+ PKHGL+ P+N E+RPPNAKK
Subjt:  MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKK

Query:  EKSSRKREASLKPL--AQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLV
        EKSS+KREASLK L  AQA DLNQRGGIY+GMEA SRKSR VFH+NQK RMCS+ E++MH SSPIF+ KE +YRVHEAAA+RNMTPVFDLNQIS      
Subjt:  EKSSRKREASLKPL--AQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLV

Query:  LISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
                                     REEEELQAGFEP R EE  KNSFLRSENDGKNSD+MIS MCRNVGSGSNRAGKRKISWQDQVALRA
Subjt:  LISYRERHLSFSITFKYSSQIFNICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940123.2e-7159.12Show/hide
Query:  IGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQ
        I I H  LLQDY EL NETEAMKEKL+I K+KK+TL AEVRFLRH+YE LKN  P T PK G   P+NL++RPP +KKE  SRKRE        A +LNQ
Subjt:  IGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQ

Query:  RGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNI
        RGGI +GMEAT+RK+RSV ++NQK+RMCS+KE+++    PI  QKERVYR HE A N NMTPVFDLNQIS                              
Subjt:  RGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNI

Query:  CFLLQREEEELQAGFEPLRSE---EALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA
             REEEELQ GFEP+R+E   + LKN   RSE D KNSD+M+SSMCRNVG+GSNRAGKRKISWQD+VALRA
Subjt:  CFLLQREEEELQAGFEPLRSE---EALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein5.8e-1228.46Show/hide
Query:  ELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLK-NQSPNTWPK-------HGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQRGGIY
        EL  E E  +++L + K+K+ TL +EVRFLR RYE LK +Q+  T P+        GL+ PR          K    RK+++ ++      DL  +  I 
Subjt:  ELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLK-NQSPNTWPK-------HGLDAPRNLEMRPPNAKKEKSSRKREASLKPLAQAHDLNQRGGIY

Query:  NGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNICFLLQ
        N  EA +    S     ++ R      +    S P  + +          +  +  P FDLNQIS                                   
Subjt:  NGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIFNICFLLQ

Query:  REEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMIS---SMCRNVGSGSNRAGKRKISWQDQVAL
        REEEE +   E + + EA+KN+ L    D + SD+ +     +C +V    NRA KRK++WQD VAL
Subjt:  REEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMIS---SMCRNVGSGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein4.4e-1236.09Show/hide
Query:  FESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKE-----KSSRKREASLK
        FE PK+  +H  L+QDY ELH ETEAM+++L   + +KATL AEVRFLR RY  L+   P    K          +R  N  K+       S K EA  K
Subjt:  FESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKE-----KSSRKREASLK

Query:  PLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQ
         ++   DLN     ++  + + ++   +F +NQ
Subjt:  PLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAATCTCGAAAAGGGGTGTCCATGGATTCAGACGCGTATGCTATGTTCGAGAGCCCGAAGATTGGGATCAAGCATCAGCGGCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGGAGAAATTAGTGATCGCAAAGCGGAAAAAGGCAACCCTTTCGGCTGAAGTACGATTTTTGAGGCATAGATATGAATTCTTGAAGA
ACCAGTCTCCAAACACCTGGCCAAAGCATGGTCTTGATGCGCCACGAAACCTTGAAATGAGACCTCCCAACGCGAAGAAAGAAAAGAGTTCTCGAAAGAGAGAAGCTTCT
TTGAAACCCCTTGCTCAGGCTCATGATTTAAATCAAAGGGGAGGAATCTACAATGGAATGGAAGCCACCTCTCGAAAATCTCGGTCAGTTTTTCACATAAACCAGAAGAC
AAGGATGTGCAGTGAGAAGGAAATTGCTATGCACGGTTCTTCTCCGATTTTCGACCAGAAGGAGAGAGTGTATAGAGTACATGAAGCTGCTGCCAACAGAAACATGACTC
CAGTTTTCGACCTAAACCAGATCTCGGTAAACATGACTCTGGTTCTCATTTCTTATAGAGAAAGACATTTGAGTTTTTCAATTACATTCAAATACAGCTCTCAGATCTTT
AACATCTGTTTCTTGCTTCAGAGAGAGGAAGAAGAATTGCAGGCTGGTTTCGAACCATTGAGATCGGAGGAGGCGCTGAAGAATAGCTTTTTAAGGAGCGAAAACGACGG
GAAGAACAGTGACATGATGATATCATCAATGTGTAGGAATGTTGGCAGTGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAGAGCAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAATCTCGAAAAGGGGTGTCCATGGATTCAGACGCGTATGCTATGTTCGAGAGCCCGAAGATTGGGATCAAGCATCAGCGGCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGGAGAAATTAGTGATCGCAAAGCGGAAAAAGGCAACCCTTTCGGCTGAAGTACGATTTTTGAGGCATAGATATGAATTCTTGAAGA
ACCAGTCTCCAAACACCTGGCCAAAGCATGGTCTTGATGCGCCACGAAACCTTGAAATGAGACCTCCCAACGCGAAGAAAGAAAAGAGTTCTCGAAAGAGAGAAGCTTCT
TTGAAACCCCTTGCTCAGGCTCATGATTTAAATCAAAGGGGAGGAATCTACAATGGAATGGAAGCCACCTCTCGAAAATCTCGGTCAGTTTTTCACATAAACCAGAAGAC
AAGGATGTGCAGTGAGAAGGAAATTGCTATGCACGGTTCTTCTCCGATTTTCGACCAGAAGGAGAGAGTGTATAGAGTACATGAAGCTGCTGCCAACAGAAACATGACTC
CAGTTTTCGACCTAAACCAGATCTCGGTAAACATGACTCTGGTTCTCATTTCTTATAGAGAAAGACATTTGAGTTTTTCAATTACATTCAAATACAGCTCTCAGATCTTT
AACATCTGTTTCTTGCTTCAGAGAGAGGAAGAAGAATTGCAGGCTGGTTTCGAACCATTGAGATCGGAGGAGGCGCTGAAGAATAGCTTTTTAAGGAGCGAAAACGACGG
GAAGAACAGTGACATGATGATATCATCAATGTGTAGGAATGTTGGCAGTGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAGAGCAT
GA
Protein sequenceShow/hide protein sequence
MKKSRKGVSMDSDAYAMFESPKIGIKHQRLLQDYEELHNETEAMKEKLVIAKRKKATLSAEVRFLRHRYEFLKNQSPNTWPKHGLDAPRNLEMRPPNAKKEKSSRKREAS
LKPLAQAHDLNQRGGIYNGMEATSRKSRSVFHINQKTRMCSEKEIAMHGSSPIFDQKERVYRVHEAAANRNMTPVFDLNQISVNMTLVLISYRERHLSFSITFKYSSQIF
NICFLLQREEEELQAGFEPLRSEEALKNSFLRSENDGKNSDMMISSMCRNVGSGSNRAGKRKISWQDQVALRA