; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy5G100340 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy5G100340
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionRibosomal RNA small subunit methyltransferase G
Genome locationchrH05:14558949..14560868
RNA-Seq ExpressionChy5G100340
SyntenyChy5G100340
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]5.76e-17397.29Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFK+PRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQA DVNQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVIVN+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+PVRLEDEPKNIFPRSE DAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]5.87e-16493.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQA D+NQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVI+NNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSE DAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]4.33e-11368.85Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK+TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLA--QARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA+D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV ++NS P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPLA--QARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGFEP R+E+ PKN F RSE D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]4.98e-14196.74Show/hide
Query:  MKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDL
        MKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFK+PRNLELKPPTVKKEKSSRKREASLKPLAQA DVNQRGGIYNG+EASSRKSQSFFDL
Subjt:  MKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDL

Query:  NQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRA
        NQKSNTCSKKEVIVN+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGF+PVRLEDEPKNIFPRSE DAKNSELVLSSMCRNDDNGSNRA
Subjt:  NQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRA

Query:  GKRKISWQDQVALRA
        GKRKISWQDQVALRA
Subjt:  GKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]7.39e-13779.07Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKG+A +S ACALFE+SM+G+KHQSLLQDYEEL NETEAMK+KLLIAKRKK TLL EVRFLRHRYELLK +PAN QPKV F+ P +LE+ PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
         KSSRK EASLKPLA+A D+NQRGGIYNG+EA SRKSQSFF++NQKS  CSKKEV + +S P FDQKERVYR HE   +RNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R+EDE KN+F RSE DAKNS+LVLSSMCRND NGSN AGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein3.4e-13397.29Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKK TLLDEVRFLRHRYELLKKQPANIQPKVGFK+PRNLELKPPTVKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQA DVNQRGGIYNG+EASSRKSQSFFDLNQKSNTCSKKEVIVN+SFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GF+PVRLEDEPKNIFPRSE DAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936832.4e-12693.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQA D+NQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVI+NNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSE DAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein2.4e-12693.05Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARKGVAA+SMACALFE+SMVGIKHQSLLQDY+ELHNETEA+KKKLLIAKRKKATLLDEVRFLRHRYELLK QPANIQPKVGFK  RNLEL+PP VKK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA
        EKSSRKREASLKPLAQA D+NQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVI+NNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEE+QA
Subjt:  EKSSRKREASLKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQA

Query:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        GFEP+R LEDE KNIFPRSE DAKNS+LVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
Subjt:  GFEPVR-LEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246961.3e-8768.85Show/hide
Query:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK+TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKK

Query:  EKSSRKREASLKPL--AQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL
        EKSS+KREASLK L  AQA+D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV ++NS P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSRKREASLKPL--AQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEEL

Query:  QAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        QAGFEP R+E+ PKN F RSE D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPVRLEDEPKNIFPRSEQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940122.3e-7667.5Show/hide
Query:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVN
        M+ I H  LLQDY EL NETEAMK+KLLI K+KK+TLL EVRFLRH+YELLK  P   QPKVGFK P+NL+++PP  KKE  SRKRE        AR++N
Subjt:  MVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVN

Query:  QRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEP---KNIFPRS
        QRGGI +G+EA++RK++S  ++NQKS  CSKKE+ + + FP   QKERVYRAHE A N NMTPVFDLNQISREEEELQ GFEP+R EDE    KNI  RS
Subjt:  QRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEP---KNIFPRS

Query:  EQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA
        E DAKNS+L++SSMCRN  NGSNRAGKRKISWQD+VALRA
Subjt:  EQDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.2e-1331.22Show/hide
Query:  ELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVNQRGGIYNGIEASSR
        EL  E E  +K+L + K+K+ TL  EVRFLR RYE L             KQ + LE  P  ++  +S    E   KP  + +            ++  R
Subjt:  ELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQARDVNQRGGIYNGIEASSR

Query:  KSQSFFDLNQKSNTCSKKEVIVNN-SFPTFDQKERVYRAHEA---------------AANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSE
         S   FDL  K+  C++KE + NN +    D+K +  R  +                 +  +  P FDLNQISREEEE +   E + + +  KN    + 
Subjt:  KSQSFFDLNQKSNTCSKKEVIVNN-SFPTFDQKERVYRAHEA---------------AANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSE

Query:  QDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVAL
            + E  L  +C + +   NRA KRK++WQD VAL
Subjt:  QDAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein5.2e-1732.53Show/hide
Query:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQA
        FE   V  +H SL+QDY ELH ETEAM+K+L   + +KATL+ EVRFLR RY  L++            QP+ ++     V++    +K    + P    
Subjt:  FESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREASLKPLAQA

Query:  RDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPR
                        S KS++             K V    S P  +  E+ +   + +  R + P+FDLNQIS EEE+     E   +++  +N   R
Subjt:  RDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPR

Query:  SEQDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA
         E+      L++SS+           CRN  NGSN   KRKISWQD VA
Subjt:  SEQDAKNSELVLSSM-----------CRNDDNGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTCGATTCCTGAGGCATAGATATGAATTGTTGAAGA
AGCAGCCTGCAAACATCCAGCCAAAGGTAGGTTTCAAGCAGCCACGAAACCTTGAACTCAAACCTCCCACCGTGAAGAAAGAAAAGAGTTCGCGGAAAAGAGAAGCTTCT
TTGAAACCGCTTGCTCAAGCGCGTGACGTAAACCAAAGGGGAGGAATCTACAATGGGATTGAAGCCTCTTCTCGAAAATCTCAGTCGTTTTTCGACCTAAACCAGAAGTC
AAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAATTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGCTGCTGCCAACAGGAACATGACCC
CGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCGAACCAGTGAGACTGGAGGACGAGCCGAAGAATATCTTCCCAAGAAGCGAACAA
GATGCAAAGAACAGTGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAG
AGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTGCGAATTCAATGGCGTGTGCTCTGTTTGAGAGTTCGATGGTTGGGATCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGTT
GCATAACGAAACAGAAGCCATGAAGAAAAAACTACTGATTGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTCGATTCCTGAGGCATAGATATGAATTGTTGAAGA
AGCAGCCTGCAAACATCCAGCCAAAGGTAGGTTTCAAGCAGCCACGAAACCTTGAACTCAAACCTCCCACCGTGAAGAAAGAAAAGAGTTCGCGGAAAAGAGAAGCTTCT
TTGAAACCGCTTGCTCAAGCGCGTGACGTAAACCAAAGGGGAGGAATCTACAATGGGATTGAAGCCTCTTCTCGAAAATCTCAGTCGTTTTTCGACCTAAACCAGAAGTC
AAATACGTGCAGCAAGAAGGAAGTCATTGTGAACAATTCTTTTCCTACTTTTGACCAGAAAGAGAGAGTATACAGAGCACATGAAGCTGCTGCCAACAGGAACATGACCC
CGGTTTTCGACCTTAACCAGATTTCGAGGGAGGAAGAAGAACTGCAAGCTGGTTTCGAACCAGTGAGACTGGAGGACGAGCCGAAGAATATCTTCCCAAGAAGCGAACAA
GATGCAAAGAACAGTGAGTTGGTGTTATCATCAATGTGTAGGAATGATGATAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCTTTAAG
AGCATGA
Protein sequenceShow/hide protein sequence
MKKARKGVAANSMACALFESSMVGIKHQSLLQDYEELHNETEAMKKKLLIAKRKKATLLDEVRFLRHRYELLKKQPANIQPKVGFKQPRNLELKPPTVKKEKSSRKREAS
LKPLAQARDVNQRGGIYNGIEASSRKSQSFFDLNQKSNTCSKKEVIVNNSFPTFDQKERVYRAHEAAANRNMTPVFDLNQISREEEELQAGFEPVRLEDEPKNIFPRSEQ
DAKNSELVLSSMCRNDDNGSNRAGKRKISWQDQVALRA