; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g2153 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g2153
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRibosomal RNA small subunit methyltransferase G
Genome locationMC08:30463635..30466041
RNA-Seq ExpressionMC08g2153
SyntenyMC08g2153
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]9.82e-11268.08Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGF+P R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]1.67e-11068.97Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A D  A ALFE  M+G KH  LLQDY++L N TE +K++LLIAKRKK+TLL EVRFLRHRYE LKNQ  N QPK G +  +N E+RPP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA DLNQRGGIY+G+EA+SRKS+  F +NQK   CS  EV M+NS P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEE+
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEP R +E+  KN F RSE+D KNSDL++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]7.96e-183100Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]3.18e-9369.59Show/hide
Query:  MKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVF
        MK++LLIAKRKK TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KKEKSS+KREASLK LA  QA D+NQRGGIY+G+EA+SRKS+  F
Subjt:  MKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVF

Query:  HMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSN
         +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEELQAGF+P R+E+ PKN F RSE+D KNS+L++S MCRN  +GSN
Subjt:  HMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSN

Query:  RAGKRKISWQDQVALRA
        RAGKRKISWQDQVALRA
Subjt:  RAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]2.09e-11370.77Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A+D +A ALFET MIG KH  LLQDYE+L N TE MKE+LLIAKRKK TLLAEVRFLRHRYE LKN+  N+QPK   E P + EI PP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
         KSS+K EASLK LA  +A DLNQRGGIY+GMEA SRKS+  F++NQK RMCS  EV++ +S PIF+ KE +YRVHE    RNMTPVFDLNQISREEEEL
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEP RME+  KN F RSE+D KNSDL++S MCRN G+GSN AGKRKISWQDQVALRA
Subjt:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein4.75e-11268.08Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A +  A ALFE+ M+G KH  LLQDYE+L N TE MK++LLIAKRKK TLL EVRFLRHRYE LK Q  N QPK G + P+N E++PP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA D+NQRGGIY+G+EA+SRKS+  F +NQK   CS  EV +++S P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEEL
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGF+P R+E+ PKN F RSE+D KNS+L++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936838.10e-11168.97Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A D  A ALFE  M+G KH  LLQDY++L N TE +K++LLIAKRKK+TLL EVRFLRHRYE LKNQ  N QPK G +  +N E+RPP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA DLNQRGGIY+G+EA+SRKS+  F +NQK   CS  EV M+NS P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEE+
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEP R +E+  KN F RSE+D KNSDL++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein8.10e-11168.97Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARK  A D  A ALFE  M+G KH  LLQDY++L N TE +K++LLIAKRKK+TLL EVRFLRHRYE LKNQ  N QPK G +  +N E+RPP  KK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSS+KREASLK LA  QA DLNQRGGIY+G+EA+SRKS+  F +NQK   CS  EV M+NS P F+ KE +YR HEAAA+RNMTPVFDLNQISREEEE+
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEP R +E+  KN F RSE+D KNSDL++S MCRN  +GSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPR-MEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246963.85e-183100Show/hide
Query:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
        MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK
Subjt:  MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKK

Query:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
        EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL
Subjt:  EKSSQKREASLKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEEL

Query:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
Subjt:  QAGFEPPRMEEGPKNSFLRSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

A0A6J1KKM1 uncharacterized protein LOC1114940121.35e-8863.64Show/hide
Query:  MIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQD
        MI   H  LLQDY +L+N TE MKE+LLI K+KKSTLLAEVRFLRH+YE LKN    +QPK G + PQN +IRPP +KKE  S+KREA          ++
Subjt:  MIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQD

Query:  LNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGP---KNSFL
        LNQRGGI DGMEA +RK+R V ++NQK RMCS  E+S+ +  PI + KE +YR HE A + NMTPVFDLNQISREEEELQ GFEP R E+     KN   
Subjt:  LNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGP---KNSFL

Query:  RSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA
        RSE D KNSDLM+S MCRNVG+GSNRAGKRKISWQD+VALRA
Subjt:  RSENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein6.4e-1533.76Show/hide
Query:  DLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLK-NQSLNSQPK-------HGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQDLNQRGG
        +LE   E+ ++ L + K+K+ TL +EVRFLR RYE LK +Q+L + P+        GLE P     R P+ +++K S  R       A     DL  +  
Subjt:  DLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLK-NQSLNSQPK-------HGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEAQAQDLNQRGG

Query:  IYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSFLRSENDGKNS
        I +  EA +          ++ R    + ++   S P  N +      + +  D+   P FDLNQISREEEE +   E   + E  KN+ L    D + S
Subjt:  IYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSFLRSENDGKNS

Query:  DLMIS---PMCRNVGSGSNRAGKRKISWQDQVAL
        DL +    P+C +V    NRA KRK++WQD VAL
Subjt:  DLMIS---PMCRNVGSGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein5.8e-1631.08Show/hide
Query:  FETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEA
        FE P +  +HH L+QDY +L   TE M++ L   + +K+TL+AEVRFLR RY  L+            +P +  ++R  N  K+    + E S    +EA
Subjt:  FETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREASLKLLAEA

Query:  QAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSF
        + +                                       H S P  NH E  +   + +  R + P+FDLNQIS EEE+     E   ++   +N+ 
Subjt:  QAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSF

Query:  LRSENDGKNSDLMISPM-----------CRNVGSGSNRAGKRKISWQDQVA
        +   N  K   LMIS +           CRN G+GSN   KRKISWQD VA
Subjt:  LRSENDGKNSDLMISPM-----------CRNVGSGSNRAGKRKISWQDQVA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGGAAACCGGCGGCCATGGATTTCGACGCCTATGCTCTATTCGAGACCCCCATGATCGGGACCAAGCACCACCGTCTCTTGCAGGATTACGAGGACCT
GGAGAATGCAACAGAAGTCATGAAAGAAGAGTTGCTGATTGCAAAGCGGAAAAAGTCGACCCTCTTGGCTGAAGTTCGATTTTTGAGGCATAGATATGAGTTCTTGAAGA
ACCAATCTCTGAACTCCCAGCCAAAGCATGGTCTCGAGCCACCACAAAACCATGAAATCCGACCTCCTAACGCTAAGAAAGAAAAGAGTTCTCAAAAAAGGGAAGCTTCT
TTGAAACTCCTTGCTGAGGCTCAGGCTCAAGATTTAAATCAGAGGGGAGGAATCTATGATGGGATGGAAGCTGCCTCTCGAAAATCTCGGTTGGTTTTTCACATGAACCA
GAAGCCAAGAATGTGTAGCGACAACGAAGTCTCTATGCACAATTCTTCTCCGATTTTCAACCATAAAGAGATACTATACAGAGTACACGAAGCTGCTGCCGACCGAAACA
TGACTCCGGTTTTCGACCTAAACCAGATCTCGAGAGAGGAAGAAGAATTGCAGGCTGGTTTCGAACCACCGAGAATGGAGGAGGGGCCGAAGAATAGCTTTCTAAGAAGC
GAAAACGACGGGAAGAACAGTGACCTGATGATATCACCGATGTGTAGGAATGTTGGCAGTGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTAGC
TTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
GTTTGAACCTCCAACCTATTGAAGATAATGATTTAGAGAGAAAATGGTACCTCCGATCTTAAAAAGATATACTATTTGAACCTCTTACCTAAAGAAAATATCGAGAGAGA
GAGAGAGATTTTTGCAATTGATTAATAGAGGAAAGAAAGGCTAAAAGCAAGGAAGGCAGTCGACAACATCATCTGCAAAACGGCATAGCGCACACACAGAGCAACAGAAA
CAGCATCATCAATCTCTCTCTCTCTCTCTCTCCAATGGCCCCAGTTGGACCAAATTTTATGCGCAACAAATTCCAAAAACAAAACACTGAAAATGGGTGCAGATTTCTTC
AAATCCCCATTTTTCTTCTCTCTCTCATCAAACCCAATTTGCGAGAATTGCCCACTCATTTTTCCCTTTTGGGCCTCTTCTTAGTTGTCTTCTAGGTCACACCATCATCA
CCATCCAACAATCTCTCCATTCTCTCTTCTGGGTCCTTCTGCGATCGCCGCCCCATTTTCCCCTTCTTCCGTTTCCATTTCCGATGAAGAAAGCTCGGAAACCGGCGGCC
ATGGATTTCGACGCCTATGCTCTATTCGAGACCCCCATGATCGGGACCAAGCACCACCGTCTCTTGCAGGATTACGAGGACCTGGAGAATGCAACAGAAGTCATGAAAGA
AGAGTTGCTGATTGCAAAGCGGAAAAAGTCGACCCTCTTGGCTGAAGTTCGATTTTTGAGGCATAGATATGAGTTCTTGAAGAACCAATCTCTGAACTCCCAGCCAAAGC
ATGGTCTCGAGCCACCACAAAACCATGAAATCCGACCTCCTAACGCTAAGAAAGAAAAGAGTTCTCAAAAAAGGGAAGCTTCTTTGAAACTCCTTGCTGAGGCTCAGGCT
CAAGATTTAAATCAGAGGGGAGGAATCTATGATGGGATGGAAGCTGCCTCTCGAAAATCTCGGTTGGTTTTTCACATGAACCAGAAGCCAAGAATGTGTAGCGACAACGA
AGTCTCTATGCACAATTCTTCTCCGATTTTCAACCATAAAGAGATACTATACAGAGTACACGAAGCTGCTGCCGACCGAAACATGACTCCGGTTTTCGACCTAAACCAGA
TCTCGAGAGAGGAAGAAGAATTGCAGGCTGGTTTCGAACCACCGAGAATGGAGGAGGGGCCGAAGAATAGCTTTCTAAGAAGCGAAAACGACGGGAAGAACAGTGACCTG
ATGATATCACCGATGTGTAGGAATGTTGGCAGTGGCTCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTAGCTTTAAGAGCATGAAATTGTCTCTTATT
GAAAGTGCCATTCAAGATCTTTCATATGCATAGGTTACAAAACTTGGCTTCCTTGAAATTGAAATTCCATTATCATCCATCATTATTATTTTTTAATTGTGAATGTTCGG
GTTAGTTCATCTATGCTTCAATAAAGCATAATCTTAACCTGGAGAGAATTGTCTGATTCTATTATATTTTGGTAATGGAAAGTCGTAGGATATATGATATTTTGAGTAGG
CAACCATGGTTCAAATATGTGATCAAT
Protein sequenceShow/hide protein sequence
MKKARKPAAMDFDAYALFETPMIGTKHHRLLQDYEDLENATEVMKEELLIAKRKKSTLLAEVRFLRHRYEFLKNQSLNSQPKHGLEPPQNHEIRPPNAKKEKSSQKREAS
LKLLAEAQAQDLNQRGGIYDGMEAASRKSRLVFHMNQKPRMCSDNEVSMHNSSPIFNHKEILYRVHEAAADRNMTPVFDLNQISREEEELQAGFEPPRMEEGPKNSFLRS
ENDGKNSDLMISPMCRNVGSGSNRAGKRKISWQDQVALRA