; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G040190 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G040190
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionUnknown protein
Genome locationCla97Chr02:28137500..28140221
RNA-Seq ExpressionCla97C02G040190
SyntenyCla97C02G040190
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]1.1e-11076.22Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAKRKK TLLDEV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]1.1e-11076.66Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]2.0e-8866.32Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAKRKK+TLL EV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN
        G E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ N
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN

Query:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]2.4e-8974.49Show/hide
Query:  MKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGI
        MK+KLLIAKRKK TLLDEV                 RFLRHRYE LK QPANIQPKVGF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGI
Subjt:  MKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGI

Query:  YNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMF
        YNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMTPVFDLNQIS           REEEELQAGF+P+++EDEPKN+F
Subjt:  YNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMF

Query:  SRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
         RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  SRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]1.0e-10876.57Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKG+A+D +ACALFE+SMIG KHQSLLQDYEEL NETEAMKEKLLIAKRKK TLL EV                 RFLRHRYE LKN+PAN QPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
         FE   +LEIGPPITKK KSSRK EASLKPLA+AHDLNQRGGIYNGMEAPS+KSQ FF+INQKSR+CSKKEVT+ +S PIF+QKERV R HE   + NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGFEPL+MEDE KNMFSRSEHDAKNSDLVLSSMCRNDGNGSN AGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein5.3e-11176.22Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAKRKK TLLDEV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936835.3e-11176.66Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein5.3e-11176.66Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246969.8e-8966.32Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAKRKK+TLL EV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN
        G E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ N
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN

Query:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1G446 uncharacterized protein LOC1114505772.6e-7361.42Show/hide
Query:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSS
        MIG  H  LLQDY ELHNETEA KEKLLI K+KK TLL EV                 RFLRH+YE LKN P   QPK+GF+  +NL+I PP++KKE  S
Subjt:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSS

Query:  RKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIR
        +KRE        A +LN+RGGI +G EA ++K++L  D+NQKSR+CSKKE+ V + FP+  QKERV RAHE+A NTNMTPVFDLNQIS           R
Subjt:  RKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIR

Query:  EEEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        EEEELQAGFEPL+   +DE KN+ SRSE DAKNSDL++SSMCRN GNGSNRAGKRKISWQD+VALRA
Subjt:  EEEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein2.9e-0829.13Show/hide
Query:  ELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQ
        EL  E E  +++L + K+K+ TL  EV                 RFLR RYE LK +Q     P    E  R  E G   + +K    RK+++ ++    
Subjt:  ELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQ

Query:  AHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPL
          DL  +  I N  EA +  +    D+++K +     +V    +FP+              + T+  P FDLNQIS           REEEE +   E +
Subjt:  AHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPL

Query:  KMEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL
          E     M      D + SDL +     +C +     NRA KRK++WQD VAL
Subjt:  KMEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein3.4e-0931.68Show/hide
Query:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITK
        FE   +  +H SL+QDY ELH ETEAM+++L   + +KATL+ EV                 RFLR RY  L+ +QP  I      ++ R    G  I  
Subjt:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITK

Query:  KEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS
        +   S K EA  K ++   DLN     ++  +   ++    FD+NQ S    ++   V N+
Subjt:  KEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGCT
GCATAACGAAACAGAAGCTATGAAGGAGAAACTATTGATCGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGTTAACTGAGTCATGTT
TATTTCTCAACTCCTTTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGACACGAAACCTTGAAATC
GGACCTCCTATAACGAAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAAGGGGAGGAATCTACAATGGGAT
GGAAGCCCCTTCTCAAAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTACACAATTCTTTTCCTATTTTTGAACAAA
AGGAGAGAGTATGCAGAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGGTACCTACCACGACTTCAATTCTTGCATCTATA
AGGGAGGAAGAAGAACTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGTT
ATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAAGAGGCCTTTAAAGCAAAGCAAAGCAAACCACTATCCACAAACATCATCTGAATCACTCCCACAATTCACTGAATATTTGTACTCAACAAAATTTTTCACTC
TCCACCCATCAATGGAATCCCCTTTCTATATCTCATCAATGGCCCAGTTGGACCAAACCAATTTTTATGCCCAACAAATTCCTAAATCAATCAGAGAAAAATGGCTAAAT
TTCTCCTCTCTCTCAACTTGCCTGAATCGCCCTGTGATTTGGGCTTTTTTTTGCCTCCCTTCTTAGTTGTCTTCTAGGTTAAGATTTAACCAAAACCCACCCTCCAATTC
TTTTCTCCGCCATTGTTTTTCCTTCTTTCTCTTCATTCTCTTTCCTGGGTCTCTTCTTTTTTCTGTCATCGTTTCGATTTTTCTTTTTCAATCGGATGAAGAAAGCTCGA
AAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGCTGCATAACGAAACAGA
AGCTATGAAGGAGAAACTATTGATCGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGTTAACTGAGTCATGTTTATTTCTCAACTCCT
TTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGACACGAAACCTTGAAATCGGACCTCCTATAACG
AAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAAGGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCA
AAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTACACAATTCTTTTCCTATTTTTGAACAAAAGGAGAGAGTATGCA
GAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGGTACCTACCACGACTTCAATTCTTGCATCTATAAGGGAGGAAGAAGAA
CTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGTTATCATCAATGTGTAG
GAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGAAGTTCTCTATTATTGAAAGGACCCATTCAGGTTC
CTTCATTGGCATAGGTGGGAAAACTGAGATTCCTTGAAATTCCATTATTATTATTGTTGTTATTATTCAACTCGAAGTTTAGAAAATTGTGACCCTACAATATTTGGAAG
TTTAGAAAGTCATTACAAACAGATCCTATAAAACTATACTGATACCGTAGACTCATAAATAGCAGGTAATCATGATTGAAACTTTTGAAGAACTTTTGAAATTACGATTT
CTCCGTCTTAAGAACTATCTTTGTGAATTCTTTAGATAGAGATGAAATAAAACCATCTTAGGAACTATCTTTGTGAATTCTTTAGATAGGGATGAAATAAAA
Protein sequenceShow/hide protein sequence
MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEI
GPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASI
REEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA