; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G15330 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G15330
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr02:27952115..27954839
RNA-Seq ExpressionClc02G15330
SyntenyClc02G15330
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]5.5e-11075.87Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAK KK TLLDEV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]5.5e-11076.31Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAK KKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]1.0e-8765.97Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAK KK+TLL EV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN
        G E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ N
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN

Query:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]1.5e-8874.07Show/hide
Query:  MKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGI
        MK+KLLIAK KK TLLDEV                 RFLRHRYE LK QPANIQPKVGF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGI
Subjt:  MKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGI

Query:  YNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMF
        YNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMTPVFDLNQIS           REEEELQAGF+P+++EDEPKN+F
Subjt:  YNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMF

Query:  SRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
         RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  SRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]5.1e-10876.22Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKG+A+D +ACALFE+SMIG KHQSLLQDYEEL NETEAMKEKLLIAK KK TLL EV                 RFLRHRYE LKN+PAN QPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
         FE   +LEIGPPITKK KSSRK EASLKPLA+AHDLNQRGGIYNGMEAPS+KSQ FF+INQKSR+CSKKEVT+ +S PIF+QKERV R HE   + NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGFEPL+MEDE KNMFSRSEHDAKNSDLVLSSMCRNDGNGSN AGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein2.6e-11075.87Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAK KK TLLDEV                 RFLRHRYE LK QPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936832.6e-11076.31Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAK KKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein2.6e-11076.31Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAK KKATLLDEV                 RFLRHRYE LKNQPANIQPKV
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT
        GF+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMT
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMT

Query:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        PVFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  PVFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246964.9e-8865.97Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAK KK+TLL EV                 RFLRHRYEFLKNQ  N QPK 
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKV

Query:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN
        G E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ N
Subjt:  GFERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTN

Query:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        MTPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  MTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1G446 uncharacterized protein LOC1114505773.4e-7361.42Show/hide
Query:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSS
        MIG  H  LLQDY ELHNETEA KEKLLI K KK TLL EV                 RFLRH+YE LKN P   QPK+GF+  +NL+I PP++KKE  S
Subjt:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSS

Query:  RKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIR
        +KRE        A +LN+RGGI +G EA ++K++L  D+NQKSR+CSKKE+ V + FP+  QKERV RAHE+A NTNMTPVFDLNQIS           R
Subjt:  RKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIR

Query:  EEEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        EEEELQAGFEPL+   +DE KN+ SRSE DAKNSDL++SSMCRN GNGSNRAGKRKISWQD+VALRA
Subjt:  EEEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein3.8e-0829.13Show/hide
Query:  ELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQ
        EL  E E  +++L + K K+ TL  EV                 RFLR RYE LK +Q     P    E  R  E G   + +K    RK+++ ++    
Subjt:  ELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQ

Query:  AHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPL
          DL  +  I N  EA +  +    D+++K +     +V    +FP+              + T+  P FDLNQIS           REEEE +   E +
Subjt:  AHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPL

Query:  KMEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL
          E     M      D + SDL +     +C +     NRA KRK++WQD VAL
Subjt:  KMEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein4.4e-0931.68Show/hide
Query:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITK
        FE   +  +H SL+QDY ELH ETEAM+++L   + +KATL+ EV                 RFLR RY  L+ +QP  I      ++ R    G  I  
Subjt:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITK

Query:  KEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS
        +   S K EA  K ++   DLN     ++  +   ++    FD+NQ S    ++   V N+
Subjt:  KEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGCT
GCATAACGAAACAGAAGCTATGAAGGAGAAACTATTGATCGCGAAGTGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGTTAACTGAGTCATGTT
TATTTCTCAACTCCTTTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGACACGAAACCTTGAAATC
GGACCTCCTATAACGAAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAAGGGGAGGAATCTACAATGGGAT
GGAAGCCCCTTCTCAAAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTACACAATTCTTTTCCTATTTTTGAACAAA
AGGAGAGAGTATGCAGAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGGTACCTACCACGACTTCAATTCTTGCATCTATA
AGGGAGGAAGAAGAACTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGTT
ATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAAGAGGCCTTTAAAGCAAAGCAAAGCAAACCACTATCCACAAACATCATCTGAATCACTCCCACAATTCACTGAATATTTGTACTCAACAAAATTTTTCACTC
TCCACCCATCAATGGAATCCCCTTTCTATATCTCATCAATGGCCCAGTTGGACCAAACCAATTTTTATGCCCAACAAATTCCTAAATCAATCAGAGAAAAATGGCTAAAT
TTCTCCTCTCTCTCAACTTGCCTGAATCGCCCTGTGATTTGGGCTTTTTTTTGCCTCCCTTCTTAGTTGTCTTCTAGGTTAAGATTTAACCAAAACCCACCCTCCAATTC
TTTTCTCCGCCATTGTTTTTCCTTCTTTCTCTTCATTCTCTTTCCTGGGTCTCTTCTTTTTTCTGTCATCGTTTCGATTTTTCTTTTTCAATCGGATGAAGAAAGCTCGA
AAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATTACGAGGAGCTGCATAACGAAACAGA
AGCTATGAAGGAGAAACTATTGATCGCGAAGTGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGTTAACTGAGTCATGTTTATTTCTCAACTCCT
TTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGACACGAAACCTTGAAATCGGACCTCCTATAACG
AAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAAGGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCA
AAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTACACAATTCTTTTCCTATTTTTGAACAAAAGGAGAGAGTATGCA
GAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGGTACCTACCACGACTTCAATTCTTGCATCTATAAGGGAGGAAGAAGAA
CTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAAGAAGCGAACACGATGCGAAGAACAGTGACTTGGTGTTATCATCAATGTGTAG
GAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAAGTGGCCTTAAGAGCATGAAGTTCTCTATTATTGAAAGGACCCATTCAGGTTC
CTTCATTTGCATAGGTGGGAAAACTGAGATTCCTTGAAATTCCATTATTATTATTGTTGTTATTATTATTCAACTCGAAGTTTAGAAAATTGTGACCCTACAATATTTGG
AAGTTTAGAAAGTCATTACAAACAGATCCTATAAAACTATACTGATACCGTAGACTCATAAATAGCAGGTAATCATGATTGAAACTTTTGAAGAACTTTTGAAATTACGA
TTTCTCCGTCTTAAGAACTATCTTTGTGAATTCTTTAGATAGAGATGAAATAAAACCATCTTAGGAACTATCTTTGTGAATTCTTTAGATAGGGATGAAATAAAA
Protein sequenceShow/hide protein sequence
MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKWKKATLLDEVRNLLMLLTESCLFLNSFRFLRHRYEFLKNQPANIQPKVGFERTRNLEI
GPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASI
REEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA