; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC02G040040 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC02G040040
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionUnknown protein
Genome locationCmU531Chr02:27858739..27861460
RNA-Seq ExpressionCmUC02G040040
SyntenyCmUC02G040040
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144727.2 uncharacterized protein LOC101218708 isoform X1 [Cucumis sativus]8.4e-11176.49Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAKRKK TLLDEV                RFLRHRYE LK QPANIQPKVG
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        F+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_008452766.1 PREDICTED: uncharacterized protein LOC103493683 [Cucumis melo]8.4e-11176.92Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                RFLRHRYE LKNQPANIQPKVG
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        F+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_022158137.1 uncharacterized protein LOC111024696 [Momordica charantia]1.5e-8866.55Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAKRKK+TLL EV                RFLRHRYEFLKNQ  N QPK G
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNM
         E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ NM
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNM

Query:  TPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        TPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  TPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_031736367.1 uncharacterized protein LOC101218708 isoform X2 [Cucumis sativus]1.8e-8974.79Show/hide
Query:  MKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIY
        MK+KLLIAKRKK TLLDEV                RFLRHRYE LK QPANIQPKVGF+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIY
Subjt:  MKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIY

Query:  NGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFS
        NG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMTPVFDLNQIS           REEEELQAGF+P+++EDEPKN+F 
Subjt:  NGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFS

Query:  RSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  RSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

XP_038889534.1 uncharacterized protein LOC120079432 [Benincasa hispida]7.9e-10976.84Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKG+A+D +ACALFE+SMIG KHQSLLQDYEEL NETEAMKEKLLIAKRKK TLL EV                RFLRHRYE LKN+PAN QPKV 
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        FE   +LEIGPPITKK KSSRK EASLKPLA+AHDLNQRGGIYNGMEAPS+KSQ FF+INQKSR+CSKKEVT+ +S PIF+QKERV R HE   + NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEELQAGFEPL+MEDE KNMFSRSEHDAKNSDLVLSSMCRNDGNGSN AGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

TrEMBL top hitse value%identityAlignment
A0A0A0LH29 Uncharacterized protein4.1e-11176.49Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKGVA +  ACALFESSM+G KHQSLLQDYEELHNETEAMK+KLLIAKRKK TLLDEV                RFLRHRYE LK QPANIQPKVG
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        F+R RNLE+ PP  KKEKSSRKREASLKPLAQAHD+NQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV V++SFP F+QKERV RAHE AAN NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEELQAGF+P+++EDEPKN+F RSEHDAKNS+LVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A1S3BVT4 uncharacterized protein LOC1034936834.1e-11176.92Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                RFLRHRYE LKNQPANIQPKVG
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        F+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A5A7URJ0 Uncharacterized protein4.1e-11176.92Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARKGVA D  ACALFE+SM+G KHQSLLQDY+ELHNETEA+K+KLLIAKRKKATLLDEV                RFLRHRYE LKNQPANIQPKVG
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP
        F+ +RNLE+ PPI KKEKSSRKREASLKPLAQAHDLNQRGGIYNG+EA S+KSQ FFD+NQKS  CSKKEV ++NSFP F+QKERV RAHE AAN NMTP
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTP

Query:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        VFDLNQIS           REEEE+QAGFEPL+ +EDE KN+F RSEHDAKNSDLVLSSMCRND NGSNRAGKRKISWQDQVALRA
Subjt:  VFDLNQISVPTTTSILASIREEEELQAGFEPLK-MEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1DWE7 uncharacterized protein LOC1110246967.5e-8966.55Show/hide
Query:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG
        MKKARK  AMD  A ALFE+ MIGTKH  LLQDYE+L N TE MKE+LLIAKRKK+TLL EV                RFLRHRYEFLKNQ  N QPK G
Subjt:  MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVG

Query:  FERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNM
         E  +N EI PP  KKEKSS+KREASLK L  AQA DLNQRGGIY+GMEA S+KS+L F +NQK R+CS  EV++HNS PIF  KE + R HE AA+ NM
Subjt:  FERTRNLEIGPPITKKEKSSRKREASLKPL--AQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNM

Query:  TPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        TPVFDLNQIS           REEEELQAGFEP +ME+ PKN F RSE+D KNSDL++S MCRN G+GSNRAGKRKISWQDQVALRA
Subjt:  TPVFDLNQISVPTTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

A0A6J1G446 uncharacterized protein LOC1114505772.0e-7361.65Show/hide
Query:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSR
        MIG  H  LLQDY ELHNETEA KEKLLI K+KK TLL EV                RFLRH+YE LKN P   QPK+GF+  +NL+I PP++KKE  S+
Subjt:  MIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVGFERTRNLEIGPPITKKEKSSR

Query:  KREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIRE
        KRE        A +LN+RGGI +G EA ++K++L  D+NQKSR+CSKKE+ V + FP+  QKERV RAHE+A NTNMTPVFDLNQIS           RE
Subjt:  KREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIRE

Query:  EEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA
        EEELQAGFEPL+   +DE KN+ SRSE DAKNSDL++SSMCRN GNGSNRAGKRKISWQD+VALRA
Subjt:  EEELQAGFEPLKM--EDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G30630.1 unknown protein1.7e-0829.25Show/hide
Query:  ELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQA
        EL  E E  +++L + K+K+ TL  EV                RFLR RYE LK +Q     P    E  R  E G   + +K    RK+++ ++     
Subjt:  ELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGP-PITKKEKSSRKREASLKPLAQA

Query:  HDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLK
         DL  +  I N  EA +  +    D+++K +     +V    +FP+              + T+  P FDLNQIS           REEEE +   E + 
Subjt:  HDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVPTTTSILASIREEEELQAGFEPLK

Query:  MEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL
         E     M      D + SDL +     +C +     NRA KRK++WQD VAL
Subjt:  MEDEPKNMFSRSEHDAKNSDLVLS---SMCRNDGNGSNRAGKRKISWQDQVAL

AT5G57910.1 unknown protein2.0e-0931.87Show/hide
Query:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITKK
        FE   +  +H SL+QDY ELH ETEAM+++L   + +KATL+ EV                RFLR RY  L+ +QP  I      ++ R    G  I  +
Subjt:  FESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLK-NQPANIQPKVGFERTRNLEIGPPITKK

Query:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS
           S K EA  K ++   DLN     ++  +   ++    FD+NQ S    ++   V N+
Subjt:  EKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAGCTCGAAAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATTACGAG
GAGCTGCATAACGAAACAGAAGCTATGAAGGAGAAACTATTGATCGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGTTAACT
GAGTCATGTTTATTTCTCAACTTTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGACACGA
AACCTTGAAATCGGACCTCCTATAACGAAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAAGGGGA
GGAATCTACAATGGGATGGAAGCCCCTTCTCAAAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTACACAAT
TCTTTTCCTATTTTTGAACAAAAGGAGAGAGTATGCAGAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGGTACCT
ACCACGACTTCAATTCTTGCATCTATAAGGGAGGAAGAAGAACTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAAGAAGC
GAACACGATGCGAAGAACAGTGACTTGGTGTTATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAGATCAA
GTGGCCTTAAGAGCATGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAGAAGAGGCCTTTAAAGCAAAGCAAAGCAAACCACTATCCACAAACATCATCTGAATCACTCCCACAATTCACTGAATATTTGTACTCAACAAAATTTTT
CACTCTCCACCCATCAATGGAATCCCCTTTCTATATCTCATCAATGGCCCAGTTGGACCAAACCAATTTTTATGCCCAACAAATTCCTAAATCAATCAGAGAAAA
ATGGCTAAATTTCTCCTCTCTCTCAACTTGCCTGAATCGCCCTGTGATTTGGGCTTTTTTTTGCCTCCCTTCTTAGTTGTCTTCTAGGTTAAGATTTAACCAAAA
CCCACCCTCCAATTCTTTTCTCCGCCATTGTTTTTCCTTCTTTCTCTTCATTCTCTTTCCTGGGTCTCTTCTTTTTTCTGTCATCGTTTCGATTTTTCTTTTTCA
ATCGGATGAAGAAAGCTCGAAAAGGGGTGGCTATGGATTTACAAGCGTGTGCTCTGTTCGAGAGCTCGATGATTGGGACCAAACATCAAAGTCTCTTGCAGGATT
ACGAGGAGCTGCATAACGAAACAGAAGCTATGAAGGAGAAACTATTGATCGCGAAGCGGAAAAAGGCAACCCTTTTGGATGAAGTTAGGAATTTGTTAATGTTGT
TAACTGAGTCATGTTTATTTCTCAACTTTAGATTTTTGAGGCATAGATATGAATTTTTGAAGAACCAGCCTGCAAACATTCAACCAAAGGTTGGTTTCGAGCGGA
CACGAAACCTTGAAATCGGACCTCCTATAACGAAGAAAGAAAAGAGTTCTCGGAAAAGAGAAGCTTCTTTGAAACCCCTTGCTCAGGCTCACGACTTAAACCAAA
GGGGAGGAATCTACAATGGGATGGAAGCCCCTTCTCAAAAATCTCAGTTGTTTTTCGACATAAACCAGAAGTCAAGGGTGTGCAGCAAGAAGGAAGTCACTGTAC
ACAATTCTTTTCCTATTTTTGAACAAAAGGAGAGAGTATGCAGAGCACATGAAATTGCTGCCAACACGAATATGACCCCGGTTTTCGACCTTAACCAGATCTCGG
TACCTACCACGACTTCAATTCTTGCATCTATAAGGGAGGAAGAAGAACTGCAAGCTGGTTTTGAACCACTGAAAATGGAGGACGAGCCGAAGAATATGTTTTCAA
GAAGCGAACACGATGCGAAGAACAGTGACTTGGTGTTATCATCAATGTGTAGGAATGATGGTAATGGATCAAACAGAGCAGGAAAAAGGAAGATCTCATGGCAAG
ATCAAGTGGCCTTAAGAGCATGAAGTTCTCTATTATTGAAAGGACCCATTCAGGTTCCTTCATTTGCATAGGTGGGAAAACTGAGATTCCTTGAAATTCCATTAT
TATTATTGTTGTTATTATTATTCAACTCGAAGTTTAGAAAATTGTGACCCTACAATATTTGGAAGTTTAGAAAGTCATTACAAACAGATCCTATAAAACTATACT
GATACCGTAGACTCATAAATAGCAGGTAATCATGATTGAAACTTTTGAAGAACTTTTGAAATTACGATTTCTCCGTCTTAAGAACTATCTTTGTGAATTCTTTAG
ATAGAGATGAAATAAAACCATCTTAGGAACTATCTTTGTGAATTCTTTAGATAGGGATGAAATAAAA
Protein sequenceShow/hide protein sequence
MKKARKGVAMDLQACALFESSMIGTKHQSLLQDYEELHNETEAMKEKLLIAKRKKATLLDEVRNLLMLLTESCLFLNFRFLRHRYEFLKNQPANIQPKVGFERTR
NLEIGPPITKKEKSSRKREASLKPLAQAHDLNQRGGIYNGMEAPSQKSQLFFDINQKSRVCSKKEVTVHNSFPIFEQKERVCRAHEIAANTNMTPVFDLNQISVP
TTTSILASIREEEELQAGFEPLKMEDEPKNMFSRSEHDAKNSDLVLSSMCRNDGNGSNRAGKRKISWQDQVALRA