; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G02630 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G02630
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1685)
Genome locationClcChr07:2588502..2589665
RNA-Seq ExpressionClc07G02630
SyntenyClc07G02630
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572385.1 hypothetical protein SDJN03_29113, partial [Cucurbita argyrosperma subsp. sororia]5.3e-8776.72Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEILPLFDLFWFQ A+F GKPLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET++Y       S NQKL+ ILSGKVTEFSG  E KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]4.0e-8776.72Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEILPLFDLFWFQ A+F GKPLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET++Y       S NQKL+ ILSGKVTEFSG  E KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

XP_022969253.1 uncharacterized protein LOC111468311 [Cucurbita maxima]1.2e-8675.86Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEIL LFDLFWFQ A+F G PLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET++Y       STNQKL+ ILSGKVTEFSG    KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG++T E+EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

XP_023554329.1 uncharacterized protein LOC111811624 [Cucurbita pepo subsp. pepo]2.0e-8676.29Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEILPLFDLFWFQ A+F GKPLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET +Y       S NQKL+ ILSGKVTEF+G  E KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]1.3e-10487.29Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWK-
        MAAEEILPLFDLFWFQRAIF GKPLLQT S APE RFQSPV QVMK RSQSEYLLS+K+FPPPET+ YST GSMIST+QKL+TILSGKV EF+GN E K 
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWK-

Query:  ---PAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEEN
           PAKKKLE GNENKRRK RGKGLSKSLSDLEFEELKGFMDLGFVF EEDKNDSNLASIIPGLQRLGQKTGENEEE  IENG+SRPYLSEAWE+VEEEN
Subjt:  ---PAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEEN

Query:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        EK+ILMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein4.0e-8071.19Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVE---
        MAAEEILPLFDLFWFQRAIF+ K  L+T        FQSPV+QV+K+RSQSEYLL++K+FPPPET+        +++NQKLETILSGKVTEF GN E   
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVE---

Query:  WKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGENEEETGIENGISRPYLSEAWESVEEEN
         K  KKKLEG  +  RRK +GKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNL SIIPGL RLG +KT E   E G+   + RPYLSEAW+++EEEN
Subjt:  WKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGENEEETGIENGISRPYLSEAWESVEEEN

Query:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        EK ILMKWRVP+LGATEMD+K HLKFWAHTVASTVR
Subjt:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR

A0A1S3C1U3 uncharacterized protein LOC1034954953.2e-7468.64Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEILPLFDLFWFQ+AIF  KPLL+T        FQ     VMK+RSQSEYLL++K+FPPP T+        +++NQKLET+LSG+VTEF G+ E K 
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLE--GGNENK-RRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGENEEETGIENGISRPYLSEAWESVEEEN
         KK+++   GNENK RRK + KGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNL SIIPGL RLG Q T E   E G+   + RPYLSEAWE++EEEN
Subjt:  AKKKLE--GGNENK-RRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLG-QKTGENEEETGIENGISRPYLSEAWESVEEEN

Query:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        EK +LMKWRVP+LGATEMD+K HLKFWAHTVASTVR
Subjt:  EKKILMKWRVPALGATEMDMKDHLKFWAHTVASTVR

A0A6J1D2N3 uncharacterized protein LOC1110167787.7e-8473.47Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTS-----SLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGN
        MA+EEIL LFD FWFQ  +FAGKPLL+T      S APEN  +SP+ QV++ RSQSEYLL + +FP PET++YSTG   I TN+KL+TILSG+VTEFSG 
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTS-----SLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGN

Query:  VEWKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENE-------EETGIENGISRPYLSEA
           KPAKKKL GGNE K RK RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDKN S+LASIIPGLQRLG+KTGENE       EE G E G+SRPYLSEA
Subjt:  VEWKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENE-------EETGIENGISRPYLSEA

Query:  WESVEEENEKKILMKWRVPALG-ATEMDMKDHLKFWAHTVASTVR
        WE+ +EENEK+ILMKWRVP LG ATEMDMKDHLKFWAHTVASTVR
Subjt:  WESVEEENEKKILMKWRVPALG-ATEMDMKDHLKFWAHTVASTVR

A0A6J1GKQ8 uncharacterized protein LOC1114548952.0e-8776.72Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEILPLFDLFWFQ A+F GKPLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET++Y       S NQKL+ ILSGKVTEFSG  E KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG+KT   EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

A0A6J1I0F9 uncharacterized protein LOC1114683115.7e-8775.86Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP
        MAAEEIL LFDLFWFQ A+F G PLL T   +PENR QSPV QV+KVRSQSEY LS+ NF PPET++Y       STNQKL+ ILSGKVTEFSG    KP
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKP

Query:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI
        AKKK E G+E +RR+ RG+GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+LASIIPGLQRLG++T E+EEE GIE+G+SRPYLSEAW++VEEE EK+ 
Subjt:  AKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKI

Query:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR
        LMKWRVP LGATEMDMKDHLKFWAHTVASTVR
Subjt:  LMKWRVPALGATEMDMKDHLKFWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.3e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  ++ E + +E+  S P ++              +  W++ + G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)1.3e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  ++ E + +E+  S P ++              +  W++ + G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.3 Protein of unknown function (DUF1685)1.3e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  ++ E + +E+  S P ++              +  W++ + G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT1G05870.4 Protein of unknown function (DUF1685)1.3e-0629.41Show/hide
Query:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE
        SKSL+D + E+L+G +DLGF FS ++  +  L + +P L+         L  K  ++ E + +E+  S P ++              +  W++ + G   
Subjt:  SKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQR--------LGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATE

Query:  MDMKDHLKFWAHTVASTVR
         D+K  LK+WA  VA TV+
Subjt:  MDMKDHLKFWAHTVASTVR

AT2G42760.1 unknown protein3.2e-3439.19Show/hide
Query:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFP-----------------PPETSFYSTGGSMI--------
        MA EE+L LF+  W +R IF         +L  ++R +   +++++ R + E L   KNFP                   +TS +S+    +        
Subjt:  MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFP-----------------PPETSFYSTGGSMI--------

Query:  ---STNQKLETILSGK-VTEFSGNVEWKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQK----T
            T  KL+TILSGK V  F+     +   +K E   + K++        KS+SDLE+EELKGFMDLGFVFSE+D  DS+L SI+PGLQRL +K    T
Subjt:  ---STNQKLETILSGK-VTEFSGNVEWKPAKKKLEGGNENKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQK----T

Query:  GENEEETGIE----NGISRPYLSEAWESVEEENEKKIL---MKWRVPA-LGATEMDMKDHLKFWAHTVASTVR
         E EEE   +    N  +RPYLSEAW+       KK +   +KWRVPA   A+E+D+KD+L+ WAH VAST+R
Subjt:  GENEEETGIE----NGISRPYLSEAWESVEEENEKKIL---MKWRVPA-LGATEMDMKDHLKFWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCGAAGAAATCCTCCCTCTCTTTGATCTCTTCTGGTTTCAACGGGCAATTTTCGCCGGAAAACCGCTTTTACAGACCAGTTCCTTGGCGCCAGAAAACCGGTT
TCAGAGTCCTGTAAGGCAAGTAATGAAGGTGAGATCTCAAAGCGAGTATCTTCTTAGCACGAAGAATTTCCCACCTCCGGAAACCTCTTTTTACTCCACCGGCGGCTCGA
TGATTTCCACCAATCAAAAGCTTGAAACCATCCTTTCCGGCAAGGTAACGGAATTTTCCGGCAACGTAGAGTGGAAACCGGCGAAGAAGAAATTGGAAGGAGGAAATGAA
AATAAAAGAAGAAAGACTAGGGGGAAAGGTTTGAGTAAGAGCTTATCAGACCTTGAATTTGAAGAGTTGAAAGGATTTATGGATTTGGGATTTGTGTTCAGTGAGGAAGA
TAAGAATGATTCAAATTTGGCTTCAATAATCCCAGGGTTACAGAGATTAGGGCAAAAAACAGGGGAAAATGAAGAGGAAACAGGAATTGAAAATGGGATTTCAAGGCCAT
ATTTGTCTGAAGCTTGGGAATCTGTTGAAGAAGAAAATGAGAAAAAGATTTTGATGAAATGGAGAGTTCCAGCTTTGGGAGCAACTGAAATGGATATGAAAGATCATCTC
AAGTTCTGGGCTCATACAGTGGCTTCAACTGTGAGATAA
mRNA sequenceShow/hide mRNA sequence
TTTAATTATCTTTTCTAGTTTCTTCCTTATCTATATAACTGTTTAAACCCTCAATTTTCTCCTCAAAAAACCCTCTCAAAATCTTCCCATTCCCACTAGCAATGGCAGCC
GAAGAAATCCTCCCTCTCTTTGATCTCTTCTGGTTTCAACGGGCAATTTTCGCCGGAAAACCGCTTTTACAGACCAGTTCCTTGGCGCCAGAAAACCGGTTTCAGAGTCC
TGTAAGGCAAGTAATGAAGGTGAGATCTCAAAGCGAGTATCTTCTTAGCACGAAGAATTTCCCACCTCCGGAAACCTCTTTTTACTCCACCGGCGGCTCGATGATTTCCA
CCAATCAAAAGCTTGAAACCATCCTTTCCGGCAAGGTAACGGAATTTTCCGGCAACGTAGAGTGGAAACCGGCGAAGAAGAAATTGGAAGGAGGAAATGAAAATAAAAGA
AGAAAGACTAGGGGGAAAGGTTTGAGTAAGAGCTTATCAGACCTTGAATTTGAAGAGTTGAAAGGATTTATGGATTTGGGATTTGTGTTCAGTGAGGAAGATAAGAATGA
TTCAAATTTGGCTTCAATAATCCCAGGGTTACAGAGATTAGGGCAAAAAACAGGGGAAAATGAAGAGGAAACAGGAATTGAAAATGGGATTTCAAGGCCATATTTGTCTG
AAGCTTGGGAATCTGTTGAAGAAGAAAATGAGAAAAAGATTTTGATGAAATGGAGAGTTCCAGCTTTGGGAGCAACTGAAATGGATATGAAAGATCATCTCAAGTTCTGG
GCTCATACAGTGGCTTCAACTGTGAGATAACACCAATTTTTTACATTTTATGGTAACTTTTCTTTTTTTTTCTTTTTTCTTTTTCTTTTTTTTTTTCCCATTTTGGAGTG
TAATTTTTGTGGGTTTTTATTGTTGTGTTTGGAAATTTATAGCTTGTTATGTAGATACATTGATAGGTTAAGAAATATCTCCCAATCCCTATACTTGATGTCTATTGACA
GAAGAAAAATAATGGTTAAATTATATAAAACGCTCGTGGGTGTTTCGACTTTGGAGTATGTTTCAATTATGTTCAAAATCTTCCGTTAAAATAAAAGATGGAAATTTACT
TTACAAAATTTTCGATTTTCATACATTAGTTGCAGCCACATCAATTTACTTGTGGTAAATGAAG
Protein sequenceShow/hide protein sequence
MAAEEILPLFDLFWFQRAIFAGKPLLQTSSLAPENRFQSPVRQVMKVRSQSEYLLSTKNFPPPETSFYSTGGSMISTNQKLETILSGKVTEFSGNVEWKPAKKKLEGGNE
NKRRKTRGKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLASIIPGLQRLGQKTGENEEETGIENGISRPYLSEAWESVEEENEKKILMKWRVPALGATEMDMKDHL
KFWAHTVASTVR