; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G08160 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G08160
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationClcChr05:6215791..6218421
RNA-Seq ExpressionClc05G08160
SyntenyClc05G08160
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134244.1 universal stress protein PHOS32 [Cucumis sativus]7.1e-7174.35Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS KALKWAIDNVIRKGD+LVLI VRP+GDYEDGEMQLWQTTGS                            LIPL EFSDP+TM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        +KYGIKPDAETLDI         I VLLKIYWGDAREKICEAID+IPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+AD+EN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

XP_008438946.1 PREDICTED: uncharacterized protein C167.05 [Cucumis melo]6.4e-7274.87Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDG+RRVGVAVDFS CS KALKWAIDNV+RKGDYLVLI VRP+GDYEDGEMQLWQTTGS                            LIPL EFSDP+TM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        +KYGIKPDAETLDI         ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+AD+EN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

XP_022938345.1 universal stress protein PHOS32-like isoform X1 [Cucurbita moschata]2.4e-7171.73Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS+KALKWAIDN+IRKGD+LVLITVRPDGDYEDGEMQLW+TTGS                            LIP+ EF+DPHT+
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KKYGIKPDAET+DI         ITVLLKIYWGDAREKICEAID+IPI+CL+IGNRGLGK+KRAILGSVSNYVVNNG+CPVTVVK+ DNEN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

XP_022972234.1 universal stress protein PHOS32 [Cucurbita maxima]3.2e-7171.2Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS+KALKWAIDN+IRKGD+L+LITVRPDGDYEDGEMQLW+TTGS                            LIP+ EF+DPHT+
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KKYGIKPDAET+DI         ITVLLKIYWGDAREKICEAID+IPI+CL+IGNRGLGK+KRAILGSVSNYVVNNG+CPVTVVK+ DNEN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

XP_038889373.1 universal stress protein PHOS32 [Benincasa hispida]2.1e-7576.96Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFSPCS+KALKWAIDNVIRKGDYLVLITVRP+GDYEDGEMQLWQTTGS                            LIPL EFSDPHTM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KKYG+KPDAET+DI         ITVLLKIYWGD REKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+ADNEN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

TrEMBL top hitse value%identityAlignment
A0A0A0L5W7 Universal stress protein3.4e-7174.35Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS KALKWAIDNVIRKGD+LVLI VRP+GDYEDGEMQLWQTTGS                            LIPL EFSDP+TM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        +KYGIKPDAETLDI         I VLLKIYWGDAREKICEAID+IPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+AD+EN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

A0A1S3AY78 uncharacterized protein C167.053.1e-7274.87Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDG+RRVGVAVDFS CS KALKWAIDNV+RKGDYLVLI VRP+GDYEDGEMQLWQTTGS                            LIPL EFSDP+TM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        +KYGIKPDAETLDI         ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+AD+EN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

A0A5A7U1A3 Usp domain-containing protein3.1e-7274.87Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDG+RRVGVAVDFS CS KALKWAIDNV+RKGDYLVLI VRP+GDYEDGEMQLWQTTGS                            LIPL EFSDP+TM
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        +KYGIKPDAETLDI         ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK+AD+EN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

A0A6J1FJI4 universal stress protein PHOS32-like isoform X11.2e-7171.73Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS+KALKWAIDN+IRKGD+LVLITVRPDGDYEDGEMQLW+TTGS                            LIP+ EF+DPHT+
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KKYGIKPDAET+DI         ITVLLKIYWGDAREKICEAID+IPI+CL+IGNRGLGK+KRAILGSVSNYVVNNG+CPVTVVK+ DNEN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

A0A6J1IAX0 universal stress protein PHOS321.5e-7171.2Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        MDGERRVGVAVDFS CS+KALKWAIDN+IRKGD+L+LITVRPDGDYEDGEMQLW+TTGS                            LIP+ EF+DPHT+
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KKYGIKPDAET+DI         ITVLLKIYWGDAREKICEAID+IPI+CL+IGNRGLGK+KRAILGSVSNYVVNNG+CPVTVVK+ DNEN
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

SwissProt top hitse value%identityAlignment
P87132 Uncharacterized protein C167.051.4e-0536.59Show/hide
Query:  KYGIKPDAET-LDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN
        KY +K  ++T L++   +  I+   A+  I E ID I  + +++G+RG   LK  +LGS SNY+VN  S PV V ++   +N
Subjt:  KYGIKPDAET-LDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN

Q8L4N1 Universal stress protein PHOS341.1e-1029.73Show/hide
Query:  RRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDG---DYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDF-VFALIPLAEFSDPHTM
        R++GVAVD S  S  A++WA+D+ IR GD +V++ V P       + G + L         + P            K    DF  F    +A+ + P  +
Subjt:  RRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDG---DYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDF-VFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAI---LGSVSNYVVNNGSCPVTVVKRADNEN
        K+ G             ++K +  D RE++C   + + ++ +I+G+RG G  KR     LGSVS+Y V++  CPV VV+  D+ +
Subjt:  KKYGIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAI---LGSVSNYVVNNGSCPVTVVKRADNEN

Q8LGG8 Universal stress protein A-like protein3.7e-0634.48Show/hide
Query:  GDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNE
        GD ++ IC+ +  +    L++G+RGLG+ ++  +G+VS + V +  CPV  +KR  +E
Subjt:  GDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNE

Q8VYN9 Universal stress protein PHOS325.9e-1230.6Show/hide
Query:  RRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDF-VFALIPLAEFSDPHTMKKY
        R++GVAVD S  S  A++WA+D+ IR GD +VL+ V P               G+    LP+   I       +    DF  F    +A+ + P  +K+ 
Subjt:  RRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDF-VFALIPLAEFSDPHTMKKY

Query:  GIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKR----AILGSVSNYVVNNGSCPVTVVKRADNEN
        G             ++K +  D RE++C  I+ + ++ +I+G+RG G  K+      LGSVS+Y V++  CPV VV+  D+ +
Subjt:  GIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKR----AILGSVSNYVVNNGSCPVTVVKRADNEN

Arabidopsis top hitse value%identityAlignment
AT1G11360.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.7e-1429.84Show/hide
Query:  ERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDG---DYEDGEMQL---WQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDP
        +R++G+AVD S  S  A++WA+ N +R GD +VL+ V+P       + G M L   W        S   + D + I  + K   V       PL E   P
Subjt:  ERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDG---DYEDGEMQL---WQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDP

Query:  ---HTMKKYGIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAI---LGSVSNYVVNNGSCPVTVVKRADNEN
           H +K +                     D +E++C  ++ + ++ LI+G+RG G  KR+    LGSVS+Y V++ +CPV VV+  D+++
Subjt:  ---HTMKKYGIKPDAETLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAI---LGSVSNYVVNNGSCPVTVVKRADNEN

AT3G03270.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.7e-2535.58Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        M   R VGV +D+SP SK AL+WA +N++  GD ++LI V+P  + +     L++ TGS                            LIPL EF + +  
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKR
        K+YG+  D E LD+         + V+ K+YWGD REK+C+A++N+ +  +++G+RGLG LKR
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKR

AT3G03270.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-3440Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        M   R VGV +D+SP SK AL+WA +N++  GD ++LI V+P  + +     L++ TGS                            LIPL EF + +  
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK
        K+YG+  D E LD+         + V+ K+YWGD REK+C+A++N+ +  +++G+RGLG LKR +LGSVSN+VV N +CPVTVVK
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK

AT3G17020.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.9e-5456.83Show/hide
Query:  GERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTMKK
        G RR+GVAVDFS CSKKAL WAIDNV+R GD+L+LIT+  D +YE+GEMQLW+T GS                             IP++EFSD   MKK
Subjt:  GERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTMKK

Query:  YGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK
        Y +KPDAETLDI         ITV++KIYWGD REKIC A + IP++ L++GNRGLG LKR I+GSVSN+VVNN +CPVTVVK
Subjt:  YGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVK

AT3G53990.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.7e-3340Show/hide
Query:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM
        M  +R +G+A+DFS  SK ALKWAI+N+  KGD + +I                        +LP+ GD  R  + FK         LIPLAEF +P  M
Subjt:  MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTM

Query:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNE
        +KYG+K D   LD+         + V+ K+YWGDAREK+ +A+ ++ +  +++G+RGL  L+R I+GSVS++V+ +  CPVTVVK  DNE
Subjt:  KKYGIKPDAETLDI---------ITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGTGAACGGAGAGTCGGTGTCGCGGTGGATTTCTCTCCCTGCAGCAAAAAAGCCCTCAAATGGGCTATCGACAATGTTATCCGCAAGGGGGATTACCTAGTTCT
CATCACCGTTCGTCCCGACGGCGATTACGAAGACGGCGAGATGCAGCTCTGGCAAACCACCGGATCTCGTATGATTTCTCTTCCGATCATCGGAGATATATACCGCATCT
GCATGCACTTCAAATTAGTCGGCGTTGATTTCGTTTTCGCGCTTATTCCTCTGGCTGAGTTTTCCGATCCTCACACCATGAAGAAGTACGGGATTAAGCCCGATGCTGAA
ACTCTGGATATTATTACTGTGTTGCTTAAGATTTATTGGGGAGATGCTCGTGAGAAGATTTGTGAAGCAATTGATAACATTCCTATTACTTGTCTCATCATTGGGAACAG
AGGTCTTGGCAAGCTTAAGAGGGCCATATTGGGGAGCGTAAGCAATTATGTGGTGAACAATGGTTCCTGTCCAGTCACTGTGGTAAAGAGAGCAGATAATGAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGTGAACGGAGAGTCGGTGTCGCGGTGGATTTCTCTCCCTGCAGCAAAAAAGCCCTCAAATGGGCTATCGACAATGTTATCCGCAAGGGGGATTACCTAGTTCT
CATCACCGTTCGTCCCGACGGCGATTACGAAGACGGCGAGATGCAGCTCTGGCAAACCACCGGATCTCGTATGATTTCTCTTCCGATCATCGGAGATATATACCGCATCT
GCATGCACTTCAAATTAGTCGGCGTTGATTTCGTTTTCGCGCTTATTCCTCTGGCTGAGTTTTCCGATCCTCACACCATGAAGAAGTACGGGATTAAGCCCGATGCTGAA
ACTCTGGATATTATTACTGTGTTGCTTAAGATTTATTGGGGAGATGCTCGTGAGAAGATTTGTGAAGCAATTGATAACATTCCTATTACTTGTCTCATCATTGGGAACAG
AGGTCTTGGCAAGCTTAAGAGGGCCATATTGGGGAGCGTAAGCAATTATGTGGTGAACAATGGTTCCTGTCCAGTCACTGTGGTAAAGAGAGCAGATAATGAGAACTGAT
CATCCCAGCCCACAACACTCTTAAAATCTATATATTCCCATTCAAATTTTCTGATTCTGATTTTGTTTTGTGAAATTGGTAATGTATATGTTGATGGCGTGTCCTTAGTT
GCTGTGTTGTTAAAATGTTTCTCAAATAATGAGAGAGCTTTTATGGTCTGTGATAGGATGATTTCTAGAGGTTGTTTGGGAGTGTTCTTGAAAACAATTTTCTGTTGAAT
AACAATTTAGTAAAAACATC
Protein sequenceShow/hide protein sequence
MDGERRVGVAVDFSPCSKKALKWAIDNVIRKGDYLVLITVRPDGDYEDGEMQLWQTTGSRMISLPIIGDIYRICMHFKLVGVDFVFALIPLAEFSDPHTMKKYGIKPDAE
TLDIITVLLKIYWGDAREKICEAIDNIPITCLIIGNRGLGKLKRAILGSVSNYVVNNGSCPVTVVKRADNEN