; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G06260 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G06260
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationClcChr09:4842920..4848322
RNA-Seq ExpressionClc09G06260
SyntenyClc09G06260
Gene Ontology termsGO:0003723 - RNA binding (molecular function)
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067996.1 APO protein 3 [Cucumis melo var. makuwa]7.0e-8495.15Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDAR VLCQAVDKH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

KGN44937.1 hypothetical protein Csa_016142 [Cucumis sativus]2.4e-8494.55Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAAL+IATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDARYVLCQAV+KH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

XP_008451579.1 PREDICTED: universal stress protein A-like protein [Cucumis melo]5.9e-8394.55Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPN PFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDAR VLCQAVDKH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

XP_031745099.1 universal stress protein PHOS34 [Cucumis sativus]2.4e-8494.55Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAAL+IATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDARYVLCQAV+KH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

XP_038899523.1 universal stress protein PHOS32-like [Benincasa hispida]3.8e-8291.57Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVN-DV
        MAT EKPV+VIGVDDSEYATYALEWTLDHFF+STPNPPFKLVVVYAKPFPD+FVGVGGPGMIVGSAGSYQFLNEDLKKKAAL+IATARG+CES+S+N +V
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVN-DV

Query:  KYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        KYEVDEGDARYVLCQAV+KHRASMLVVGSHGYGA+KRAFLGSVSDYCAHQASCTV+IVK LKTKEG
Subjt:  KYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

TrEMBL top hitse value%identityAlignment
A0A0A0K987 Usp domain-containing protein1.2e-8494.55Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAAL+IATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDARYVLCQAV+KH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

A0A1S3BRV2 universal stress protein A-like protein2.9e-8394.55Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPN PFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDAR VLCQAVDKH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

A0A5D3D1K7 APO protein 33.4e-8495.15Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EK V+VIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARG+CES+SVNDVK
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG
        YEVDEGDAR VLCQAVDKH ASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTV+IVKKLKTKEG
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG

A0A6J1GP13 universal stress protein A-like protein isoform X14.6e-8192.64Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EKPVMVIGVDDSEYATYALEWTLDHFFSST NPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLN+DLKKKAALIIATARG+CES+ VND K
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK
        +EVDEGDARYVLCQAVDKHRASMLVVGSHGYGA+KRAFLGSVSDYCAHQASCTV+IVKKL  K
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK

A0A6J1JQ78 universal stress protein A-like protein6.0e-8192.02Show/hide
Query:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK
        MAT EKPVMVIGVDDSEYATYALEWTLDHFFSST NPPFKLVVVYAKPFPDVF+GVGGPGMIVGSAGSYQFLN+DLKKKAALIIATARG+CES+ VND K
Subjt:  MATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK
        +EVDEGDARYVLCQAVDKHRASMLVVGSHGYGA+KRAFLGSVSDYCAHQASCTV+IVKKL  K
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK

SwissProt top hitse value%identityAlignment
Q57951 Universal stress protein MJ05311.0e-0531.36Show/hide
Query:  VYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVS
        VYA    DV   VG P     + GS++ ++E LK++    +   + + E   V  +  E+ EG     + +  +K +A ++V+G+ G   L+R  LGSV+
Subjt:  VYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVS

Query:  DYCAHQASCTVLIVKKLK
        +     A C VL+VKK K
Subjt:  DYCAHQASCTVLIVKKLK

Q8L4N1 Universal stress protein PHOS342.2e-0827.38Show/hide
Query:  IGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGG---------PGMIVGSAGSYQFLNEDLKKKAALIIA-TARGLCESRSVNDVK
        + VD SE + +A+ W +DH+      P   +V+++  P   +F    G         P          +   ED     +  +A  A+ L E+   + + 
Subjt:  IGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGG---------PGMIVGSAGSYQFLNEDLKKKAALIIA-TARGLCESRSVNDVK

Query:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAF---LGSVSDYCAHQASCTVLIVKKLKTKEG
        + V + D R  LC   ++   S +++GS G+GA KR     LGSVSDYC H   C V++V+    ++G
Subjt:  YEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAF---LGSVSDYCAHQASCTVLIVKKLKTKEG

Q8LGG8 Universal stress protein A-like protein1.0e-0827.54Show/hide
Query:  ALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEGDARYVLCQAVDKHRA
        A EWTL+    S  +  FK+++++ +   +   G      I  S   ++ + +  K K   ++      C    V    + +  GD + V+CQ V + R 
Subjt:  ALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEGDARYVLCQAVDKHRA

Query:  SMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKK
          LVVGS G G  ++ F+G+VS +C   A C V+ +K+
Subjt:  SMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKK

Q8VYN9 Universal stress protein PHOS321.3e-0826.99Show/hide
Query:  IGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGP----GMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEG
        + VD SE +++A+ W +DH+      P   +V+++  P   +F    GP      I       Q   ED     +  +A      +        + V + 
Subjt:  IGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGP----GMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEG

Query:  DARYVLCQAVDKHRASMLVVGSHGYGALKR----AFLGSVSDYCAHQASCTVLIVKKLKTKEG
        D R  LC  +++   S +++GS G+GA K+      LGSVSDYC H   C V++V+    ++G
Subjt:  DARYVLCQAVDKHRASMLVVGSHGYGALKR----AFLGSVSDYCAHQASCTVLIVKKLKTKEG

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.6e-2238.22Show/hide
Query:  MVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGV-------GGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKY
        +V+ VD SE +  AL W LD+   S+ +     VV++ +P P V  GV       GGP  +   A +   + +  K+    I+  A  +C  +SVN VK 
Subjt:  MVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGV-------GGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKY

Query:  EVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVK
        +V  GD +Y +C+AV+   A +LV+GS  YG +KR FLGSVS+YC + A C V+I+K
Subjt:  EVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVK

AT2G47710.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.6e-4758.79Show/hide
Query:  MATPE-KPVMVIGVDDSEYATYALEWTLDHFFSS-TPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVND
        MAT + K VMV+GVDDSE +TYALEWTLD FF+   PN PFKL +V+AKP     VG+ GP    G+A    +++ DLK  AA ++  A+ +C+SRSV+ 
Subjt:  MATPE-KPVMVIGVDDSEYATYALEWTLDHFFSS-TPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVND

Query:  VKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK
           EV EGDAR +LC+ VDKH AS+LVVGSHGYGA+KRA LGS SDYCAH A C+V+IVKK K K
Subjt:  VKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTK

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-2335.19Show/hide
Query:  MVIGVDDSEYATYALEWTLDHF-------FSSTPNPPFKLVVVYAKPFPDVFVGVGGPG--MIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDV
        MV+ +D+S+ + YAL+W +DHF        ++        V+    PF        GPG   +  S+   + + +  ++ +A +++ A  +C ++ +   
Subjt:  MVIGVDDSEYATYALEWTLDHF-------FSSTPNPPFKLVVVYAKPFPDVFVGVGGPG--MIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDV

Query:  KYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK
        +  V EG+A+ ++C+AV+K    +LVVGS G G +KRAFLGSVSDYCAH A+C +LIVK  K
Subjt:  KYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein4.3e-2334.97Show/hide
Query:  MVIGVDDSEYATYALEWTLDHF------FSSTPNPPFKLVVVYAKP----FPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVND
        MV+ +D+S+ + YAL+W +DHF       ++       L V++ +     F     G GG   +  S+   + + +  ++ +A +++ A  +C ++ +  
Subjt:  MVIGVDDSEYATYALEWTLDHF------FSSTPNPPFKLVVVYAKP----FPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVND

Query:  VKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK
         +  V EG+A+ ++C+AV+K    +LVVGS G G +KRAFLGSVSDYCAH A+C +LIVK  K
Subjt:  VKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-2335.98Show/hide
Query:  MVIGVDDSEYATYALEWTLDHF-------FSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKK----KAALIIATARGLCESRSVN
        MV+ +D+S+ + YAL+W +DHF        ++        V+    PF        GPG    +  +   + E +KK     +A +++ A  +C ++ + 
Subjt:  MVIGVDDSEYATYALEWTLDHF-------FSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKK----KAALIIATARGLCESRSVN

Query:  DVKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK
          +  V EG+A+ ++C+AV+K    +LVVGS G G +KRAFLGSVSDYCAH A+C +LIVK  K
Subjt:  DVKYEVDEGDARYVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGACATTATATGCAAGAGGTCCTTAGCAGAAATTGCTGACACGATCTGCCACGTGGACAAACTCGCCTTGAACTTAAAAAAGCGCCCGCGTTCTCCTTCTTCCTG
TTTTAACACCAGTTTCACCAATCCCCAAACACAGGCGGCGTTCTGGAATCGTTTCGACGGCGGCGATGGCGGCAGCAGCAACGAAGCCTGTGATGGTAATCGGAGTCGAC
GACAGCGAATGCGCGATCTCCGCCCTGGAATGGACATTGGATCCGATTTTTCTCTCAAACGGTTCAAATTATATCCGTTCAAGCTCGTCCTCGTTCATGTCAAACCCTCT
CCCGACGTCTTCGTCGGCGTCTCCGGACCGGGAAGAATTGCAGGATCGGTTGAAACCTACCAAGCTTTGGACGGTGATTTGAAGAGGAAAGCTGCGACAACTCTCAAAAT
TGCTAAAGAAATTTGCGCTGCAAAATCGAAGGAGATGCTAGGTAAGTACTGTGCGAGGCGGTTAACAAGCACCGGGCTTCAGTGCTCGTGggtgttattagggagtgtga
gtgactattgtgcccgtcaggctccttgcacggtcatgattgtgaagTTTTCCGTATACTTGTGGAATGTGCTGGTAGCCAGAAAAGCCAGCGGCGGCGGCAGAATTGTT
CCGATGGCGACACCGGAGAAGCCAGTTATGGTGATCGGAGTGGACGACAGCGAATACGCAACCTACGCTTTAGAGTGGACACTCGACCACTTCTTCTCTTCAACGCCAAA
TCCTCCGTTCAAGCTCGTCGTCGTCTACGCCAAACCGTTTCCCGACGTCTTCGTCGGCGTTGGCGGACCTGGAATGATTGTAGGTTCTGCTGGAAGTTACCAGTTCCTGA
ACGAGGATTTGAAGAAGAAAGCTGCGTTGATTATCGCAACTGCAAGAGGACTCTGCGAATCAAGATCGGTGAATGATGTGAAATATGAGGTGGATGAAGGAGATGCTAGG
TATGTGCTGTGTCAGGCGGTGGATAAGCACAGGGCTTCAATGCTGGTGGTGGGAAGCCATGGCTATGGAGCTCTCAAGAGGGCGTTTTTGGGGAGTGTGAGTGACTATTG
TGCTCATCAAGCTTCATGTACCGTCCTGATTGTCAAGAAGCTTAAAACCAAGGAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGACATTATATGCAAGAGGTCCTTAGCAGAAATTGCTGACACGATCTGCCACGTGGACAAACTCGCCTTGAACTTAAAAAAGCGCCCGCGTTCTCCTTCTTCCTG
TTTTAACACCAGTTTCACCAATCCCCAAACACAGGCGGCGTTCTGGAATCGTTTCGACGGCGGCGATGGCGGCAGCAGCAACGAAGCCTGTGATGGTAATCGGAGTCGAC
GACAGCGAATGCGCGATCTCCGCCCTGGAATGGACATTGGATCCGATTTTTCTCTCAAACGGTTCAAATTATATCCGTTCAAGCTCGTCCTCGTTCATGTCAAACCCTCT
CCCGACGTCTTCGTCGGCGTCTCCGGACCGGGAAGAATTGCAGGATCGGTTGAAACCTACCAAGCTTTGGACGGTGATTTGAAGAGGAAAGCTGCGACAACTCTCAAAAT
TGCTAAAGAAATTTGCGCTGCAAAATCGAAGGAGATGCTAGGTAAGTACTGTGCGAGGCGGTTAACAAGCACCGGGCTTCAGTGCTCGTGggtgttattagggagtgtga
gtgactattgtgcccgtcaggctccttgcacggtcatgattgtgaagTTTTCCGTATACTTGTGGAATGTGCTGGTAGCCAGAAAAGCCAGCGGCGGCGGCAGAATTGTT
CCGATGGCGACACCGGAGAAGCCAGTTATGGTGATCGGAGTGGACGACAGCGAATACGCAACCTACGCTTTAGAGTGGACACTCGACCACTTCTTCTCTTCAACGCCAAA
TCCTCCGTTCAAGCTCGTCGTCGTCTACGCCAAACCGTTTCCCGACGTCTTCGTCGGCGTTGGCGGACCTGGAATGATTGTAGGTTCTGCTGGAAGTTACCAGTTCCTGA
ACGAGGATTTGAAGAAGAAAGCTGCGTTGATTATCGCAACTGCAAGAGGACTCTGCGAATCAAGATCGGTGAATGATGTGAAATATGAGGTGGATGAAGGAGATGCTAGG
TATGTGCTGTGTCAGGCGGTGGATAAGCACAGGGCTTCAATGCTGGTGGTGGGAAGCCATGGCTATGGAGCTCTCAAGAGGGCGTTTTTGGGGAGTGTGAGTGACTATTG
TGCTCATCAAGCTTCATGTACCGTCCTGATTGTCAAGAAGCTTAAAACCAAGGAAGGTTAA
Protein sequenceShow/hide protein sequence
MGDIICKRSLAEIADTICHVDKLALNLKKRPRSPSSCFNTSFTNPQTQAAFWNRFDGGDGGSSNEACDGNRSRRQRMRDLRPGMDIGSDFSLKRFKLYPFKLVLVHVKPS
PDVFVGVSGPGRIAGSVETYQALDGDLKRKAATTLKIAKEICAAKSKEMLGKYCARRLTSTGLQCSWVLLGSVSDYCARQAPCTVMIVKFSVYLWNVLVARKASGGGRIV
PMATPEKPVMVIGVDDSEYATYALEWTLDHFFSSTPNPPFKLVVVYAKPFPDVFVGVGGPGMIVGSAGSYQFLNEDLKKKAALIIATARGLCESRSVNDVKYEVDEGDAR
YVLCQAVDKHRASMLVVGSHGYGALKRAFLGSVSDYCAHQASCTVLIVKKLKTKEG