; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001160 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001160
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationscaffold36:1908128..1909575
RNA-Seq ExpressionMS001160
SyntenyMS001160
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598738.1 Universal stress protein A-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.5e-8693.06Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+APDSTDGTF+ LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI DHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS++
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

XP_022132206.1 universal stress protein PHOS34 [Momordica charantia]5.3e-92100Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

XP_022961627.1 universal stress protein PHOS34-like [Cucurbita moschata]1.7e-8591.91Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+APDS DGTF+ LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI DHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICE AENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS++
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

XP_022997274.1 universal stress protein PHOS34 [Cucurbita maxima]1.1e-8491.91Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+AP STDGTFV LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI  HASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        +YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS++
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

XP_023547229.1 universal stress protein PHOS34 [Cucurbita pepo subsp. pepo]4.4e-8693.06Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+ PDSTDGTFV LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI DHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICE AENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS+S
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

TrEMBL top hitse value%identityAlignment
A0A0A0LLD9 Usp domain-containing protein6.3e-8389.02Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGS+ESMGALRWAL+NLKL S++PDSTDGTFVALHVQ PPSIAAGLSP PIPFGGPSDLEVPAF AAIESHQRRITAAI +HAS+ICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EY+V VETKVVIGDPKEKICEVAE+LHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDK++GS+S
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

A0A1S3BBJ3 universal stress protein Sll13881.8e-8289.6Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESM ALRWA +NLKL S+ PDSTDGTFV LHVQ PPSIAAGLSP PIPFGGPSDLEVPAF AAIESHQRRITAAI +HASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EY+V VETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK+GS+S
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

A0A6J1BRL9 universal stress protein PHOS342.6e-92100Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

A0A6J1HCC8 universal stress protein PHOS34-like8.0e-8691.91Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+APDS DGTF+ LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI DHASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        EYKVNVETKVVIGDPKEKICE AENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS++
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

A0A6J1KAZ9 universal stress protein PHOS345.2e-8591.91Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        MS +LRCVIVAVDGSEESMGALRWA ENLKLRS+AP STDGTFV LHVQ PP+IAAGLSPGPIPFGGPSDLEVPAF AAIESHQRRITAAI  HASRICS
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
        +YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGS++
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS

SwissProt top hitse value%identityAlignment
O07552 Stress response protein NhaX2.9e-0830.49Show/hide
Query:  VIVAVDGSEESMGALRWALENLKLRSAA------PDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSE
        +IVA DGSE S  AL  A++  K  +AA       D  D   V +    P + A+ +S G      P   +V +    I   +   T  +   A  + +E
Subjt:  VIVAVDGSEESMGALRWALENLKLRSAA------PDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSE

Query:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVK
         + + +  ++ GDP E I E A  + AD++V GSR    +K++  GSVS   +   + PV+IVK
Subjt:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVK

Q57951 Universal stress protein MJ05312.5e-0735.62Show/hide
Query:  RICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKR
        ++  E+ V + T+++ G P  +I E AE   ADL+VMG+     ++R+ LGSV+     +  CPV++VK  K+
Subjt:  RICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKR

Q8L4N1 Universal stress protein PHOS343.0e-1332.42Show/hide
Query:  RCVIVAVDGSEESMGALRWALEN-------LKLRSAAPDST--DGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAA-IESHQRRITAAIFDHA
        R + VAVD SEES  A+RWA+++       + +   +P S      +  L +Q+PP  +A   PG  P   PS  +  AF ++ +    + +  A F H 
Subjt:  RCVIVAVDGSEESMGALRWALEN-------LKLRSAAPDST--DGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAA-IESHQRRITAAIFDHA

Query:  SRICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMF---LGSVSNYCTNHVECPVIIVK--DKKRGSA
          I  ++           D +E++C   E L+   ++MGSR FG  KR     LGSVS+YC +H  CPV++V+  D + G A
Subjt:  SRICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMF---LGSVSNYCTNHVECPVIIVK--DKKRGSA

Q8LGG8 Universal stress protein A-like protein3.7e-1141.79Show/hide
Query:  CSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVK
        C E  V  E  +  GDPK+ IC+  + +  D LV+GSR  G  +++F+G+VS +C  H ECPV+ +K
Subjt:  CSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVK

Q8VYN9 Universal stress protein PHOS328.2e-1132.76Show/hide
Query:  RCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPF----------GGPSDLEVPAFAAAIESHQRRITAAIFDHA
        R + VAVD SEES  A+RWA+++      A        V LHV SP S+  G   GP+P             PS  +  AF +          A +    
Subjt:  RCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPF----------GGPSDLEVPAFAAAIESHQRRITAAIFDHA

Query:  SRICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRM----FLGSVSNYCTNHVECPVIIVK
          +   YK+++   V   D +E++C   E L    ++MGSR FG  K+      LGSVS+YC +H  CPV++V+
Subjt:  SRICSEYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRM----FLGSVSNYCTNHVECPVIIVK

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.4e-6671.51Show/hide
Query:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS
        +S +L CV+VAVDGSE SM ALRWAL+NLKL S+   S+D +FV LHVQ  PS+AAG+SPG IPFGGPS LEVPAF AAIE HQ+RIT  I +HAS+IC+
Subjt:  MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICS

Query:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSA
        E  VNV+T+VVIGDPK KICE  ENLHADLLVMGSRA+G IKRMFLGSVSNYCTNH  CPV+I+K K+  SA
Subjt:  EYKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSA

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.6e-3040.24Show/hide
Query:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSEYKV
        ++VA+D S+ S  AL+W ++   NL L +AA ++  G    +HVQSP +  A    GP   GG +     +   +++  Q+  +AA+   A ++C   ++
Subjt:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSEYKV

Query:  NVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK
          ET V+ G+ KE ICE  E +H DLLV+GSR  G IKR FLGSVS+YC +H  CP++IVK  K
Subjt:  NVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.1e-2942.51Show/hide
Query:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIES---HQRRITAAIFDHASRICSE
        ++VA+D S+ S  AL+W ++   NL L +AA ++  G    +HVQSP +  A    GP   GG +   V A ++ IES    Q+  +AA+   A ++C  
Subjt:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIES---HQRRITAAIFDHASRICSE

Query:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK
         ++  ET V+ G+ KE ICE  E +H DLLV+GSR  G IKR FLGSVS+YC +H  CP++IVK  K
Subjt:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-2941.92Show/hide
Query:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIES---HQRRITAAIFDHASRICSE
        ++VA+D S+ S  AL+W ++   NL L +AA ++  G    +HVQSP +  A    GP    G +   V A ++ IES    Q+  +AA+   A ++C  
Subjt:  VIVAVDGSEESMGALRWALE---NLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIES---HQRRITAAIFDHASRICSE

Query:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK
         ++  ET V+ G+ KE ICE  E +H DLLV+GSR  G IKR FLGSVS+YC +H  CP++IVK  K
Subjt:  YKVNVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKK

AT3G58450.2 Adenine nucleotide alpha hydrolases-like superfamily protein3.2e-2640.24Show/hide
Query:  VIVAVDGSEESMGALRWALENLKLRSAAPDST---DGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSEYKV
        V+VA+D S+ S  AL WA+++L++  +A   T    G    LHV   P+    + P     GG   +  P   A  ES     T  +F  A  IC    V
Subjt:  VIVAVDGSEESMGALRWALENLKLRSAAPDST---DGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSEYKV

Query:  NVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS
          ET ++ GDPKE IC+  E  H DLLV+GSR  G IKR FLGSVS+YC  H +CP++IV+  +  S S
Subjt:  NVETKVVIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGAATCTCCGGTGCGTGATCGTGGCCGTCGACGGCAGCGAGGAGAGCATGGGCGCTTTACGGTGGGCGCTTGAAAATCTCAAGCTCCGGTCAGCGGCGCCTGA
TTCCACTGACGGCACTTTTGTAGCGCTACATGTTCAATCTCCTCCTTCAATCGCCGCCGGTCTCAGTCCTGGTCCAATCCCCTTTGGCGGCCCTAGCGATCTGGAGGTTC
CTGCTTTCGCGGCAGCAATCGAGTCGCACCAGAGGCGGATTACAGCTGCGATTTTCGACCATGCTTCACGGATTTGCTCCGAGTACAAGGTGAATGTGGAAACTAAAGTA
GTGATCGGAGATCCAAAGGAAAAGATCTGTGAAGTGGCAGAGAATTTGCATGCTGATTTGCTAGTTATGGGATCCCGTGCTTTTGGTCCCATCAAAAGGATGTTCTTGGG
AAGTGTAAGCAACTATTGTACCAACCATGTTGAATGCCCAGTCATCATAGTCAAGGACAAGAAAAGGGGCAGTGCGTCA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAGAATCTCCGGTGCGTGATCGTGGCCGTCGACGGCAGCGAGGAGAGCATGGGCGCTTTACGGTGGGCGCTTGAAAATCTCAAGCTCCGGTCAGCGGCGCCTGA
TTCCACTGACGGCACTTTTGTAGCGCTACATGTTCAATCTCCTCCTTCAATCGCCGCCGGTCTCAGTCCTGGTCCAATCCCCTTTGGCGGCCCTAGCGATCTGGAGGTTC
CTGCTTTCGCGGCAGCAATCGAGTCGCACCAGAGGCGGATTACAGCTGCGATTTTCGACCATGCTTCACGGATTTGCTCCGAGTACAAGGTGAATGTGGAAACTAAAGTA
GTGATCGGAGATCCAAAGGAAAAGATCTGTGAAGTGGCAGAGAATTTGCATGCTGATTTGCTAGTTATGGGATCCCGTGCTTTTGGTCCCATCAAAAGGATGTTCTTGGG
AAGTGTAAGCAACTATTGTACCAACCATGTTGAATGCCCAGTCATCATAGTCAAGGACAAGAAAAGGGGCAGTGCGTCA
Protein sequenceShow/hide protein sequence
MSENLRCVIVAVDGSEESMGALRWALENLKLRSAAPDSTDGTFVALHVQSPPSIAAGLSPGPIPFGGPSDLEVPAFAAAIESHQRRITAAIFDHASRICSEYKVNVETKV
VIGDPKEKICEVAENLHADLLVMGSRAFGPIKRMFLGSVSNYCTNHVECPVIIVKDKKRGSAS