; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G006150 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G006150
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationCG_Chr05:6111472..6114221
RNA-Seq ExpressionClCG05G006150
SyntenyClCG05G006150
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022937984.1 universal stress protein PHOS34-like [Cucurbita moschata]3.4e-6781.41Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS+L I+GPAA +VLPLLD DLKKI NRTVQ+AK+ICIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK M LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

XP_022973665.1 universal stress protein PHOS34-like [Cucurbita maxima]9.9e-6780.77Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS+L I+GP A +VLPLLD DLKKI NRTVQ+AK+ICIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK M LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

XP_023539948.1 universal stress protein PHOS34-like [Cucurbita pepo subsp. pepo]2.9e-6680.77Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN   + LVIV+AKPPPSS+L I+GPAA +VLPLLD DLKKI NRTVQ+AK+ICIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK M LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

XP_038904675.1 universal stress protein PHOS34-like isoform X1 [Benincasa hispida]1.6e-7792.99Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYALNWTLQHFFGPN TPYKLVIVNAKPPPSS LG+AGP A+DVLP+LDADLKKIA+RTVQKAKDICIEHKVQ VQTEV+EGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        R VMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM+
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

XP_038904684.1 universal stress protein PHOS34-like isoform X2 [Benincasa hispida]2.3e-7993.63Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYALNWTLQHFFGPN TPYKLVIVNAKPPPSS LG+AGPAA+DVLP+LDADLKKIA+RTVQKAKDICIEHKVQ VQTEV+EGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        R VMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM+
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

TrEMBL top hitse value%identityAlignment
A0A0A0LAL5 Usp domain-containing protein5.0e-6479.62Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYAL+WTLQHFF PN TPYKL IVNA   PS   G A   + +++P +DADLKK+ NRTVQ+AKDICIEH VQSV+TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        RNV+CDSVEKFHASIL+VGSH+YGVVK+MGLGSVSD+CA HAHCSVMIVKRPPKPMT
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

A0A1S3AXJ3 uncharacterized protein LOC103483847 isoform X12.1e-6277.85Show/hide
Query:  TMAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGD
        TMAE+VMVIGVDESEHSFYAL+WTLQHFFGPN TPYKL IVNA   PS+  G+A   + +++P +DADLKK  N TVQ+AKDICIEHKVQSV+TEVVEGD
Subjt:  TMAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGD

Query:  ARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        AR+V+CDSV+KFHASILVVGSH+YGVVK+MGLGSVSD+CA HAHC VMIVKRPPKPM+
Subjt:  ARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

A0A6J1CC29 universal stress protein PHOS34-like3.1e-6678.98Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGD
        MAEQV+VIGVDES+HSFYAL W LQHF  P + +PYKLVIV+AKPPPSS++ + GP A + L LLDADL K+ANRTVQKAKDICIEHK+++VQ E+VEGD
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGD

Query:  ARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        ARNVMCD+VEKFHASILVVGSHNYGVVKRM LGSVS++CA+HAHCSVMIVKRPPKPM
Subjt:  ARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

A0A6J1FCR8 universal stress protein PHOS34-like1.6e-6781.41Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS+L I+GPAA +VLPLLD DLKKI NRTVQ+AK+ICIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK M LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

A0A6J1IBY1 universal stress protein PHOS34-like4.8e-6780.77Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS+L I+GP A +VLPLLD DLKKI NRTVQ+AK+ICIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK M LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPM

SwissProt top hitse value%identityAlignment
Q57951 Universal stress protein MJ05313.5e-0627.52Show/hide
Query:  MVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDARNVMC
        +VI  D S+ S  A    +      +   Y + +V+  P         G  A     L+   LK+     ++K K +  E  V+ + TE++EG   N + 
Subjt:  MVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDARNVMC

Query:  DSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPK
        +  EK  A ++V+G+     ++R+ LGSV++    +AHC V++VK+P K
Subjt:  DSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPK

Q8L4N1 Universal stress protein PHOS341.5e-0927.98Show/hide
Query:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKL------VIVNA--------KPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHK
        A + + + VD SE S +A+ W + H+  P      L      V+  A         PPP S     G          DA          +  K+    HK
Subjt:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKL------VIVNA--------KPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHK

Query:  VQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMG---LGSVSDFCAHHAHCSVMIVKRP
        +  V+    + D R  +C   E+ + S +++GS  +G  KR     LGSVSD+C HH  C V++V+ P
Subjt:  VQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMG---LGSVSDFCAHHAHCSVMIVKRP

Q8LGG8 Universal stress protein A-like protein8.3e-0828.87Show/hide
Query:  ALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDI---------CIEHKVQSVQTEVVEGDARNVMCDSVE
        A  WTL+     NT+ +K+++++ +     ++   G   VD +     D + +  R   KAK +         C E  V   +  +  GD ++V+C  V+
Subjt:  ALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDI---------CIEHKVQSVQTEVVEGDARNVMCDSVE

Query:  KFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKR
        +     LVVGS   G  +++ +G+VS FC  HA C VM +KR
Subjt:  KFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKR

Q8VYN9 Universal stress protein PHOS322.6e-0928.4Show/hide
Query:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLG-----------IAGPAAVDVLPLLDADL---KKIANRTVQKAKDICIEHK
        A + + + VD SE S +A+ W + H+  P       V++    P S L G           I  P A       D D     K+A+   +  K++   +K
Subjt:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLG-----------IAGPAAVDVLPLLDADL---KKIANRTVQKAKDICIEHK

Query:  VQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMG----LGSVSDFCAHHAHCSVMIVKRP
        +  V+    + D R  +C  +E+   S +++GS  +G  K+ G    LGSVSD+C HH  C V++V+ P
Subjt:  VQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMG----LGSVSDFCAHHAHCSVMIVKRP

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.5e-2337.18Show/hide
Query:  MVIGVDESEHSFYALNWTLQHF-FGPNTTPYKLVIVNAKPPPSSLLGIA-------GPAAVDV---LPLLDADLKKIANRTVQKAKDICIEHKVQSVQTE
        +V+ VD SE S  AL W L +     +++    V+++ +P PS   G++       GP+ ++V      ++   K+I +  ++ A  IC E  V +V+T+
Subjt:  MVIGVDESEHSFYALNWTLQHF-FGPNTTPYKLVIVNAKPPPSSLLGIA-------GPAAVDV---LPLLDADLKKIANRTVQKAKDICIEHKVQSVQTE

Query:  VVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVK
        VV GD +  +C++VE  HA +LV+GS  YG +KRM LGSVS++C +HAHC V+I+K
Subjt:  VVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVK

AT2G47710.1 Adenine nucleotide alpha hydrolases-like superfamily protein7.1e-4759.21Show/hide
Query:  EQVMVIGVDESEHSFYALNWTLQHFFGPNTT--PYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA
        + VMV+GVD+SE S YAL WTL  FF P     P+KL IV+AKP   S +G+AGP   +V+P +DADLK  A + V+KAK IC    V     EV EGDA
Subjt:  EQVMVIGVDESEHSFYALNWTLQHFFGPNTT--PYKLVIVNAKPPPSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRP
        RN++C+ V+K HASILVVGSH YG +KR  LGS SD+CAHHAHCSVMIVK+P
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRP

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.1e-2338.55Show/hide
Query:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSLLGI-AGP--AAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQSV
        MV+ +DES+ SFYAL W + HF     T          L +++ + P +      AGP  A V     +   +KK    T    + +A  +C   ++++ 
Subjt:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSLLGI-AGP--AAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQSV

Query:  QTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK MT
Subjt:  QTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein8.5e-2437.72Show/hide
Query:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSL----LGIAGPAAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQS
        MV+ +DES+ SFYAL W + HF     T          L +++ + P +       G  G  AV     +   +KK    T    + +A  +C   ++++
Subjt:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSL----LGIAGPAAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQS

Query:  VQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
         +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK MT
Subjt:  VQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein6.5e-2438.69Show/hide
Query:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSLLGI-AGP----AAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQ
        MV+ +DES+ SFYAL W + HF     T          L +++ + P +      AGP    AAV     +   +KK    T    + +A  +C   +++
Subjt:  MVIGVDESEHSFYALNWTLQHFFGPNTTPY-------KLVIVNAKPPPSSLLGI-AGP----AAVDVLPLLDADLKKIANRT----VQKAKDICIEHKVQ

Query:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT
        + +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK MT
Subjt:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKPMT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCAGGTAATGGTGATTGGCGTCGATGAAAGTGAGCACAGCTTCTACGCCTTGAATTGGACGCTTCAACATTTCTTTGGCCCTAACACTACTCCTTACAAGCT
CGTTATTGTCAACGCCAAACCACCTCCCTCTAGCTTACTTGGAATTGCCGGCCCAGATTTCTTAAATTTCCGTGTCTGTTCGACAATGGCGGAGCAGGTAATGGTGATTG
GCGTCGATGAAAGTGAGCACAGCTTCTACGCCTTGAATTGGACGCTTCAACATTTCTTTGGCCCTAACACTACTCCTTACAAGCTCGTTATTGTCAACGCCAAACCACCT
CCCTCTAGCTTACTTGGAATTGCCGGCCCAGCAGCCGTGGATGTCTTGCCCCTGCTCGATGCAGATTTGAAGAAAATAGCTAATAGGACTGTACAAAAGGCGAAGGATAT
ATGCATCGAACACAAGGTTCAAAGCGTGCAGACTGAAGTTGTGGAGGGTGATGCCAGAAATGTAATGTGCGATTCCGTTGAAAAATTTCATGCATCCATTTTGGTTGTTG
GAAGTCACAATTATGGAGTAGTAAAAAGGATGGGACTTGGCAGTGTAAGTGACTTCTGTGCTCACCATGCCCATTGCTCTGTCATGATTGTGAAGAGGCCTCCCAAACCC
ATGACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGCAGGTAATGGTGATTGGCGTCGATGAAAGTGAGCACAGCTTCTACGCCTTGAATTGGACGCTTCAACATTTCTTTGGCCCTAACACTACTCCTTACAAGCT
CGTTATTGTCAACGCCAAACCACCTCCCTCTAGCTTACTTGGAATTGCCGGCCCAGATTTCTTAAATTTCCGTGTCTGTTCGACAATGGCGGAGCAGGTAATGGTGATTG
GCGTCGATGAAAGTGAGCACAGCTTCTACGCCTTGAATTGGACGCTTCAACATTTCTTTGGCCCTAACACTACTCCTTACAAGCTCGTTATTGTCAACGCCAAACCACCT
CCCTCTAGCTTACTTGGAATTGCCGGCCCAGCAGCCGTGGATGTCTTGCCCCTGCTCGATGCAGATTTGAAGAAAATAGCTAATAGGACTGTACAAAAGGCGAAGGATAT
ATGCATCGAACACAAGGTTCAAAGCGTGCAGACTGAAGTTGTGGAGGGTGATGCCAGAAATGTAATGTGCGATTCCGTTGAAAAATTTCATGCATCCATTTTGGTTGTTG
GAAGTCACAATTATGGAGTAGTAAAAAGGATGGGACTTGGCAGTGTAAGTGACTTCTGTGCTCACCATGCCCATTGCTCTGTCATGATTGTGAAGAGGCCTCCCAAACCC
ATGACCTAA
Protein sequenceShow/hide protein sequence
MAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPPPSSLLGIAGPDFLNFRVCSTMAEQVMVIGVDESEHSFYALNWTLQHFFGPNTTPYKLVIVNAKPP
PSSLLGIAGPAAVDVLPLLDADLKKIANRTVQKAKDICIEHKVQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRMGLGSVSDFCAHHAHCSVMIVKRPPKP
MT