; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012557 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012557
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationChr01:22348374..22350124
RNA-Seq ExpressionHG10012557
SyntenyHG10012557
Gene Ontology termsNA
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138083.1 universal stress protein PHOS34-like [Momordica charantia]6.7e-6577.22Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGD
        MAEQV+VIGVDES+HSFYAL W LQHF  P + +PYKLVIV+AKPPPSS + V GP A + LS+LDADL  +ANRTVQKAKD+CIEHK+++VQ E+VEGD
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGD

Query:  ARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        ARNVMCD+VEKFHASILVVGSHNYGVVKR  LGSVS++CA+HAHCSVMIVKRPPKPM+
Subjt:  ARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

XP_022937984.1 universal stress protein PHOS34-like [Cucurbita moschata]2.0e-6477.56Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS L ++GPAA +VL +LD DLK I NRTVQ+AK++CIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK   LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM

XP_022973665.1 universal stress protein PHOS34-like [Cucurbita maxima]5.7e-6476.92Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS L ++GP A +VL +LD DLK I NRTVQ+AK++CIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK   LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM

XP_038904675.1 universal stress protein PHOS34-like isoform X1 [Benincasa hispida]1.2e-7793.04Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGP A+DVL MLDADLK IA+RTVQKAKD+CIEHKVQ VQTEV+EGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMSL
        R VMCDSVEKFHASILVVGSHNYGVVKR GLGSVSDFCAHHAHCSVMIVKRPPKPMS+
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMSL

XP_038904684.1 universal stress protein PHOS34-like isoform X2 [Benincasa hispida]2.2e-7993.67Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAA+DVL MLDADLK IA+RTVQKAKD+CIEHKVQ VQTEV+EGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMSL
        R VMCDSVEKFHASILVVGSHNYGVVKR GLGSVSDFCAHHAHCSVMIVKRPPKPMS+
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMSL

TrEMBL top hitse value%identityAlignment
A0A0A0LAL5 Usp domain-containing protein5.7e-6277.07Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MAEQVMVIGVDESEHSFYAL+WTLQHFF PNATPYKL IVNA   PS   G A   + +++  +DADLK + NRTVQ+AKD+CIEH VQSV+TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        RNV+CDSVEKFHASIL+VGSH+YGVVK+ GLGSVSD+CA HAHCSVMIVKRPPKPM+
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

A0A5D3CXU5 Universal stress protein A-like protein1.8e-6077.07Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MAE+VMVIGVDESEHSFYAL+WTLQHFFGPNATPYKL IVNA   PS+  GVA   + +++  +DADLK   N TVQ+AKD+CIEHKVQSV+TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        R+V+CDSV+KFHASILVVGSH+YGVVK+ GLGSVSD+CA HAHC VMIVKRPPKPMS
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

A0A6J1CC29 universal stress protein PHOS34-like3.3e-6577.22Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGD
        MAEQV+VIGVDES+HSFYAL W LQHF  P + +PYKLVIV+AKPPPSS + V GP A + LS+LDADL  +ANRTVQKAKD+CIEHK+++VQ E+VEGD
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGP-NATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGD

Query:  ARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        ARNVMCD+VEKFHASILVVGSHNYGVVKR  LGSVS++CA+HAHCSVMIVKRPPKPM+
Subjt:  ARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

A0A6J1FCR8 universal stress protein PHOS34-like9.5e-6577.56Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS L ++GPAA +VL +LD DLK I NRTVQ+AK++CIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK   LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM

A0A6J1IBY1 universal stress protein PHOS34-like2.8e-6476.92Show/hide
Query:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        MA++V+VIGVDE+EHSFYAL W LQHFFGPN  P+ LVIV+AKPPPSS L ++GP A +VL +LD DLK I NRTVQ+AK++CIEHKVQ+V TEVVEGDA
Subjt:  MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM
        RNVMCD+VEK HAS+LVVGSHNYGVVK   LGSVS +CAHHAHCSVMIVKRPPKPM
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPM

SwissProt top hitse value%identityAlignment
Q57951 Universal stress protein MJ05316.0e-0828.86Show/hide
Query:  MVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDARNVMC
        +VI  D S+ S  A    +      +A  Y + +V+  P    F+G+    + +++S L   LK      ++K K +  E  V+ + TE++EG   N + 
Subjt:  MVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDARNVMC

Query:  DSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPK
        +  EK  A ++V+G+     ++R  LGSV++    +AHC V++VK+P K
Subjt:  DSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPK

Q8L4N1 Universal stress protein PHOS342.9e-1027.91Show/hide
Query:  AEQVMVIGVDESEHSFYALNWTLQHFFGP-------NATPYKLVI--------VNAKPPPSSFLGVAGPAAVDVLSMLDADL---KNIANRTVQKAKDVC
        A + + + VD SE S +A+ W + H+  P       + +P  ++         +   PPPS+      P A    S  D D      +A+   +  K+  
Subjt:  AEQVMVIGVDESEHSFYALNWTLQHFFGP-------NATPYKLVI--------VNAKPPPSSFLGVAGPAAVDVLSMLDADL---KNIANRTVQKAKDVC

Query:  IEHKVQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTG---LGSVSDFCAHHAHCSVMIVKRP
          HK+  V+    + D R  +C   E+ + S +++GS  +G  KR     LGSVSD+C HH  C V++V+ P
Subjt:  IEHKVQSVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTG---LGSVSDFCAHHAHCSVMIVKRP

Q8LGG8 Universal stress protein A-like protein1.7e-0728.17Show/hide
Query:  ALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDV---------CIEHKVQSVQTEVVEGDARNVMCDSVE
        A  WTL+     N + +K+++++ +      +   G   VD +     D +++  R   KAK +         C E  V   +  +  GD ++V+C  V+
Subjt:  ALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDV---------CIEHKVQSVQTEVVEGDARNVMCDSVE

Query:  KFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKR
        +     LVVGS   G  ++  +G+VS FC  HA C VM +KR
Subjt:  KFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKR

Q8VYN9 Universal stress protein PHOS322.4e-0925.75Show/hide
Query:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDV------------LSMLDADLKNIANRTVQKAKDVCIEHKVQ
        A + + + VD SE S +A+ W + H+  P      +V+++  P    F    GP  +                  DA          +  K++   +K+ 
Subjt:  AEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDV------------LSMLDADLKNIANRTVQKAKDVCIEHKVQ

Query:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTG----LGSVSDFCAHHAHCSVMIVKRP
         V+    + D R  +C  +E+   S +++GS  +G  K+ G    LGSVSD+C HH  C V++V+ P
Subjt:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTG----LGSVSDFCAHHAHCSVMIVKRP

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-2236.54Show/hide
Query:  MVIGVDESEHSFYALNWTLQHF-FGPNATPYKLVIVNAKPPPSSFLGVA-------GPAAVDV---LSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTE
        +V+ VD SE S  AL W L +     +++    V+++ +P PS   GV+       GP+ ++V    + ++   K I +  ++ A  +C E  V +V+T+
Subjt:  MVIGVDESEHSFYALNWTLQHF-FGPNATPYKLVIVNAKPPPSSFLGVA-------GPAAVDV---LSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTE

Query:  VVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVK
        VV GD +  +C++VE  HA +LV+GS  YG +KR  LGSVS++C +HAHC V+I+K
Subjt:  VVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVK

AT2G47710.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.5e-4658.55Show/hide
Query:  EQVMVIGVDESEHSFYALNWTLQHFFGPNAT--PYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA
        + VMV+GVD+SE S YAL WTL  FF P A   P+KL IV+AKP   S +G+AGP   +V+  +DADLK+ A + V+KAK +C    V     EV EGDA
Subjt:  EQVMVIGVDESEHSFYALNWTLQHFFGPNAT--PYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDA

Query:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRP
        RN++C+ V+K HASILVVGSH YG +KR  LGS SD+CAHHAHCSVMIVK+P
Subjt:  RNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRP

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-2438.55Show/hide
Query:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSFLGV-AGP--AAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQSV
        MV+ +DES+ SFYAL W + HF           A    L +++ + P + F    AGP  A V   S +   +K     T    + +A  +C   ++++ 
Subjt:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSFLGV-AGP--AAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQSV

Query:  QTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK M+
Subjt:  QTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein5.6e-2537.72Show/hide
Query:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSF----LGVAGPAAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQS
        MV+ +DES+ SFYAL W + HF           A    L +++ + P + F     G  G  AV   S +   +K     T    + +A  +C   ++++
Subjt:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSF----LGVAGPAAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQS

Query:  VQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
         +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK M+
Subjt:  VQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein7.3e-2538.69Show/hide
Query:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSFLGV-AGP----AAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQ
        MV+ +DES+ SFYAL W + HF           A    L +++ + P + F    AGP    AAV   S +   +K     T    + +A  +C   +++
Subjt:  MVIGVDESEHSFYALNWTLQHF-------FGPNATPYKLVIVNAKPPPSSFLGV-AGP----AAVDVLSMLDADLKNIANRT----VQKAKDVCIEHKVQ

Query:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS
        + +T V+EG+A+ ++C++VEK H  +LVVGS   G +KR  LGSVSD+CAHHA+C ++IVK PPK M+
Subjt:  SVQTEVVEGDARNVMCDSVEKFHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCAGGTGATGGTGATTGGCGTCGACGAAAGTGAGCACAGCTTCTACGCGTTGAATTGGACGCTCCAACATTTCTTTGGCCCTAACGCTACTCCTTACAAGCT
CGTTATTGTCAACGCTAAACCACCTCCCTCTAGCTTTCTTGGAGTCGCCGGCCCAGCAGCCGTGGATGTCTTGTCCATGCTCGATGCAGATTTGAAGAACATAGCTAATA
GGACTGTCCAAAAAGCTAAGGATGTATGCATAGAACACAAGGTTCAAAGCGTGCAGACTGAAGTTGTGGAGGGTGATGCCAGAAATGTAATGTGCGATTCTGTAGAAAAA
TTCCATGCATCCATTTTGGTTGTTGGAAGTCACAATTATGGAGTAGTAAAAAGGACGGGACTGGGCAGTGTAAGTGATTTCTGTGCTCACCATGCCCATTGCTCTGTCAT
GATTGTGAAGAGGCCACCCAAGCCCATGAGCTTGGAGGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGCAGGTGATGGTGATTGGCGTCGACGAAAGTGAGCACAGCTTCTACGCGTTGAATTGGACGCTCCAACATTTCTTTGGCCCTAACGCTACTCCTTACAAGCT
CGTTATTGTCAACGCTAAACCACCTCCCTCTAGCTTTCTTGGAGTCGCCGGCCCAGCAGCCGTGGATGTCTTGTCCATGCTCGATGCAGATTTGAAGAACATAGCTAATA
GGACTGTCCAAAAAGCTAAGGATGTATGCATAGAACACAAGGTTCAAAGCGTGCAGACTGAAGTTGTGGAGGGTGATGCCAGAAATGTAATGTGCGATTCTGTAGAAAAA
TTCCATGCATCCATTTTGGTTGTTGGAAGTCACAATTATGGAGTAGTAAAAAGGACGGGACTGGGCAGTGTAAGTGATTTCTGTGCTCACCATGCCCATTGCTCTGTCAT
GATTGTGAAGAGGCCACCCAAGCCCATGAGCTTGGAGGCTCAATGA
Protein sequenceShow/hide protein sequence
MAEQVMVIGVDESEHSFYALNWTLQHFFGPNATPYKLVIVNAKPPPSSFLGVAGPAAVDVLSMLDADLKNIANRTVQKAKDVCIEHKVQSVQTEVVEGDARNVMCDSVEK
FHASILVVGSHNYGVVKRTGLGSVSDFCAHHAHCSVMIVKRPPKPMSLEAQ