; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018741 (gene) of Snake gourd v1 genome

Gene IDTan0018741
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationLG10:8957094..8959018
RNA-Seq ExpressionTan0018741
SyntenyTan0018741
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138083.1 universal stress protein PHOS34-like [Momordica charantia]2.1e-6378.48Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFC-TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGD
        MAEQV+VIGVDES+HSFYAL+WMLQHF    + +PYKLVIVHAKPPPSSV+A   PGA N L +LDADL K+A+RTV+KAK++C EHK+ENVQ EIVEGD
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFC-TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGD

Query:  ARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKPKN
        ARNVMCDAVEK  AS+LVVGSHNYGVVKRM LGSVS YCA+HAHCSVMIVKRPPKP N
Subjt:  ARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKPKN

XP_022937984.1 universal stress protein PHOS34-like [Cucurbita moschata]1.0e-6580Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MA++V+VIGVDE+EHSFYAL+WMLQHFF  N  P+ LVIVHAKPPPSSVLA + P AEN+LP+LD DLKKI +RTV++AKE+C EHKV+NV TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNVMCDAVEKH ASLLVVGSHNYGVVK M+LGSVS YCAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

XP_022973665.1 universal stress protein PHOS34-like [Cucurbita maxima]2.0e-6680.65Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MA++V+VIGVDE+EHSFYAL+WMLQHFF  N  P+ LVIVHAKPPPSSVLA + PGAEN+LP+LD DLKKI +RTV++AKE+C EHKV+NV TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNVMCDAVEKH ASLLVVGSHNYGVVK M+LGSVS YCAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

XP_023539948.1 universal stress protein PHOS34-like [Cucurbita pepo subsp. pepo]8.6e-6579.35Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MA++V+VIGVDE+EHSFYAL+WMLQHFF  N   + LVIVHAKPPPSSVLA + P AEN+LP+LD DLKKI +RTV++AKE+C EHKV+NV TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNVMCDAVEKH ASLLVVGSHNYGVVK M+LGSVS YCAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

XP_038904684.1 universal stress protein PHOS34-like isoform X2 [Benincasa hispida]5.6e-6480Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MAEQVMVIGVDESEHSFYAL W LQHFF  N TPYKLVIV+AKPPPSS L  A P A ++LPMLDADLKKIA RTV+KAK++C EHKV++VQTE++EGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        R VMCD+VEK  AS+LVVGSHNYGVVKRM LGSVS +CAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

TrEMBL top hitse value%identityAlignment
A0A0A0LAL5 Usp domain-containing protein1.5e-5469.68Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MAEQVMVIGVDESEHSFYAL W LQHFF  N TPYKL IV+A   PS     A  G+ N++P +DADLKK+ +RTV++AK++C EH V++V+TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNV+CD+VEK  AS+L+VGSH+YGVVK+M LGSVS YCA HAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

A0A5D3CXU5 Universal stress protein A-like protein2.6e-5168.39Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MAE+VMVIGVDESEHSFYAL W LQHFF  N TPYKL IV+A   PS+    A  G+ N++P +DADLKK  S TV++AK++C EHKV++V+TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        R+V+CD+V+K  AS+LVVGSH+YGVVK+M LGSVS YCA HAHC VMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

A0A6J1CC29 universal stress protein PHOS34-like1.0e-6378.48Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFC-TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGD
        MAEQV+VIGVDES+HSFYAL+WMLQHF    + +PYKLVIVHAKPPPSSV+A   PGA N L +LDADL K+A+RTV+KAK++C EHK+ENVQ EIVEGD
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFC-TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGD

Query:  ARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKPKN
        ARNVMCDAVEK  AS+LVVGSHNYGVVKRM LGSVS YCA+HAHCSVMIVKRPPKP N
Subjt:  ARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKPKN

A0A6J1FCR8 universal stress protein PHOS34-like4.9e-6680Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MA++V+VIGVDE+EHSFYAL+WMLQHFF  N  P+ LVIVHAKPPPSSVLA + P AEN+LP+LD DLKKI +RTV++AKE+C EHKV+NV TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNVMCDAVEKH ASLLVVGSHNYGVVK M+LGSVS YCAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

A0A6J1IBY1 universal stress protein PHOS34-like9.9e-6780.65Show/hide
Query:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        MA++V+VIGVDE+EHSFYAL+WMLQHFF  N  P+ LVIVHAKPPPSSVLA + PGAEN+LP+LD DLKKI +RTV++AKE+C EHKV+NV TE+VEGDA
Subjt:  MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP
        RNVMCDAVEKH ASLLVVGSHNYGVVK M+LGSVS YCAHHAHCSVMIVKRPPKP
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKP

SwissProt top hitse value%identityAlignment
Q57951 Universal stress protein MJ05318.5e-0728.86Show/hide
Query:  MVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDARNVMC
        +VI  D S+ S  A K  +      +   Y + +V   P         +P AE    ++   LK+     ++K K++  E  V+ + TE++EG   N + 
Subjt:  MVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDARNVMC

Query:  DAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK
        +  EK +A L+V+G+     ++R+ LGSV+     +AHC V++VK+P K
Subjt:  DAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK

Q8L4N1 Universal stress protein PHOS346.9e-0929.31Show/hide
Query:  AEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAK-----------------PPPSSVLANAIPGAENILPMLDADL---KKIASRTVEKAKE
        A + + + VD SE S +A++W + H+         +VI+H                   PPP S   +  PGA+      D D     K+A    +  KE
Subjt:  AEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAK-----------------PPPSSVLANAIPGAENILPMLDADL---KKIASRTVEKAKE

Query:  VCTEHKVENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMD---LGSVSYYCAHHAHCSVMIVKRP
            HK+  V+    + D R  +C   E+   S +++GS  +G  KR     LGSVS YC HH  C V++V+ P
Subjt:  VCTEHKVENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMD---LGSVSYYCAHHAHCSVMIVKRP

Q8LGG8 Universal stress protein A-like protein4.2e-0626.47Show/hide
Query:  ALKWMLQHFFCTNGTPYKLVIVHAKPPPS---SVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDARNVMCDAVEKHQASL
        A +W L+    +N + +K++++H +         + +     E+   M  ++  K     +E     C E  V   +  I  GD ++V+C  V++ +   
Subjt:  ALKWMLQHFFCTNGTPYKLVIVHAKPPPS---SVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDARNVMCDAVEKHQASL

Query:  LVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKR
        LVVGS   G  +++ +G+VS +C  HA C VM +KR
Subjt:  LVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKR

Q8VYN9 Universal stress protein PHOS321.7e-0727.81Show/hide
Query:  AEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGA---------ENILPM-----LDADLKKIASRTVEKAKEVCTEHK
        A + + + VD SE S +A++W + H+         +V++H    P+SVL  A  G           N  P       DA      +   +  KE+   +K
Subjt:  AEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGA---------ENILPM-----LDADLKKIASRTVEKAKEVCTEHK

Query:  VENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRM----DLGSVSYYCAHHAHCSVMIVKRP
        +  V+    + D R  +C  +E+   S +++GS  +G  K+      LGSVS YC HH  C V++V+ P
Subjt:  VENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRM----DLGSVSYYCAHHAHCSVMIVKRP

Arabidopsis top hitse value%identityAlignment
AT1G09740.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.5e-2238.85Show/hide
Query:  MVIGVDESEHSFYALKWMLQHF-FCTNGTPYKLVIVHAKPPPSSVLANAIPG-----------AENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQT
        +V+ VD SE S  AL+W L +    ++ +    V++H +P P SV A   PG                  ++   K+I    +E A ++C E  V NV+T
Subjt:  MVIGVDESEHSFYALKWMLQHF-FCTNGTPYKLVIVHAKPPPSSVLANAIPG-----------AENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQT

Query:  EIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVK
        ++V GD +  +C+AVE   A LLV+GS  YG +KRM LGSVS YC +HAHC V+I+K
Subjt:  EIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVK

AT2G47710.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.1e-4455.92Show/hide
Query:  EQVMVIGVDESEHSFYALKWMLQHFFC--TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA
        + VMV+GVD+SE S YAL+W L  FF       P+KL IVHAKP   S +  A PG   ++P +DADLK  A++ VEKAK +C    V     E+ EGDA
Subjt:  EQVMVIGVDESEHSFYALKWMLQHFFC--TNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDA

Query:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRP
        RN++C+ V+KH AS+LVVGSH YG +KR  LGS S YCAHHAHCSVMIVK+P
Subjt:  RNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRP

AT3G11930.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-2233.74Show/hide
Query:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPP-------PSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENV
        MV+ +DES+ SFYAL+W++ HF                L ++H + P       P+      +  + +++  +    ++ ++  + +A ++C   ++   
Subjt:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPP-------PSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENV

Query:  QTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK
        +T ++EG+A+ ++C+AVEK    LLVVGS   G +KR  LGSVS YCAHHA+C ++IVK PPK
Subjt:  QTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK

AT3G11930.2 Adenine nucleotide alpha hydrolases-like superfamily protein2.5e-2234.15Show/hide
Query:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPPPSSVL--------ANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVEN
        MV+ +DES+ SFYAL+W++ HF                L ++H + P +           A A+  + +++  +    ++ ++  + +A ++C   ++  
Subjt:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPPPSSVL--------ANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVEN

Query:  VQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK
         +T ++EG+A+ ++C+AVEK    LLVVGS   G +KR  LGSVS YCAHHA+C ++IVK PPK
Subjt:  VQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK

AT3G11930.4 Adenine nucleotide alpha hydrolases-like superfamily protein5.6e-2234.34Show/hide
Query:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPP----------PSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKV
        MV+ +DES+ SFYAL+W++ HF                L ++H + P          P    A A+  + +++  +    ++ ++  + +A ++C   ++
Subjt:  MVIGVDESEHSFYALKWMLQHF-------FCTNGTPYKLVIVHAKPP----------PSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKV

Query:  ENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK
           +T ++EG+A+ ++C+AVEK    LLVVGS   G +KR  LGSVS YCAHHA+C ++IVK PPK
Subjt:  ENVQTEIVEGDARNVMCDAVEKHQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCAGGTGATGGTGATTGGCGTCGATGAGAGTGAGCACAGCTTCTACGCCTTGAAGTGGATGCTCCAACATTTCTTTTGCACTAATGGTACTCCTTACAAACT
TGTTATTGTCCACGCTAAACCGCCTCCCTCTAGCGTTCTTGCAAACGCCATCCCAGGAGCAGAGAATATCTTGCCCATGCTAGATGCAGATTTGAAGAAAATAGCTTCCA
GGACTGTCGAAAAAGCGAAGGAAGTATGCACCGAACACAAGGTCGAAAACGTGCAGACTGAAATCGTGGAGGGTGATGCCAGAAATGTAATGTGCGATGCTGTAGAAAAA
CACCAAGCATCCCTTTTGGTTGTTGGAAGTCATAATTATGGAGTAGTAAAAAGGATGGACCTGGGCAGTGTAAGTTACTACTGTGCTCACCATGCACATTGCTCTGTCAT
GATTGTGAAGAGGCCACCCAAGCCCAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGCAGGTGATGGTGATTGGCGTCGATGAGAGTGAGCACAGCTTCTACGCCTTGAAGTGGATGCTCCAACATTTCTTTTGCACTAATGGTACTCCTTACAAACT
TGTTATTGTCCACGCTAAACCGCCTCCCTCTAGCGTTCTTGCAAACGCCATCCCAGGAGCAGAGAATATCTTGCCCATGCTAGATGCAGATTTGAAGAAAATAGCTTCCA
GGACTGTCGAAAAAGCGAAGGAAGTATGCACCGAACACAAGGTCGAAAACGTGCAGACTGAAATCGTGGAGGGTGATGCCAGAAATGTAATGTGCGATGCTGTAGAAAAA
CACCAAGCATCCCTTTTGGTTGTTGGAAGTCATAATTATGGAGTAGTAAAAAGGATGGACCTGGGCAGTGTAAGTTACTACTGTGCTCACCATGCACATTGCTCTGTCAT
GATTGTGAAGAGGCCACCCAAGCCCAAGAACTGAAC
Protein sequenceShow/hide protein sequence
MAEQVMVIGVDESEHSFYALKWMLQHFFCTNGTPYKLVIVHAKPPPSSVLANAIPGAENILPMLDADLKKIASRTVEKAKEVCTEHKVENVQTEIVEGDARNVMCDAVEK
HQASLLVVGSHNYGVVKRMDLGSVSYYCAHHAHCSVMIVKRPPKPKN