; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016942 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016942
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationtig00153016:822415..830538
RNA-Seq ExpressionSgr016942
SyntenySgr016942
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006015 - Universal stress protein A family
IPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016574.1 Universal stress protein A-like protein [Cucurbita argyrosperma subsp. argyrosperma]8.5e-5872.39Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQDT   LIPL+EF+++  EKQYGL NDAE            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGLG IKRVL+GSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

XP_022938828.1 universal stress protein A-like protein [Cucurbita moschata]7.7e-5974.23Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQDT   LIPLEEFRE+  EKQYGL NDAE            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGLG IKRVL+GSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

XP_022993769.1 universal stress protein A-like protein [Cucurbita maxima]7.2e-5773.01Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQ T   LIPLEEFRE+  EKQYGL ND E            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGL  IKRVLLGSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

XP_023551238.1 universal stress protein A-like protein [Cucurbita pepo subsp. pepo]5.0e-5873.62Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQ T   LIPLEEFRE+  EKQYGL NDAE            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGLG IKRVL+GSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

XP_038875204.1 universal stress protein PHOS32 [Benincasa hispida]7.7e-5973.33Show/hide
Query:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKV
        ME N NR +GVAMDYS+TSKSAL+WAAHNL+SHGD L+LIHVEP NSD+P KLLFQDT   LIPLEEFRE+  EKQYGLSNDAE              KV
Subjt:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKV

Query:  VAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL
        VAKVYWGDPREKLCEAV+DL LHSLV+GSRGLG IKRVLLGSVSNYVV++ASCPVTV K   + L
Subjt:  VAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL

TrEMBL top hitse value%identityAlignment
A0A1S3CIK6 universal stress protein A-like protein3.6e-5469.09Show/hide
Query:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKV
        ME + NR VGVAMDYS+TSKSAL+W A+NL+  GD LVLIHV+P NSD+P KLLFQDT   LIPLEEF E+  EKQYGLSNDAE              KV
Subjt:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKV

Query:  VAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL
        +AKVYWGDPREKLCEAV+DL LHSLV+GSRGLG IK+VLLGSVSNYVV++ASCPVTV K   + L
Subjt:  VAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL

A0A5D3D5Z6 Universal stress protein A-like protein1.2e-5275.71Show/hide
Query:  MDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVEDLKLHSL
        MDYS+TSKSAL+W A+NL+  GD LVLIHV+P NSD+P KLLFQDT   LIPLEEF E+  EKQYGLSNDAE KV+AKVYWGDPREKLCEAV+DL LHSL
Subjt:  MDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVEDLKLHSL

Query:  VVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL
        V+GSRGLG IK+VLLGSVSNYVV++ASCPVTV K   + L
Subjt:  VVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVL

A0A6A1VFL6 Universal stress protein A-like protein4.9e-5171.24Show/hide
Query:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW
        RT+GV MDYSATSK AL+WAA NLI  GDRLVLIHV+P  SD  RK LF+DT   L+PLEEFRE+NF KQYGL+ND E             AKVVAK+YW
Subjt:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW

Query:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
        GDPREKLC AV+DLKL SLVVGSRGLGPIKRVLLGSVS +VV +ASCPVTV K
Subjt:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

A0A6J1FK03 universal stress protein A-like protein3.7e-5974.23Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQDT   LIPLEEFRE+  EKQYGL NDAE            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGLG IKRVL+GSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

A0A6J1K109 universal stress protein A-like protein3.5e-5773.01Show/hide
Query:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------
        +S+NMEK+ NR VGVAMDYSATSKSAL+WAAHNL++HGD+L+LIHVEP NSD+PRKLLFQ T   LIPLEEFRE+  EKQYGL ND E            
Subjt:  VSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE------------

Query:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
          +VVAKVYWGDPREKLCEAVEDLKL SLV+GSRGL  IKRVLLGSVS+YVV++ASCPVTV K
Subjt:  -AKVVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

SwissProt top hitse value%identityAlignment
P74897 Universal stress protein in QAH/OAS sulfhydrylase 3'region5.4e-0729.85Show/hide
Query:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTALIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVEDLK
        +T+ +A D S  +K A   A     +HG RL+++H      D   +  F++     LE   +V  E    ++     +  A +  G P E + +A    K
Subjt:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTALIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVEDLK

Query:  LHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPV
           +V+G+RGLG +  + LGS S  VV  A CPV
Subjt:  LHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPV

P87132 Uncharacterized protein C167.051.4e-0728.78Show/hide
Query:  NRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPR--KLLFQDTALIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVE
        N T  + +D S+ S  A +WA   L+ +GD L+++ V   +  S R  K   +   L  LE+  +   +       + E  +   ++    +  + E ++
Subjt:  NRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPR--KLLFQDTALIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVE

Query:  DLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTV
         ++   +V+GSRG   +K VLLGS SNY+V  +S PV V
Subjt:  DLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTV

Q8L4N1 Universal stress protein PHOS342.1e-1133.75Show/hide
Query:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSN------------SDSPRKLLFQDTALIP---LEEFREVNFEKQYGLSND-AEAKVVAK
        + R +GVA+D S  S  A++WA  + I  GD +V++HV P++               P      D    P    E+F      K   L+    EA    K
Subjt:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSN------------SDSPRKLLFQDTALIP---LEEFREVNFEKQYGLSND-AEAKVVAK

Query:  VYW---GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR---VLLGSVSNYVVRHASCPVTV
        ++     D RE+LC   E L L ++++GSRG G  KR     LGSVS+Y V H  CPV V
Subjt:  VYW---GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR---VLLGSVSNYVVRHASCPVTV

Q8LGG8 Universal stress protein A-like protein2.0e-0928.87Show/hide
Query:  SATSKSALQWAAHNLISHGD---RLVLIHVEPSNSDSPRKLLFQDTALIPLEEFREV-NFEKQYGL---------SNDAEAKVVAKVYWGDPREKLCEAV
        S + K A +W    ++       +++L+HV+  + D    +   D+     E+FR++    K  GL          ++      A +  GDP++ +C+ V
Subjt:  SATSKSALQWAAHNLISHGD---RLVLIHVEPSNSDSPRKLLFQDTALIPLEEFREV-NFEKQYGL---------SNDAEAKVVAKVYWGDPREKLCEAV

Query:  EDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
        + ++   LVVGSRGLG  ++V +G+VS + V+HA CPV   K
Subjt:  EDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

Q8VYN9 Universal stress protein PHOS323.0e-1333.33Show/hide
Query:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSN-------SDSPRKLLFQDTALIP---LEEFREVNFEKQYGLSND-AEAKVVAKVYW--
        + R +GVA+D S  S  A++WA  + I  GD +VL+HV P++          P K   +D    P    E+F      K   L+    E     K++   
Subjt:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSN-------SDSPRKLLFQDTALIP---LEEFREVNFEKQYGLSND-AEAKVVAKVYW--

Query:  -GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR----VLLGSVSNYVVRHASCPVTVNK--DDK-----VVLEFKGGGVGG---AAETTLNHE
          D RE+LC  +E L L ++++GSRG G  K+      LGSVS+Y V H  CPV V +  DD+     +V    GG   G   AA  + +HE
Subjt:  -GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR----VLLGSVSNYVVRHASCPVTVNK--DDK-----VVLEFKGGGVGG---AAETTLNHE

Arabidopsis top hitse value%identityAlignment
AT1G11360.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-1733.84Show/hide
Query:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHV--------------------EPSNSDSPRKLLFQDTALIPLEEFREVNFEKQYGLSND-AEA
        + R +G+A+D S  S  A+QWA  N +  GD +VL+HV                    +P+N +S RKL          ++F  V  +K   ++    EA
Subjt:  SNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHV--------------------EPSNSDSPRKLLFQDTALIPLEEFREVNFEKQYGLSND-AEA

Query:  KVVAKVYW---GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRV---LLGSVSNYVVRHASCPVTVNK--DDKVVLEFKGGGVGGAAETTLNHEKYTTL
         +  K++     D +E+LC  VE L L +L++GSRG G  KR     LGSVS+Y V H +CPV V +  DDK   + K G  GG  E  ++ +K  T+
Subjt:  KVVAKVYW---GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRV---LLGSVSNYVVRHASCPVTVNK--DDKVVLEFKGGGVGGAAETTLNHEKYTTL

AT3G03270.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.3e-4063.36Show/hide
Query:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW
        RTVGV MDYS TSK AL+WAA NL+  GD ++LIHV+P N+D  RK+LF++T   LIPLEEFREVN  KQYGL+ D E              KVVAKVYW
Subjt:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW

Query:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR
        GDPREKLC+AVE+LKL S+V+GSRGLG +KR
Subjt:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKR

AT3G03270.2 Adenine nucleotide alpha hydrolases-like superfamily protein4.1e-5064.71Show/hide
Query:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW
        RTVGV MDYS TSK AL+WAA NL+  GD ++LIHV+P N+D  RK+LF++T   LIPLEEFREVN  KQYGL+ D E              KVVAKVYW
Subjt:  RTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA--LIPLEEFREVNFEKQYGLSNDAE-------------AKVVAKVYW

Query:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
        GDPREKLC+AVE+LKL S+V+GSRGLG +KR+LLGSVSN+VV +A+CPVTV K
Subjt:  GDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

AT3G17020.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.4e-3142.5Show/hide
Query:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA---LIPLEEFREVNFEKQYGLSNDAEA-------------K
        M ++  R +GVA+D+S  SK AL WA  N++  GD L+LI +    +    ++   +T     IP+ EF +    K+Y L  DAE               
Subjt:  MEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTA---LIPLEEFREVNFEKQYGLSNDAEA-------------K

Query:  VVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK
        VV K+YWGDPREK+C A E + L SLV+G+RGLG +KR+++GSVSN+VV + +CPVTV K
Subjt:  VVAKVYWGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNK

AT3G53990.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.8e-3646.5Show/hide
Query:  NRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLF--QDTALIPLEEFREVNFEKQYGLSND-------------AEAKVVAKVY
        +R +G+AMD+S +SK+AL+WA  NL   GD + +IH  P + D  R  L+    + LIPL EFRE    ++YG+  D              E  VV K+Y
Subjt:  NRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLF--QDTALIPLEEFREVNFEKQYGLSND-------------AEAKVVAKVY

Query:  WGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDK
        WGD REKL +AV+DLKL S+V+GSRGL  ++R+++GSVS++V++HA CPVTV KD++
Subjt:  WGDPREKLCEAVEDLKLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATCAGACAATATGGAGAAGAATAGCAATCGCACAGTGGGAGTAGCCATGGATTACTCTGCAACAAGCAAATCAGCTTTACAATGGGCTGCACATAATCTCATCAG
CCATGGAGATCGCTTAGTTCTGATCCATGTTGAGCCTTCCAATTCCGACAGTCCCAGAAAGCTTCTCTTCCAGGACACTGCTTTGATTCCTCTTGAGGAATTCAGAGAGG
TTAACTTTGAGAAGCAATATGGACTAAGTAATGATGCAGAGGCAAAAGTGGTGGCTAAGGTTTACTGGGGTGATCCTCGGGAGAAGTTGTGTGAGGCTGTGGAGGATCTC
AAGCTCCATTCGCTTGTTGTTGGTAGCAGGGGTTTAGGCCCTATCAAAAGGGTTTTGCTTGGCAGTGTGAGCAACTACGTGGTGAGACATGCTTCATGTCCAGTTACAGT
GAATAAGGATGACAAAGTTGTACTTGAATTCAAGGGAGGAGGAGTTGGGGGAGCAGCAGAAACAACTTTGAACCATGAAAAATACACAACACTCTCATTCCAAATACATA
TATTTTGGCTCTCATCCGTTTTCAATTATATATATATATATATATATATATATATTATTTGCCTTCCTGTTCCATCTCTAACAAACTCCATTGGTCTTCTTCTTCTCCAT
CACAAACTTCCTCCAAGCGGTGGTGATCTCCCTCTGCGTCTCCAGGGCGCTCTTCTTCGCCACCACGACGTCGTTCCGCTGCGCTTCTTCTCCGGCCACCGTCGTCTTGA
TATCGGTCCTCGACGGCAGACCGTCCAGCTCGTGCAGCACTTTCTGGATGTCCCCCACCAGACGCTCCGCAAGTGTTCGAGAAAAGTCTTCTCTGATCACGACTCTCAGA
ACTGTTATGTGCTGCGCGTCCGGCGGCATGGTGGCTTGCCTTCAGCCTTCCTTCTGTTCTGCCACGTTCTCTTGAAAGCCAAACCAGCCAGCATTATGGCCTCCGACGAC
CCCACAGTTCCCACTCCCACTGCCGTCTCAGACTCACCCAATGGGGCATTGAACAAAACTCCGCCACGACCGACGCTGCAATTTGCAGTTGAGGAGAATTTCCAGAGGAT
AGCGTTCATGGAGAGAAACGATAGAACGGATAGATCTAAGCTTTTTGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTATCAGACAATATGGAGAAGAATAGCAATCGCACAGTGGGAGTAGCCATGGATTACTCTGCAACAAGCAAATCAGCTTTACAATGGGCTGCACATAATCTCATCAG
CCATGGAGATCGCTTAGTTCTGATCCATGTTGAGCCTTCCAATTCCGACAGTCCCAGAAAGCTTCTCTTCCAGGACACTGCTTTGATTCCTCTTGAGGAATTCAGAGAGG
TTAACTTTGAGAAGCAATATGGACTAAGTAATGATGCAGAGGCAAAAGTGGTGGCTAAGGTTTACTGGGGTGATCCTCGGGAGAAGTTGTGTGAGGCTGTGGAGGATCTC
AAGCTCCATTCGCTTGTTGTTGGTAGCAGGGGTTTAGGCCCTATCAAAAGGGTTTTGCTTGGCAGTGTGAGCAACTACGTGGTGAGACATGCTTCATGTCCAGTTACAGT
GAATAAGGATGACAAAGTTGTACTTGAATTCAAGGGAGGAGGAGTTGGGGGAGCAGCAGAAACAACTTTGAACCATGAAAAATACACAACACTCTCATTCCAAATACATA
TATTTTGGCTCTCATCCGTTTTCAATTATATATATATATATATATATATATATATTATTTGCCTTCCTGTTCCATCTCTAACAAACTCCATTGGTCTTCTTCTTCTCCAT
CACAAACTTCCTCCAAGCGGTGGTGATCTCCCTCTGCGTCTCCAGGGCGCTCTTCTTCGCCACCACGACGTCGTTCCGCTGCGCTTCTTCTCCGGCCACCGTCGTCTTGA
TATCGGTCCTCGACGGCAGACCGTCCAGCTCGTGCAGCACTTTCTGGATGTCCCCCACCAGACGCTCCGCAAGTGTTCGAGAAAAGTCTTCTCTGATCACGACTCTCAGA
ACTGTTATGTGCTGCGCGTCCGGCGGCATGGTGGCTTGCCTTCAGCCTTCCTTCTGTTCTGCCACGTTCTCTTGAAAGCCAAACCAGCCAGCATTATGGCCTCCGACGAC
CCCACAGTTCCCACTCCCACTGCCGTCTCAGACTCACCCAATGGGGCATTGAACAAAACTCCGCCACGACCGACGCTGCAATTTGCAGTTGAGGAGAATTTCCAGAGGAT
AGCGTTCATGGAGAGAAACGATAGAACGGATAGATCTAAGCTTTTTGATTAA
Protein sequenceShow/hide protein sequence
MVSDNMEKNSNRTVGVAMDYSATSKSALQWAAHNLISHGDRLVLIHVEPSNSDSPRKLLFQDTALIPLEEFREVNFEKQYGLSNDAEAKVVAKVYWGDPREKLCEAVEDL
KLHSLVVGSRGLGPIKRVLLGSVSNYVVRHASCPVTVNKDDKVVLEFKGGGVGGAAETTLNHEKYTTLSFQIHIFWLSSVFNYIYIYIYIYIICLPVPSLTNSIGLLLLH
HKLPPSGGDLPLRLQGALLRHHDVVPLRFFSGHRRLDIGPRRQTVQLVQHFLDVPHQTLRKCSRKVFSDHDSQNCYVLRVRRHGGLPSAFLLFCHVLLKAKPASIMASDD
PTVPTPTAVSDSPNGALNKTPPRPTLQFAVEENFQRIAFMERNDRTDRSKLFD