; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G011180 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G011180
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationCmo_Chr18:11556233..11558457
RNA-Seq ExpressionCmoCh18G011180
SyntenyCmoCh18G011180
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574035.1 hypothetical protein SDJN03_27922, partial [Cucurbita argyrosperma subsp. sororia]2.1e-10798.05Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETA-PRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCM
        EGSKETA PR YELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW GGGDGGGTVEYCIQNASCM
Subjt:  EGSKETA-PRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCM

Query:  AIAVK
        AIAV+
Subjt:  AIAVK

KAG6601483.1 hypothetical protein SDJN03_06716, partial [Cucurbita argyrosperma subsp. sororia]1.6e-8382.3Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKK
        MGKTG KLPSFCLNRIR HVRVPIQSK DSVSVKTG +K  +  E    G KP LG GRKIMIVVDSTIEAEGALQWALS+TVQNQD IVLLH+  PS+K
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKK

Query:  GEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW--GGGGDGGGTVEYCIQN
        GEG SKET+PR YELVHSMRNLCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ ASLLVLGQ KKRSTTWRLLM+WAGHRW  GGGG GGG VEYCIQN
Subjt:  GEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW--GGGGDGGGTVEYCIQN

Query:  ASCMAIAVK
        ASCMAIAV+
Subjt:  ASCMAIAVK

XP_022945318.1 uncharacterized protein LOC111449594 [Cucurbita moschata]1.5e-11099.51Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
        EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
Subjt:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA

Query:  IAVK
        IAV+
Subjt:  IAVK

XP_022968363.1 universal stress protein PHOS32-like [Cucurbita maxima]3.1e-10395.15Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSF LNRIRSHVRVPIQSKPDSVSVKTGER GGEF ESNGG KPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC
        EGSKETAPR YELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLM+WAGHR GGGGD  GGGTVEYCIQNASC
Subjt:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC

Query:  MAIAVK
        MAIAV+
Subjt:  MAIAVK

XP_023542429.1 uncharacterized protein LOC111802335 [Cucurbita pepo subsp. pepo]9.3e-10897.56Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLT PSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD-GGGTVEYCIQNASCM
        EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKR TTWRLLM+WAGHRWGGGGD GGGTVEYCIQNASCM
Subjt:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD-GGGTVEYCIQNASCM

Query:  AIAVK
        AIAV+
Subjt:  AIAVK

TrEMBL top hitse value%identityAlignment
A0A0A0KQW0 Usp domain-containing protein1.8e-7776.74Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGE--FDESNG------GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLH
        MGKTG KLPSFCLNRIR HVRVPIQSKPD VSVKTG  K  +   DE N         K  +GIGRKIMIVVDSTIEAEGAL WALS+TVQ QD I+LLH
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGE--FDESNG------GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLH

Query:  LTNPSKKGEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW-GGGGDGGGTV
        +T PS KGEG +KETAPR YELVHSMR LCQLKRPEVE EV VVE GKEKG VIVEEARK+ ASLLVLGQ KKRSTTWRLLM+WAG RW GGGG  GG V
Subjt:  LTNPSKKGEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW-GGGGDGGGTV

Query:  EYCIQNASCMAIAVK
        EYCIQNASCMAIAV+
Subjt:  EYCIQNASCMAIAVK

A0A6J1G0H0 uncharacterized protein LOC1114495947.4e-11199.51Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
        EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
Subjt:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA

Query:  IAVK
        IAV+
Subjt:  IAVK

A0A6J1GXX9 uncharacterized protein LOC1114582161.0e-8381.9Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKK
        MGKTG KLPSFCLNRIR HVRVPIQSK DSVSVKTG +K  +  E    G KP LG GRKIMIVVDSTIEAEGALQWALS+TVQNQD IVLLH+  PS+K
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKK

Query:  GEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW---GGGGDGGGTVEYCIQ
        GEG SKET+PR YELVHSMRNLCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ ASLLVLGQ KKRSTTWRLLM+WAGHRW   GGGG GGG VEYCIQ
Subjt:  GEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW---GGGGDGGGTVEYCIQ

Query:  NASCMAIAVK
        NASCMAIAV+
Subjt:  NASCMAIAVK

A0A6J1HXT3 universal stress protein PHOS32-like1.5e-10395.15Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
        MGKTGGKLPSF LNRIRSHVRVPIQSKPDSVSVKTGER GGEF ESNGG KPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKG

Query:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC
        EGSKETAPR YELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLM+WAGHR GGGGD  GGGTVEYCIQNASC
Subjt:  EGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC

Query:  MAIAVK
        MAIAV+
Subjt:  MAIAVK

A0A6J1IT49 uncharacterized protein LOC1114784232.2e-8380.84Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERK------GGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLT
        MGKTG KLPSFCLNRIR HVRVPIQSK DSVSVKTG +K      GGE      G KP LG GRKIMIV+DSTIEAEGALQWALS+TVQNQD IVLLH+ 
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERK------GGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLT

Query:  NPSKKGEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGG--DGGGTVE
         PS+KGEG SKET+PR YELVHSMRNLCQLKRPEVE+EVAVVE GKEKG VIVEEARKQ ASLLVLGQ KKRSTTWRLLM+WAGHRWGGGG   GGG VE
Subjt:  NPSKKGEG-SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGG--DGGGTVE

Query:  YCIQNASCMAIAVK
        YCIQNASCMAIAV+
Subjt:  YCIQNASCMAIAVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.4e-3142.2Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---
        MGK G     F ++R+R++VRV    +P + + + G         S+     ++ I GR+I++VVDS  EA+ AL W LS+  Q QD I+LLH       
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---

Query:  -----SKKGEG-----SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGG
             + K EG      K T  R  + V +++ +C+LKRPEV+ EV  V+G EKGP IV+EAR++ ASLLVLGQKK+ + TWRLLM+WA           
Subjt:  -----SKKGEG-----SKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGG

Query:  GTVEYCIQNASCMAIAVK
          VEYCI N+ CMAIAV+
Subjt:  GTVEYCIQNASCMAIAVK

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein8.8e-2438.97Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---
        MGK G     F ++R+R++VRV    +P + + + G         S+     ++ I GR+I++VVDS  EA+ AL W LS+  Q QD I+LLH       
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---

Query:  -----SKKGEGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEY
             + K EG  E+  +                 +V+ EV  V+G EKGP IV+EAR++ ASLLVLGQKK+ + TWRLLM+WA             VEY
Subjt:  -----SKKGEGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEY

Query:  CIQNASCMAIAVK
        CI N+ CMAIAV+
Subjt:  CIQNASCMAIAVK

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.5e-3450.98Show/hide
Query:  MIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---SKKGEGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLL
        M+VVD+T + + ALQWAL++ VQ++D I LLH+T         E  +E   R +ELVH ++N CQLK+P V+ E+ VVE  +EKG  IVEE++KQGA +L
Subjt:  MIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNP---SKKGEGSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVE-GKEKGPVIVEEARKQGASLL

Query:  VLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAV--KTNN
        VLGQ +KR++ WR++  W       GG GGG VEYCI N+ CMAIAV  K+NN
Subjt:  VLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAV--KTNN

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.8e-3246.1Show/hide
Query:  GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKGE-GSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++M+VVD  I + GAL+WAL +T+Q+QD + LL+ + P +KG+  ++++  +T ELVH+++ LCQ KRP +E+E+  ++G  KEKG  IVEEA++Q  
Subjt:  GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKGE-GSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVKTNN
        SLLV+G K+K+   WRLL  W    W       GT++YC++ ASCM IAVK  N
Subjt:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVKTNN

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-3146.1Show/hide
Query:  GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKGE-GSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++M+VVD  + + GAL+WA+++T+Q QD + LL+   P +K +  +++   +T ELVH+++ LCQ KRP +E+E+  +EG  K+KG  IVEE++KQ  
Subjt:  GRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKGE-GSKETAPRTYELVHSMRNLCQLKRPEVEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVKTNN
        SLLV+GQ+KK    WRLL  WA  R  G     G ++YC++NASCM IAVK  N
Subjt:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVKTNN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGGTGGAAAGCTGCCGAGTTTTTGCCTGAACCGGATCCGGTCTCATGTTCGTGTGCCAATTCAGTCCAAACCGGACTCTGTTTCTGTGAAAACAGGGGA
GAGGAAGGGCGGGGAGTTTGATGAGAGTAATGGTGGAGGTAAGCCGGCGTTGGGAATTGGAAGGAAGATAATGATTGTGGTTGATTCCACCATTGAAGCTGAAGGAGCTC
TTCAATGGGCGCTCTCAAATACGGTTCAGAATCAAGATATGATTGTTCTTCTCCATCTCACCAACCCTTCAAAAAAAGGGGAAGGATCTAAGGAGACAGCACCAAGAACG
TATGAACTAGTTCATTCGATGAGAAATCTATGTCAATTGAAGCGACCCGAGGTGGAAATTGAAGTAGCAGTGGTGGAAGGGAAGGAGAAAGGGCCAGTGATTGTAGAAGA
AGCAAGAAAGCAAGGGGCATCGTTGCTGGTTTTGGGGCAGAAGAAGAAACGGTCGACGACATGGCGGCTTCTGATGATCTGGGCCGGCCACCGGTGGGGTGGCGGCGGAG
ACGGCGGCGGAACAGTGGAGTATTGTATTCAGAATGCGAGCTGCATGGCGATTGCAGTCAAAACAAACAACATCTACTCGTCATGGGCACGACCCATAACAAGTAAGTCA
AAACGGACAACAAAATTAAACACATCCATAATTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAACCGGTGGAAAGCTGCCGAGTTTTTGCCTGAACCGGATCCGGTCTCATGTTCGTGTGCCAATTCAGTCCAAACCGGACTCTGTTTCTGTGAAAACAGGGGA
GAGGAAGGGCGGGGAGTTTGATGAGAGTAATGGTGGAGGTAAGCCGGCGTTGGGAATTGGAAGGAAGATAATGATTGTGGTTGATTCCACCATTGAAGCTGAAGGAGCTC
TTCAATGGGCGCTCTCAAATACGGTTCAGAATCAAGATATGATTGTTCTTCTCCATCTCACCAACCCTTCAAAAAAAGGGGAAGGATCTAAGGAGACAGCACCAAGAACG
TATGAACTAGTTCATTCGATGAGAAATCTATGTCAATTGAAGCGACCCGAGGTGGAAATTGAAGTAGCAGTGGTGGAAGGGAAGGAGAAAGGGCCAGTGATTGTAGAAGA
AGCAAGAAAGCAAGGGGCATCGTTGCTGGTTTTGGGGCAGAAGAAGAAACGGTCGACGACATGGCGGCTTCTGATGATCTGGGCCGGCCACCGGTGGGGTGGCGGCGGAG
ACGGCGGCGGAACAGTGGAGTATTGTATTCAGAATGCGAGCTGCATGGCGATTGCAGTCAAAACAAACAACATCTACTCGTCATGGGCACGACCCATAACAAGTAAGTCA
AAACGGACAACAAAATTAAACACATCCATAATTATTTAA
Protein sequenceShow/hide protein sequence
MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVQNQDMIVLLHLTNPSKKGEGSKETAPRT
YELVHSMRNLCQLKRPEVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVKTNNIYSSWARPITSKS
KRTTKLNTSIII