; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016119 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016119
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProton_antipo_M domain-containing protein
Genome locationtig00007400:119154..121746
RNA-Seq ExpressionSgr016119
SyntenySgr016119
Gene Ontology termsGO:0042773 - ATP synthesis coupled electron transport (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0008137 - NADH dehydrogenase (ubiquinone) activity (molecular function)
InterPro domainsIPR003918 - NADH:ubiquinone oxidoreductase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD1835877.1 unnamed protein product [Ananas comosus var. bracteatus]2.0e-6347.85Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSGPI CPVLGSITPLFIPNS IRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSGMRSYGKEYITASLIREFLMIAV  MLD +LFYVLPESVPIPMLCGAEHL+FAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKE-KGNCTRPGMDVNSRVGSSVVAGELLLVETFLWPPSKGNAGWVKL
                     EGVLLGVCRGCVCSREWIPRQV  G   T A        EKE K   TR G  +NS        G++ L       P K NAG V+L
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKE-KGNCTRPGMDVNSRVGSSVVAGELLLVETFLWPPSKGNAGWVKL

Query:  GREFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCI
        GR FENR                                    RASPATQAALFAT REAKDSRV+AE T I
Subjt:  GREFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCI

KEH16850.1 NADH-quinone oxidoreductase protein [Medicago truncatula]1.1e-6959.7Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSGPILCPVLGSI PLFIPNSRIRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSGMRSYGKEYITASLIREFLMIAVFRMLD +LFYVLPESV IPMLCGAEHLLFAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS
                     EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREK KGNCT+PGMDVNS
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS

KJB09790.1 hypothetical protein B456_001G166300 [Gossypium raimondii]6.4e-7054.37Show/hide
Query:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF
        PI ++VGWSGMRSYGKEYITASLIREFLMIAVFRMLDL+LFYV PES                                                     
Subjt:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF

Query:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG
                    EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREK KGNCTRPGMD      S +   +  L +ETFLWPPSKGNAG VKLG
Subjt:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG

Query:  REFENR-------------------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRS
        R FENR                                                 RASPATQAALFAT R+AKDSRVQAENTCIDSGLMTRADDGSSGRS
Subjt:  REFENR-------------------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRS

Query:  RMMRKSHVR
        RMMRKSH+R
Subjt:  RMMRKSHVR

KVH96778.1 hypothetical protein Ccrd_001132, partial [Cynara cardunculus var. scolymus]6.0e-8457.68Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSG ILCPVLGSITPLFIPNSRIRP+RLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSG RSYGKEYITASLIREFLMIAVFRMLD +LFYVLPESV IPMLCGAEHL+FAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGE--LLLVETFLWPPSK-----GN
                     EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS    S +  +     VETFLWP  K     G 
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGE--LLLVETFLWPPSK-----GN

Query:  AGWVKLGREFENRRASPATQAALFATIREAKDSRVQAENTCIDSG
         G  +    +E  RASPA QAALFAT REAKD RVQAENT IDSG
Subjt:  AGWVKLGREFENRRASPATQAALFATIREAKDSRVQAENTCIDSG

TYJ49584.1 hypothetical protein E1A91_A01G143900v1 [Gossypium mustelinum]2.3e-6754.73Show/hide
Query:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF
        PI ++VGWSGMRSYGKEYITASLIREFLMIAVFRMLDL+LFYV PESVPIPML                                               
Subjt:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF

Query:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG
                             GCVCSR+WIPRQVSLGGVDTLA+IRVTATREK KGNCTRPGMD      S +   +  L ++TFLWPPSKGNAG VKLG
Subjt:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG

Query:  REFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSHVR
        R FENR                                    RASPATQAALFAT R+AKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSH+R
Subjt:  REFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSHVR

TrEMBL top hitse value%identityAlignment
A0A072TTK3 NADH-quinone oxidoreductase protein5.3e-7059.7Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSGPILCPVLGSI PLFIPNSRIRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSGMRSYGKEYITASLIREFLMIAVFRMLD +LFYVLPESV IPMLCGAEHLLFAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS
                     EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREK KGNCT+PGMDVNS
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS

A0A0D2QRC7 Proton_antipo_M domain-containing protein3.1e-7054.37Show/hide
Query:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF
        PI ++VGWSGMRSYGKEYITASLIREFLMIAVFRMLDL+LFYV PES                                                     
Subjt:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF

Query:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG
                    EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREK KGNCTRPGMD      S +   +  L +ETFLWPPSKGNAG VKLG
Subjt:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG

Query:  REFENR-------------------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRS
        R FENR                                                 RASPATQAALFAT R+AKDSRVQAENTCIDSGLMTRADDGSSGRS
Subjt:  REFENR-------------------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRS

Query:  RMMRKSHVR
        RMMRKSH+R
Subjt:  RMMRKSHVR

A0A103XTX3 Proton_antipo_M domain-containing protein (Fragment)2.9e-8457.68Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSG ILCPVLGSITPLFIPNSRIRP+RLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSG RSYGKEYITASLIREFLMIAVFRMLD +LFYVLPESV IPMLCGAEHL+FAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGE--LLLVETFLWPPSK-----GN
                     EGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNS    S +  +     VETFLWP  K     G 
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGE--LLLVETFLWPPSK-----GN

Query:  AGWVKLGREFENRRASPATQAALFATIREAKDSRVQAENTCIDSG
         G  +    +E  RASPA QAALFAT REAKD RVQAENT IDSG
Subjt:  AGWVKLGREFENRRASPATQAALFATIREAKDSRVQAENTCIDSG

A0A5D3AF22 Proton_antipo_M domain-containing protein1.1e-6754.73Show/hide
Query:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF
        PI ++VGWSGMRSYGKEYITASLIREFLMIAVFRMLDL+LFYV PESVPIPML                                               
Subjt:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHIF

Query:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG
                             GCVCSR+WIPRQVSLGGVDTLA+IRVTATREK KGNCTRPGMD      S +   +  L ++TFLWPPSKGNAG VKLG
Subjt:  SFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGEL-LLVETFLWPPSKGNAGWVKLG

Query:  REFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSHVR
        R FENR                                    RASPATQAALFAT R+AKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSH+R
Subjt:  REFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSHVR

A0A6V7PYX7 Uncharacterized protein9.7e-6447.85Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYSDLSGPI CPVLGSITPLFIPNS IRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI
              VGWSGMRSYGKEYITASLIREFLMIAV  MLD +LFYVLPESVPIPMLCGAEHL+FAGIKLFLCRGLV              T+L  +  D   
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCKHSISGRTRPTQLRRTEPDRHI

Query:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKE-KGNCTRPGMDVNSRVGSSVVAGELLLVETFLWPPSKGNAGWVKL
                     EGVLLGVCRGCVCSREWIPRQV  G   T A        EKE K   TR G  +NS        G++ L       P K NAG V+L
Subjt:  FSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKE-KGNCTRPGMDVNSRVGSSVVAGELLLVETFLWPPSKGNAGWVKL

Query:  GREFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCI
        GR FENR                                    RASPATQAALFAT REAKDSRV+AE T I
Subjt:  GREFENR------------------------------------RASPATQAALFATIREAKDSRVQAENTCI

SwissProt top hitse value%identityAlignment
P26848 NADH-ubiquinone oxidoreductase chain 42.8e-0731.37Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIR-----------------------------------------------------------------
        ML+     YS+LSG IL P+LGS+  L IPNSR+R                                                                 
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIR-----------------------------------------------------------------

Query:  -PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
         PI ++VG+  ++SY KEY+ A  I E  +IAVF  LDL++FYV  ESV IPM
Subjt:  -PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM

P27572 NADH-ubiquinone oxidoreductase chain 45.2e-2247.06Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECY DLSG ILCPVLGSI  LFIPNS IR IRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
              VGWSGMRS+GKEYI A LI EFLMIAVF MLDL+LFYV  ESV IPM
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM

P93313 NADH-ubiquinone oxidoreductase chain 47.2e-2448.37Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECY +LSG ILCPVLGSI  LFIPNSRIR IRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
              VGWSGMRSYGKEYI A LI EFLMIAVF MLDL+LFYV PESV IPM
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM

Q04050 NADH-ubiquinone oxidoreductase chain 42.7e-3153.59Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYS+LSG ILCPVLGSITPLFIPNSRIRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
              VGWSGMRSYGKEYITA LIREFLMIAVFRMLDL+LFYV PESVPIPM
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM

Q37617 NADH-ubiquinone oxidoreductase chain 45.2e-0651.92Show/hide
Query:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
        PI ++V W+ +  Y KEY  A L+ E LM+ VF +LDL+LFY+  ESV IPM
Subjt:  PIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM

Arabidopsis top hitse value%identityAlignment
ATMG00580.1 NADH dehydrogenase subunit 42.8e-3152.94Show/hide
Query:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------
        MLEHFCECYS+LSG ILCPVLGSIT LFIPNSRIRPIRLI                                                            
Subjt:  MLEHFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLI------------------------------------------------------------

Query:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM
              VGWSGMRSYGKEYITA LIREFLMIAVFRMLDL+LFYV PESVPIPM
Subjt:  ------VGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCTAAACCGAAACCGAGCTCGATCGAGAGGGCAGCCCCTTCCTTTGTTGGCTCTTCGAGACAAAGCACTCATAGGAATGGTGCTAATTCCCATGTTCTTTCGAGT
CTCGTTTGGTTTCGAGAGCCTCCCCTCCCCGTCCCTTACTCGTCTTTTGACAGCCGTCATCCTGGATTTTTTTCCGTATGCCGGGGAAAGGGGGGGGGAGACGGGACTCA
TTACTTCCCACTGGGCGAGAGGTGCACCTACCCACTCATCAATCACGGCGCCCATCTGTTTACTGTTGTCAACGAATCTCTTCAATGTTCAATTCGACTCAATGTTAGAA
CATTTCTGTGAATGCTATTCTGATTTAAGTGGTCCTATTCTGTGTCCCGTGCTAGGAAGCATTACTCCTCTTTTCATTCCAAATTCAAGAATACGACCGATACGATTGAT
TGTGGGTTGGTCTGGTATGAGAAGTTATGGGAAAGAGTATATTACAGCATCTCTAATTCGTGAATTTCTAATGATCGCCGTGTTCCGCATGCTGGATCTGATACTATTCT
ATGTTCTTCCCGAAAGCGTGCCAATCCCTATGTTGTGCGGAGCTGAGCATCTTCTATTCGCTGGGATCAAGCTTTTCCTCTGCAGGGGCCTTGTGCATCTTGTGTGTAAG
CATAGCATTTCTGGTCGAACCCGCCCAACCCAACTAAGAAGAACCGAACCTGACAGACACATCTTTTCCTTTTGGGAGGGTACTCCGAGTAGTGGGTACCTCGAAGGGGT
GCTCCTAGGTGTGTGTAGGGGTTGTGTTTGTTCGCGAGAATGGATTCCTCGTCAAGTCAGTTTGGGGGGTGTGGACACACTTGCGCGAATTCGGGTAACGGCTACAAGGG
AGAAAGAGAAAGGAAACTGTACCCGACCAGGGATGGACGTAAACTCGCGGGTTGGTTCCTCTGTCGTCGCCGGGGAGCTCTTGCTGGTGGAGACATTTCTTTGGCCCCCT
TCAAAAGGAAATGCGGGCTGGGTGAAGCTCGGCAGAGAGTTCGAGAATAGGCGGGCATCTCCCGCAACGCAAGCTGCATTGTTCGCCACTATCCGAGAAGCAAAAGATTC
GAGAGTCCAGGCTGAAAATACATGCATAGATAGTGGTCTAATGACAAGGGCCGACGACGGAAGCTCGGGACGGAGCCGTATGATGCGGAAGTCTCACGTACGGTTCCCTG
AGAAGGGAGTGGCTACCTACTGGAGCTTCGACCAACTACCATCGGTCAATTCCGCTTTGGGGCCACCCCTTACTCTACCATTATTATAG
mRNA sequenceShow/hide mRNA sequence
ATGATGCTAAACCGAAACCGAGCTCGATCGAGAGGGCAGCCCCTTCCTTTGTTGGCTCTTCGAGACAAAGCACTCATAGGAATGGTGCTAATTCCCATGTTCTTTCGAGT
CTCGTTTGGTTTCGAGAGCCTCCCCTCCCCGTCCCTTACTCGTCTTTTGACAGCCGTCATCCTGGATTTTTTTCCGTATGCCGGGGAAAGGGGGGGGGAGACGGGACTCA
TTACTTCCCACTGGGCGAGAGGTGCACCTACCCACTCATCAATCACGGCGCCCATCTGTTTACTGTTGTCAACGAATCTCTTCAATGTTCAATTCGACTCAATGTTAGAA
CATTTCTGTGAATGCTATTCTGATTTAAGTGGTCCTATTCTGTGTCCCGTGCTAGGAAGCATTACTCCTCTTTTCATTCCAAATTCAAGAATACGACCGATACGATTGAT
TGTGGGTTGGTCTGGTATGAGAAGTTATGGGAAAGAGTATATTACAGCATCTCTAATTCGTGAATTTCTAATGATCGCCGTGTTCCGCATGCTGGATCTGATACTATTCT
ATGTTCTTCCCGAAAGCGTGCCAATCCCTATGTTGTGCGGAGCTGAGCATCTTCTATTCGCTGGGATCAAGCTTTTCCTCTGCAGGGGCCTTGTGCATCTTGTGTGTAAG
CATAGCATTTCTGGTCGAACCCGCCCAACCCAACTAAGAAGAACCGAACCTGACAGACACATCTTTTCCTTTTGGGAGGGTACTCCGAGTAGTGGGTACCTCGAAGGGGT
GCTCCTAGGTGTGTGTAGGGGTTGTGTTTGTTCGCGAGAATGGATTCCTCGTCAAGTCAGTTTGGGGGGTGTGGACACACTTGCGCGAATTCGGGTAACGGCTACAAGGG
AGAAAGAGAAAGGAAACTGTACCCGACCAGGGATGGACGTAAACTCGCGGGTTGGTTCCTCTGTCGTCGCCGGGGAGCTCTTGCTGGTGGAGACATTTCTTTGGCCCCCT
TCAAAAGGAAATGCGGGCTGGGTGAAGCTCGGCAGAGAGTTCGAGAATAGGCGGGCATCTCCCGCAACGCAAGCTGCATTGTTCGCCACTATCCGAGAAGCAAAAGATTC
GAGAGTCCAGGCTGAAAATACATGCATAGATAGTGGTCTAATGACAAGGGCCGACGACGGAAGCTCGGGACGGAGCCGTATGATGCGGAAGTCTCACGTACGGTTCCCTG
AGAAGGGAGTGGCTACCTACTGGAGCTTCGACCAACTACCATCGGTCAATTCCGCTTTGGGGCCACCCCTTACTCTACCATTATTATAG
Protein sequenceShow/hide protein sequence
MMLNRNRARSRGQPLPLLALRDKALIGMVLIPMFFRVSFGFESLPSPSLTRLLTAVILDFFPYAGERGGETGLITSHWARGAPTHSSITAPICLLLSTNLFNVQFDSMLE
HFCECYSDLSGPILCPVLGSITPLFIPNSRIRPIRLIVGWSGMRSYGKEYITASLIREFLMIAVFRMLDLILFYVLPESVPIPMLCGAEHLLFAGIKLFLCRGLVHLVCK
HSISGRTRPTQLRRTEPDRHIFSFWEGTPSSGYLEGVLLGVCRGCVCSREWIPRQVSLGGVDTLARIRVTATREKEKGNCTRPGMDVNSRVGSSVVAGELLLVETFLWPP
SKGNAGWVKLGREFENRRASPATQAALFATIREAKDSRVQAENTCIDSGLMTRADDGSSGRSRMMRKSHVRFPEKGVATYWSFDQLPSVNSALGPPLTLPLL