; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034673 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034673
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNADH-ubiquinone oxidoreductase chain 6
Genome locationscaffold7:46305745..46308795
RNA-Seq ExpressionSpg034673
SyntenySpg034673
Gene Ontology termsGO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0022900 - electron transport chain (biological process)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0031966 - mitochondrial membrane (cellular component)
GO:0045263 - proton-transporting ATP synthase complex, coupling factor F(o) (cellular component)
GO:0070469 - respirasome (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008137 - NADH dehydrogenase (ubiquinone) activity (molecular function)
GO:0015078 - proton transmembrane transporter activity (molecular function)
InterPro domainsIPR001457 - NADH:ubiquinone/plastoquinone oxidoreductase, chain 6
IPR002942 - RNA-binding S4 domain
IPR036986 - RNA-binding S4 domain superfamily
IPR042106 - NADH-ubiquinone/plastoquinone oxidoreductase chain 6, subunit NuoJ


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KJB09804.1 hypothetical protein B456_001G168000, partial [Gossypium raimondii]9.2e-5591.91Show/hide
Query:  EGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRL
        EGRRTMIL VLSSPALVSGLMV RAKN VHSVLFPILVFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII L
Subjt:  EGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRL

Query:  IFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        IFWWEM FILDNETIPLLPTQRNTTSLRYTVYA K+
Subjt:  IFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

MBA0757956.1 hypothetical protein [Gossypium trilobum]6.2e-6790.74Show/hide
Query:  GGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV
        GGEASR SDRP +I NSRRRERFEIPEGRRTMIL VLSSPALVSGLMV RAKN VHSVLFPILVFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVV
Subjt:  GGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV

Query:  MI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        M+ HIQIAEIHEEVLRYLPVSGII LIFWWEM FILDNETIPLLPTQRNTTSLRYTVYA K+
Subjt:  MI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

PWA38432.1 NADH:ubiquinone/plastoquinone oxidoreductase, chain 6 [Artemisia annua]3.2e-5583.12Show/hide
Query:  EIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGI
        ++ EGRRTMIL VLSS ALVSGLMVVRAKN VHSVLFPI VFRNTSGLLLLLGLDFFAMIFPVV+IGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGI
Subjt:  EIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGI

Query:  IRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKMAVTVRKRRRLAWLWL
        I LIFWWEM FILDNE+IPLLPTQRNTTSLRY VYAGK+ +    + RL   WL
Subjt:  IRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKMAVTVRKRRRLAWLWL

TXG46253.1 hypothetical protein EZV62_028284 [Acer yangbiense]1.2e-9487.78Show/hide
Query:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA
        MHRGT++TSYIPFPLNPETR DVIPVRLHF ETI QARQ I+HRRVCVN GMGP  LKGGGEASR SDRP +I +SRRRERFEIPEGRRTMIL VLSS A
Subjt:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA

Query:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETI
        LVSGLMV RAKN VHSVLFPI VFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII LIFWWEM FILDNETI
Subjt:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETI

Query:  PLLPTQRNTTSLRYTVYAGKM
        PLLPTQRNTTSLRYTVYAGK+
Subjt:  PLLPTQRNTTSLRYTVYAGKM

XP_039833224.1 NADH-ubiquinone oxidoreductase chain 6 [Panicum virgatum]1.6e-5487.94Show/hide
Query:  RFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVS
        +F   EGRRTMIL VLSSPALVSGLMVVRAKN VHSVLFPILVF +TSGLL+LLGLDF AMIFPVVHIGAIAVSFLFVVM+ +IQIAEIHEEVLRYLPVS
Subjt:  RFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVS

Query:  GIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        GII LIFWWEM FILDNETIPLLPT RNTTSLRYTVYAGK+
Subjt:  GIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

TrEMBL top hitse value%identityAlignment
A0A0D2PSI1 Uncharacterized protein (Fragment)4.5e-5591.91Show/hide
Query:  EGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRL
        EGRRTMIL VLSSPALVSGLMV RAKN VHSVLFPILVFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII L
Subjt:  EGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRL

Query:  IFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        IFWWEM FILDNETIPLLPTQRNTTSLRYTVYA K+
Subjt:  IFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

A0A2U1KNW6 NAD(P)H dehydrogenase subunit 61.5e-5583.12Show/hide
Query:  EIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGI
        ++ EGRRTMIL VLSS ALVSGLMVVRAKN VHSVLFPI VFRNTSGLLLLLGLDFFAMIFPVV+IGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGI
Subjt:  EIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGI

Query:  IRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKMAVTVRKRRRLAWLWL
        I LIFWWEM FILDNE+IPLLPTQRNTTSLRY VYAGK+ +    + RL   WL
Subjt:  IRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKMAVTVRKRRRLAWLWL

A0A5C7GNW2 S4 RNA-binding domain-containing protein5.8e-9587.78Show/hide
Query:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA
        MHRGT++TSYIPFPLNPETR DVIPVRLHF ETI QARQ I+HRRVCVN GMGP  LKGGGEASR SDRP +I +SRRRERFEIPEGRRTMIL VLSS A
Subjt:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA

Query:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETI
        LVSGLMV RAKN VHSVLFPI VFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII LIFWWEM FILDNETI
Subjt:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETI

Query:  PLLPTQRNTTSLRYTVYAGKM
        PLLPTQRNTTSLRYTVYAGK+
Subjt:  PLLPTQRNTTSLRYTVYAGKM

A0A5C7GQ79 S4 RNA-binding domain-containing protein2.6e-7186.59Show/hide
Query:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA
        MHRGT++TSYIPFPLNPETR DVIPVRLHF ETI QARQ I+HRRVCVN GMGP  LKGGGEASR SDRP +I +SRRRERFEIPEGRRTMIL VLSS A
Subjt:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPA

Query:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPV
        LVSGLMV RAKN VHSVLFPI VFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPV
Subjt:  LVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPV

A0A7J9DB23 Uncharacterized protein (Fragment)3.0e-6790.74Show/hide
Query:  GGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV
        GGEASR SDRP +I NSRRRERFEIPEGRRTMIL VLSSPALVSGLMV RAKN VHSVLFPILVFR+TSGLLLLLGLDF AMIFPVVHIGAIAVSFLFVV
Subjt:  GGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV

Query:  MI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        M+ HIQIAEIHEEVLRYLPVSGII LIFWWEM FILDNETIPLLPTQRNTTSLRYTVYA K+
Subjt:  MI-HIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

SwissProt top hitse value%identityAlignment
P26850 NADH-ubiquinone oxidoreductase chain 62.6e-3670.45Show/hide
Query:  MILF-VLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV-MIHIQIAEIHEEVLRYLPVSGIIRLIFWW
        MILF V    ALVSG MV+RAKN VHSVLF ILVF NTSGLL+LLGLDFFAMIF VV++GAIAV FLFVV M+HI+I EIHE VLRYLPV GII LIF  
Subjt:  MILF-VLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV-MIHIQIAEIHEEVLRYLPVSGIIRLIFWW

Query:  EMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        E+  ++DN+ IP+LPT+ + T L YTVYAGK+
Subjt:  EMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

P60497 NADH-ubiquinone oxidoreductase chain 61.1e-5088.55Show/hide
Query:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE
        MIL VLSS ALVSGLMVVRAKN VHSVLF ILVF +TSGLLLLLGLDFFAMIF VV+IGAIAV FLFVVM+ HIQIAEIHEEVLRYLPVSGII LIFWWE
Subjt:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE

Query:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        M FILDNE+IPLLPTQRNTTSLRYTVYAGK+
Subjt:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM

P60498 NADH-ubiquinone oxidoreductase chain 62.5e-5591.6Show/hide
Query:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE
        MIL VLSSPALVSGLMV RAKN VHSVLFPI VFR+TSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII LIFWWE
Subjt:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE

Query:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        M FILDNE+IPLLPTQRNTTSLRYTVYAGK+
Subjt:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM

Q02500 NADH-ubiquinone oxidoreductase chain 61.1e-5586.52Show/hide
Query:  RFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVS
        +F    GRRTMIL VLSSPALVSGLMVVRAKN VHSVLFPILVF +TSGLL+LLGLDF AMI PVVHIGAIAVSFLFVVM+ +IQIAEIHEEVLRYLPVS
Subjt:  RFEIPEGRRTMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVS

Query:  GIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        GII LIFWWEM FILDNETIPLLPT RNTTSLRYTVYAGK+
Subjt:  GIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM

Q37626 NADH-ubiquinone oxidoreductase chain 62.1e-2554.55Show/hide
Query:  ILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV-MIHIQIAEIHEEVLRYLPVSGIIRLIFWWEM
        + ++ SS  L+SG +V++A+N VHSVLF +LVF N +GLL+LLGLDFFA+IF VV++GAIAV FLFVV M++I+I EI E+ LRYLPV G++ ++F +E+
Subjt:  ILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV-MIHIQIAEIHEEVLRYLPVSGIIRLIFWWEM

Query:  LFILDNETIPLLPTQRNTTSL
          ++DN+ IPLL      T+L
Subjt:  LFILDNETIPLLPTQRNTTSL

Arabidopsis top hitse value%identityAlignment
AT2G07718.1 Cytochrome b/b6 protein1.5e-0751.43Show/hide
Query:  TMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV
        TMIL VLSSPALVSGLMV RAKNLVHSVLFPI +F + + L       +F  +  + H+       LF++
Subjt:  TMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV

AT2G07734.1 Alpha-L RNA-binding motif/Ribosomal protein S4 family protein7.1e-2188.46Show/hide
Query:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGM
        MHRGTKRTSYIPFPLNPETR DVIP+RLHF ETI QARQ I+HRRVCVNKGM
Subjt:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGM

ATMG00270.1 NADH dehydrogenase 61.8e-5691.6Show/hide
Query:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE
        MIL VLSSPALVSGLMV RAKN VHSVLFPI VFR+TSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVM+ HIQIAEIHEEVLRYLPVSGII LIFWWE
Subjt:  MILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMI-HIQIAEIHEEVLRYLPVSGIIRLIFWWE

Query:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM
        M FILDNE+IPLLPTQRNTTSLRYTVYAGK+
Subjt:  MLFILDNETIPLLPTQRNTTSLRYTVYAGKM

ATMG00290.1 mitochondrial ribosomal protein S47.1e-2188.46Show/hide
Query:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGM
        MHRGTKRTSYIPFPLNPETR DVIP+RLHF ETI QARQ I+HRRVCVNKGM
Subjt:  MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGM

ATMG00590.1 Cytochrome b/b6 protein1.5e-0751.43Show/hide
Query:  TMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV
        TMIL VLSSPALVSGLMV RAKNLVHSVLFPI +F + + L       +F  +  + H+       LF++
Subjt:  TMILFVLSSPALVSGLMVVRAKNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGAGGAACTAAACGAACTTCTTATATCCCTTTTCCACTCAATCCAGAAACAAGATCGGACGTTATTCCGGTTCGTCTCCATTTTAGTGAAACTATTTCTCAAGC
AAGGCAGTCGATAAATCATCGAAGGGTTTGTGTGAATAAAGGAATGGGGCCCAGAACGCTTAAAGGTGGGGGAGAAGCAAGCCGAAACTCTGATCGACCTGGAAACATCA
CAAATTCTCGGCGTCGAGAGAGATTTGAGATTCCGGAAGGACGACGTACCATGATACTTTTCGTTTTGTCTAGCCCTGCTTTGGTCTCTGGTTTGATGGTTGTACGTGCT
AAAAATTTGGTACATTCCGTTTTGTTTCCCATCCTAGTCTTTCGCAACACTTCAGGTTTACTTCTTTTGTTAGGTCTCGACTTCTTCGCTATGATCTTCCCAGTAGTTCA
TATAGGAGCTATAGCCGTTTCATTCCTATTCGTTGTTATGATTCATATTCAAATAGCGGAGATTCACGAAGAAGTCTTGCGCTATTTACCAGTGAGTGGTATTATTAGAC
TGATCTTTTGGTGGGAAATGCTCTTCATTTTAGATAATGAAACCATTCCATTACTACCAACCCAAAGAAATACGACCTCTCTGAGATATACGGTTTATGCCGGAAAGATG
GCAGTAACGGTGAGGAAAAGGCGTAGGTTGGCTTGGCTTTGGCTTGCTCTGGTCATTATGGCGATTGATGATCCCTGTGCAAACAATAAGTCTGAGTCCCTTAAAAAACT
CCCAGTGTATAAAGGAACGTTCGGGCCCCCCAGTCGTCGCAGGGGTCCCTCACTCGGATCACACGAGGAGAGTGGAGACCATTTTCATGCGGGACTCATAGCAAAAGATG
CCTCAAAGCAAACTTCCACGAAAAAGATGAGGGCAGGATTCCCGGAGCGGGAGGGGAGAATGAGGGATGTGAAATTCCACAAAAGGTGGGGAATTGTGTGTGCTTACGTT
ATGAAGAAAGATGAAGAACCTTGCATATGGAGGCAATATAATATAGATCAATTTAAGGAAATTGCCTTAGCCCGGAATAGGGGAAACCAAAAAAGCACCGTACCCGAAAG
GATACAAAAATATCTGAAGGAGCATGGTGACCCCAAGGAGTACGACATTGAGGAACTAGGTGAGAAGTACCGACGGAAGAGATTGTCGCGGTCTCTTGTCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAGAGGAACTAAACGAACTTCTTATATCCCTTTTCCACTCAATCCAGAAACAAGATCGGACGTTATTCCGGTTCGTCTCCATTTTAGTGAAACTATTTCTCAAGC
AAGGCAGTCGATAAATCATCGAAGGGTTTGTGTGAATAAAGGAATGGGGCCCAGAACGCTTAAAGGTGGGGGAGAAGCAAGCCGAAACTCTGATCGACCTGGAAACATCA
CAAATTCTCGGCGTCGAGAGAGATTTGAGATTCCGGAAGGACGACGTACCATGATACTTTTCGTTTTGTCTAGCCCTGCTTTGGTCTCTGGTTTGATGGTTGTACGTGCT
AAAAATTTGGTACATTCCGTTTTGTTTCCCATCCTAGTCTTTCGCAACACTTCAGGTTTACTTCTTTTGTTAGGTCTCGACTTCTTCGCTATGATCTTCCCAGTAGTTCA
TATAGGAGCTATAGCCGTTTCATTCCTATTCGTTGTTATGATTCATATTCAAATAGCGGAGATTCACGAAGAAGTCTTGCGCTATTTACCAGTGAGTGGTATTATTAGAC
TGATCTTTTGGTGGGAAATGCTCTTCATTTTAGATAATGAAACCATTCCATTACTACCAACCCAAAGAAATACGACCTCTCTGAGATATACGGTTTATGCCGGAAAGATG
GCAGTAACGGTGAGGAAAAGGCGTAGGTTGGCTTGGCTTTGGCTTGCTCTGGTCATTATGGCGATTGATGATCCCTGTGCAAACAATAAGTCTGAGTCCCTTAAAAAACT
CCCAGTGTATAAAGGAACGTTCGGGCCCCCCAGTCGTCGCAGGGGTCCCTCACTCGGATCACACGAGGAGAGTGGAGACCATTTTCATGCGGGACTCATAGCAAAAGATG
CCTCAAAGCAAACTTCCACGAAAAAGATGAGGGCAGGATTCCCGGAGCGGGAGGGGAGAATGAGGGATGTGAAATTCCACAAAAGGTGGGGAATTGTGTGTGCTTACGTT
ATGAAGAAAGATGAAGAACCTTGCATATGGAGGCAATATAATATAGATCAATTTAAGGAAATTGCCTTAGCCCGGAATAGGGGAAACCAAAAAAGCACCGTACCCGAAAG
GATACAAAAATATCTGAAGGAGCATGGTGACCCCAAGGAGTACGACATTGAGGAACTAGGTGAGAAGTACCGACGGAAGAGATTGTCGCGGTCTCTTGTCAAATAA
Protein sequenceShow/hide protein sequence
MHRGTKRTSYIPFPLNPETRSDVIPVRLHFSETISQARQSINHRRVCVNKGMGPRTLKGGGEASRNSDRPGNITNSRRRERFEIPEGRRTMILFVLSSPALVSGLMVVRA
KNLVHSVLFPILVFRNTSGLLLLLGLDFFAMIFPVVHIGAIAVSFLFVVMIHIQIAEIHEEVLRYLPVSGIIRLIFWWEMLFILDNETIPLLPTQRNTTSLRYTVYAGKM
AVTVRKRRRLAWLWLALVIMAIDDPCANNKSESLKKLPVYKGTFGPPSRRRGPSLGSHEESGDHFHAGLIAKDASKQTSTKKMRAGFPEREGRMRDVKFHKRWGIVCAYV
MKKDEEPCIWRQYNIDQFKEIALARNRGNQKSTVPERIQKYLKEHGDPKEYDIEELGEKYRRKRLSRSLVK