; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021676 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021676
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNADH-ubiquinone oxidoreductase chain 1
Genome locationchr7:10574330..10585283
RNA-Seq ExpressionLag0021676
SyntenyLag0021676
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0008137 - NADH dehydrogenase (ubiquinone) activity (molecular function)
InterPro domainsIPR001694 - NADH:ubiquinone oxidoreductase, subunit 1/F420H2 oxidoreductase subunit H
IPR018086 - NADH:ubiquinone oxidoreductase, subunit 1, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GER38991.1 NADH-ubiquinone oxidoreductase chain 1 [Striga asiatica]5.0e-6295.49Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRI VSLGSHRPTV
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV

Query:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        ILKLPC VEKKRTKVRSL VLFS ANWDRSPAR
Subjt:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

KAG6383915.1 hypothetical protein SASPL_156340 [Salvia splendens]1.2e-5578.98Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM--------------------
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM+                    
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM--------------------

Query:  PPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG
         P+ GADRI VSLGSHRPTVILKLPC VEKKRTKVRSL VLFS ANWDRSPA   AG
Subjt:  PPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG

KAG8362761.1 hypothetical protein BUALT_BualtUnG0039900 [Buddleja alternifolia]2.9e-6294.81Show/hide
Query:  QRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRP
        Q VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRI VSLGSHRP
Subjt:  QRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRP

Query:  TVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        TVILKLPC VEKKRTKVRSL VLFS ANWDRSPAR
Subjt:  TVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

QHN95457.1 NADH-ubiquinone oxidoreductase chain [Arachis hypogaea]4.6e-5261.54Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM---------------------
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVA AVVPFDYGM                     
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM---------------------

Query:  -------------------------------------MPPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAGYNDRL
                                             MPPS GADRI VSLGSHRPTVILKLPC VEKKRTKV SL VLFSAANWDRSPAR   G +  L
Subjt:  -------------------------------------MPPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAGYNDRL

Query:  SRIQRLIS
          +++  S
Subjt:  SRIQRLIS

YP_009526578.1 Nad1 [Ammopiptanthus mongolicus]3.8e-6293.43Show/hide
Query:  RDQRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSH
        ++ RVAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPS GADRI VSLGSH
Subjt:  RDQRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSH

Query:  RPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        RPTVILKLPC VEKKRTKVRSL VLFSAANWDRSPAR
Subjt:  RPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

TrEMBL top hitse value%identityAlignment
A0A1R3KU78 NADH:ubiquinone oxidoreductase8.9e-4172.26Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQP+ADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVA AVVPFDYGM                     
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV

Query:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG
                 KKRTKVRSL VLFS+ANWDRSPAR L+G
Subjt:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG

A0A385G2B5 NADH-ubiquinone oxidoreductase chain 11.8e-6293.43Show/hide
Query:  RDQRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSH
        ++ RVAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPS GADRI VSLGSH
Subjt:  RDQRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSH

Query:  RPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        RPTVILKLPC VEKKRTKVRSL VLFSAANWDRSPAR
Subjt:  RPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

A0A4D8Y5W6 Uncharacterized protein3.7e-5575.61Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM--------------------
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM+                    
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM--------------------

Query:  -------PPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG
                P+ GADRI VSLGSHRPTVILKLPC VEKKRTKVRSL VLFS ANWDRSPA   AG
Subjt:  -------PPSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAG

A0A5A7Q257 NADH-ubiquinone oxidoreductase chain 12.4e-6295.49Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRI VSLGSHRPTV
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV

Query:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        ILKLPC VEKKRTKVRSL VLFS ANWDRSPAR
Subjt:  ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

A0A6A5LB92 Uncharacterized protein8.5e-5276.97Show/hide
Query:  MAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM------------------------------MP
        MAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM                              MP
Subjt:  MAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM------------------------------MP

Query:  PSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR
        PS GADRI VSLGSHRPTVILKLPC VEKKRTKVRSL VLFSAANWDRSPAR
Subjt:  PSRGADRIRVSLGSHRPTVILKLPCFVEKKRTKVRSLPVLFSAANWDRSPAR

SwissProt top hitse value%identityAlignment
P26845 NADH-ubiquinone oxidoreductase chain 16.6e-2573.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMA +QRRKGP+VVG  GLLQPLADGLKL++KEP+ PSSAN  +F MAPV TF L+L A AV+PFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

P31839 NADH-ubiquinone oxidoreductase chain 13.5e-3495Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVA AVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

P92558 NADH-ubiquinone oxidoreductase chain 11.7e-3393.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANF LFRMAPVATFMLSLVA AVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

Q01148 NADH-ubiquinone oxidoreductase chain 11.7e-3393.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANF LFRMAPVATFMLSLVA AVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

Q01300 NADH-ubiquinone oxidoreductase chain 14.1e-3596.25Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEP+SPSSANFSLFRMAPVATFMLSLVARAVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

Arabidopsis top hitse value%identityAlignment
AT2G07785.1 NADH dehydrogenase family protein1.2e-2994.2Show/hide
Query:  MAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        MAFVQRRKGPDVVGSFGLLQPLADG KLILKEP+SPSSANF LFRMAPVATFMLSLVARAVVPFDYGM+
Subjt:  MAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

ATCG01100.1 NADH dehydrogenase family protein8.0e-1042.67Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPF
        V  +V  ER++ A +Q+R GP+  G  G+LQ LADG KL+ KE L PS  N  LF + P    +  L++ +V+PF
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPF

ATMG00516.1 NADH dehydrogenase 1C7.2e-3593.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADG KLILKEP+SPSSANF LFRMAPVATFMLSLVARAVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

ATMG01120.1 NADH dehydrogenase 1B7.2e-3593.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADG KLILKEP+SPSSANF LFRMAPVATFMLSLVARAVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM

ATMG01275.1 NADH dehydrogenase 1A7.2e-3593.75Show/hide
Query:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM
        VAFLVL ERKVMAFVQRRKGPDVVGSFGLLQPLADG KLILKEP+SPSSANF LFRMAPVATFMLSLVARAVVPFDYGM+
Subjt:  VAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTGCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCCCAAATTC
GAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTG
CAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCAACGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTGCAGTTT
CTTCTCCCTAACTTCGAAGGTTCTCATGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGT
TTTCATGCGTTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCAGTCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTTCGAAGGTTCTCACGCGCTGTG
ATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAA
GTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAAGTTCCCTCACGTGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGC
TCCTTCTCCAAGTTTGAAGACGCTTCTCTCCGTCGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTACAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCAC
GCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTTCTTCTCCAAGTTCGAAGGCGCTT
CTCTTCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGTTT
CTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTTATCTTCAAATGTTGGTGGTGAAGTCACTGCAAT
TGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTATGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAGATGGAGACTGGGTCTAGCAGGAGTGCATGAAGGC
GAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCGTGAAGGCGAATCTGGTGACTACCCCTGCAGGAGTGCATC
ACTGAAGGCGAATCTGGTGACTACCCCTGCACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAATGCATCACTGCAGGCGAATCTGTGATCCATCAGATAACA
TCAGCCAAGTTCAATCAGATTATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGAGTCATCAGAAGTCAGAGAGTTTAGAGAATTCAGAAGATCCTAGATTC
AGAATTCAACCAACTCAAGACTCAGAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGTAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATTCGACAG
ATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTTAGCA
GACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAGGAAGATCAACAAGCCAAC
AAGCCGATCCAAGAGATCATCACGGCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAAAGGCCGATCATCCAAGAAGATCAACAAGTCACAAC
AGGCCGATCCAAGAGATCATTAAGCCGGCAGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGTAAGCCGATCATCCAAGAAG
CTCAACAAGCCAACTGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTCGATCATCGAAGAAGCTCAACAAACCAGCC
CAAGAAGATCAAGAAGCTAGAGACCAAAGAGTAGCCTTTTTAGTGCTAACTGAACGTAAAGTAATGGCTTTTGTGCAACGTCGAAAGGGTCCTGATGTAGTGGGATCGTT
CGGATTGTTACAACCTCTAGCAGATGGTTTGAAATTGATTCTAAAAGAACCTCTTTCACCAAGTAGTGCAAATTTCTCCCTTTTTCGAATGGCTCCAGTGGCTACATTTA
TGTTAAGTCTGGTCGCTCGGGCCGTTGTACCTTTTGATTATGGTATGATGCCTCCCAGCCGGGGGGCGGATCGAATCAGAGTTTCCTTAGGTAGCCACAGACCTACAGTT
ATCCTTAAACTTCCGTGCTTTGTGGAGAAGAAGCGAACAAAGGTACGCTCGCTTCCTGTCTTGTTCTCTGCTGCGAACTGGGATCGCTCGCCAGCTAGGTTGCTAGCAGG
CTACAACGATCGTCTTAGCAGGATACAACGACTAATTTCAGTAGAACAGCAACCTCGAATTACTCGGATCTGGCAGAATTCTTGGATACTTGCTCTCTCAACTCAACTCA
AGGTCAAAAGAAACATAGAAAGAAGGAAGAATGCTCCAAATGGAATGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTGCTTCCTCCAAGTTTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCTCCCCAAATTC
GAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGTCGCTTCGCTG
CAGTTCCTTCCTCCAAGTTTGAAGGTTCTCACATCGCTTCGCTTCGCGCAACGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACACGCTTCGCTGCAGTTT
CTTCTCCCTAACTTCGAAGGTTCTCATGCGCTTCGCTGCAATTCCTTCCTCCCTAAGTTTGAAGGTTCTCACGTTGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGT
TTTCATGCGTTTTGCTGCAGTTCCTTCTCTCCACGTTCAAAGGTTCTCAGTCGCTTCGCTGCAGTTCCTTCCTCCAAATTTGAAGGTTTCGAAGGTTCTCACGCGCTGTG
ATTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGCTGCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTGCTACCTTCCTCCAA
GTTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAAGTTCCCTCACGTGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCACTGC
TCCTTCTCCAAGTTTGAAGACGCTTCTCTCCGTCGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGCTGCTACAGTTCCTTCCTCCAAGTTCGAAGGTTCCCTCAC
GCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTTCTTCTCCAAGTTCGAAGGCGCTT
CTCTTCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCTACTGCTCCTTCTCCAAGTTCGAAGGTGCTTCTCTCCACCCCTCTTTTTGAAGGTTCGCCACTGAGTTT
CTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGCGGTTGACGTCCTCGTTCCGCTTTATCTTCAAATGTTGGTGGTGAAGTCACTGCAAT
TGAATCTGATGACGACCGTTGTAGGCGAGTCGGGTATGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAGATGGAGACTGGGTCTAGCAGGAGTGCATGAAGGC
GAATCTGGTGACTACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAGTGCGTGAAGGCGAATCTGGTGACTACCCCTGCAGGAGTGCATC
ACTGAAGGCGAATCTGGTGACTACCCCTGCACTCAGATCACCCAATAAAATGGGGACTGGTCTAGCAGGAATGCATCACTGCAGGCGAATCTGTGATCCATCAGATAACA
TCAGCCAAGTTCAATCAGATTATGCCACGTGGCAATTGGACCCAAGCCCAAGCCCATATGAGAGTCATCAGAAGTCAGAGAGTTTAGAGAATTCAGAAGATCCTAGATTC
AGAATTCAACCAACTCAAGACTCAGAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGTAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCCGATTCGACAG
ATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAAGAAGATCAACAAGTTAGCA
GACCGATCATCAAACCGATCATCCGAGAAGATCAACAAGTCAGCAGACCGATCATCCAAGAAGATCAACAAGCCAACAGGCCGATCATCCAGGAAGATCAACAAGCCAAC
AAGCCGATCCAAGAGATCATCACGGCAACAGGCCGATCATCCAAGAAGATCAACAAGCCAATAAGCCGATCCAAAAGGCCGATCATCCAAGAAGATCAACAAGTCACAAC
AGGCCGATCCAAGAGATCATTAAGCCGGCAGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGTAAGCCGATCATCCAAGAAG
CTCAACAAGCCAACTGGCCGATCATCCAAGAAGATCATCAAGCCAACAGGCCGATCCAAGAGATCATCAACCTAGCAAGTCGATCATCGAAGAAGCTCAACAAACCAGCC
CAAGAAGATCAAGAAGCTAGAGACCAAAGAGTAGCCTTTTTAGTGCTAACTGAACGTAAAGTAATGGCTTTTGTGCAACGTCGAAAGGGTCCTGATGTAGTGGGATCGTT
CGGATTGTTACAACCTCTAGCAGATGGTTTGAAATTGATTCTAAAAGAACCTCTTTCACCAAGTAGTGCAAATTTCTCCCTTTTTCGAATGGCTCCAGTGGCTACATTTA
TGTTAAGTCTGGTCGCTCGGGCCGTTGTACCTTTTGATTATGGTATGATGCCTCCCAGCCGGGGGGCGGATCGAATCAGAGTTTCCTTAGGTAGCCACAGACCTACAGTT
ATCCTTAAACTTCCGTGCTTTGTGGAGAAGAAGCGAACAAAGGTACGCTCGCTTCCTGTCTTGTTCTCTGCTGCGAACTGGGATCGCTCGCCAGCTAGGTTGCTAGCAGG
CTACAACGATCGTCTTAGCAGGATACAACGACTAATTTCAGTAGAACAGCAACCTCGAATTACTCGGATCTGGCAGAATTCTTGGATACTTGCTCTCTCAACTCAACTCA
AGGTCAAAAGAAACATAGAAAGAAGGAAGAATGCTCCAAATGGAATGAGATGA
Protein sequenceShow/hide protein sequence
MEVASSKFEGSHALRCSSFPQFEGSHALRCSSFSPNSKVLTRFAAVPSSQFEGSHALRCSSFPQVRRFSRRFAAVPSSKFEGSHIASLRATLRCSSFLQVRRFSHASLQF
LLPNFEGSHALRCNSFLPKFEGSHVASLQFLPPSSKVFMRFAAVPSLHVQRFSVASLQFLPPNLKVSKVLTRCDSLQFLPPSSKVLMRFAAQFLPPSSKVLMRFVATFLQ
VRRFSHALLQLLPPSSKVPSRASLAPSPSSKALLSTAPSPSLKTLLSVATFLQVRRFSHALLQFLPPSSKVPSRASLAPSPSSKALLSAAPSPSSKALLSAASSPSSKAL
LFTAPSPSSKALLSTAPSPSSKVLLSTPLFEGSPLSFSFSKFEGSPLLLFKCLAAVDVLVPLYLQMLVVKSLQLNLMTTVVGESGMVTTPAGYSDHPIRWRLGLAGVHEG
ESGDYPCRLLRSPNKMGTGLAGVREGESGDYPCRSASLKANLVTTPALRSPNKMGTGLAGMHHCRRICDPSDNISQVQSDYATWQLDPSPSPYESHQKSESLENSEDPRF
RIQPTQDSEANRPIKKINKSVGRSSKRINKLTSRFDRSSSQSTDQEDQQVSRPIIQEDQQANRPIIQEDQQVSRPIIKPIIREDQQVSRPIIQEDQQANRPIIQEDQQAN
KPIQEIITATGRSSKKINKPISRSKRPIIQEDQQVTTGRSKRSLSRQADHPRRSSSQQADPRDHQPSKPIIQEAQQANWPIIQEDHQANRPIQEIINLASRSSKKLNKPA
QEDQEARDQRVAFLVLTERKVMAFVQRRKGPDVVGSFGLLQPLADGLKLILKEPLSPSSANFSLFRMAPVATFMLSLVARAVVPFDYGMMPPSRGADRIRVSLGSHRPTV
ILKLPCFVEKKRTKVRSLPVLFSAANWDRSPARLLAGYNDRLSRIQRLISVEQQPRITRIWQNSWILALSTQLKVKRNIERRKNAPNGMR