; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G010920 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G010920
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionNADH dehydrogenase subunit 7
Genome locationchr01:9002883..9005501
RNA-Seq ExpressionLsi01G010920
SyntenyLsi01G010920
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0016651 - oxidoreductase activity, acting on NAD(P)H (molecular function)
GO:0048038 - quinone binding (molecular function)
GO:0051287 - NAD binding (molecular function)
InterPro domainsIPR001135 - NADH-quinone oxidoreductase, subunit D
IPR022885 - NAD(P)H-quinone oxidoreductase subunit D/H
IPR029014 - [NiFe]-hydrogenase, large subunit


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEV11856.1 NADH dehydrogenase [ubiquinone] iron-sulfur protein 2 [Tanacetum cinerariifolium]7.5e-7058.33Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCE----------RAVAFVV
        GVCWD RRAAPYDVHDQLDPD+PV TRGDRYDRYCIRIEEMRQSVRIIVQCPN+MPSGMIKADDRKLCP S+SRMKLSMESC           R VAFVV
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCE----------RAVAFVV

Query:  VLQDSNVLHPRSERACRRTIAVPFLSQLHSSNLDKAFLCPFFRLGCWIRDPRSRLDQHPAERGQPLEA-------------REAFSIQCPTTIHSIVGLK
        VLQDSNVL PRS+RAC RT AVPFLS                  G WIRDPRSRLDQHP ERGQ LE                +      +T   I  L 
Subjt:  VLQDSNVLHPRSERACRRTIAVPFLSQLHSSNLDKAFLCPFFRLGCWIRDPRSRLDQHPAERGQPLEA-------------REAFSIQCPTTIHSIVGLK

Query:  HESARP----FFQVGHFF---RNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGSRLSRALAKRHSHSEALRVFEAL
        H+   P       VG  F   R   R +      S+S        +   KRSAFATEK NGQREG RLSR LA   SHSEALRV EAL
Subjt:  HESARP----FFQVGHFF---RNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGSRLSRALAKRHSHSEALRVFEAL

KAF3452247.1 hypothetical protein FNV43_RR08345 [Rhamnella rubrinervis]9.5e-6542.76Show/hide
Query:  ISSPGWSTVRQNRHRLGAIDGTCWWLTGLFPRSKRISSPSCYPKARPPSPYPLCLGAGSAQERPVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMI
        +SSPGWSTVRQNRHRLGAIDGT  W +G+ PR+   +SP    KAR P                                +R P     K   +    ++
Subjt:  ISSPGWSTVRQNRHRLGAIDGTCWWLTGLFPRSKRISSPSCYPKARPPSPYPLCLGAGSAQERPVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMI

Query:  RAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES---------
        R          VCWD RRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSR RMKLSMES         
Subjt:  RAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES---------

Query:  ---------------------------------------------------------CERAV------------AFVVVLQDSNVLHPRSERACRRTIAV
                                                                     V            ++++  Q   + H R  R   R  A 
Subjt:  ---------------------------------------------------------CERAV------------AFVVVLQDSNVLHPRSERACRRTIAV

Query:  PF---LSQLHSSNLDKAFLCPFFRLGCWIRDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKHESARPFFQVGHFFRNAARDKRDRVYISISIY
         F    S++ +S L      P   LGCWIRDPRSRLDQHPAERGQPLEA                    E+ R                  RV++S S  
Subjt:  PF---LSQLHSSNLDKAFLCPFFRLGCWIRDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKHESARPFFQVGHFFRNAARDKRDRVYISISIY

Query:  IFDAVISKWAKRSAFATEKVNGQREGSRLSRALAKRHSHSEALRVFEAL
              S+ AKR AFATEK NGQREGSRLSRALAK  SHSEALRV EAL
Subjt:  IFDAVISKWAKRSAFATEKVNGQREGSRLSRALAKRHSHSEALRVFEAL

KAF7117280.1 hypothetical protein RHSIM_RhsimMtG0002200 [Rhododendron simsii]2.3e-7975.12Show/hide
Query:  MVLVLRTVNGFASGLWVRRNRISSPGWSTVRQNRHRLGAIDGTCWWLTGLFPRSKRISSPSCYPKARPPSPYPLCLGAGSAQERPVPWWYFPAVAAGARL
        MVLVLRTVN  ASGLW RRNR+SSPGWSTVRQNRHRLGAIDGT  W                      P P  L   +G  + R      FPAV AGARL
Subjt:  MVLVLRTVNGFASGLWVRRNRISSPGWSTVRQNRHRLGAIDGTCWWLTGLFPRSKRISSPSCYPKARPPSPYPLCLGAGSAQERPVPWWYFPAVAAGARL

Query:  RAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCP
         AREPTQQTRKGCLSGHLRMIRAWWA  SQPGVCWD RRAAPYDVHDQ DPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQC NQMPSGMIKADDRKLCP
Subjt:  RAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCP

Query:  PSRSRMKLSMESC
        PSR RMKLSMESC
Subjt:  PSRSRMKLSMESC

TXG46223.1 hypothetical protein EZV62_028254 [Acer yangbiense]1.6e-7556.52Show/hide
Query:  LGAGSAQERPVPWWY--FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR
        +G  +AQ+    W +  FPAVAAGARLRAREPTQQTRKGCLS HLRMIRAWWAYPSQPGVCWD RRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR
Subjt:  LGAGSAQERPVPWWY--FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR

Query:  QSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL----DKAFLCPFFRLGCWI
        QSVRIIVQCPNQMPSGMIKADDRKLCPPSR RMKLSMES  R     +V +   +  P S       ++ P     HS++L          PFF      
Subjt:  QSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL----DKAFLCPFFRLGCWI

Query:  RDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKH--ESARPFFQVGHFFRNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGS
              +D    E     EAR           H + GL    +  R  + +          +R R  +      F     +WAKRSAFATEK NG REGS
Subjt:  RDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKH--ESARPFFQVGHFFRNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGS

Query:  RLSRALAKRHSHSEALRVFEAL
        RLSRALAK  SHSEALRV EAL
Subjt:  RLSRALAKRHSHSEALRVFEAL

VDD65199.1 unnamed protein product [Brassica oleracea]6.2e-6465.44Show/hide
Query:  PVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPN
        P P   FPAVAAGARLRAREPTQQ RKGC   HLRMIRAWWAYPSQPGVCWD RRAAPYDVHDQ DPDVPVGTRGDRYDRYCIRIEEMRQS+RIIVQC N
Subjt:  PVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPN

Query:  QMPSGMIKADDRK---------------------------------LCPPS-----RSRM---KLSME------SCERAVAFVVVLQDSNVLHPRSERAC
        QMPSGMIKADD+K                                 L P S     R+R+   KL ++      + ERAVAFVVVLQDSNVL PRSERAC
Subjt:  QMPSGMIKADDRK---------------------------------LCPPS-----RSRM---KLSME------SCERAVAFVVVLQDSNVLHPRSERAC

Query:  RRTIAVPFLSQLHSSNL
        RRT AVPFLS+L  S+L
Subjt:  RRTIAVPFLSQLHSSNL

TrEMBL top hitse value%identityAlignment
A0A3P6HGY3 Uncharacterized protein3.0e-6465.44Show/hide
Query:  PVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPN
        P P   FPAVAAGARLRAREPTQQ RKGC   HLRMIRAWWAYPSQPGVCWD RRAAPYDVHDQ DPDVPVGTRGDRYDRYCIRIEEMRQS+RIIVQC N
Subjt:  PVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPN

Query:  QMPSGMIKADDRK---------------------------------LCPPS-----RSRM---KLSME------SCERAVAFVVVLQDSNVLHPRSERAC
        QMPSGMIKADD+K                                 L P S     R+R+   KL ++      + ERAVAFVVVLQDSNVL PRSERAC
Subjt:  QMPSGMIKADDRK---------------------------------LCPPS-----RSRM---KLSME------SCERAVAFVVVLQDSNVLHPRSERAC

Query:  RRTIAVPFLSQLHSSNL
        RRT AVPFLS+L  S+L
Subjt:  RRTIAVPFLSQLHSSNL

A0A5C7GNT9 Uncharacterized protein7.6e-7656.52Show/hide
Query:  LGAGSAQERPVPWWY--FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR
        +G  +AQ+    W +  FPAVAAGARLRAREPTQQTRKGCLS HLRMIRAWWAYPSQPGVCWD RRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR
Subjt:  LGAGSAQERPVPWWY--FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMR

Query:  QSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL----DKAFLCPFFRLGCWI
        QSVRIIVQCPNQMPSGMIKADDRKLCPPSR RMKLSMES  R     +V +   +  P S       ++ P     HS++L          PFF      
Subjt:  QSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL----DKAFLCPFFRLGCWI

Query:  RDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKH--ESARPFFQVGHFFRNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGS
              +D    E     EAR           H + GL    +  R  + +          +R R  +      F     +WAKRSAFATEK NG REGS
Subjt:  RDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKH--ESARPFFQVGHFFRNAARDKRDRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGS

Query:  RLSRALAKRHSHSEALRVFEAL
        RLSRALAK  SHSEALRV EAL
Subjt:  RLSRALAKRHSHSEALRVFEAL

A0A5C7GQJ3 Uncharacterized protein3.2e-6684.31Show/hide
Query:  CYPKAR-PPSPYPLCLGAGSAQERPVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGD
        CYPK R PP     C G             FPAVAAGARLRAREPTQQTRKGCLS HLRMIRAWWAYPSQPGVCWD RRAAPYDVHDQLDPDVPVGTRGD
Subjt:  CYPKAR-PPSPYPLCLGAGSAQERPVPWWYFPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGD

Query:  RYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESC
        RYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSR RMKLSMESC
Subjt:  RYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESC

A0A5N5J0K6 Uncharacterized protein5.1e-6495.93Show/hide
Query:  FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGM
        FPAVAAG RLRAREPTQQTR GCLSGHLRMIRAWWAYPSQPGVCWDLR+AAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQS+RIIVQCPNQMPSGM
Subjt:  FPAVAAGARLRAREPTQQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGM

Query:  IKADDRKLCPPSRSRMKLSMESC
        IKADDRKLCPPSR RMKLSMESC
Subjt:  IKADDRKLCPPSRSRMKLSMESC

K3YY16 NAD(P)H dehydrogenase subunit H3.8e-6766.98Show/hide
Query:  QQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRM
        QQ +    SG +  ++ WWAYPSQPGVCWD RRAAPYDVHDQ D DVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRM
Subjt:  QQTRKGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRM

Query:  KLSMESC-------ERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL------------------------DKAFLCPFFRLGCW-IRDPRS
        KLSMES         RAVAFVVVLQDSNVL PRSERACRRT A+PF S+L  S+L                         +    P  R   + IRDPRS
Subjt:  KLSMESC-------ERAVAFVVVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNL------------------------DKAFLCPFFRLGCW-IRDPRS

Query:  RLDQHPAERGQPLEA
        RLDQ+PAERGQPLEA
Subjt:  RLDQHPAERGQPLEA

SwissProt top hitse value%identityAlignment
O21270 NADH-ubiquinone oxidoreductase 49 kDa subunit1.2e-2562.96Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        G+ WDLR+  PYDV+D+L+ D+PVGT+GD YDRY IR+EEMRQS+++I+QC N+MP+G+IK DD+K+ PPSR  MK SME+
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES

P93306 NADH dehydrogenase [ubiquinone] iron-sulfur protein 23.5e-3892.68Show/hide
Query:  PGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        PGVCWDLRRAAPYDV+DQLD DVPVGTRGD YDRYCIRIEEMRQS+RIIVQC NQMPSGMIKADDRKLCPPSR RMKLSMES
Subjt:  PGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES

Q0MQG4 NADH dehydrogenase [ubiquinone] iron-sulfur protein 2, mitochondrial5.9e-2565.43Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        G+ WDLR+  PYDV+DQ++ DVPVG+RGD YDRY  R+EEMRQS+RII QC N+MP G IK DD K+ PP R+ MK SMES
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES

Q36450 NADH dehydrogenase [ubiquinone] iron-sulfur protein 21.0e-4096.3Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        GVCWDLR+AAPYDVHDQLDPD+PVGTRGDRYDRYCIRIEEMRQSVRIIVQC NQMPSGMIKADDRKLCPPSRSRMKLSMES
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES

Q9TC96 NADH dehydrogenase [ubiquinone] iron-sulfur protein 21.8e-2667.9Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        GV WDLR+  PYDV++++  DVPVGT+GD YDRY  R+EEMRQS+ II+QC NQ+P GMIKADD+K+ PPSRS+MK SMES
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES

Arabidopsis top hitse value%identityAlignment
ATCG01110.1 NAD(P)H dehydrogenase subunit H6.3e-0634.48Show/hide
Query:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSG
        G+ WDLR+   Y+ +D+ + ++    +GD   RY +R+ EM +S++II Q    +P G
Subjt:  GVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSG

ATMG00510.1 NADH dehydrogenase subunit 71.1e-3992.68Show/hide
Query:  PGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES
        PGVCWD RRAAPYDVHDQ D DVPVGTRGDRYDRYCIRIEEMRQS+RIIVQC NQMPSGMIKADDRKLCPPSR RMKLSMES
Subjt:  PGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTGGTCCTACGGACCGTGAACGGATTCGCCTCTGGCCTCTGGGTACGTCGGAACCGCATAAGTTCACCGGGGTGGAGCACGGTCCGCCAAAATCGGCATAGGTT
AGGTGCTATTGATGGAACATGTTGGTGGCTGACTGGGCTTTTTCCTAGATCAAAGCGGATCAGCTCGCCTTCTTGTTATCCAAAAGCCCGGCCACCTTCCCCCTATCCCT
TATGTTTAGGAGCGGGATCCGCCCAAGAACGACCAGTCCCGTGGTGGTACTTCCCTGCTGTTGCGGCCGGTGCTCGTTTGCGCGCGCGTGAACCAACCCAACAAACAAGG
AAAGGATGCCTCTCGGGGCATCTGAGAATGATTCGAGCTTGGTGGGCCTACCCATCCCAACCAGGGGTATGCTGGGATTTGCGAAGAGCAGCACCTTACGATGTTCATGA
CCAATTGGATCCTGACGTACCAGTAGGTACCAGAGGAGATCGCTATGATCGTTACTGTATCCGTATCGAAGAGATGCGACAAAGTGTTCGGATCATTGTGCAATGTCCTA
ATCAAATGCCTAGTGGCATGATCAAAGCCGATGATCGTAAGCTATGTCCTCCATCACGATCTCGAATGAAACTATCCATGGAATCGTGTGAACGCGCGGTAGCGTTCGTG
GTGGTGCTTCAGGATTCCAATGTACTGCATCCAAGATCAGAACGAGCTTGCCGGCGGACCATTGCCGTCCCATTCTTGAGTCAGCTTCATAGTTCCAACCTAGATAAGGC
TTTTTTATGCCCTTTTTTTAGGTTGGGTTGCTGGATACGGGATCCTCGTAGTAGGCTGGACCAACATCCAGCCGAGAGAGGGCAGCCTTTAGAAGCAAGAGAGGCTTTCA
GTATACAGTGCCCCACAACTATTCATAGTATAGTGGGGTTGAAACACGAGAGTGCCCGCCCTTTCTTTCAAGTAGGCCACTTTTTCCGGAACGCAGCCCGGGATAAGCGT
GACCGTGTATATATATCTATATCTATATATATCTTCGATGCTGTCATTTCGAAATGGGCAAAGAGAAGCGCTTTTGCTACTGAGAAAGTGAACGGTCAGCGCGAAGGTTC
AAGACTTAGCCGAGCATTAGCGAAGCGTCATTCTCATAGTGAGGCGCTTCGAGTTTTCGAAGCGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTGGTCCTACGGACCGTGAACGGATTCGCCTCTGGCCTCTGGGTACGTCGGAACCGCATAAGTTCACCGGGGTGGAGCACGGTCCGCCAAAATCGGCATAGGTT
AGGTGCTATTGATGGAACATGTTGGTGGCTGACTGGGCTTTTTCCTAGATCAAAGCGGATCAGCTCGCCTTCTTGTTATCCAAAAGCCCGGCCACCTTCCCCCTATCCCT
TATGTTTAGGAGCGGGATCCGCCCAAGAACGACCAGTCCCGTGGTGGTACTTCCCTGCTGTTGCGGCCGGTGCTCGTTTGCGCGCGCGTGAACCAACCCAACAAACAAGG
AAAGGATGCCTCTCGGGGCATCTGAGAATGATTCGAGCTTGGTGGGCCTACCCATCCCAACCAGGGGTATGCTGGGATTTGCGAAGAGCAGCACCTTACGATGTTCATGA
CCAATTGGATCCTGACGTACCAGTAGGTACCAGAGGAGATCGCTATGATCGTTACTGTATCCGTATCGAAGAGATGCGACAAAGTGTTCGGATCATTGTGCAATGTCCTA
ATCAAATGCCTAGTGGCATGATCAAAGCCGATGATCGTAAGCTATGTCCTCCATCACGATCTCGAATGAAACTATCCATGGAATCGTGTGAACGCGCGGTAGCGTTCGTG
GTGGTGCTTCAGGATTCCAATGTACTGCATCCAAGATCAGAACGAGCTTGCCGGCGGACCATTGCCGTCCCATTCTTGAGTCAGCTTCATAGTTCCAACCTAGATAAGGC
TTTTTTATGCCCTTTTTTTAGGTTGGGTTGCTGGATACGGGATCCTCGTAGTAGGCTGGACCAACATCCAGCCGAGAGAGGGCAGCCTTTAGAAGCAAGAGAGGCTTTCA
GTATACAGTGCCCCACAACTATTCATAGTATAGTGGGGTTGAAACACGAGAGTGCCCGCCCTTTCTTTCAAGTAGGCCACTTTTTCCGGAACGCAGCCCGGGATAAGCGT
GACCGTGTATATATATCTATATCTATATATATCTTCGATGCTGTCATTTCGAAATGGGCAAAGAGAAGCGCTTTTGCTACTGAGAAAGTGAACGGTCAGCGCGAAGGTTC
AAGACTTAGCCGAGCATTAGCGAAGCGTCATTCTCATAGTGAGGCGCTTCGAGTTTTCGAAGCGCTGTAG
Protein sequenceShow/hide protein sequence
MVLVLRTVNGFASGLWVRRNRISSPGWSTVRQNRHRLGAIDGTCWWLTGLFPRSKRISSPSCYPKARPPSPYPLCLGAGSAQERPVPWWYFPAVAAGARLRAREPTQQTR
KGCLSGHLRMIRAWWAYPSQPGVCWDLRRAAPYDVHDQLDPDVPVGTRGDRYDRYCIRIEEMRQSVRIIVQCPNQMPSGMIKADDRKLCPPSRSRMKLSMESCERAVAFV
VVLQDSNVLHPRSERACRRTIAVPFLSQLHSSNLDKAFLCPFFRLGCWIRDPRSRLDQHPAERGQPLEAREAFSIQCPTTIHSIVGLKHESARPFFQVGHFFRNAARDKR
DRVYISISIYIFDAVISKWAKRSAFATEKVNGQREGSRLSRALAKRHSHSEALRVFEAL