; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009441 (gene) of Chayote v1 genome

Gene IDSed0009441
OrganismSechium edule (Chayote v1)
Descriptionprotein DCL, chloroplastic
Genome locationLG01:21552754..21556473
RNA-Seq ExpressionSed0009441
SyntenySed0009441
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR044673 - Protein DCL-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608449.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.0e-7574.23Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        M AS + RGH L+RLG R  GLCTGIVQV  RS CTA  ASTPP G L+SAENTTS LSA++PPKYQ   +  +RKWK+QE+EILSDI+P+ISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT EDE+IV DRLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYP YAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

KAG7037785.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-7574.23Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        M AS + RGH L+RLG R  GLCTGIVQV  RS CTA  ASTPP G L+SAENTTS LSA++PPKYQ   +  +RKWK+QE+EILSDI+P+ISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT EDE+IV DRLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYP YAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

XP_008452601.1 PREDICTED: protein DCL, chloroplastic [Cucumis melo]2.3e-7573.71Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        MAASF+ RGH LLRLG +  GLCTG+VQV  RS C+AT ASTP DGDLT+ +N TS +S+SEPPKYQ   + D+RKWK+QE+EIL DIEPII LT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RY +GERLTLEDER V DRLL HHP AEDKIGCGLESIMV+RHPQFR S C FVIRTDG WIDFSY  CLR YIRNKYPS+AERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

XP_022135338.1 protein DCL, chloroplastic isoform X1 [Momordica charantia]1.9e-7776.8Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        MAAS + RGH LLRLG R  GLCTGIVQV  RS CTATAA TPPDG+LTSAEN TS LS+S+PPKY    + D+RKWKDQE+E+L+DIEPIISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT  DERIV +RLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYPSYAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

XP_022940238.1 protein DCL homolog, chloroplastic-like [Cucurbita moschata]4.0e-7574.23Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        M AS + RGH L+RLG R  GLC GIVQV  RS CTA  ASTPP G L+SAENTTS LSA++PPKYQ  ++  +RKWK+QE+EILSDI+PIISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT EDE+IV DRLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYP YAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

TrEMBL top hitse value%identityAlignment
A0A0A0L0D6 Uncharacterized protein6.2e-7473.2Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        MAASF+ RGH LLRLG +  G CTGIVQV  R  C+ATAASTP DGDLT+ +N TS +S SEPPKY    + D+RKWK+QE+EIL DIEPII LT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RY +GERLTLEDER V DRLL HHP AEDKIGCGLESIMV+RHPQFR S C FVIRTDG WIDFSY  CLR YIRNKYPS+AERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

A0A1S3BU88 protein DCL, chloroplastic1.1e-7573.71Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        MAASF+ RGH LLRLG +  GLCTG+VQV  RS C+AT ASTP DGDLT+ +N TS +S+SEPPKYQ   + D+RKWK+QE+EIL DIEPII LT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RY +GERLTLEDER V DRLL HHP AEDKIGCGLESIMV+RHPQFR S C FVIRTDG WIDFSY  CLR YIRNKYPS+AERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

A0A6J1C4J4 protein DCL, chloroplastic isoform X19.2e-7876.8Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        MAAS + RGH LLRLG R  GLCTGIVQV  RS CTATAA TPPDG+LTSAEN TS LS+S+PPKY    + D+RKWKDQE+E+L+DIEPIISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT  DERIV +RLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYPSYAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

A0A6J1FNQ4 protein DCL homolog, chloroplastic-like1.9e-7574.23Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        M AS + RGH L+RLG R  GLC GIVQV  RS CTA  ASTPP G L+SAENTTS LSA++PPKYQ  ++  +RKWK+QE+EILSDI+PIISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT EDE+IV DRLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYP YAERFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

A0A6J1J4G4 protein DCL homolog, chloroplastic-like isoform X11.6e-7473.2Show/hide
Query:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR
        M AS + RGH L+RLG R  GLCTGI+QV  RS CTA  ASTPP G L+SAENTTS LS ++PPKYQ   +  +RKWK+QE+EILSDI+PIISLT+EIL 
Subjt:  MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILR

Query:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR
        S RYV+GERLT EDE+IV DRLL HHP AEDKIGCGLESIMV+RHPQFR S CLFVIRTDG WIDFSY  CLR YIRNKYP YA+RFI +HFKR
Subjt:  SERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKR

SwissProt top hitse value%identityAlignment
Q42463 Protein DCL, chloroplastic1.0e-2536.81Show/hide
Query:  GRSLCTATAASTPPDGDLTSAEN--------------TTS---PLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLE
        G  LC   A  T  +G    ++N              TTS    L   E      +   D   W D E +IL D  P++   R IL S +Y  G+RL+ +
Subjt:  GRSLCTATAASTPPDGDLTSAEN--------------TTS---PLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLE

Query:  DERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR
         +R +  RLL +HP+ + KIG G++ I V  HP F  S CLF++R DG  +DFSY  C++G IR  YP YA+ FI +HF++R
Subjt:  DERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR

Q5D869 DNA-directed RNA polymerase V subunit 11.2e-2140.87Show/hide
Query:  QEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNK
        +E+E+LSD+EP++   R+I+    Y +G+ ++ +D+  V +++L  HPQ E K+G G++ I V++H  F  S C FV+ TDG+  DFSY   L  Y+  K
Subjt:  QEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNK

Query:  YPSYAERFIEKHFKR
        YP  AE FI+K+F +
Subjt:  YPSYAERFIEKHFKR

Q9C642 Protein DCL homolog, chloroplastic2.2e-2841.91Show/hide
Query:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT
        +SE  + +   +++  ++ D E +IL    P++   R IL S +Y N +RL+ E ER + + LL +HP+ E KIGCG++ IMV  HP F  S C+F++R 
Subjt:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT

Query:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR
        DG  +DFSY  C++G I+ KYP YA+ FI +HF++R
Subjt:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR

Q9LQ02 DNA-directed RNA polymerase IV subunit 16.5e-0432.71Show/hide
Query:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER
        +IE +    + IL S  Y   E L   DE +V   +L  HP + +KIG G++ I V +  +   S C  V+R DG++ DFSY  C+ G  +   P     
Subjt:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER

Query:  FIEKHFK
        +  K+ K
Subjt:  FIEKHFK

Arabidopsis top hitse value%identityAlignment
AT1G45230.1 Protein of unknown function (DUF3223)1.6e-2941.91Show/hide
Query:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT
        +SE  + +   +++  ++ D E +IL    P++   R IL S +Y N +RL+ E ER + + LL +HP+ E KIGCG++ IMV  HP F  S C+F++R 
Subjt:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT

Query:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR
        DG  +DFSY  C++G I+ KYP YA+ FI +HF++R
Subjt:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR

AT1G45230.2 Protein of unknown function (DUF3223)4.6e-2941.91Show/hide
Query:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT
        +SE  + +   +++  ++ D E +IL    P++   R IL S +Y N +RL+ E ER + + LL +HP+ E KIGCG++ IMV  HP F  S C+F++R 
Subjt:  ASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRT

Query:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR
        DG  +DFSY  C++G I+ KYP YA+ FI +HF++R
Subjt:  DGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRR

AT1G63020.1 nuclear RNA polymerase D1A4.6e-0532.71Show/hide
Query:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER
        +IE +    + IL S  Y   E L   DE +V   +L  HP + +KIG G++ I V +  +   S C  V+R DG++ DFSY  C+ G  +   P     
Subjt:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER

Query:  FIEKHFK
        +  K+ K
Subjt:  FIEKHFK

AT2G40030.1 nuclear RNA polymerase D1B8.3e-2340.87Show/hide
Query:  QEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNK
        +E+E+LSD+EP++   R+I+    Y +G+ ++ +D+  V +++L  HPQ E K+G G++ I V++H  F  S C FV+ TDG+  DFSY   L  Y+  K
Subjt:  QEKEILSDIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNK

Query:  YPSYAERFIEKHFKR
        YP  AE FI+K+F +
Subjt:  YPSYAERFIEKHFKR

AT3G46630.1 Protein of unknown function (DUF3223)2.2e-4750Show/hide
Query:  MAASFIRRGHSLLRLGF-----RPSGLCTGIVQVAGRSLCTA---------TAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILS
        M +  + R   LLR GF     +   +  GI+    R LC+            + +P +G   +A N TSP+      +Y+   D D+RKWK+ E EIL 
Subjt:  MAASFIRRGHSLLRLGF-----RPSGLCTGIVQVAGRSLCTA---------TAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILS

Query:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER
        DIEPI  L +EIL S+RY++GERL  EDE+IV ++LL +HP ++DKIGCGL+ IMV+RHPQFR S CLFV+RTDG WIDFSY  CLR Y+R+KYPS+AER
Subjt:  DIEPIISLTREILRSERYVNGERLTLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAER

Query:  FIEKHFKR
        FI +HFKR
Subjt:  FIEKHFKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCTTCTTTCATACGCAGGGGGCATTCTCTTCTCCGGTTAGGGTTCCGGCCAAGCGGGCTATGTACTGGAATCGTGCAGGTGGCTGGCCGGTCTTTGTGTACTGC
GACGGCGGCGTCTACTCCACCGGACGGCGACTTAACATCTGCTGAAAATACTACCTCGCCGTTGAGTGCCAGTGAGCCACCGAAGTATCAAACGCGGCACGATGCTGATC
ATCGAAAGTGGAAGGATCAGGAAAAGGAAATTCTCAGCGACATCGAGCCTATCATCTCTCTCACTAGAGAAATCCTTCGTTCCGAAAGGTACGTGAATGGGGAGCGATTG
ACATTGGAAGACGAGAGAATTGTTGCTGACAGGCTTCTAGTTCATCATCCACAAGCTGAAGATAAAATCGGATGTGGACTTGAATCTATTATGGTCAACCGGCACCCCCA
GTTTCGGTGTTCAACCTGCCTCTTTGTTATAAGGACTGATGGCTCATGGATTGACTTCTCGTATCTAATGTGTCTTCGAGGTTATATCCGAAATAAATACCCATCATATG
CAGAGCGGTTTATTGAAAAGCATTTCAAACGCAGGATATTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
CCAACAAAATCAGCCCAAAACGCCCAATAACTGATCATCATCCCTAATGAGCTGATCTGCACAAGGCACATGAACATTTGATCCTCATGGCTGCTTCTTTCATACGCAGG
GGGCATTCTCTTCTCCGGTTAGGGTTCCGGCCAAGCGGGCTATGTACTGGAATCGTGCAGGTGGCTGGCCGGTCTTTGTGTACTGCGACGGCGGCGTCTACTCCACCGGA
CGGCGACTTAACATCTGCTGAAAATACTACCTCGCCGTTGAGTGCCAGTGAGCCACCGAAGTATCAAACGCGGCACGATGCTGATCATCGAAAGTGGAAGGATCAGGAAA
AGGAAATTCTCAGCGACATCGAGCCTATCATCTCTCTCACTAGAGAAATCCTTCGTTCCGAAAGGTACGTGAATGGGGAGCGATTGACATTGGAAGACGAGAGAATTGTT
GCTGACAGGCTTCTAGTTCATCATCCACAAGCTGAAGATAAAATCGGATGTGGACTTGAATCTATTATGGTCAACCGGCACCCCCAGTTTCGGTGTTCAACCTGCCTCTT
TGTTATAAGGACTGATGGCTCATGGATTGACTTCTCGTATCTAATGTGTCTTCGAGGTTATATCCGAAATAAATACCCATCATATGCAGAGCGGTTTATTGAAAAGCATT
TCAAACGCAGGATATTTGTTTGAATGATTAAAAGCGTTCTTCATGGTTCGGTGGCCTACATGAAGGCAAGATGGATGTTGAATGCAGTGGAGAATGAGAACAATCGACAA
CCAGTAAAGTGGACTTTCTGTTTATTAACTTGCAGTTCTGTTTTGTTTGTGTTCTAGATTTCTGGATAATCGGATAAGTGCAGGAATGTAGAACGAGATTATTGATCATG
TTTGAAAAGTAATGTAGTATTGAAAAGCTTAAACTGATTTATTTAATCTATTCTTTTTCACTTTTTACCATCTGCTTATCTAGATCTTGCGCGTGTTCTAATGTTGTTCA
TGATGAACCAACCAAGAGAAGTAATCAAACTTCAGGGGCATGAGCAGGTTGAATGGTCGATGATTGCAGCTGTAGAACTGTGGTGGCAAAATGAAGGCTTAAAGCGCATT
TTGTGGGATTAGCCAATAAAGAAATTTTCAAATGAAAATTGATTGAAGAGTTTGATTTTGTGCAATGGTAATTGATTTGCTTGTTTATGTTGGTTTCAAGGATTAGTTTT
TTTTTCAGTTCAATGTTATTTTGGGGGTGATTAGATTTTGTTGTCGAAAAGAGGAGATGAGAGAAAGTCGAAGAGGAAGAAGTGGAGTGATTTGGAGATTGATGTTTTAG
AGAGAAG
Protein sequenceShow/hide protein sequence
MAASFIRRGHSLLRLGFRPSGLCTGIVQVAGRSLCTATAASTPPDGDLTSAENTTSPLSASEPPKYQTRHDADHRKWKDQEKEILSDIEPIISLTREILRSERYVNGERL
TLEDERIVADRLLVHHPQAEDKIGCGLESIMVNRHPQFRCSTCLFVIRTDGSWIDFSYLMCLRGYIRNKYPSYAERFIEKHFKRRIFV