; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036992 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036992
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein DCL homolog, chloroplastic-like
Genome locationchr2:2668778..2672677
RNA-Seq ExpressionLag0036992
SyntenyLag0036992
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:1901259 - chloroplast rRNA processing (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003729 - mRNA binding (molecular function)
InterPro domainsIPR044673 - Protein DCL-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608449.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.2e-9487.24Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLCTGIVQV RRSCCTA  ASTPP G L+SAENTTSVLSA+DPPKYQRWDEP YRKWKNQE+EILSDI+P+ISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YAERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

KAG7037785.1 Protein DCL-like, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-9487.63Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLCTGIVQV RRSCCTA  ASTPP G L+SAENTTSVLSA+DPPKYQRWDEP YRKWKNQE+EILSDI+P+ISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKR
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YAERFIR+HFKR
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKR

XP_022135338.1 protein DCL, chloroplastic isoform X1 [Momordica charantia]8.1e-9789.8Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        MA SLL+RGHPLLRL LR  GLCTGIVQV RRSCCTATAA TPPDGNLTSAEN TSVLS+SDPPKY RWDEPDYRKWK+QE+E+L+DIEPIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLTS  ERIVV+RLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYPSYAERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

XP_022940238.1 protein DCL homolog, chloroplastic-like [Cucurbita moschata]1.9e-9386.73Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLC GIVQV RRSCCTA  ASTPP G L+SAENTTSVLSA+DPPKYQRW+EP YRKWKNQE+EILSDI+PIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YAERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

XP_022982288.1 protein DCL homolog, chloroplastic-like isoform X1 [Cucurbita maxima]2.4e-9386.22Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLCTGI+QV RRSCCTA  ASTPP G L+SAENTTSVLS +DPPKYQRWDEP YRKWKNQE+EILSDI+PIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YA+RFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

TrEMBL top hitse value%identityAlignment
A0A1S3BU88 protein DCL, chloroplastic7.2e-9182.65Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        MA S L RGHPLLRL L++ GLCTG+VQV RRSCC+AT ASTP DG+LT+ +N TS++S+S+PPKYQRWDEPDYRKWKNQE+EIL DIEPII LTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        S+RY DGERLT E ER VVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRC FVIRTDGGWIDFSYQKCLR YIRNKYPS+AERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

A0A6J1C4J4 protein DCL, chloroplastic isoform X13.9e-9789.8Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        MA SLL+RGHPLLRL LR  GLCTGIVQV RRSCCTATAA TPPDGNLTSAEN TSVLS+SDPPKY RWDEPDYRKWK+QE+E+L+DIEPIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLTS  ERIVV+RLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYPSYAERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

A0A6J1FNQ4 protein DCL homolog, chloroplastic-like9.1e-9486.73Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLC GIVQV RRSCCTA  ASTPP G L+SAENTTSVLSA+DPPKYQRW+EP YRKWKNQE+EILSDI+PIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YAERFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

A0A6J1IHJ1 protein DCL homolog, chloroplastic-like2.1e-9084.18Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        MA SLL+RGHPLLR  L   GLC+G+VQV RRSCCTATAASTPPDG+LTSAENT SV SAS   KYQRW EPDYRKWK+QE EILSD+EP++SLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLTSE ERIVVDRLLAHHPHAEDKIGCGLE IMVDRHPQFR+SRCLFV+RTDGGWIDFSYQKCLR YIR+KYPSYAE FIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

A0A6J1J4G4 protein DCL homolog, chloroplastic-like isoform X11.2e-9386.22Show/hide
Query:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH
        M  SLL+RGHPL+RL LR  GLCTGI+QV RRSCCTA  ASTPP G L+SAENTTSVLS +DPPKYQRWDEP YRKWKNQE+EILSDI+PIISLTKEILH
Subjt:  MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILH

Query:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS
        SNRYVDGERLT E E+IVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLR YIRNKYP YA+RFIR+HFKR S
Subjt:  SNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS

SwissProt top hitse value%identityAlignment
Q42463 Protein DCL, chloroplastic2.9e-2841.1Show/hide
Query:  TSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQF
        T++E+   V   SD    ++    D   W + E +IL D  P++   + ILHS +Y  G+RL+ + +R ++ RLL +HP  + KIG G++ I V  HP F
Subjt:  TSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQF

Query:  RHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKR
         +SRCLF++R DG  +DFSY KC++G IR  YP YA+ FI +HF++
Subjt:  RHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKR

Q5D869 DNA-directed RNA polymerase V subunit 11.5e-2442.24Show/hide
Query:  NQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRN
        ++E+E+LSD+EP++   ++I+H + Y DG+ ++ + +  V++++L  HP  E K+G G++ I VD+H  F  SRC FV+ TDG   DFSY+K L  Y+  
Subjt:  NQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRN

Query:  KYPSYAERFIRKHFKR
        KYP  AE FI K+F +
Subjt:  KYPSYAERFIRKHFKR

Q9C642 Protein DCL homolog, chloroplastic4.4e-2947.37Show/hide
Query:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY
        E +IL    P++   + ILHS +Y + +RL+ E ER +++ LL +HP  E KIGCG++ IMV  HP F  SRC+F++R DG  +DFSY KC++G I+ KY
Subjt:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY

Query:  PSYAERFIRKHFKR
        P YA+ FI +HF++
Subjt:  PSYAERFIRKHFKR

Q9LQ02 DNA-directed RNA polymerase IV subunit 14.0e-0631.67Show/hide
Query:  WKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYI
        WKN        IE +    K ILHS    +   L +E +  +V  +L  HP++ +KIG G++ I V +  +   S C  V+R DG + DFSY KC+ G  
Subjt:  WKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYI

Query:  RNKYPSYAERFIRKHFKRSS
        +   P     +  K+ K  +
Subjt:  RNKYPSYAERFIRKHFKRSS

Arabidopsis top hitse value%identityAlignment
AT1G45230.1 Protein of unknown function (DUF3223)3.1e-3047.37Show/hide
Query:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY
        E +IL    P++   + ILHS +Y + +RL+ E ER +++ LL +HP  E KIGCG++ IMV  HP F  SRC+F++R DG  +DFSY KC++G I+ KY
Subjt:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY

Query:  PSYAERFIRKHFKR
        P YA+ FI +HF++
Subjt:  PSYAERFIRKHFKR

AT1G45230.2 Protein of unknown function (DUF3223)7.0e-3047.37Show/hide
Query:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY
        E +IL    P++   + ILHS +Y + +RL+ E ER +++ LL +HP  E KIGCG++ IMV  HP F  SRC+F++R DG  +DFSY KC++G I+ KY
Subjt:  EKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKY

Query:  PSYAERFIRKHFKR
        P YA+ FI +HF++
Subjt:  PSYAERFIRKHFKR

AT1G63020.1 nuclear RNA polymerase D1A2.9e-0731.67Show/hide
Query:  WKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYI
        WKN        IE +    K ILHS    +   L +E +  +V  +L  HP++ +KIG G++ I V +  +   S C  V+R DG + DFSY KC+ G  
Subjt:  WKNQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYI

Query:  RNKYPSYAERFIRKHFKRSS
        +   P     +  K+ K  +
Subjt:  RNKYPSYAERFIRKHFKRSS

AT2G40030.1 nuclear RNA polymerase D1B1.0e-2542.24Show/hide
Query:  NQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRN
        ++E+E+LSD+EP++   ++I+H + Y DG+ ++ + +  V++++L  HP  E K+G G++ I VD+H  F  SRC FV+ TDG   DFSY+K L  Y+  
Subjt:  NQEKEILSDIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRN

Query:  KYPSYAERFIRKHFKR
        KYP  AE FI K+F +
Subjt:  KYPSYAERFIRKHFKR

AT3G46630.1 Protein of unknown function (DUF3223)7.9e-5856.19Show/hide
Query:  MATSLLIRGHPLLRLAL----RQCGLCT-GIVQVARRSCCTA---------TAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILS
        M + LL+R  PLLR        Q G+   GI+   RR  C+            + +P +G+  +A N TS +  +      R+++PDYRKWKN E EIL 
Subjt:  MATSLLIRGHPLLRLAL----RQCGLCT-GIVQVARRSCCTA---------TAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILS

Query:  DIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAER
        DIEPI  L KEILHS+RY+DGERL  E E+IV+++LL +HP+++DKIGCGL+ IMVDRHPQFRHSRCLFV+RTDGGWIDFSYQKCLR Y+R+KYPS+AER
Subjt:  DIEPIISLTKEILHSNRYVDGERLTSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAER

Query:  FIRKHFKRSS
        FIR+HFKR+S
Subjt:  FIRKHFKRSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTTCTTTGCTAATCAGGGGGCATCCTCTCCTCCGGCTTGCGCTCAGGCAGTGCGGGCTATGTACAGGGATCGTGCAGGTGGCTCGCCGGTCTTGCTGCACTGC
CACGGCGGCGTCTACTCCACCGGACGGCAACTTAACATCTGCTGAAAATACTACCTCAGTGTTGAGTGCCAGTGACCCACCCAAGTACCAAAGGTGGGACGAGCCTGATT
ATCGAAAGTGGAAGAACCAGGAAAAGGAAATTCTCAGCGACATCGAGCCTATCATATCCCTCACAAAAGAGATCCTCCACTCTAATAGGTATGTAGATGGGGAGCGGTTG
ACATCTGAGGTCGAGAGAATTGTGGTTGACAGGCTTCTGGCTCATCATCCACATGCTGAAGATAAAATTGGATGTGGACTTGAATCCATTATGGTTGATCGGCACCCCCA
ATTTCGGCACTCAAGATGCCTCTTTGTTATAAGGACTGATGGTGGATGGATTGACTTCTCCTATCAAAAATGTCTTCGGGGATATATCCGAAATAAGTACCCATCTTATG
CAGAGCGGTTTATTCGAAAACATTTCAAACGCAGTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTTCTTTGCTAATCAGGGGGCATCCTCTCCTCCGGCTTGCGCTCAGGCAGTGCGGGCTATGTACAGGGATCGTGCAGGTGGCTCGCCGGTCTTGCTGCACTGC
CACGGCGGCGTCTACTCCACCGGACGGCAACTTAACATCTGCTGAAAATACTACCTCAGTGTTGAGTGCCAGTGACCCACCCAAGTACCAAAGGTGGGACGAGCCTGATT
ATCGAAAGTGGAAGAACCAGGAAAAGGAAATTCTCAGCGACATCGAGCCTATCATATCCCTCACAAAAGAGATCCTCCACTCTAATAGGTATGTAGATGGGGAGCGGTTG
ACATCTGAGGTCGAGAGAATTGTGGTTGACAGGCTTCTGGCTCATCATCCACATGCTGAAGATAAAATTGGATGTGGACTTGAATCCATTATGGTTGATCGGCACCCCCA
ATTTCGGCACTCAAGATGCCTCTTTGTTATAAGGACTGATGGTGGATGGATTGACTTCTCCTATCAAAAATGTCTTCGGGGATATATCCGAAATAAGTACCCATCTTATG
CAGAGCGGTTTATTCGAAAACATTTCAAACGCAGTAGTTGA
Protein sequenceShow/hide protein sequence
MATSLLIRGHPLLRLALRQCGLCTGIVQVARRSCCTATAASTPPDGNLTSAENTTSVLSASDPPKYQRWDEPDYRKWKNQEKEILSDIEPIISLTKEILHSNRYVDGERL
TSEVERIVVDRLLAHHPHAEDKIGCGLESIMVDRHPQFRHSRCLFVIRTDGGWIDFSYQKCLRGYIRNKYPSYAERFIRKHFKRSS