; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038537 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038537
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLYR motif protein
Genome locationscaffold12:3922746..3926266
RNA-Seq ExpressionSpg038537
SyntenySpg038537
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036664.1 hypothetical protein SDJN02_00284, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-12990.55Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTRRS
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRR R+EKD HLGGGF SNGG DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSR+EIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022155253.1 uncharacterized protein LOC111022393 [Momordica charantia]5.7e-13391.94Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF
        MSRRALDSRQSI+SCTLKLH WRPFQLH+A KTLDSD H+SAPT++KP YYSS+GLHTKRPCLSDR TSF+VDAIDMSRLSLIDDDKPSIAAGCYTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGFCSNGGFDAQGNES
        RL+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELF+NGDANWSSDVSEAKNSRREREEKD  G    SNGGFDAQGNES
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGFCSNGGFDAQGNES

Query:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022949469.1 uncharacterized protein LOC111452804 isoform X2 [Cucurbita moschata]9.2e-13191.27Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTRRS
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRRER+EKD HLGGGF SNGG DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_023524455.1 uncharacterized protein LOC111788369 [Cucurbita pepo subsp. pepo]5.4e-13191.27Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTRRS
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRRER+EKD HLGGGF SNGG DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_038906083.1 uncharacterized protein LOC120091971 [Benincasa hispida]2.2e-13292.34Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF
        MSRRALDSRQSIDSCTLKLHGW PF L    KTLDSD HSSAPT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSIAAGCYTRRS 
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGF-CSNGGFDAQGNE
        RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRREREEKDHLG GF  SNGGFDAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGF-CSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDS+MEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

TrEMBL top hitse value%identityAlignment
A0A6J1DMG8 uncharacterized protein LOC1110223932.8e-13391.94Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF
        MSRRALDSRQSI+SCTLKLH WRPFQLH+A KTLDSD H+SAPT++KP YYSS+GLHTKRPCLSDR TSF+VDAIDMSRLSLIDDDKPSIAAGCYTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGFCSNGGFDAQGNES
        RL+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELF+NGDANWSSDVSEAKNSRREREEKD  G    SNGGFDAQGNES
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGFCSNGGFDAQGNES

Query:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GC64 uncharacterized protein LOC111452804 isoform X12.3e-12783.89Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTRRS
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRRER+EKD HLGGGF SNGG DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGG-----------------------DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                         SRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGG-----------------------DSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GCW8 uncharacterized protein LOC111452804 isoform X24.4e-13191.27Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTRRS
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRRER+EKD HLGGGF SNGG DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1J7N4 uncharacterized protein LOC1114824878.1e-12589.53Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSI-AAGCYTRRS
        MSRRALDSR+SI SCTLKLHGWRPFQL    K LDSDAH+SAPTSAKP YYSSSGLHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSI A G Y+R S
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSI-AAGCYTRRS

Query:  FRLIARK-RRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNS-RREREEKDHLGGGF-CSNGGFDAQ
        F+LIARK RRRRGSRSVSGRS+DRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNS RREREEKD LG GF  SNGGFDAQ
Subjt:  FRLIARK-RRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNS-RREREEKDHLGGGF-CSNGGFDAQ

Query:  GNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSR+EIVGENTFADQKSHHRCRRKKHEC MVD+LR
Subjt:  GNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1K891 uncharacterized protein LOC111493139 isoform X21.6e-12890.55Show/hide
Query:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS
        MSRR LDSRQSIDSCTLKLH WRPF  LHSA KTLDSD H S PT++KP YYSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAG YTR S
Subjt:  MSRRALDSRQSIDSCTLKLHGWRPF-QLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRS

Query:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN
        F LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELF+NGDANWSSDVSEAKNSRRER+EKD HLGGGF  NG  DAQGN
Subjt:  FRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKD-HLGGGFCSNGGFDAQGN

Query:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL GDSRMEIVGENTFADQKSHHRCRRKKHECRMVD LR
Subjt:  ESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02425.1 unknown protein1.1e-6857.45Show/hide
Query:  MSRRALD-SRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDK--PSIAAGCYTR
        MS + L+ SR SI+SCT +L  WRPF     SKTLDS   S  P     ++        KRPC SDR+TSF ++A  MSRLSL DDD    +++A  Y+ 
Subjt:  MSRRALD-SRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDK--PSIAAGCYTR

Query:  R-SFRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSE-AKNSRREREE---KDHLGGGFCSNGG
        R SFRL+ARKRRRR SRSVSGRSSDRSGTRRCCS+G   AHGTCSD P AVGTDSSGELF  G+ANW+SDVSE A+NSRRER +   +    GGF    G
Subjt:  R-SFRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSE-AKNSRREREE---KDHLGGGFCSNGG

Query:  FDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR
         D  GNESGYGSEPGYRGD EFGYGDE D+E+ED + L WG+    DS M + GE  F+D K   RCRR++ H+ + VD++R
Subjt:  FDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCAGAGCCCTAGATTCCCGCCAATCGATTGACTCCTGCACTCTCAAGCTCCATGGTTGGAGACCCTTCCAGCTCCATTCTGCCTCCAAAACCCTAGATTCCGA
TGCCCATAGCTCGGCGCCCACAAGCGCTAAACCCTACTACTACTCGTCCAGTGGGCTTCACACCAAGCGCCCTTGTCTATCCGATCGAACTACCTCGTTCAATGTCGACG
CCATCGACATGTCGAGGCTGAGTTTGATCGACGACGACAAGCCTTCCATTGCCGCAGGGTGTTACACTCGGCGGAGCTTCCGATTGATCGCTAGGAAGCGGCGGCGGCGT
GGATCGAGGTCGGTTTCTGGGCGGAGTAGCGATCGGAGTGGGACGAGGCGGTGCTGCTCTGTTGGGGCTTCGGCGGCTCATGGGACTTGCTCGGATTTCCCTGTGGCGGT
TGGGACTGATTCGAGTGGGGAGCTGTTTCTGAATGGGGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGATCATTTGG
GTGGTGGGTTTTGTTCTAATGGAGGTTTTGATGCTCAGGGGAACGAGTCTGGGTATGGAAGTGAACCTGGTTATCGTGGAGATGGTGAATTTGGGTATGGTGATGAGATC
GATGAGGAGGATGAAGATGCCAGATTGTTGCTGTGGGGTGAACGACTGGGAGGAGATTCTAGAATGGAAATTGTAGGAGAGAACACATTTGCAGATCAGAAATCACACCA
TAGATGTCGTCGTAAGAAGCACGAATGTAGAATGGTTGATGCCCTGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGCAGAGCCCTAGATTCCCGCCAATCGATTGACTCCTGCACTCTCAAGCTCCATGGTTGGAGACCCTTCCAGCTCCATTCTGCCTCCAAAACCCTAGATTCCGA
TGCCCATAGCTCGGCGCCCACAAGCGCTAAACCCTACTACTACTCGTCCAGTGGGCTTCACACCAAGCGCCCTTGTCTATCCGATCGAACTACCTCGTTCAATGTCGACG
CCATCGACATGTCGAGGCTGAGTTTGATCGACGACGACAAGCCTTCCATTGCCGCAGGGTGTTACACTCGGCGGAGCTTCCGATTGATCGCTAGGAAGCGGCGGCGGCGT
GGATCGAGGTCGGTTTCTGGGCGGAGTAGCGATCGGAGTGGGACGAGGCGGTGCTGCTCTGTTGGGGCTTCGGCGGCTCATGGGACTTGCTCGGATTTCCCTGTGGCGGT
TGGGACTGATTCGAGTGGGGAGCTGTTTCTGAATGGGGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGATCATTTGG
GTGGTGGGTTTTGTTCTAATGGAGGTTTTGATGCTCAGGGGAACGAGTCTGGGTATGGAAGTGAACCTGGTTATCGTGGAGATGGTGAATTTGGGTATGGTGATGAGATC
GATGAGGAGGATGAAGATGCCAGATTGTTGCTGTGGGGTGAACGACTGGGAGGAGATTCTAGAATGGAAATTGTAGGAGAGAACACATTTGCAGATCAGAAATCACACCA
TAGATGTCGTCGTAAGAAGCACGAATGTAGAATGGTTGATGCCCTGAGATGA
Protein sequenceShow/hide protein sequence
MSRRALDSRQSIDSCTLKLHGWRPFQLHSASKTLDSDAHSSAPTSAKPYYYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGCYTRRSFRLIARKRRRR
GSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFLNGDANWSSDVSEAKNSRREREEKDHLGGGFCSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEI
DEEDEDARLLLWGERLGGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR