; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006860 (gene) of Snake gourd v1 genome

Gene IDTan0006860
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionUnknown protein
Genome locationLG04:79729680..79732997
RNA-Seq ExpressionTan0006860
SyntenyTan0006860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7036664.1 hypothetical protein SDJN02_00284, partial [Cucurbita argyrosperma subsp. argyrosperma]4.8e-13291.21Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRR R+EKD HLGGGFGSNGG DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSR+EIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022155253.1 uncharacterized protein LOC111022393 [Momordica charantia]1.0e-13492.62Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR
        MSRRALDSRQSI+SCTLKLHSWRPFQLH+APKTLDSD H SA   +KP+YSS+GLHTKRPCLSDR TSF+VDAIDMSRLSLIDDDKPSIA GCYTRRSFR
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR

Query:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFGSNGGFDAQGNESG
        L+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD     FGSNGGFDAQGNESG
Subjt:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFGSNGGFDAQGNESG

Query:  YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_022949469.1 uncharacterized protein LOC111452804 isoform X2 [Cucurbita moschata]2.6e-13391.94Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLGGGFGSNGG DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_023524455.1 uncharacterized protein LOC111788369 [Cucurbita pepo subsp. pepo]1.3e-13291.58Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLGGGF SNGG DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

XP_038906083.1 uncharacterized protein LOC120091971 [Benincasa hispida]9.7e-13391.91Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR
        MSRRALDSRQSIDSCTLKLH W PF L   PKTLDSD H+SA   +KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSIA GCYTRRS R
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR

Query:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFG-SNGGFDAQGNES
        LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLG GFG SNGGFDAQGNES
Subjt:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFG-SNGGFDAQGNES

Query:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDS+MEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  GYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

TrEMBL top hitse value%identityAlignment
A0A6J1DMG8 uncharacterized protein LOC1110223935.0e-13592.62Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR
        MSRRALDSRQSI+SCTLKLHSWRPFQLH+APKTLDSD H SA   +KP+YSS+GLHTKRPCLSDR TSF+VDAIDMSRLSLIDDDKPSIA GCYTRRSFR
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFR

Query:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFGSNGGFDAQGNESG
        L+A KRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD     FGSNGGFDAQGNESG
Subjt:  LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFGSNGGFDAQGNESG

Query:  YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
Subjt:  YGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GC64 uncharacterized protein LOC111452804 isoform X11.2e-12884.18Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLGGGFGSNGG DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGD------------------------SRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG                         SRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGD------------------------SRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1GCW8 uncharacterized protein LOC111452804 isoform X21.2e-13391.94Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTRRSF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLGGGFGSNGG DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1J7N4 uncharacterized protein LOC1114824874.3e-12689.82Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSI-APGCYTRRSF
        MSRRALDSR+SI SCTLKLH WRPFQL   PK LDSDAHTSA  +AKP+YSSSGLHTKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSI A G Y+R SF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSI-APGCYTRRSF

Query:  RLIARK-RRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNS-RREREEKDHLGGGFG-SNGGFDAQG
        +LIARK RRRRGSRSVSGRS+DRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNS RREREEKD LG GFG SNGGFDAQG
Subjt:  RLIARK-RRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNS-RREREEKDHLGGGFG-SNGGFDAQG

Query:  NESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        NESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSR+EIVGENTFADQKSHHRCRRKKHEC MVD+LR
Subjt:  NESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

A0A6J1K891 uncharacterized protein LOC111493139 isoform X23.7e-13090.84Show/hide
Query:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF
        MSRR LDSRQSIDSCTLKLH+WRPF  LHSAPKTLDSD H S   T+KP+YSS+ LHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIA G YTR SF
Subjt:  MSRRALDSRQSIDSCTLKLHSWRPF-QLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSF

Query:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE
         LIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRRER+EKD HLGGGF  NG  DAQGNE
Subjt:  RLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD-HLGGGFGSNGGFDAQGNE

Query:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR
        SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVD LR
Subjt:  SGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02425.1 unknown protein2.3e-6857.6Show/hide
Query:  MSRRALD-SRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHT---KRPCLSDRTTSFNVDAIDMSRLSLIDDD---KPSIAPGC
        MS + L+ SR SI+SCT +L SWRPF      KTLDS               ++G H+   KRPC SDR+TSF ++A  MSRLSL DDD   K   A   
Subjt:  MSRRALD-SRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHT---KRPCLSDRTTSFNVDAIDMSRLSLIDDD---KPSIAPGC

Query:  YTRRSFRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSE-AKNSRREREE---KDHLGGGFGSN
          R SFRL+ARKRRRR SRSVSGRSSDRSGTRRCCS+G   AHGTCSD P AVGTDSSGELF  G+ANW+SDVSE A+NSRRER +   +    GGFG  
Subjt:  YTRRSFRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSE-AKNSRREREE---KDHLGGGFGSN

Query:  GGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR
         G D  GNESGYGSEPGYRGD EFGYGDE D+E+ED + L WG+   DS M + GE  F+D K   RCRR++ H+ + VD++R
Subjt:  GGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKK-HECRMVDALR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGCAGAGCCCTAGATTCCCGCCAATCGATTGACTCCTGTACTCTCAAACTCCACAGTTGGAGACCCTTTCAGCTCCACTCTGCTCCCAAAACCCTAGATTCCGA
TGCCCATACCTCAGCTCTCGCAACCGCTAAACCCCACTACTCATCCAGCGGCCTTCACACCAAGCGCCCTTGTCTCTCCGATCGAACTACCTCTTTCAATGTCGACGCCA
TCGACATGTCGAGGCTGAGCTTGATTGACGACGACAAGCCTTCCATTGCGCCAGGGTGTTACACGCGACGAAGCTTCCGGTTGATTGCTAGGAAACGGCGGCGGCGTGGA
TCCAGGTCGGTTTCTGGCCGGAGTAGTGATCGGAGTGGGACTAGGCGGTGCTGCTCTGTTGGGGCTTCGGCGGCTCATGGGACTTGCTCGGATTTCCCTGTGGCGGTTGG
GACTGATTCCAGTGGGGAGTTGTTTGTGAATGGGGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGATCATTTGGGAG
GTGGGTTTGGTTCTAATGGGGGTTTTGATGCTCAGGGGAATGAGTCTGGGTATGGTAGTGAACCTGGTTATCGTGGAGATGGTGAATTTGGGTATGGCGATGAGATCGAT
GAGGAGGATGAAGATGCCAGATTGCTGCTGTGGGGTGAACGACTGGGAGATTCTAGAATGGAAATTGTAGGAGAAAACACATTTGCAGATCAGAAATCACACCATAGATG
TCGTCGTAAGAAACACGAATGTAGAATGGTCGACGCCCTGAGGTGA
mRNA sequenceShow/hide mRNA sequence
GTAGATTACAACTATATTGCGGTAGATTCTTCCCTCTGATATGAACTTTACTACGGCCACTCTCACGGATCCTCGAGATTCGAGGGAATTGGTCTCACCGTGCCATCTTT
TTCTCCTCTGCTTCTTGAAAAATCTCAACAATTCTCCTCCTCAAACCCCGTTTCCTTCTTCACATTGATCCTTTCTGCAACTCCAATTCAATCATGGCTTTCAATCTCTG
AACCCTTCTTTTCATCTTTCAATTCTTCCCCATGTCTCGCAGAGCCCTAGATTCCCGCCAATCGATTGACTCCTGTACTCTCAAACTCCACAGTTGGAGACCCTTTCAGC
TCCACTCTGCTCCCAAAACCCTAGATTCCGATGCCCATACCTCAGCTCTCGCAACCGCTAAACCCCACTACTCATCCAGCGGCCTTCACACCAAGCGCCCTTGTCTCTCC
GATCGAACTACCTCTTTCAATGTCGACGCCATCGACATGTCGAGGCTGAGCTTGATTGACGACGACAAGCCTTCCATTGCGCCAGGGTGTTACACGCGACGAAGCTTCCG
GTTGATTGCTAGGAAACGGCGGCGGCGTGGATCCAGGTCGGTTTCTGGCCGGAGTAGTGATCGGAGTGGGACTAGGCGGTGCTGCTCTGTTGGGGCTTCGGCGGCTCATG
GGACTTGCTCGGATTTCCCTGTGGCGGTTGGGACTGATTCCAGTGGGGAGTTGTTTGTGAATGGGGATGCGAATTGGTCGTCGGATGTGAGTGAAGCGAAGAATTCGAGG
AGGGAGAGAGAGGAGAAGGATCATTTGGGAGGTGGGTTTGGTTCTAATGGGGGTTTTGATGCTCAGGGGAATGAGTCTGGGTATGGTAGTGAACCTGGTTATCGTGGAGA
TGGTGAATTTGGGTATGGCGATGAGATCGATGAGGAGGATGAAGATGCCAGATTGCTGCTGTGGGGTGAACGACTGGGAGATTCTAGAATGGAAATTGTAGGAGAAAACA
CATTTGCAGATCAGAAATCACACCATAGATGTCGTCGTAAGAAACACGAATGTAGAATGGTCGACGCCCTGAGGTGAAGCAAACATGGAAACGAGGCGACGGACACTGTG
AAAGCTCCTGATGCTGATGGTCTCTACCTGTTGGGGTATGCCTCTGAGACCTGGAATTTTCAACTGGAGAGGTTCCCCAGTTTTTAGACTGCTTAAATTGCAGCTTAAAG
TAACTAGATTTCCCTTTGACATTTTGTGAATAGATCGCTGCTGGTATGAGCTCTAGAACACCGACTTCTCGACGCACGTCGACGGGTGCATCGATTGGGAAGAAAACATA
ATCCCATTGTCAGATTTCATCTTAAAATTTTCTCACCTTAAGAAAGTGAAGAACTCTTTTGTATACGTTTAGACAGAACTGGGAAAAAGGCTTTAATGCTTCTTAAGTTT
ACTTAGTAAGTACAACTTTTGACCGTTGCTTCTATTTTGATCTTTGTGGTTTGTGCTTTCATGTTTTTTCAACCTTGGC
Protein sequenceShow/hide protein sequence
MSRRALDSRQSIDSCTLKLHSWRPFQLHSAPKTLDSDAHTSALATAKPHYSSSGLHTKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAPGCYTRRSFRLIARKRRRRG
SRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPVAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGGGFGSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEID
EEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDALR