; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008091 (gene) of Chayote v1 genome

Gene IDSed0008091
OrganismSechium edule (Chayote v1)
DescriptionConserved peptide upstream open reading frame 46
Genome locationLG14:4361070..4362176
RNA-Seq ExpressionSed0008091
SyntenySed0008091
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR029063 - S-adenosyl-L-methionine-dependent methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047269.1 Methyltransferase type 11 [Cucumis melo var. makuwa]5.4e-8567.46Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  KSPIL D AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSF+SLHC++ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV
        ++ ++ L D +HLFN +ARAL+VGGSSSSAAS LQDLGFS  +GVD GR  S      GY+LD+ N SFDFVLF GK  KVSVP+LVVGEIER+LD GG+
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV

Query:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF
        GAV  G     +++ ISIG  GRV +L++SSC V SG V   Y+ VFKKK F
Subjt:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF

KAE8647389.1 hypothetical protein Csa_003425 [Cucumis sativus]3.9e-9167.57Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  +SPILHD AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSFDSLHCQ  VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG
        ++ ++ L   +HLFN +ARAL+VGGSSSSAAS L DLGFSR VGVD GR  S      GY+LD+PN SFDFVLF+GK KVSVP+LVVGE+ER+LD GG+G
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG

Query:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC
        AV  G     +++ ISIGL GRV +L++SSC V SG V   Y+ VFKKK FD  + + C
Subjt:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC

XP_008449921.1 PREDICTED: uncharacterized protein LOC103491650 [Cucumis melo]1.9e-8565.9Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  KSPIL D AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSF+SLHC++ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV
        ++ ++ L D +HLFN +ARAL+VGGSSSSAAS LQDLGFS  +GVD GR  S      GY+LD+ N SFDFVLF GK  KVSVP+LVVGEIER+LD GG+
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV

Query:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF-DGHLSVKC
        GAV  G     +++ ISIG  GRV +L++SSC V SG V   Y+ VFKKK F D  + + C
Subjt:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF-DGHLSVKC

XP_011657647.1 uncharacterized protein LOC105435880 [Cucumis sativus]3.9e-9167.57Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  +SPILHD AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSFDSLHCQ  VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG
        ++ ++ L   +HLFN +ARAL+VGGSSSSAAS L DLGFSR VGVD GR  S      GY+LD+PN SFDFVLF+GK KVSVP+LVVGE+ER+LD GG+G
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG

Query:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC
        AV  G     +++ ISIGL GRV +L++SSC V SG V   Y+ VFKKK FD  + + C
Subjt:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC

XP_038883488.1 uncharacterized protein LOC120074441 [Benincasa hispida]5.2e-8867.18Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  KSPILHD AFARR+FFR+FLF S +SLIPILHILTS+DFKSF LPKSPPC+A     A   +LPRGSYLFQGHFLNPVWDSFDS+HCQ+ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG
        V+ ++ L + +HLFN +ARAL+VGGSSSSAAS LQDLGFS  VGVD GR  S      GY+LD+ N SFDFVLF+GK KVSVP+LVVGEIER+L  GG+G
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG

Query:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC
        AV  G     +++ ISIGLAGRVG+L++SSC V SG VN  Y+ VFKKKQF   L + C
Subjt:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC

TrEMBL top hitse value%identityAlignment
A0A0A0KEM7 Uncharacterized protein1.9e-9167.57Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  +SPILHD AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSFDSLHCQ  VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG
        ++ ++ L   +HLFN +ARAL+VGGSSSSAAS L DLGFSR VGVD GR  S      GY+LD+PN SFDFVLF+GK KVSVP+LVVGE+ER+LD GG+G
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG

Query:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC
        AV  G     +++ ISIGL GRV +L++SSC V SG V   Y+ VFKKK FD  + + C
Subjt:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC

A0A1S3BMI8 uncharacterized protein LOC1034916509.0e-8665.9Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  KSPIL D AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSF+SLHC++ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV
        ++ ++ L D +HLFN +ARAL+VGGSSSSAAS LQDLGFS  +GVD GR  S      GY+LD+ N SFDFVLF GK  KVSVP+LVVGEIER+LD GG+
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV

Query:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF-DGHLSVKC
        GAV  G     +++ ISIG  GRV +L++SSC V SG V   Y+ VFKKK F D  + + C
Subjt:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF-DGHLSVKC

A0A5A7U1B0 Methyltransferase type 112.6e-8567.46Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLK  KSPIL D AFA+RVFFRVFLFASA+SLIPILHILTS+DFKSF LPKSPPC+A+ H  +TP  LPRGSYLFQGHFLNPVWDSF+SLHC++ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV
        ++ ++ L D +HLFN +ARAL+VGGSSSSAAS LQDLGFS  +GVD GR  S      GY+LD+ N SFDFVLF GK  KVSVP+LVVGEIER+LD GG+
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGK-FKVSVPELVVGEIERVLDRGGV

Query:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF
        GAV  G     +++ ISIG  GRV +L++SSC V SG V   Y+ VFKKK F
Subjt:  GAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQF

A0A6J1CPZ9 uncharacterized protein LOC1110136734.9e-8466.02Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPR---RLPRGSYLFQGHFLNPVWDSFDSLHCQQNV
        MDLKLSKS IL+D A ARRVFFRVFLFASAVS+IPI+HILT++DF++F LP+S  CYAA  GG T +   + PRGSYLFQGHFLNPVWDSFDS+HCQ+NV
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPR---RLPRGSYLFQGHFLNPVWDSFDSLHCQQNV

Query:  NLTVAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRG
        NLT++A++ L D +HLFN +A+AL+VGGSSSSA SA++DLGFS  VGVD GRV S      GY LD+ NGSFDFV+FRGKFKVSVP+LVVGEIERVL+ G
Subjt:  NLTVAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRG

Query:  GVGAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLS
        G+GAV  G+  +P         A RVG L++SSC V SG VNNFYM VFKK+  +G  S
Subjt:  GVGAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKKQFDGHLS

A0A6J1EDK4 uncharacterized protein LOC1114332234.2e-8366.27Show/hide
Query:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT
        MDLKL KSPILHD AFARR+ FR+FLFA AVS+IP +HI TS+DFKSF LPKSPPC+AA HGGA   +LPRGSYLFQGHFLNP+WDS +S HCQ+ VNLT
Subjt:  MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLT

Query:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG
        ++ +++L D +HLFN +ARAL+VG SSS+AAS LQDLGF   VG+D GR  S      GY+LD+PN SFDFVLFRGKFK+SVP+LVVGEIERVL  GG G
Subjt:  VAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVG

Query:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKK
        AV  G+  SPV    +IG AGR+  L++SSC V S  VNN  + VFKKK
Subjt:  AVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)3.3e-1628.8Show/hide
Query:  LKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPC----YAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVN
        LKL   P     +F RRV  R  +   A S++ +L  L    ++   +  + PC     A       P        LF   FL PVW+  +S  C+ N+ 
Subjt:  LKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPC----YAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVN

Query:  LTVAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGSHGY-----ELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGG
        LT   V++L  G +L +  ++AL +G  S SA  A+   G S V       V +  +     EL + + SF FV       V+VP  +V EIER+L  GG
Subjt:  LTVAAVQKLADGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGSHGY-----ELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGG

Query:  VGAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK
         GA+  G      + E+ +     V  L+++S  V    +    ++VFK+
Subjt:  VGAVAFGVFRSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK

AT5G03190.1 conserved peptide upstream open reading frame 474.2e-1128.39Show/hide
Query:  RRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPR----GSYLFQGHFLNPVWDSFDSLHCQQNVNLTVAAVQKLADGRHL
        R   FR  + ASA+S++P+L +                     HG      L R    G  LF    + P W   ++   ++   + +A +     G  L
Subjt:  RRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPR----GSYLFQGHFLNPVWDSFDSLHCQQNVNLTVAAVQKLADGRHL

Query:  FNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFP-NGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVGAVAFGVFRSPVA
         +  A+ L +G  S SA S  +++GFS V GV    + S     H  EL+   + SFDFVL      V+ P L+V E+ERVL  GG GAV      + + 
Subjt:  FNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFP-NGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVGAVAFGVFRSPVA

Query:  AEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK
          ++ GL        + S  V    ++ F +IVFK+
Subjt:  AEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK

AT5G03190.2 conserved peptide upstream open reading frame 475.0e-1228.22Show/hide
Query:  DHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPR----GSYLFQGHFLNPVWDSFDSLHCQQNVNLTVAAVQKLA
        +H  +R   FR  + ASA+S++P+L +                     HG      L R    G  LF    + P W   ++   ++   + +A +    
Subjt:  DHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPR----GSYLFQGHFLNPVWDSFDSLHCQQNVNLTVAAVQKLA

Query:  DGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFP-NGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVGAVAFGVF
         G  L +  A+ L +G  S SA S  +++GFS V GV    + S     H  EL+   + SFDFVL      V+ P L+V E+ERVL  GG GAV     
Subjt:  DGRHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGS-----HGYELDFP-NGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVGAVAFGVF

Query:  RSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK
         + +   ++ GL        + S  V    ++ F +IVFK+
Subjt:  RSPVAAEISIGLAGRVGRLVRSSCAVDSGYVNNFYMIVFKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGAAGCTCTCCAAATCTCCGATCCTCCACGACCACGCGTTCGCCCGGCGCGTGTTCTTCCGCGTCTTCCTATTCGCCTCCGCCGTCTCCCTCATTCCGATCCT
CCACATCCTCACCTCCTTCGACTTCAAATCCTTCCGCCTCCCCAAGTCGCCGCCCTGCTACGCCGCCCTCCACGGCGGCGCCACCCCCCGCCGCCTCCCCCGCGGCTCCT
ACCTCTTCCAGGGCCACTTCCTCAACCCCGTCTGGGATTCCTTCGACTCCCTCCATTGCCAACAGAACGTTAACCTCACCGTCGCCGCCGTCCAGAAATTAGCCGACGGG
CGGCATCTCTTCAATCGCACCGCCCGCGCTCTCTACGTCGGCGGGAGTTCCTCCTCCGCCGCCTCCGCCTTGCAGGATCTGGGATTTAGCCGTGTCGTCGGTGTCGATTC
GGGTCGGGTTGGATCGCACGGGTACGAGCTTGATTTCCCGAACGGGTCGTTTGATTTTGTCTTGTTTAGAGGGAAGTTTAAGGTTTCTGTTCCTGAATTGGTGGTGGGTG
AGATCGAGCGGGTTCTCGACCGCGGCGGGGTCGGGGCGGTGGCTTTCGGTGTGTTTCGTAGCCCGGTGGCGGCGGAGATTTCGATTGGATTGGCAGGAAGAGTGGGGAGA
TTGGTGAGATCTTCGTGTGCTGTGGATTCTGGGTATGTAAATAACTTTTATATGATTGTGTTTAAGAAGAAACAGTTTGATGGTCATCTTTCAGTTAAGTGTTGA
mRNA sequenceShow/hide mRNA sequence
GCCCACTCAGCCAAACCGTCTTTAATCCACGTCAGCATCTTCTTCCCCAATCCCAATCGCAAGCCCTAACCCTAAAACCTACCCTCCATAAATCCAACCAACCAGAGAGC
TTTGAATTCCGCCATAGCCAAAGCTGTCATGGATTTGAAGCTCTCCAAATCTCCGATCCTCCACGACCACGCGTTCGCCCGGCGCGTGTTCTTCCGCGTCTTCCTATTCG
CCTCCGCCGTCTCCCTCATTCCGATCCTCCACATCCTCACCTCCTTCGACTTCAAATCCTTCCGCCTCCCCAAGTCGCCGCCCTGCTACGCCGCCCTCCACGGCGGCGCC
ACCCCCCGCCGCCTCCCCCGCGGCTCCTACCTCTTCCAGGGCCACTTCCTCAACCCCGTCTGGGATTCCTTCGACTCCCTCCATTGCCAACAGAACGTTAACCTCACCGT
CGCCGCCGTCCAGAAATTAGCCGACGGGCGGCATCTCTTCAATCGCACCGCCCGCGCTCTCTACGTCGGCGGGAGTTCCTCCTCCGCCGCCTCCGCCTTGCAGGATCTGG
GATTTAGCCGTGTCGTCGGTGTCGATTCGGGTCGGGTTGGATCGCACGGGTACGAGCTTGATTTCCCGAACGGGTCGTTTGATTTTGTCTTGTTTAGAGGGAAGTTTAAG
GTTTCTGTTCCTGAATTGGTGGTGGGTGAGATCGAGCGGGTTCTCGACCGCGGCGGGGTCGGGGCGGTGGCTTTCGGTGTGTTTCGTAGCCCGGTGGCGGCGGAGATTTC
GATTGGATTGGCAGGAAGAGTGGGGAGATTGGTGAGATCTTCGTGTGCTGTGGATTCTGGGTATGTAAATAACTTTTATATGATTGTGTTTAAGAAGAAACAGTTTGATG
GTCATCTTTCAGTTAAGTGTTGAAGTTACAATGCAAATGGGACTCATTTGAAATCAATCATCAAATATTTTGTTACTTGTTTCAAGTGAAAGATATGTGTGCTCTAATTT
TAGAGTCTTTTTTGTTTAGAACAAGTCGGGAGAGCCGAACTGCAAACCTTACGAATAGGGTATTACTGTTCGTGTTGAGCTAAATTATCCTTACACAAGGAAACTATCAA
ACCACCG
Protein sequenceShow/hide protein sequence
MDLKLSKSPILHDHAFARRVFFRVFLFASAVSLIPILHILTSFDFKSFRLPKSPPCYAALHGGATPRRLPRGSYLFQGHFLNPVWDSFDSLHCQQNVNLTVAAVQKLADG
RHLFNRTARALYVGGSSSSAASALQDLGFSRVVGVDSGRVGSHGYELDFPNGSFDFVLFRGKFKVSVPELVVGEIERVLDRGGVGAVAFGVFRSPVAAEISIGLAGRVGR
LVRSSCAVDSGYVNNFYMIVFKKKQFDGHLSVKC