; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G09230 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G09230
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionConserved peptide upstream open reading frame 46
Genome locationClcChr04:22843502..22844581
RNA-Seq ExpressionClc04G09230
SyntenyClc04G09230
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047269.1 Methyltransferase type 11 [Cucumis melo var. makuwa]9.0e-10983.4Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF+KSPIL DPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSF+S+HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA
        +IKLLVD KHLFNH+ARALFVG SSSSA SVL DLGFS A+GVDKGRFISLK+ EVGYKLDY N+S DFVLF GK  KVSVPDLVVGEIERIL GGGIGA
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA

Query:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE
        VVTGISSPISIGF GRV KLLKSSCVV+SGNV+K+Y+SVFKKK F +
Subjt:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE

KAE8647389.1 hypothetical protein Csa_003425 [Cucumis sativus]1.6e-11383.86Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF +SPILHDPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSFDS+HC  TVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV
        +IKLLV EKHLFNH+ARALFVG SSSSA SVLHDLGFS AVGVDKGRFISLK+ EVGYKLDY N S DFVLF+GK KVSVPDLVVGE+ERIL GGGIGAV
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV

Query:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS
        VTGISSPISIG  GRV KLLKSSCVV+SGNV+KLY+SVFKKK F D++PINC S
Subjt:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS

XP_008449921.1 PREDICTED: uncharacterized protein LOC103491650 [Cucumis melo]5.6e-11182.42Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF+KSPIL DPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSF+S+HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA
        +IKLLVD KHLFNH+ARALFVG SSSSA SVL DLGFS A+GVDKGRFISLK+ EVGYKLDY N+S DFVLF GK  KVSVPDLVVGEIERIL GGGIGA
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA

Query:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE--LPINCMS
        VVTGISSPISIGF GRV KLLKSSCVV+SGNV+K+Y+SVFKKK F +  +PINC S
Subjt:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE--LPINCMS

XP_011657647.1 uncharacterized protein LOC105435880 [Cucumis sativus]1.6e-11383.86Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF +SPILHDPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSFDS+HC  TVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV
        +IKLLV EKHLFNH+ARALFVG SSSSA SVLHDLGFS AVGVDKGRFISLK+ EVGYKLDY N S DFVLF+GK KVSVPDLVVGE+ERIL GGGIGAV
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV

Query:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS
        VTGISSPISIG  GRV KLLKSSCVV+SGNV+KLY+SVFKKK F D++PINC S
Subjt:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS

XP_038883488.1 uncharacterized protein LOC120074441 [Benincasa hispida]2.5e-11987.4Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAVA-----DQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTI
        MDLKFVKSPILHD AFARR+FFRIFLF S ISLIPILHILTSYDFKSFHLPKSPPCHAVA     DQ PRGSYLFQGHFLNPVWDSFDSVHC ETVNLT+
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAVA-----DQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTI

Query:  SVIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGA
        S+IKLLV+EKHLFNH+ARALFVG SSSSA S+L DLGFS AVGVDKGRFISL+KR VGYKLDY NDS DFVLF+GK KVSVPDLVVGEIERILAGGGIGA
Subjt:  SVIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGA

Query:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS
        VVTGISSPISIG AGRVGKLLKSSCVV+SGNV+KLY+SVFKKKQF ELPINC S
Subjt:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS

TrEMBL top hitse value%identityAlignment
A0A0A0KEM7 Uncharacterized protein7.7e-11483.86Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF +SPILHDPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSFDS+HC  TVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV
        +IKLLV EKHLFNH+ARALFVG SSSSA SVLHDLGFS AVGVDKGRFISLK+ EVGYKLDY N S DFVLF+GK KVSVPDLVVGE+ERIL GGGIGAV
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV

Query:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS
        VTGISSPISIG  GRV KLLKSSCVV+SGNV+KLY+SVFKKK F D++PINC S
Subjt:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQF-DELPINCMS

A0A1S3BMI8 uncharacterized protein LOC1034916502.7e-11182.42Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF+KSPIL DPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSF+S+HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA
        +IKLLVD KHLFNH+ARALFVG SSSSA SVL DLGFS A+GVDKGRFISLK+ EVGYKLDY N+S DFVLF GK  KVSVPDLVVGEIERIL GGGIGA
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA

Query:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE--LPINCMS
        VVTGISSPISIGF GRV KLLKSSCVV+SGNV+K+Y+SVFKKK F +  +PINC S
Subjt:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE--LPINCMS

A0A5A7U1B0 Methyltransferase type 114.4e-10983.4Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLKF+KSPIL DPAFA+RVFFR+FLFASAISLIPILHILTSYDFKSFHLPKSPPCHA      D  PRGSYLFQGHFLNPVWDSF+S+HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHA----VADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA
        +IKLLVD KHLFNH+ARALFVG SSSSA SVL DLGFS A+GVDKGRFISLK+ EVGYKLDY N+S DFVLF GK  KVSVPDLVVGEIERIL GGGIGA
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGK-FKVSVPDLVVGEIERILAGGGIGA

Query:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE
        VVTGISSPISIGF GRV KLLKSSCVV+SGNV+K+Y+SVFKKK F +
Subjt:  VVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDE

A0A6J1EDK4 uncharacterized protein LOC1114332239.4e-10476.28Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAV----ADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLK  KSPILHD AFARR+ FR+FLFA A+S+IP +HI TSYDFKSFHLPKSPPCHA     ADQ PRGSYLFQGHFLNP+WDS +S HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAV----ADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV
        VI+ LVDEKHLFNH+ARALFVG SSS+A SVL DLGF  AVG+DKGRFIS+K+REVGYKLDY NDS DFVLFRGKFK+SVPDLVVGEIER+LAGGG GAV
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV

Query:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS
        V GI+SP++IGFAGR+  LLKSSCVV S  V+ L ++VFKKK   E PINC S
Subjt:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS

A0A6J1IPF9 uncharacterized protein LOC1114776271.5e-10175.1Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAV----ADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS
        MDLK  KSPILHD AFARR+ FR+FLFA A+S+IP +HI TSYD KSFHLPKSPPCH+     ADQ PRGSYLFQGHFLNP+WDS +S HC ETVNLTIS
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAV----ADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTIS

Query:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV
        VI+ L DEKHLFNH+ARALFVG SSS+A SVL DLGF  AVG+ KGRFISLK+REVGYKLDY NDS DFVLFRGKFK+SVPDLVVGEIER+LAGGG GAV
Subjt:  VIKLLVDEKHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAV

Query:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS
        V GISSP++IGFAGR+  LLKSSCVV S  V+ L ++VF+KK   E PINC S
Subjt:  VTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPINCMS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G53400.1 BEST Arabidopsis thaliana protein match is: conserved peptide upstream open reading frame 47 (TAIR:AT5G03190.1)3.5e-1829.27Show/hide
Query:  AFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPC---HAVADQPPRGSYLFQGH-------FLNPVWDSFDSVHCHETVNLTISVIKLLVDE
        +F RRV  R  +   A S++ +L  L    ++   +  + PC       +    G +LF G+       FL PVW+  +S  C + + LT  V++ L   
Subjt:  AFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPC---HAVADQPPRGSYLFQGH-------FLNPVWDSFDSVHCHETVNLTISVIKLLVDE

Query:  KHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAVVTGISSPI
         +L ++ ++AL +G  S SAV  ++  G S           + K R+   +L Y + S  FV       V+VP  +V EIERIL  GG GA++ G +S  
Subjt:  KHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAVVTGISSPI

Query:  S----IGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPIN
             +     V  LLK+S VVH  ++ K  + VFK+   D   ++
Subjt:  S----IGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFDELPIN

AT5G03190.1 conserved peptide upstream open reading frame 471.8e-1429.85Show/hide
Query:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILT-SYDFKSFHLPKSPPCHAVADQPPRGSYLFQG------HFLNPVWDSFDSVHCHETVNL
        M +K +K  I    +  R   FR  + ASA+S++P+L +   ++ F           H   D    G ++  G        + P W         ET   
Subjt:  MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILT-SYDFKSFHLPKSPPCHAVADQPPRGSYLFQG------HFLNPVWDSFDSVHCHETVNL

Query:  TISVIKLLVDE---KHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRND-SLDFVLFRGKFKVSVPDLVVGEIERILA
           VI  LVDE     L ++ A+ L +G  S SAVS   ++GFS   GV K    S   R+   +L+   D S DFVL      V+ P L+V E+ER+L 
Subjt:  TISVIKLLVDE---KHLFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRND-SLDFVLFRGKFKVSVPDLVVGEIERILA

Query:  GGGIGAVVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFD--------ELPINCMS
         GG GAV+   ++         V   LK S +V   N+DK  + VFK+   +        +LP +C S
Subjt:  GGGIGAVVTGISSPISIGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFD--------ELPINCMS

AT5G03190.2 conserved peptide upstream open reading frame 473.1e-1430.16Show/hide
Query:  ARRVFFRIFLFASAISLIPILHILT-SYDFKSFHLPKSPPCHAVADQPPRGSYLFQG------HFLNPVWDSFDSVHCHETVNLTISVIKLLVDE---KH
        +R   FR  + ASA+S++P+L +   ++ F           H   D    G ++  G        + P W         ET      VI  LVDE     
Subjt:  ARRVFFRIFLFASAISLIPILHILT-SYDFKSFHLPKSPPCHAVADQPPRGSYLFQG------HFLNPVWDSFDSVHCHETVNLTISVIKLLVDE---KH

Query:  LFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRND-SLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAVVTGISSPIS
        L ++ A+ L +G  S SAVS   ++GFS   GV K    S   R+   +L+   D S DFVL      V+ P L+V E+ER+L  GG GAV+   ++   
Subjt:  LFNHTARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRND-SLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAVVTGISSPIS

Query:  IGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFD--------ELPINCMS
              V   LK S +V   N+DK  + VFK+   +        +LP +C S
Subjt:  IGFAGRVGKLLKSSCVVHSGNVDKLYMSVFKKKQFD--------ELPINCMS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTAAAGTTCGTGAAATCTCCGATCTTACACGACCCTGCATTCGCTAGACGCGTCTTCTTCCGTATCTTCTTATTCGCTTCCGCCATTTCCCTCATTCCCATCCT
CCACATTCTCACTTCTTACGATTTCAAATCCTTCCATTTACCCAAATCCCCACCCTGTCACGCCGTCGCAGATCAACCCCCCCGAGGCTCGTACCTATTCCAAGGCCATT
TCCTAAACCCCGTTTGGGATTCTTTCGATTCAGTCCATTGCCATGAAACCGTGAATCTCACCATCTCCGTCATCAAACTTCTTGTCGATGAAAAGCATCTGTTCAACCAC
ACCGCTAGAGCTCTGTTCGTCGGAGCAAGTTCGTCCTCCGCCGTGTCGGTCCTTCATGATTTAGGATTTTCCAGCGCCGTCGGAGTTGATAAGGGTCGGTTTATATCGCT
GAAAAAGAGGGAAGTTGGGTATAAACTTGATTACCGGAATGATTCGTTGGATTTCGTTTTGTTTAGAGGGAAATTCAAGGTCTCTGTTCCTGATTTGGTGGTGGGTGAAA
TTGAACGGATTCTTGCCGGTGGCGGAATTGGGGCGGTGGTTACCGGAATCAGTAGTCCAATTTCGATTGGATTCGCCGGGAGAGTAGGGAAATTACTGAAATCTTCTTGT
GTTGTGCATTCGGGGAATGTTGATAAGTTATATATGAGTGTATTCAAGAAGAAACAGTTTGATGAGCTTCCAATTAATTGCATGAGTTAA
mRNA sequenceShow/hide mRNA sequence
CCACAGTCTCTACTCCACGTCAGCTTCTTCTTCCCCAAACCCTAACCCGAACTCTCCATAAATCAACAGAGCTTCCATTTCCACACTCAGAGCTCGAACGGAGAAATAAT
CCAAAACCATTTTGCTATTTCTCTGAAGAATCAAAGAGCTTCCGATGGATTTAAAGTTCGTGAAATCTCCGATCTTACACGACCCTGCATTCGCTAGACGCGTCTTCTTC
CGTATCTTCTTATTCGCTTCCGCCATTTCCCTCATTCCCATCCTCCACATTCTCACTTCTTACGATTTCAAATCCTTCCATTTACCCAAATCCCCACCCTGTCACGCCGT
CGCAGATCAACCCCCCCGAGGCTCGTACCTATTCCAAGGCCATTTCCTAAACCCCGTTTGGGATTCTTTCGATTCAGTCCATTGCCATGAAACCGTGAATCTCACCATCT
CCGTCATCAAACTTCTTGTCGATGAAAAGCATCTGTTCAACCACACCGCTAGAGCTCTGTTCGTCGGAGCAAGTTCGTCCTCCGCCGTGTCGGTCCTTCATGATTTAGGA
TTTTCCAGCGCCGTCGGAGTTGATAAGGGTCGGTTTATATCGCTGAAAAAGAGGGAAGTTGGGTATAAACTTGATTACCGGAATGATTCGTTGGATTTCGTTTTGTTTAG
AGGGAAATTCAAGGTCTCTGTTCCTGATTTGGTGGTGGGTGAAATTGAACGGATTCTTGCCGGTGGCGGAATTGGGGCGGTGGTTACCGGAATCAGTAGTCCAATTTCGA
TTGGATTCGCCGGGAGAGTAGGGAAATTACTGAAATCTTCTTGTGTTGTGCATTCGGGGAATGTTGATAAGTTATATATGAGTGTATTCAAGAAGAAACAGTTTGATGAG
CTTCCAATTAATTGCATGAGTTAAGATAGGATTAAGGACCTCTAATGAAGGATTAGCACAAGAAATTATAAAAAGCCATGCCTAATGAATGGTTGGTTGTTTGTAGTTTG
TACATACCGTAATTGGGGCTCATTAGAATTTGTTGCATCATTGTTATCTACGTAATATTGTGTTGAAATGGATGAGAAATTATTTGAGAA
Protein sequenceShow/hide protein sequence
MDLKFVKSPILHDPAFARRVFFRIFLFASAISLIPILHILTSYDFKSFHLPKSPPCHAVADQPPRGSYLFQGHFLNPVWDSFDSVHCHETVNLTISVIKLLVDEKHLFNH
TARALFVGASSSSAVSVLHDLGFSSAVGVDKGRFISLKKREVGYKLDYRNDSLDFVLFRGKFKVSVPDLVVGEIERILAGGGIGAVVTGISSPISIGFAGRVGKLLKSSC
VVHSGNVDKLYMSVFKKKQFDELPINCMS