; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04650 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04650
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDUF2232 domain-containing protein
Genome locationClcChr08:14118655..14128804
RNA-Seq ExpressionClc08G04650
SyntenyClc08G04650
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR018710 - Protein of unknown function DUF2232


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579706.1 hypothetical protein SDJN03_24154, partial [Cucurbita argyrosperma subsp. sororia]2.2e-13890.67Show/hide
Query:  MISGKLYPSYSTSCILPPTRT---TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKK
        MISG LYPS STSCI PP  T   T  ++HLPLLKISS LRLISFQSVSLSFPTF +SKS+ KSTR  NSVAKVYS+EGQNP +LSDLEDLSENGVVYKK
Subjt:  MISGKLYPSYSTSCILPPTRT---TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFL
        TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSIFL
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFL

Query:  CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM
        CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKAM
Subjt:  CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM

XP_022929150.1 uncharacterized protein LOC111435817 [Cucurbita moschata]1.3e-13890.07Show/hide
Query:  MISGKLYPSYSTSCILPP-----TRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVY
        MISG LYPS STSCI PP     + T  ++L LPLLKISS LRLISF+SVSLSFPTF +SKS+ KSTRFSNSVAKVYS+EGQNP +LSDLEDLSENGVVY
Subjt:  MISGKLYPSYSTSCILPP-----TRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSI
        KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSI
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSI

Query:  FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEK
        FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEK
Subjt:  FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEK

Query:  AM
        AM
Subjt:  AM

XP_022969819.1 uncharacterized protein LOC111468904 [Cucurbita maxima]1.5e-13991.28Show/hide
Query:  MISGKLYPSYSTSCILPPTRT-TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTL
        MISG LYPS STSCI PP RT T  ++HLPLLKIS+ LRLISF+SVSLSFPTF +SKS+ KSTRFSNSVAKVYS+EGQNP +LSDLEDLSENGVVYKKTL
Subjt:  MISGKLYPSYSTSCILPPTRT-TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTL

Query:  AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT
        AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSIFLCT
Subjt:  AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT

Query:  IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM
        IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSM+AIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKAM
Subjt:  IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM

XP_023549519.1 uncharacterized protein LOC111807999 [Cucurbita pepo subsp. pepo]8.1e-14191.67Show/hide
Query:  MISGKLYPSYSTSCILPPTRT---TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKK
        MISGKLYPS STSCI PP  T   T +++HLPLLKISS LRLISFQSVSLSFPTF +SKS+ KSTRFSNSVAKVYS+EGQNP +LSDLEDLSENGVVYKK
Subjt:  MISGKLYPSYSTSCILPPTRT---TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKK

Query:  TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFL
        TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSIFL
Subjt:  TLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFL

Query:  CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM
        CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKAM
Subjt:  CTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM

XP_038885169.1 uncharacterized protein LOC120075651 [Benincasa hispida]7.9e-14493.27Show/hide
Query:  MISGKLYPSYSTSCILPPTRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTLA
        MISGKLYPSYS SCI PP +T   NLHLPLL+ISSTLRLISFQSVSLSFP+FF+SKS+AKSTRFSNSV KVYSYEGQNPI LSDLEDLSE+G VYKKTLA
Subjt:  MISGKLYPSYSTSCILPPTRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTLA

Query:  MVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTI
        MVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTI
Subjt:  MVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTI

Query:  VRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM
        VRALGAVGYVL+SSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFG LV LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKAM
Subjt:  VRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM

TrEMBL top hitse value%identityAlignment
A0A0A0K8H0 Uncharacterized protein5.3e-13890.03Show/hide
Query:  MISGKLYPSY--STSCILPPTRTTLI--NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY SY  S+SCI PPT T+    NLHL  LKISSTLRLISFQS SLSFP+ F SKS+AKSTRFS+S+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSY--STSCILPPTRTTLI--NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF
        KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF

Query:  LCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKA
        LCTIVRA GAVGYVL+SSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKA
Subjt:  LCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKA

Query:  M
        M
Subjt:  M

A0A1S4DVX7 uncharacterized protein LOC103488678 isoform X55.0e-13687.1Show/hide
Query:  MISGKLYPSYSTSCILPPTRTTLI----NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY S S+SCI PPT T       NLHL  LKISSTLRLISFQSVSLS P+ F+SKS+AKSTRFSNS+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSYSTSCILPPTRTTLI----NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTM---------VATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRL
        KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTM         VATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRL
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTM---------VATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRL

Query:  GANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSL
        GANWSTSIFLCTIVRA GAVGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV LN GCFMFLLHLLYS+FLTRLGLKTSL
Subjt:  GANWSTSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSL

Query:  TLPRWLEKAM
        TLPRWLEKAM
Subjt:  TLPRWLEKAM

A0A1S4DWP5 uncharacterized protein LOC103488678 isoform X62.4e-13889.7Show/hide
Query:  MISGKLYPSYSTSCILPPTRTTLI----NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK
        MISGKLY S S+SCI PPT T       NLHL  LKISSTLRLISFQSVSLS P+ F+SKS+AKSTRFSNS+ +VYSYEGQN ITLSDL+DLSENGVVYK
Subjt:  MISGKLYPSYSTSCILPPTRTTLI----NLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYK

Query:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF
        KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGI+AGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF
Subjt:  KTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIF

Query:  LCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKA
        LCTIVRA GAVGYVL+SSFLIRENIL+LITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLV LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKA
Subjt:  LCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKA

Query:  M
        M
Subjt:  M

A0A6J1ETG3 uncharacterized protein LOC1114358176.3e-13990.07Show/hide
Query:  MISGKLYPSYSTSCILPP-----TRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVY
        MISG LYPS STSCI PP     + T  ++L LPLLKISS LRLISF+SVSLSFPTF +SKS+ KSTRFSNSVAKVYS+EGQNP +LSDLEDLSENGVVY
Subjt:  MISGKLYPSYSTSCILPP-----TRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVY

Query:  KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSI
        KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSI
Subjt:  KKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSI

Query:  FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEK
        FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSMNAIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEK
Subjt:  FLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEK

Query:  AM
        AM
Subjt:  AM

A0A6J1I3S1 uncharacterized protein LOC1114689047.4e-14091.28Show/hide
Query:  MISGKLYPSYSTSCILPPTRT-TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTL
        MISG LYPS STSCI PP RT T  ++HLPLLKIS+ LRLISF+SVSLSFPTF +SKS+ KSTRFSNSVAKVYS+EGQNP +LSDLEDLSENGVVYKKTL
Subjt:  MISGKLYPSYSTSCILPPTRT-TLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTL

Query:  AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT
        AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALT+LLRHGLVG TMGSLWRLGANWSTSIFLCT
Subjt:  AMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCT

Query:  IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM
        IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTA GVNLIPSM+AIYAIFGTLV+LN GCFMFLLHLLYS+FLTRLGLKTSLTLPRWLEKAM
Subjt:  IVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26180.1 unknown protein2.0e-8174.63Show/hide
Query:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWS
        VVY+KTL +VEC+MFAA+ GLVYFLSNSLA+ENYFGCFF LPIVISS+RW IA GRKTMVAT +LL +LSGPVKALTY L HGLVG  +GSLW +GA+W 
Subjt:  VVYKKTLAMVECSMFAALNGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWS

Query:  TSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRW
         SIFLCT+VRALG +GYVL SSFLIRENILA+ITINIHASL+ +FTA G+N++PSM+ IY IFGT++LLNSG F+ LLHLLYS+FLTRLG+K+SL LP W
Subjt:  TSIFLCTIVRALGAVGYVLISSFLIRENILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRW

Query:  LEKAM
        L+KA+
Subjt:  LEKAM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCTGGGAAGCTTTATCCATCCTACTCCACATCATGCATTTTGCCACCAACACGAACAACACTAATCAATCTTCATCTTCCTCTTCTTAAAATCTCTTCCACACT
CAGATTAATTAGCTTTCAATCCGTCTCCCTCTCTTTTCCAACCTTCTTTTCTTCTAAATCCACTGCCAAGTCCACCAGATTTTCGAATTCTGTGGCAAAAGTTTATAGCT
ATGAGGGCCAAAACCCCATTACTTTGTCGGATTTGGAAGACTTGTCCGAGAATGGAGTTGTTTATAAGAAGACACTGGCCATGGTGGAGTGCTCCATGTTCGCTGCACTT
AATGGCTTGGTTTACTTCTTGAGCAATTCACTTGCTCTTGAGAATTACTTTGGCTGTTTCTTCTGTCTACCAATAGTAATCTCTTCAATGAGATGGGGCATAGCAGCTGG
GAGAAAAACCATGGTGGCAACATTCTTGCTGCTGCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACATATCTGCTTAGGCATGGTTTAGTGGGGTTTACAATGGGCTCCT
TGTGGAGGCTTGGAGCAAATTGGAGTACCTCAATCTTTCTTTGCACAATCGTTCGGGCACTTGGCGCAGTGGGGTATGTCTTAATATCTTCATTCTTGATAAGAGAGAAC
ATACTAGCTCTGATCACTATAAATATTCACGCTTCCCTCACCCTTATCTTCACTGCCTGGGGTGTGAACTTAATTCCATCAATGAATGCAATATATGCTATATTTGGGAC
ACTGGTATTGCTGAACTCTGGATGTTTCATGTTTTTGCTTCATCTTTTGTACTCCTTATTCCTTACTAGACTTGGTTTGAAGACTTCATTGACATTGCCAAGGTGGCTGG
AAAAGGCGATGTAA
mRNA sequenceShow/hide mRNA sequence
CAAAAATTGATTGATTTTAAAAAATACCCATTTTTTCATTTATTACATATTTTTCGCCTCGCAATTGGTCCAGTGCAGAGTATAGCCCAACCTCCGGCCCGGAAAAGCCC
AAAAGGGAACAGGTTGAACTTCAGGCCGTCACAATTCGTATACAATCTTCAACGTCTTTGAGTAGTGTTTAATCCTCCATTTTTCTTCCCCATGATTTCTGGGAAGCTTT
ATCCATCCTACTCCACATCATGCATTTTGCCACCAACACGAACAACACTAATCAATCTTCATCTTCCTCTTCTTAAAATCTCTTCCACACTCAGATTAATTAGCTTTCAA
TCCGTCTCCCTCTCTTTTCCAACCTTCTTTTCTTCTAAATCCACTGCCAAGTCCACCAGATTTTCGAATTCTGTGGCAAAAGTTTATAGCTATGAGGGCCAAAACCCCAT
TACTTTGTCGGATTTGGAAGACTTGTCCGAGAATGGAGTTGTTTATAAGAAGACACTGGCCATGGTGGAGTGCTCCATGTTCGCTGCACTTAATGGCTTGGTTTACTTCT
TGAGCAATTCACTTGCTCTTGAGAATTACTTTGGCTGTTTCTTCTGTCTACCAATAGTAATCTCTTCAATGAGATGGGGCATAGCAGCTGGGAGAAAAACCATGGTGGCA
ACATTCTTGCTGCTGCTGGTTTTGTCTGGTCCAGTGAAAGCTTTAACATATCTGCTTAGGCATGGTTTAGTGGGGTTTACAATGGGCTCCTTGTGGAGGCTTGGAGCAAA
TTGGAGTACCTCAATCTTTCTTTGCACAATCGTTCGGGCACTTGGCGCAGTGGGGTATGTCTTAATATCTTCATTCTTGATAAGAGAGAACATACTAGCTCTGATCACTA
TAAATATTCACGCTTCCCTCACCCTTATCTTCACTGCCTGGGGTGTGAACTTAATTCCATCAATGAATGCAATATATGCTATATTTGGGACACTGGTATTGCTGAACTCT
GGATGTTTCATGTTTTTGCTTCATCTTTTGTACTCCTTATTCCTTACTAGACTTGGTTTGAAGACTTCATTGACATTGCCAAGGTGGCTGGAAAAGGCGATGTAAATGCT
CGACGGGTATTAATTTGTCGACGATGATACACCATTTATTTTCTCCTTTTATAGGAGGAAGAAATTGGTCAGGCTGAATGGAATTTTTCATTCATTCACGATTTGCTCTA
GTGTATGTGAGAGGTTACGAATGTATAAAATGGAAGTAATTATTATAGCTTTAGGGAGATCTCAGATTTCTACTAGTGGTTGTCTTGCTGTAAAGTTTCTGAGGAGTTTG
ATCATCCTTTAGGATGCAATGAGTAATTAGCTTTGGAACTTTGCAGACTGGAATATACAATCTGTGATCCGATTTACGATTTCAAATTTGAAATTATCATATTAAGTTTT
CAAGATAAATTTTGTGAAAGCTTAGAAAGGAGTTTATGAGTTTAGTAGCATCTATACTTCTCAGCCTTCTCTCATCAATTTAGGTTGGAGGTCCTTTCTC
Protein sequenceShow/hide protein sequence
MISGKLYPSYSTSCILPPTRTTLINLHLPLLKISSTLRLISFQSVSLSFPTFFSSKSTAKSTRFSNSVAKVYSYEGQNPITLSDLEDLSENGVVYKKTLAMVECSMFAAL
NGLVYFLSNSLALENYFGCFFCLPIVISSMRWGIAAGRKTMVATFLLLLVLSGPVKALTYLLRHGLVGFTMGSLWRLGANWSTSIFLCTIVRALGAVGYVLISSFLIREN
ILALITINIHASLTLIFTAWGVNLIPSMNAIYAIFGTLVLLNSGCFMFLLHLLYSLFLTRLGLKTSLTLPRWLEKAM