; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G024910 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G024910
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionUnknown protein
Genome locationchr02:31374683..31378479
RNA-Seq ExpressionLsi02G024910
SyntenyLsi02G024910
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046709.1 putative transmembrane protein [Cucumis melo var. makuwa]2.5e-10877.55Show/hide
Query:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE
        L DSWVRFFG PFDVLEIDF                       KGYEIEYMALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDE
Subjt:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE

Query:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA
        R IASNSNFAN+VADI+ADE++  LIDKNLEGEDDHEVFAHMEK+NARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKA
Subjt:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA

Query:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        KAEKTSSFNSCIPALIITFLFFLVIIIQ                     GIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSNP S
Subjt:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

TYK18245.1 putative transmembrane protein [Cucumis melo var. makuwa]2.7e-10777.66Show/hide
Query:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE
        L DSWVRFFG PFDVLEIDF                       KGYEIEYMALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDE
Subjt:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE

Query:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA
        R IASNSNFAN+VADI+ADE++  LIDKNLEGEDDHEVFAHMEK+NARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKA
Subjt:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA

Query:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSN
        KAEKTSSFNSCIPALIITFLFFLVIIIQ                     GIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSN
Subjt:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSN

XP_008451449.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo]2.8e-9682.79Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDER IASNSNFAN+VADI+ADE++  LI+KNLEGEDDHEVFAHMEK+NARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ                     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        IS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSNP S
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

XP_031744965.1 uncharacterized protein LOC101218752 [Cucumis sativus]2.6e-9782.38Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL FDLE+GGKI+EEVGS EPSS KRDVKN WSRLTEDSLLKDER IASNSNFANS+ D++ADE++  LIDKNLEGED HEVF H+EKNNARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKAPKPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ                     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        IS RSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNP S
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

XP_038898210.1 uncharacterized protein LOC120085950 [Benincasa hispida]2.7e-10286.07Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL+ DLESGGKIV+EVGSVEPSSIKRDVKNIWSRLT+DSLLKDERL+ASNSNFANSVA+I+ADENIE LIDKNLEGEDDHEVF HMEKNNARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRAR ERMKALKKAKAEKTSSFNSCIPA+IITFLFFLVIIIQ                     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESN+SNP S
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

TrEMBL top hitse value%identityAlignment
A0A0A0K5U6 Uncharacterized protein4.4e-10385.25Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL FDLE+GGKI+EEVGS EPSS KRDVKN WSRLTEDSLLKDER IASNSNFANS+ D++ADE++  LIDKNLEGED HEVF H+EKNNARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKAPKPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASA+SEL+  K   ++     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        IS RSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNP S
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

A0A1S4DYQ5 uncharacterized protein LOC1034927411.4e-9682.79Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDER IASNSNFAN+VADI+ADE++  LI+KNLEGEDDHEVFAHMEK+NARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ                     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        IS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSNP S
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

A0A5A7TZQ1 Putative transmembrane protein1.2e-10877.55Show/hide
Query:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE
        L DSWVRFFG PFDVLEIDF                       KGYEIEYMALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDE
Subjt:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE

Query:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA
        R IASNSNFAN+VADI+ADE++  LIDKNLEGEDDHEVFAHMEK+NARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKA
Subjt:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA

Query:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS
        KAEKTSSFNSCIPALIITFLFFLVIIIQ                     GIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSNP S
Subjt:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSS

A0A5D3D3X4 Putative transmembrane protein1.3e-10777.66Show/hide
Query:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE
        L DSWVRFFG PFDVLEIDF                       KGYEIEYMALRERDL+FDLE+GGKIVEE GS EPSS KRDVKNIWSRLTEDSLLKDE
Subjt:  LGDSWVRFFGCPFDVLEIDF----------------------LKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDE

Query:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA
        R IASNSNFAN+VADI+ADE++  LIDKNLEGEDDHEVFAHMEK+NARGKHKNKKKA KPPRPPKGPSLDAADRMMVKE+AVLAMKKRAR ERMKALKKA
Subjt:  RLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKA

Query:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSN
        KAEKTSSFNSCIPALIITFLFFLVIIIQ                     GIS RSSSILQGSPEPAVGGSSGFISVQYIKSFPP+ES+VSN
Subjt:  KAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSN

A0A6J1JVV1 uncharacterized protein LOC1114883453.2e-9379.34Show/hide
Query:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK
        MALRERDL+FDLESGG+I EEVGSVEPSSIKRDVKNIW+RLTED+LLKDE ++ASNSNFANSVAD+VAD N+E LIDKNLEGEDD E FAH+EK NARGK
Subjt:  MALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG
        HKNKKKA KPPRPPKGPSLDAADR +VKEIAV+AMKKRARVERMKAL+K+KAEKTSSFNSCIPA+IITFLFFLVII+Q                     G
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSG

Query:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNP
        ISSRSS +LQGSPEPAV GSSGFISVQYIKSFPPNESN++NP
Subjt:  ISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein2.8e-1737.44Show/hide
Query:  ASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERM-KALKKAKA
        +  +NF   V++ +AD+    LI      E   +     EK    GK K  +KA KPPRPPKGPSL   DR ++++I  LAM+KRAR+ERM K+LK+ KA
Subjt:  ASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERM-KALKKAKA

Query:  EKTSSFNSCIP--ALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSS
         KTS  + CI   ++IIT +FF  ++ Q                  F +G SS +S     SP P V  ++  ISVQ+   F P E    +P++S
Subjt:  EKTSSFNSCIP--ALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSS

AT3G17120.1 unknown protein2.6e-1540Show/hide
Query:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQASAISELRFGKGICIVGS
        K K KK A KPPRPP+GPSLDAAD+ +++EIA LAM KRAR+ERM+ALKK++A K +S  S    + A + T +FF V++ Q               +  
Subjt:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQASAISELRFGKGICIVGS

Query:  FKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSK
          +G S +S  ++ G        + GF+SVQY            NPS+S+
Subjt:  FKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSK

AT3G17120.2 unknown protein2.6e-1540Show/hide
Query:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQASAISELRFGKGICIVGS
        K K KK A KPPRPP+GPSLDAAD+ +++EIA LAM KRAR+ERM+ALKK++A K +S  S    + A + T +FF V++ Q               +  
Subjt:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQASAISELRFGKGICIVGS

Query:  FKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSK
          +G S +S  ++ G        + GF+SVQY            NPS+S+
Subjt:  FKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSK

AT4G01960.1 unknown protein1.5e-1531.98Show/hide
Query:  ERDLSFDLESG-----GKIVEEVGSVEPSSIKRDVKNIWS-RLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNAR
        E DL  D+E G      +I   + S E  S +  + ++WS RL+ D   K                   AD+ ++ L+ +    E   +     +     
Subjt:  ERDLSFDLESG-----GKIVEEVGSVEPSSIKRDVKNIWS-RLTEDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNAR

Query:  GKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFK
         K K  +K  KPPRPPKGP L A D+ +++EI  LAM+KRAR+ERMK L++ KA K+SS  S I A+I+T +FF+ +I Q                G F 
Subjt:  GKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFK

Query:  SGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSS
        S  S  S +    SP P    ++  +SVQ+   F P E    +P++S
Subjt:  SGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGGCCGACGATTTCAAAATCAAATTTCATGGGAATTATTGTTTCCATTGATGATCAGAAGAACAGTACCGAAATCGATTTGGAATTTGTCGGTCTGAGCAGGAAAAC
GCTTGGAGATTTGGGAGACTCATGGGTTAGATTCTTTGGATGTCCCTTTGATGTCCTTGAGATTGACTTTCTGAAAGGCTATGAGATTGAGTACATGGCTTTAAGAGAAA
GGGATCTCTCATTTGATCTCGAAAGTGGGGGGAAGATCGTTGAAGAGGTTGGAAGTGTGGAACCAAGTTCAATCAAAAGAGATGTAAAGAATATTTGGAGTAGGTTGACA
GAGGATTCACTGCTAAAAGATGAGCGACTCATAGCCTCAAACAGTAATTTTGCTAATTCTGTTGCCGACATCGTTGCTGATGAGAACATAGAATTCTTGATAGATAAGAA
TTTGGAAGGGGAAGATGATCATGAAGTTTTTGCGCATATGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCAAAGCCACCACGGCCGCCCAAAG
GTCCTTCACTTGATGCCGCTGACCGAATGATGGTGAAGGAAATTGCAGTGCTTGCCATGAAAAAACGTGCAAGAGTTGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCA
GAGAAAACATCCTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTAATCATCATTCAAGCCTCTGCTATATCTGAGCTGCGGTTTGGGAA
GGGCATTTGTATTGTTGGGTCCTTCAAATCTGGAATAAGCTCCAGAAGCAGTTCGATATTGCAGGGGTCGCCTGAACCTGCTGTTGGTGGTAGTAGCGGTTTCATATCCG
TTCAGTACATTAAAAGCTTTCCCCCGAATGAAAGCAATGTATCCAATCCCTCCTCGTCTAAAGTTATGATGATATTATTGGCTGATGCTCATCCCTCTACACTTTTGGAC
ATATCCAATCCCGTTCATCGAGGACAAACAGAGTTGAGAGCTTCAACAAGACAGACTGATCCTGTGGTGAAGCTATGTCTAGCTGCCTGCGATGTGCTGAGTTGTAACCG
CTACGACGATCCAATGCTCAATGGGGCATCTTATGTCGAAGGAGTACAAGATTGCAATTCCGTGGATGTAGCTCGGAAGGTCGAGGCTTGCAAAGGAGCCATTCAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGGCCGACGATTTCAAAATCAAATTTCATGGGAATTATTGTTTCCATTGATGATCAGAAGAACAGTACCGAAATCGATTTGGAATTTGTCGGTCTGAGCAGGAAAAC
GCTTGGAGATTTGGGAGACTCATGGGTTAGATTCTTTGGATGTCCCTTTGATGTCCTTGAGATTGACTTTCTGAAAGGCTATGAGATTGAGTACATGGCTTTAAGAGAAA
GGGATCTCTCATTTGATCTCGAAAGTGGGGGGAAGATCGTTGAAGAGGTTGGAAGTGTGGAACCAAGTTCAATCAAAAGAGATGTAAAGAATATTTGGAGTAGGTTGACA
GAGGATTCACTGCTAAAAGATGAGCGACTCATAGCCTCAAACAGTAATTTTGCTAATTCTGTTGCCGACATCGTTGCTGATGAGAACATAGAATTCTTGATAGATAAGAA
TTTGGAAGGGGAAGATGATCATGAAGTTTTTGCGCATATGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCAAAGCCACCACGGCCGCCCAAAG
GTCCTTCACTTGATGCCGCTGACCGAATGATGGTGAAGGAAATTGCAGTGCTTGCCATGAAAAAACGTGCAAGAGTTGAGCGAATGAAAGCATTGAAGAAGGCAAAAGCA
GAGAAAACATCCTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTAATCATCATTCAAGCCTCTGCTATATCTGAGCTGCGGTTTGGGAA
GGGCATTTGTATTGTTGGGTCCTTCAAATCTGGAATAAGCTCCAGAAGCAGTTCGATATTGCAGGGGTCGCCTGAACCTGCTGTTGGTGGTAGTAGCGGTTTCATATCCG
TTCAGTACATTAAAAGCTTTCCCCCGAATGAAAGCAATGTATCCAATCCCTCCTCGTCTAAAGTTATGATGATATTATTGGCTGATGCTCATCCCTCTACACTTTTGGAC
ATATCCAATCCCGTTCATCGAGGACAAACAGAGTTGAGAGCTTCAACAAGACAGACTGATCCTGTGGTGAAGCTATGTCTAGCTGCCTGCGATGTGCTGAGTTGTAACCG
CTACGACGATCCAATGCTCAATGGGGCATCTTATGTCGAAGGAGTACAAGATTGCAATTCCGTGGATGTAGCTCGGAAGGTCGAGGCTTGCAAAGGAGCCATTCAGTAA
Protein sequenceShow/hide protein sequence
MRPTISKSNFMGIIVSIDDQKNSTEIDLEFVGLSRKTLGDLGDSWVRFFGCPFDVLEIDFLKGYEIEYMALRERDLSFDLESGGKIVEEVGSVEPSSIKRDVKNIWSRLT
EDSLLKDERLIASNSNFANSVADIVADENIEFLIDKNLEGEDDHEVFAHMEKNNARGKHKNKKKAPKPPRPPKGPSLDAADRMMVKEIAVLAMKKRARVERMKALKKAKA
EKTSSFNSCIPALIITFLFFLVIIIQASAISELRFGKGICIVGSFKSGISSRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPSSSKVMMILLADAHPSTLLD
ISNPVHRGQTELRASTRQTDPVVKLCLAACDVLSCNRYDDPMLNGASYVEGVQDCNSVDVARKVEACKGAIQ