; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi03G012080 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi03G012080
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionGroup 2, putative
Genome locationchr03:22617624..22621749
RNA-Seq ExpressionLsi03G012080
SyntenyLsi03G012080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149613.1 uncharacterized protein LOC101209149 [Cucumis sativus]2.6e-12572.37Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN  HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------
        NG DC EE+EE+G+G EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VK                       
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------

Query:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
                                               GPRLYAESG T F+LSVG SNK MYGAGRDMEDKL+SG GLELTIR+NFISNYRVVWK I 
Subjt:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHFHR VQCLL+L K YDR  HTRSFNSTC TS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_008461795.2 PREDICTED: uncharacterized protein LOC103500312 [Cucumis melo]1.8e-12672.46Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------
        G+GRDC EE+EEDG+G EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVK                      
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------

Query:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
                                                GPRLYAESG T F LSVG SNK MYGAGR+MEDKL+SG GLELTIR+NFISNYRVVWK I
Subjt:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
         PHFHR VQCLL+LGKAYDRKRHT SFNSTC TS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_022152674.1 uncharacterized protein LOC111020336 [Momordica charantia]1.5e-11770.27Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP  HHHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------
        NGR+ DEE+EEDG G EEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK                       
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------

Query:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
                                               G RLYAESGTTTFQLSVG SN+ MYGAGR MED LESG GLEL IR+NFISNYRVVWKIIR
Subjt:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHF  RV+C L+LGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_022934269.1 uncharacterized protein LOC111441481 [Cucurbita moschata]1.1e-11567.37Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------
        G+G    EE++E  +  EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVK                      
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------

Query:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
                                                GPRLYAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHRRV CLL++   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

XP_038905771.1 uncharacterized protein LOC120091726 [Benincasa hispida]8.8e-13475.82Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN---VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG
        MEAAEEQEAVLFHSYPC+YYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN    HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGE RLIVG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN---VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVG

Query:  RGNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK---------------------
        RGNGRDC+EEQE D DG EEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK                     
Subjt:  RGNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK---------------------

Query:  -----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKI
                                                 GPRLYAESGTTTF LSVG SNKPMYGAGRDMEDKLESG GLELTIR+NFISNYRVVWK 
Subjt:  -----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKI

Query:  IRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        IRPHFHR V+CLL+LGKAYDRKRHTRSFNSTCL S
Subjt:  IRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

TrEMBL top hitse value%identityAlignment
A0A0A0LD21 Uncharacterized protein1.2e-12572.37Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        ME AEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN  HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV-HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------
        NG DC EE+EE+G+G EEGYYGK+KRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPII+VK                       
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------

Query:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
                                               GPRLYAESG T F+LSVG SNK MYGAGRDMEDKL+SG GLELTIR+NFISNYRVVWK I 
Subjt:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHFHR VQCLL+L K YDR  HTRSFNSTC TS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A1S3CFE9 uncharacterized protein LOC1035003128.6e-12772.46Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        ME +EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAE S CHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNV--HHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------
        G+GRDC EE+EEDG+G EEGYYGK+KRGCWKRYFTYR+SDSNAWICLQLSWRAIFSMGIALLVFY+VTNPPSPIISVK                      
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------

Query:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
                                                GPRLYAESG T F LSVG SNK MYGAGR+MEDKL+SG GLELTIR+NFISNYRVVWK I
Subjt:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
         PHFHR VQCLL+LGKAYDRKRHT SFNSTC TS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1DGR2 uncharacterized protein LOC1110203367.3e-11870.27Show/hide
Query:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG
        MEAA E+QEAVLFHSYPCAYYVQSPST+SHANSSDIRN AESSACHSPL SDTFP  HHHH N TQEASR TLS YSSSR SNHGAGTDNGEARLIVGRG
Subjt:  MEAA-EEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRG

Query:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------
        NGR+ DEE+EEDG G EEGYYGKK+RGCWK YFTYRNSDSNAWI LQLSWRAIFSMGIALLVFYIVT PPSP ISVK                       
Subjt:  NGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK-----------------------

Query:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR
                                               G RLYAESGTTTFQLSVG SN+ MYGAGR MED LESG GLEL IR+NFISNYRVVWKIIR
Subjt:  ---------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIR

Query:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        PHF  RV+C L+LGK YDRKRHTRSFNSTCLTS
Subjt:  PHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1F239 uncharacterized protein LOC1114414815.2e-11667.37Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSD RNPAESSACHSPLPSDTFPN   HHHHRNPTQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN--VHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------
        G+G    EE++E  +  EE YYG+K+RGCWK YFTYRNSDSNAWICLQLSWRA+FSMG+ALLVFYIVTNPP P+ISVK                      
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVK----------------------

Query:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
                                                GPRLYAESGTTTF+L+VGIS KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  ----------------------------------------GPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHRRV CLL++   YDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

A0A6J1J6W9 uncharacterized protein LOC1114819091.1e-11366.77Show/hide
Query:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVH--HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR
        M+AAE+QE VLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPN    HHHRN TQEASRFTLSHYSSS GSNHG GTDNGEARL+VG 
Subjt:  MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVH--HHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGR

Query:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISV-----------------------
        G+G    EE+ E  +  EE YYGKK+RGCWK YFTYRNSD+NAWICLQLSWRA+FSMG+ALLVFYIVTNPP PIISV                       
Subjt:  GNGRDCDEEQEEDGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISV-----------------------

Query:  ---------------------------------------KGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII
                                               +GPRLYAESGTTTF+L+VG S KPMYGAGR++EDKLESG GLELTIR+NFISNYRVVWKII
Subjt:  ---------------------------------------KGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKII

Query:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS
        +P FHR V CLL++  AYDRKRHTR FNSTCLTS
Subjt:  RPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G08490.1 BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant protein, group 2 (TAIR:AT3G24600.1)1.2e-2435.38Show/hide
Query:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIIS------------------------------------------------------
        KR      S+S+ WI LQ+ WR +FS+G+ALLVFYI T PP P IS                                                      
Subjt:  KRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIIS------------------------------------------------------

Query:  --------VKGPRLYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHT
                 +GP+LY  S  +TTFQL +  +N+ MYGAG +M D L S  GL L +R + IS+YRVVW II P +H +V+CLL+L    D++RH+
Subjt:  --------VKGPRLYAES-GTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELTIRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTGCCGAGGAGCAAGAAGCCGTTCTCTTCCACTCCTATCCATGTGCTTATTACGTACAAAGCCCCTCTACCCTCTCCCACGCCAACAGCTCCGACATCCGAAA
CCCCGCCGAGTCCTCGGCTTGCCACTCGCCTCTTCCCTCAGACACTTTTCCCAACGTCCACCACCACCACCGCAACCCGACTCAAGAAGCCTCTCGCTTCACTCTCTCCC
ACTATTCATCCTCCCGTGGCTCAAACCATGGGGCCGGGACCGACAATGGCGAGGCTCGCTTGATAGTTGGTCGTGGCAATGGTCGAGATTGTGACGAGGAGCAGGAGGAG
GATGGGGACGGAGGCGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGGAAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTT
GAGTTGGAGGGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGTCACTAACCCTCCTTCACCAATCATTTCTGTTAAGGGTCCAAGATTGTATGCTGAGA
GTGGAACGACGACGTTTCAATTAAGCGTGGGCATTAGCAACAAGCCGATGTACGGTGCGGGAAGGGACATGGAAGACAAGCTTGAATCAGGAACGGGATTGGAGCTTACA
ATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCCCACTTTCATCGTCGTGTCCAATGCTTATTGATCCTTGGAAAAGCCTACGATAGGAA
GCGTCACACCCGATCCTTCAATAGTACTTGCTTAACTTCTTCATGA
mRNA sequenceShow/hide mRNA sequence
AGGCCTCCAATTATTTCTCTTTCCCATGTCAGAGAAATCATCTCTCCGCCGCCGTGATGGAGGCTGCCGAGGAGCAAGAAGCCGTTCTCTTCCACTCCTATCCATGTGCT
TATTACGTACAAAGCCCCTCTACCCTCTCCCACGCCAACAGCTCCGACATCCGAAACCCCGCCGAGTCCTCGGCTTGCCACTCGCCTCTTCCCTCAGACACTTTTCCCAA
CGTCCACCACCACCACCGCAACCCGACTCAAGAAGCCTCTCGCTTCACTCTCTCCCACTATTCATCCTCCCGTGGCTCAAACCATGGGGCCGGGACCGACAATGGCGAGG
CTCGCTTGATAGTTGGTCGTGGCAATGGTCGAGATTGTGACGAGGAGCAGGAGGAGGATGGGGACGGAGGCGAGGAAGGGTATTATGGGAAGAAAAAAAGAGGTTGTTGG
AAGAGGTATTTTACGTATAGGAATTCGGATTCTAATGCATGGATTTGCTTGCAGTTGAGTTGGAGGGCAATTTTCAGTATGGGAATTGCTTTGCTTGTGTTTTACATTGT
CACTAACCCTCCTTCACCAATCATTTCTGTTAAGGGTCCAAGATTGTATGCTGAGAGTGGAACGACGACGTTTCAATTAAGCGTGGGCATTAGCAACAAGCCGATGTACG
GTGCGGGAAGGGACATGGAAGACAAGCTTGAATCAGGAACGGGATTGGAGCTTACAATTCGAGTCAATTTCATTTCAAATTATAGAGTAGTTTGGAAAATCATAAGGCCC
CACTTTCATCGTCGTGTCCAATGCTTATTGATCCTTGGAAAAGCCTACGATAGGAAGCGTCACACCCGATCCTTCAATAGTACTTGCTTAACTTCTTCATGATCATTGTC
AACCAACAAATTTTACTTCTCAATGAT
Protein sequenceShow/hide protein sequence
MEAAEEQEAVLFHSYPCAYYVQSPSTLSHANSSDIRNPAESSACHSPLPSDTFPNVHHHHRNPTQEASRFTLSHYSSSRGSNHGAGTDNGEARLIVGRGNGRDCDEEQEE
DGDGGEEGYYGKKKRGCWKRYFTYRNSDSNAWICLQLSWRAIFSMGIALLVFYIVTNPPSPIISVKGPRLYAESGTTTFQLSVGISNKPMYGAGRDMEDKLESGTGLELT
IRVNFISNYRVVWKIIRPHFHRRVQCLLILGKAYDRKRHTRSFNSTCLTSS