; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018664 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018664
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPlant basic secretory protein (BSP) family protein
Genome locationChr04:6570135..6571548
RNA-Seq ExpressionHG10018664
SyntenyHG10018664
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645853.1 hypothetical protein Csa_017343 [Cucumis sativus]1.3e-10781.4Show/hide
Query:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH
        M SNNLIFF LL  LALL+S+ AVEYTV NNA GTPGG RFD++IGANYSRQTLVAATALIWNIFQQSTA DRK+V+KISLFIDKNLDGV A   NNEIH
Subjt:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH

Query:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR
        VSANYISSY+GDLKREITG+LYHE+T+IWQW+GNL AP GLIEGIADYVRLKSGYIPG W EPG G  WDEGYDVTARFLDYLE  V+SGLVA+LNR++R
Subjt:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR

Query:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIE
        NGYS DYFRQL+GKPVD+LWAEYKTKAK+ N+D KCNSF IE
Subjt:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIE

XP_011658890.1 uncharacterized protein LOC105434420 [Cucumis sativus]3.9e-11281.67Show/hide
Query:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH
        M SNNLIFF LL  LALL+S+ AVEYTV NNA GTPGG RFD++IGANYSRQTLVAATALIWNIFQQSTA DRK+V+KISLFIDKNLDGV A   NNEIH
Subjt:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH

Query:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR
        VSANYISSY+GDLKREITG+LYHE+T+IWQW+GNL AP GLIEGIADYVRLKSGYIPG W EPG G  WDEGYDVTARFLDYLE  V+SGLVA+LNR++R
Subjt:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR

Query:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL
        NGYS DYFRQL+GKPVD+LWAEYKTKAK+ N+D KCNSF IEV+GVFALQL
Subjt:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL

XP_038888417.1 uncharacterized protein LOC120078264 [Benincasa hispida]9.6e-11984.62Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        MASNNLIFFLLPSLALLQS+ AVEYTV NNA GTPGG RFDN+IGANYSRQTL AAT LIWNIF+QSTA DRK+VQKISLFIDKN +GV AFTTN+EIH+
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
         ANYISSY+GDLKREITGVLYHE+T+IWQWSGNL AP GLIEGIADY+RLKSGYIPG W EPG G RWDEGYDVTARFLDYLEGV+SGLVA LNRRLRNG
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFAL
        YSADYFRQLLGKPVD+LWA+YKTKAKY  ++ KCN+F IEVNGVFAL
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFAL

XP_038888579.1 uncharacterized protein LOC120078385 [Benincasa hispida]4.4e-11682.73Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        MASNNLIFFLLP LALLQS+ AVEYTV NNA GTPGGIRF+N+IGANYSRQTL AAT LIWNIF+QSTA DRKNVQKISL IDKN +GV AFTTN++IH+
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
         ANYISSY+GDLKREITGVLYHE+T+IWQW+GN  AP  LIEGIADYVRLKSGYIPG+W EPG G RWDEGYDVTARFLDYLEGV+SGLVA+LNRRLRN 
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL
        YS DYFRQLLGKPVD+LWA+YKTKAKY  ++KKCN+F IEVNGVFALQL
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL

XP_038888599.1 uncharacterized protein LOC120078398 [Benincasa hispida]1.5e-11684.74Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        MASNNLIFFLLPSLALLQ + AVEYTV NNA GT GG RFDN+IGANYSRQTLVAAT LIWNIF+QSTA DRK VQKISLFIDKN +GV A TTN+EIH+
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
         ANYISSY+GDLKREITGVLYHE+T+IWQW GN  APGGLIEGIADYVRLKSGYIPGHW EPG G  WDEGYDVTARFLDYLEGV+SGLVA+LNRRLRNG
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL
        YSADYFRQ LGKPVD+LWAEYKTKAKY  +DKKC+SF IEVNGVFALQL
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL

TrEMBL top hitse value%identityAlignment
A0A0A0K322 Uncharacterized protein3.3e-10981.22Show/hide
Query:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH
        M SNNLIFF LL  LALL+S+ AVEYTV NNA GTPGG RFD++IGANYSRQTLVAATALIWNIFQQSTA DRK+V+KISLFIDKNLDGV A   NNEIH
Subjt:  MASNNLIFF-LLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIH

Query:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR
        VSANYISSY+GDLKREITG+LYHE+T+IWQW+GNL AP GLIEGIADYVRLKSGYIPG W EPG G  WDEGYDVTARFLDYLE  V+SGLVA+LNR++R
Subjt:  VSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE-GVKSGLVAKLNRRLR

Query:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNG
        NGYS DYFRQL+GKPVD+LWAEYKTKAK+ N+D KCNSF IEV+G
Subjt:  NGYSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNG

A0A1S3BZT8 uncharacterized protein LOC1034952683.8e-10577.11Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        M SNN IFFLL  LALL+S+ AVEY V NNA GTPGG RFD++IGANYSRQTLVAATALIWNIF+QSTA DRK+VQKISLFID N+D V AF  NNEIHV
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
        SA+YISSY+GDLKREITG+LYHE+T+I QWSGN+ AP GLIEGIADYVRLKSGYI   W EPG G RWDEGYDVTARFLDYLE V+SGLVA+LNR++RN 
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL
        YS DYFRQLLGKPVD+LW EYKTK K   +D KC +F IE +GVFALQL
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL

A0A5D3C5V2 NtPRp27-like protein3.8e-10577.11Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        M SNN IFFLL  LALL+S+ AVEY V NNA GTPGG RFD++IGANYSRQTLVAATALIWNIF+QSTA DRK+VQKISLFID N+D V AF  NNEIHV
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
        SA+YISSY+GDLKREITG+LYHE+T+I QWSGN+ AP GLIEGIADYVRLKSGYI   W EPG G RWDEGYDVTARFLDYLE V+SGLVA+LNR++RN 
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL
        YS DYFRQLLGKPVD+LW EYKTK K   +D KC +F IE +GVFALQL
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRNIDKKCNSFAIEVNGVFALQL

A0A6J1D406 uncharacterized protein LOC1110173791.8e-9980.63Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        MASNN IFFLL SLALLQ+V AVEYTV NNA GTPGG+RFDN IGA+YS QTLVAAT  IWNIFQQSTA DRKNV K+SLFID + DGV AF +NNEIHV
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
         ANYI++Y+GDLKREITGVLYHE+THIWQW+GN +APGGLIEGIADYVRLKSGYIPGHW  PG G RWD+GYDVTARFLDYLEG++SG V++LNRRLRNG
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYK
        Y ADYF QLLGK VDQLWA+YK
Subjt:  YSADYFRQLLGKPVDQLWAEYK

A0A6J1I4N3 uncharacterized protein LOC1114709522.3e-9474.67Show/hide
Query:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV
        MASN +IFFL  SL LLQ+V AVEY V N+A GTPGGIRFDN IGA+YSRQ L AAT  IW IF+QS+  DRKNVQK+SLFID++ DGV AF +N+EIHV
Subjt:  MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHV

Query:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG
         A+YI++Y GDLK+EITGVLYHE+THIWQW+GN ++PGGLIEGIADYVRLKSGYIPGHW  PG G RWD+GYDVTARFLDYLEG++SG VA+LNR L+NG
Subjt:  SANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNG

Query:  YSADYFRQLLGKPVDQLWAEYKTKAKYRN
        YSADYF QLLGKPVDQLWA+Y  KA +RN
Subjt:  YSADYFRQLLGKPVDQLWAEYKTKAKYRN

SwissProt top hitse value%identityAlignment
C0HJG8 Basic secretory protease (Fragments)1.3e-0649.09Show/hide
Query:  WDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADYFRQLLGKPVDQLWAEYK
        WDEGYDVTARFLDYL  + +G VA+L                  K V+QLW+EYK
Subjt:  WDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADYFRQLLGKPVDQLWAEYK

Arabidopsis top hitse value%identityAlignment
AT2G15130.1 Plant basic secretory protein (BSP) family protein3.7e-6049.77Show/hide
Query:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVI-GANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHVSANYI
        IF ++  +  +  V AV+++V +N G +PGG RF N I G +Y  Q+L  AT   W +FQQ+   DRK+V KI+LF++ N +G+ A+++ +EIH +A  +
Subjt:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVI-GANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHVSANYI

Query:  SSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADY
            G ++R  TGV+YHE+ H WQW+G   APGGLIEGIADYVRLK+GY+  HW  PG G RWD+GYDVTARFL+Y   +++G VA+LN+++R+ Y+  +
Subjt:  SSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADY

Query:  FRQLLGKPVDQLWAEYK
        F  LLGK V+QLW EYK
Subjt:  FRQLLGKPVDQLWAEYK

AT2G15130.2 Plant basic secretory protein (BSP) family protein8.3e-4453.96Show/hide
Query:  KNLDGVLAFTTNNEIHVSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE
        +N +G+ A+++ +EIH +A  +    G ++R  TGV+YHE+ H WQW+G   APGGLIEGIADYVRLK+GY+  HW  PG G RWD+GYDVTARFL+Y  
Subjt:  KNLDGVLAFTTNNEIHVSANYISSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLE

Query:  GVKSGLVAKLNRRLRNGYSADYFRQLLGKPVDQLWAEYK
         +++G VA+LN+++R+ Y+  +F  LLGK V+QLW EYK
Subjt:  GVKSGLVAKLNRRLRNGYSADYFRQLLGKPVDQLWAEYK

AT2G15170.1 Plant basic secretory protein (BSP) family protein1.2e-1036.46Show/hide
Query:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRF-DNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTN--NEIH
        IF ++  +  +  V AV++ V +N G +PGG +F D + G +Y +Q++ +AT   W +FQQ+  +DRK +  I+LFI+ +    +A+ TN   EIH
Subjt:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRF-DNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTN--NEIH

AT2G15220.1 Plant basic secretory protein (BSP) family protein5.7e-6955.71Show/hide
Query:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGA-NYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHVSANYI
        IFF++  + ++  V AV+Y+V +N+G + GG RF   IG  +Y  QTL +AT  +W +FQQ+   DRK+V KI+LF++ N DGV A+ + NEIH +  Y+
Subjt:  IFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGA-NYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHVSANYI

Query:  SSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADY
        +  +GD+KRE TGV+YHE+ H WQW+G   APGGLIEGIADYVRLK+GY P HW  PG G RWD+GYDVTARFLDY  G+++G VA+LN+++RNGYS  +
Subjt:  SSYNGDLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADY

Query:  FRQLLGKPVDQLWAEYKTK
        F  LLGK V+QLW EYK K
Subjt:  FRQLLGKPVDQLWAEYKTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAACAACTTAATCTTCTTCCTCTTGCCCTCTCTCGCACTCCTACAATCCGTCTTCGCCGTTGAGTACACCGTCGCCAACAATGCCGGTGGCACCCCCGGCGG
CATTCGCTTCGACAACGTAATCGGAGCAAATTACAGCCGGCAGACGCTGGTAGCTGCCACTGCTTTAATATGGAACATTTTCCAGCAAAGCACCGCGATCGATCGGAAGA
ACGTGCAGAAGATCAGCCTGTTTATTGATAAAAACTTGGATGGAGTACTGGCGTTCACTACCAACAACGAGATTCATGTCAGCGCAAACTATATTTCAAGCTATAATGGG
GATTTGAAAAGGGAGATCACAGGGGTGTTGTACCACGAAATAACCCATATATGGCAGTGGAGTGGGAACTTGACGGCGCCCGGCGGGCTGATTGAAGGGATCGCCGATTA
CGTACGGCTTAAGTCGGGTTATATTCCGGGGCACTGGGCAGAGCCGGGCGACGGGAAGAGGTGGGACGAAGGCTACGATGTGACGGCGAGGTTTTTGGATTATTTGGAGG
GGGTGAAAAGTGGGTTGGTGGCGAAGCTAAACCGGAGGCTGAGAAATGGCTACTCCGCCGATTACTTCCGGCAGTTGTTGGGGAAGCCGGTGGATCAGCTGTGGGCTGAG
TATAAGACTAAGGCCAAGTATAGGAATATTGATAAGAAATGCAATAGTTTTGCAATTGAAGTCAATGGAGTTTTTGCCCTACAGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAACAACTTAATCTTCTTCCTCTTGCCCTCTCTCGCACTCCTACAATCCGTCTTCGCCGTTGAGTACACCGTCGCCAACAATGCCGGTGGCACCCCCGGCGG
CATTCGCTTCGACAACGTAATCGGAGCAAATTACAGCCGGCAGACGCTGGTAGCTGCCACTGCTTTAATATGGAACATTTTCCAGCAAAGCACCGCGATCGATCGGAAGA
ACGTGCAGAAGATCAGCCTGTTTATTGATAAAAACTTGGATGGAGTACTGGCGTTCACTACCAACAACGAGATTCATGTCAGCGCAAACTATATTTCAAGCTATAATGGG
GATTTGAAAAGGGAGATCACAGGGGTGTTGTACCACGAAATAACCCATATATGGCAGTGGAGTGGGAACTTGACGGCGCCCGGCGGGCTGATTGAAGGGATCGCCGATTA
CGTACGGCTTAAGTCGGGTTATATTCCGGGGCACTGGGCAGAGCCGGGCGACGGGAAGAGGTGGGACGAAGGCTACGATGTGACGGCGAGGTTTTTGGATTATTTGGAGG
GGGTGAAAAGTGGGTTGGTGGCGAAGCTAAACCGGAGGCTGAGAAATGGCTACTCCGCCGATTACTTCCGGCAGTTGTTGGGGAAGCCGGTGGATCAGCTGTGGGCTGAG
TATAAGACTAAGGCCAAGTATAGGAATATTGATAAGAAATGCAATAGTTTTGCAATTGAAGTCAATGGAGTTTTTGCCCTACAGCTGTAA
Protein sequenceShow/hide protein sequence
MASNNLIFFLLPSLALLQSVFAVEYTVANNAGGTPGGIRFDNVIGANYSRQTLVAATALIWNIFQQSTAIDRKNVQKISLFIDKNLDGVLAFTTNNEIHVSANYISSYNG
DLKREITGVLYHEITHIWQWSGNLTAPGGLIEGIADYVRLKSGYIPGHWAEPGDGKRWDEGYDVTARFLDYLEGVKSGLVAKLNRRLRNGYSADYFRQLLGKPVDQLWAE
YKTKAKYRNIDKKCNSFAIEVNGVFALQL