; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G06910 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G06910
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlant basic secretory protein (BSP) family protein
Genome locationClcChr02:6975405..6976859
RNA-Seq ExpressionClc02G06910
SyntenyClc02G06910
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645853.1 hypothetical protein Csa_017343 [Cucumis sativus]2.6e-10680.91Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AV+YTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIE
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+  KC SF+IE
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIE

XP_011658890.1 uncharacterized protein LOC105434420 [Cucumis sativus]1.5e-10980.4Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AV+YTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+  KC SF+IE  GVF LQL
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL

XP_038888417.1 uncharacterized protein LOC120078264 [Benincasa hispida]2.9e-11884.15Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLPSLALLQSI AV+YTVTN A GT GGTRFDNIIGANYSRQTL AAT LIWNIFRQSTAADRK+VQK+SLFIDKN++GVAFTTN+EIH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQWSGN   P GLIEGIADY+RLKSGYIPG WVEPGGGNRWDEGYDVTARFL+YLEGVRSGLV +LNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVL
        SADYFRQL+GKPVDELWA+YKTKAKYG L+ KC +F+IE  GVF L
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVL

XP_038888579.1 uncharacterized protein LOC120078385 [Benincasa hispida]1.5e-11482.26Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLP LALLQSI AV+YTVTN A GT GG RF+NIIGANYSRQTL AAT LIWNIFRQSTAADRKNVQK+SL IDKN++GVAFTTN++IH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQW+GN   P  LIEGIADYVRLKSGYIPG WVEPGGGNRWDEGYDVTARFL+YLEGVRSGLV ELNRRLR+ Y
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL
        S DYFRQL+GKPVDELWA+YKTKAKYG L+KKC +F+IE  GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL

XP_038888599.1 uncharacterized protein LOC120078398 [Benincasa hispida]5.0e-11885.48Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLPSLALLQ I AV+YTVTN A GTLGGTRFDNIIGANYSRQTLVAAT LIWNIF+QSTAADRK VQK+SLFIDKN++GVA TTN+EIH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQW GNS  PGGLIEGIADYVRLKSGYIPG+WVEPGGGN WDEGYDVTARFL+YLEGVRSGLV ELNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL
        SADYFRQ +GKPVDELWAEYKTKAKYG L KKC SF+IE  GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL

TrEMBL top hitse value%identityAlignment
A0A0A0K322 Uncharacterized protein2.5e-10779.03Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AV+YTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVL
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+  KC SF+IE  G  ++
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVL

A0A1S3BZT8 uncharacterized protein LOC1034952686.2e-10676.86Show/hide
Query:  LRSTRTSMASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTT
        L  T  SM SNN IF LL  LALL+SI AV+Y V N A GT GG RFD+IIGANYSRQTLVAATALIWNIFRQSTAADRK+VQK+SLFID N D VAF  
Subjt:  LRSTRTSMASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTT

Query:  NNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELN
        NNEIHVSA+YISSYSGDLKRE TG+LYHE+T+I QWSGN   P GLIEGIADYVRLKSGYI   WVEPGGGNRWDEGYDVTARFL+YLE VRSGLV ELN
Subjt:  NNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELN

Query:  RRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL
        R++R+ YS DYFRQL+GKPVDELW EYKTK K G L  KC++F+IEA GVF LQL
Subjt:  RRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL

A0A5D3C5V2 NtPRp27-like protein5.3e-10577.82Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        M SNN IF LL  LALL+SI AV+Y V N A GT GG RFD+IIGANYSRQTLVAATALIWNIFRQSTAADRK+VQK+SLFID N D VAF  NNEIHVS
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        A+YISSYSGDLKRE TG+LYHE+T+I QWSGN   P GLIEGIADYVRLKSGYI   WVEPGGGNRWDEGYDVTARFL+YLE VRSGLV ELNR++R+ Y
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL
        S DYFRQL+GKPVDELW EYKTK K G L  KC++F+IEA GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL

A0A6J1D406 uncharacterized protein LOC1110173794.8e-9877.63Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNN IF LL SLALLQ++ AV+YTVTN A GT GG RFDN IGA+YS QTLVAAT  IWNIF+QSTAADRKNV KVSLFID +YDGVAF +NNEIHV 
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYI++Y GDLKRE TGVLYHE+THIWQW+GN + PGGLIEGIADYVRLKSGYIPG+WV PGGG+RWD+GYDVTARFL+YLEG+RSG V ELNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGN
         ADYF QL+GK VD+LWA+Y  KA +GN
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGN

A0A6J1GMQ9 uncharacterized protein LOC1114553818.4e-9574.78Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASN +IF L PSL LLQ++ AV+YTVTN A GTLGG RFDN IG +YS+Q L AAT  IWNIFRQS+ ADRKNVQKVSLFIDKNYDGVAF  N+EIHV 
Subjt:  MASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLK--REFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRD
        A+YI++Y GDLK     TGVLYHE+THIWQW+GN + PGGLIEGIADYVRLKSGYIPG+WV PGGG+RWD+GYDVTARFL+YLEG+RSG V ELNRRL++
Subjt:  ANYISSYSGDLK--REFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGN
        GYSADYF QL+GKPVD LWA+Y  KA +GN
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGN

SwissProt top hitse value%identityAlignment
C0HJG8 Basic secretory protease (Fragments)4.1e-0645.9Show/hide
Query:  WDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG
        WDEGYDVTARFL+YL  + +G V EL                  K V++LW+EY  KA YG
Subjt:  WDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG

Arabidopsis top hitse value%identityAlignment
AT2G15130.1 Plant basic secretory protein (BSP) family protein3.5e-6148.65Show/hide
Query:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNII-GANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS
        IFL++  +  +  + AV ++V +  G + GG RF N I G +Y  Q+L  AT   W +F+Q+  +DRK+V K++LF++ N +G+A+++ +EIH +A  + 
Subjt:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNII-GANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS

Query:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF
           G ++R FTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY+  +WV PGGG+RWD+GYDVTARFLEY   +R+G V ELN+++R  Y+  +F
Subjt:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF

Query:  RQLVGKPVDELWAEYKTKAKYG
          L+GK V++LW EY  KA YG
Subjt:  RQLVGKPVDELWAEYKTKAKYG

AT2G15130.2 Plant basic secretory protein (BSP) family protein8.4e-4754.86Show/hide
Query:  KNYDGVAFTTNNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEG
        +N +G+A+++ +EIH +A  +    G ++R FTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY+  +WV PGGG+RWD+GYDVTARFLEY   
Subjt:  KNYDGVAFTTNNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEG

Query:  VRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG
        +R+G V ELN+++R  Y+  +F  L+GK V++LW EY  KA YG
Subjt:  VRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG

AT2G15170.1 Plant basic secretory protein (BSP) family protein3.1e-0935.79Show/hide
Query:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRF-DNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTN--NEIH
        IFL++  +  +  + AV + V +  G + GG +F D I G +Y +Q++ +AT   W +F+Q+   DRK +  ++LFI+ N + VA+ TN   EIH
Subjt:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRF-DNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTN--NEIH

AT2G15220.1 Plant basic secretory protein (BSP) family protein2.3e-6853.15Show/hide
Query:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGA-NYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS
        IF ++  + ++  + AV Y+V + +G + GG RF   IG  +Y  QTL +AT  +W +F+Q+  +DRK+V K++LF++ N DGVA+ + NEIH +  Y++
Subjt:  IFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGA-NYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS

Query:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF
          SGD+KREFTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY P +WV PG G+RWD+GYDVTARFL+Y  G+R+G V ELN+++R+GYS  +F
Subjt:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF

Query:  RQLVGKPVDELWAEYKTKAKYG
          L+GK V++LW EY  KAKYG
Subjt:  RQLVGKPVDELWAEYKTKAKYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAAATAAAGCTTCAAAAGCTCAGATCCACACGTACGTCCATGGCTTCCAATAACTTAATCTTCTTACTCTTGCCCTCTCTCGCTCTCCTACAATCCATCTTCGC
CGTACAGTACACAGTCACCAACAAGGCCGGTGGCACCCTCGGCGGCACCCGGTTCGACAACATAATCGGAGCGAATTACAGCCGGCAAACGCTGGTTGCTGCCACTGCTT
TAATATGGAACATTTTTCGGCAAAGTACTGCCGCCGACCGGAAGAACGTGCAGAAGGTCAGCCTGTTTATTGATAAAAACTACGATGGAGTGGCGTTCACTACCAACAAC
GAGATTCATGTCAGCGCGAACTATATCTCAAGCTACAGCGGGGATTTGAAAAGGGAGTTCACAGGAGTGTTGTACCACGAAATAACCCATATCTGGCAGTGGAGTGGGAA
TTCGACGGTGCCCGGCGGGCTGATTGAAGGGATCGCGGATTACGTACGGCTGAAGTCGGGGTATATTCCGGGGAACTGGGTGGAGCCGGGTGGCGGGAACAGGTGGGACG
AAGGCTACGACGTGACGGCGAGATTTTTGGAGTATTTAGAGGGGGTGAGAAGTGGGTTGGTGGGGGAGCTGAACCGGAGGCTGAGAGATGGCTACTCTGCCGATTACTTC
CGGCAGCTGGTGGGGAAGCCGGTGGATGAGCTGTGGGCTGAGTATAAGACAAAGGCCAAGTATGGGAATCTTCACAAGAAATGCCAAAGTTTTAAAATTGAAGCCTATGG
GGTTTTTGTCCTACAGCTGTAA
mRNA sequenceShow/hide mRNA sequence
CAAAAGGTTACTTTTCCAATTTTCAAAAACACATTTCCAAAAACAAATTATAGAGTAGTTTCCAAATTTTGGACGACAAAGTCAAGCCAACACAATTTGACAATTCTTCC
ATTTTCCTAAATTGAAATCAATGACCAAAATAAAGCTTCAAAAGCTCAGATCCACACGTACGTCCATGGCTTCCAATAACTTAATCTTCTTACTCTTGCCCTCTCTCGCT
CTCCTACAATCCATCTTCGCCGTACAGTACACAGTCACCAACAAGGCCGGTGGCACCCTCGGCGGCACCCGGTTCGACAACATAATCGGAGCGAATTACAGCCGGCAAAC
GCTGGTTGCTGCCACTGCTTTAATATGGAACATTTTTCGGCAAAGTACTGCCGCCGACCGGAAGAACGTGCAGAAGGTCAGCCTGTTTATTGATAAAAACTACGATGGAG
TGGCGTTCACTACCAACAACGAGATTCATGTCAGCGCGAACTATATCTCAAGCTACAGCGGGGATTTGAAAAGGGAGTTCACAGGAGTGTTGTACCACGAAATAACCCAT
ATCTGGCAGTGGAGTGGGAATTCGACGGTGCCCGGCGGGCTGATTGAAGGGATCGCGGATTACGTACGGCTGAAGTCGGGGTATATTCCGGGGAACTGGGTGGAGCCGGG
TGGCGGGAACAGGTGGGACGAAGGCTACGACGTGACGGCGAGATTTTTGGAGTATTTAGAGGGGGTGAGAAGTGGGTTGGTGGGGGAGCTGAACCGGAGGCTGAGAGATG
GCTACTCTGCCGATTACTTCCGGCAGCTGGTGGGGAAGCCGGTGGATGAGCTGTGGGCTGAGTATAAGACAAAGGCCAAGTATGGGAATCTTCACAAGAAATGCCAAAGT
TTTAAAATTGAAGCCTATGGGGTTTTTGTCCTACAGCTGTAAGAAGAATTAAAGTGTTTGGGATTCAAAAGTCTAACATGTATGTGTACTTTTAGGCTTCCATTTGGATT
GGTGTCTCATGTAATCAATTCATTATTGATTTATTGCATCAGAGACCAAAGTATTTGATAGTGA
Protein sequenceShow/hide protein sequence
MTKIKLQKLRSTRTSMASNNLIFLLLPSLALLQSIFAVQYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNN
EIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF
RQLVGKPVDELWAEYKTKAKYGNLHKKCQSFKIEAYGVFVLQL