; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G032630 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G032630
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPlant basic secretory protein (BSP) family protein
Genome locationCiama_Chr02:7734647..7737124
RNA-Seq ExpressionCaUC02G032630
SyntenyCaUC02G032630
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8645853.1 hypothetical protein Csa_017343 [Cucumis sativus]1.4e-10781.74Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AVEYTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIE
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+D KC SF+IE
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIE

XP_011658890.1 uncharacterized protein LOC105434420 [Cucumis sativus]1.0e-11081.2Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AVEYTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+D KC SF+IE  GVF LQL
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL

XP_038888417.1 uncharacterized protein LOC120078264 [Benincasa hispida]1.3e-11884.55Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLPSLALLQSI AVEYTVTN A GT GGTRFDNIIGANYSRQTL AAT LIWNIFRQSTAADRK+VQK+SLFIDKN++GVAFTTN+EIH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQWSGN   P GLIEGIADY+RLKSGYIPG WVEPGGGNRWDEGYDVTARFL+YLEGVRSGLV +LNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVL
        SADYFRQL+GKPVDELWA+YKTKAKYG L+ KC +F+IE  GVF L
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVL

XP_038888579.1 uncharacterized protein LOC120078385 [Benincasa hispida]6.8e-11582.66Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLP LALLQSI AVEYTVTN A GT GG RF+NIIGANYSRQTL AAT LIWNIFRQSTAADRKNVQK+SL IDKN++GVAFTTN++IH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQW+GN   P  LIEGIADYVRLKSGYIPG WVEPGGGNRWDEGYDVTARFL+YLEGVRSGLV ELNRRLR+ Y
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL
        S DYFRQL+GKPVDELWA+YKTKAKYG L+KKC +F+IE  GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL

XP_038888599.1 uncharacterized protein LOC120078398 [Benincasa hispida]2.7e-11986.29Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNNLIF LLPSLALLQ I AVEYTVTN A GTLGGTRFDNIIGANYSRQTLVAAT LIWNIF+QSTAADRK VQK+SLFIDKN++GVA TTN+EIH+ 
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYISSYSGDLKRE TGVLYHE+T+IWQW GNS  PGGLIEGIADYVRLKSGYIPG+WVEPGGGN WDEGYDVTARFL+YLEGVRSGLV ELNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL
        SADYFRQ +GKPVDELWAEYKTKAKYG LDKKC SF+IE  GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL

TrEMBL top hitse value%identityAlignment
A0A0A0K322 Uncharacterized protein1.7e-10879.84Show/hide
Query:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV
        M SNNLI FLLL  LALL+SI AVEYTVTN A GT GG RFD+IIGANYSRQTLVAATALIWNIF+QSTAADRK+V+K+SLFIDKN DGVA   NNEIHV
Subjt:  MASNNLI-FLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHV

Query:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD
        SANYISSYSGDLKRE TG+LYHE+T+IWQW+GN   P GLIEGIADYVRLKSGYIPG WVEPGGGN WDEGYDVTARFL+YLE  VRSGLV ELNR++R+
Subjt:  SANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLE-GVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVL
        GYS DYFRQL+GKPVDELWAEYKTKAK+GN+D KC SF+IE  G  ++
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVL

A0A1S3BZT8 uncharacterized protein LOC1034952683.3e-10777.65Show/hide
Query:  LRSTRTSMASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTT
        L  T  SM SNN IF LL  LALL+SI AVEY V N A GT GG RFD+IIGANYSRQTLVAATALIWNIFRQSTAADRK+VQK+SLFID N D VAF  
Subjt:  LRSTRTSMASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTT

Query:  NNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELN
        NNEIHVSA+YISSYSGDLKRE TG+LYHE+T+I QWSGN   P GLIEGIADYVRLKSGYI   WVEPGGGNRWDEGYDVTARFL+YLE VRSGLV ELN
Subjt:  NNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELN

Query:  RRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL
        R++R+ YS DYFRQL+GKPVDELW EYKTK K G LD KC++F+IEA GVF LQL
Subjt:  RRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL

A0A5D3C5V2 NtPRp27-like protein3.6e-10678.63Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        M SNN IF LL  LALL+SI AVEY V N A GT GG RFD+IIGANYSRQTLVAATALIWNIFRQSTAADRK+VQK+SLFID N D VAF  NNEIHVS
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        A+YISSYSGDLKRE TG+LYHE+T+I QWSGN   P GLIEGIADYVRLKSGYI   WVEPGGGNRWDEGYDVTARFL+YLE VRSGLV ELNR++R+ Y
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL
        S DYFRQL+GKPVDELW EYKTK K G LD KC++F+IEA GVF LQL
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL

A0A6J1D406 uncharacterized protein LOC1110173791.6e-9878.07Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASNN IF LL SLALLQ++ AVEYTVTN A GT GG RFDN IGA+YS QTLVAAT  IWNIF+QSTAADRKNV KVSLFID +YDGVAF +NNEIHV 
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY
        ANYI++Y GDLKRE TGVLYHE+THIWQW+GN + PGGLIEGIADYVRLKSGYIPG+WV PGGG+RWD+GYDVTARFL+YLEG+RSG V ELNRRLR+GY
Subjt:  ANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGY

Query:  SADYFRQLVGKPVDELWAEYKTKAKYGN
         ADYF QL+GK VD+LWA+Y  KA +GN
Subjt:  SADYFRQLVGKPVDELWAEYKTKAKYGN

A0A6J1GMQ9 uncharacterized protein LOC1114553812.9e-9575.22Show/hide
Query:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS
        MASN +IF L PSL LLQ++ AVEYTVTN A GTLGG RFDN IG +YS+Q L AAT  IWNIFRQS+ ADRKNVQKVSLFIDKNYDGVAF  N+EIHV 
Subjt:  MASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVS

Query:  ANYISSYSGDLK--REFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRD
        A+YI++Y GDLK     TGVLYHE+THIWQW+GN + PGGLIEGIADYVRLKSGYIPG+WV PGGG+RWD+GYDVTARFL+YLEG+RSG V ELNRRL++
Subjt:  ANYISSYSGDLK--REFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRD

Query:  GYSADYFRQLVGKPVDELWAEYKTKAKYGN
        GYSADYF QL+GKPVD LWA+Y  KA +GN
Subjt:  GYSADYFRQLVGKPVDELWAEYKTKAKYGN

SwissProt top hitse value%identityAlignment
C0HJG8 Basic secretory protease (Fragments)4.1e-0645.9Show/hide
Query:  WDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG
        WDEGYDVTARFL+YL  + +G V EL                  K V++LW+EY  KA YG
Subjt:  WDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG

Arabidopsis top hitse value%identityAlignment
AT2G15130.1 Plant basic secretory protein (BSP) family protein2.1e-6148.65Show/hide
Query:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNII-GANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS
        IFL++  +  +  + AV+++V +  G + GG RF N I G +Y  Q+L  AT   W +F+Q+  +DRK+V K++LF++ N +G+A+++ +EIH +A  + 
Subjt:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNII-GANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS

Query:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF
           G ++R FTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY+  +WV PGGG+RWD+GYDVTARFLEY   +R+G V ELN+++R  Y+  +F
Subjt:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF

Query:  RQLVGKPVDELWAEYKTKAKYG
          L+GK V++LW EY  KA YG
Subjt:  RQLVGKPVDELWAEYKTKAKYG

AT2G15130.2 Plant basic secretory protein (BSP) family protein8.4e-4754.86Show/hide
Query:  KNYDGVAFTTNNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEG
        +N +G+A+++ +EIH +A  +    G ++R FTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY+  +WV PGGG+RWD+GYDVTARFLEY   
Subjt:  KNYDGVAFTTNNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEG

Query:  VRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG
        +R+G V ELN+++R  Y+  +F  L+GK V++LW EY  KA YG
Subjt:  VRSGLVGELNRRLRDGYSADYFRQLVGKPVDELWAEYKTKAKYG

AT2G15170.1 Plant basic secretory protein (BSP) family protein1.8e-0935.79Show/hide
Query:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRF-DNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTN--NEIH
        IFL++  +  +  + AV++ V +  G + GG +F D I G +Y +Q++ +AT   W +F+Q+   DRK +  ++LFI+ N + VA+ TN   EIH
Subjt:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRF-DNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTN--NEIH

AT2G15220.1 Plant basic secretory protein (BSP) family protein1.3e-6853.15Show/hide
Query:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGA-NYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS
        IF ++  + ++  + AV+Y+V + +G + GG RF   IG  +Y  QTL +AT  +W +F+Q+  +DRK+V K++LF++ N DGVA+ + NEIH +  Y++
Subjt:  IFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGA-NYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVAFTTNNEIHVSANYIS

Query:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF
          SGD+KREFTGV+YHE+ H WQW+G    PGGLIEGIADYVRLK+GY P +WV PG G+RWD+GYDVTARFL+Y  G+R+G V ELN+++R+GYS  +F
Subjt:  SYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRRLRDGYSADYF

Query:  RQLVGKPVDELWAEYKTKAKYG
          L+GK V++LW EY  KAKYG
Subjt:  RQLVGKPVDELWAEYKTKAKYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAATTAAAGCTTCAAAAGCTCAGATCCACACGTACGTCCATGGCTTCCAATAACTTAATCTTCTTACTCTTGCCCTCTCTCGCTCTCCTACAATCCATC
TTCGCCGTAGAGTACACAGTCACCAACAAGGCCGGTGGCACCCTCGGCGGCACCCGGTTCGACAACATAATCGGAGCGAATTACAGCCGGCAAACGCTTGTTGCT
GCCACTGCTTTAATATGGAACATTTTTCGGCAAAGTACTGCCGCCGACCGGAAGAACGTGCAGAAGGTCAGCCTGTTTATTGATAAAAACTACGATGGAGTGGCG
TTCACTACCAACAACGAGATTCATGTCAGCGCGAACTATATCTCAAGCTATAGCGGGGATTTGAAAAGGGAGTTCACAGGAGTGTTGTACCACGAAATAACCCAT
ATCTGGCAGTGGAGTGGGAATTCGACGGTGCCCGGCGGGCTGATTGAAGGGATCGCGGATTACGTACGGCTGAAGTCGGGGTATATTCCGGGGAACTGGGTGGAG
CCGGGTGGCGGGAACAGGTGGGACGAAGGCTACGACGTGACGGCGAGATTTTTGGAGTATTTAGAGGGGGTGAGAAGTGGGTTGGTGGGGGAGCTGAACCGGAGG
CTGAGAGATGGCTACTCTGCCGATTACTTCCGGCAGCTGGTGGGGAAGCCGGTGGATGAGCTGTGGGCTGAGTATAAGACAAAGGCCAAGTATGGGAATCTTGAC
AAGAAATGCCAAAGTTTTAAAATTGAAGCCTATGGGGTTTTTGTCCTACAGCTGTAA
mRNA sequenceShow/hide mRNA sequence
CAAAAGGTTACTTTTCCAACTTTCAAAAACACATTTCCAAAAACAAATTATAGAGTAGTTTCCAAATTTTGGACGACAAAGTCAAGACAACACAATTTGACAATT
CTTCCATTTTCCTAAATTGAAATCAATGACCAAATTAAAGCTTCAAAAGCTCAGATCCACACGTACGTCCATGGCTTCCAATAACTTAATCTTCTTACTCTTGCC
CTCTCTCGCTCTCCTACAATCCATCTTCGCCGTAGAGTACACAGTCACCAACAAGGCCGGTGGCACCCTCGGCGGCACCCGGTTCGACAACATAATCGGAGCGAA
TTACAGCCGGCAAACGCTTGTTGCTGCCACTGCTTTAATATGGAACATTTTTCGGCAAAGTACTGCCGCCGACCGGAAGAACGTGCAGAAGGTCAGCCTGTTTAT
TGATAAAAACTACGATGGAGTGGCGTTCACTACCAACAACGAGATTCATGTCAGCGCGAACTATATCTCAAGCTATAGCGGGGATTTGAAAAGGGAGTTCACAGG
AGTGTTGTACCACGAAATAACCCATATCTGGCAGTGGAGTGGGAATTCGACGGTGCCCGGCGGGCTGATTGAAGGGATCGCGGATTACGTACGGCTGAAGTCGGG
GTATATTCCGGGGAACTGGGTGGAGCCGGGTGGCGGGAACAGGTGGGACGAAGGCTACGACGTGACGGCGAGATTTTTGGAGTATTTAGAGGGGGTGAGAAGTGG
GTTGGTGGGGGAGCTGAACCGGAGGCTGAGAGATGGCTACTCTGCCGATTACTTCCGGCAGCTGGTGGGGAAGCCGGTGGATGAGCTGTGGGCTGAGTATAAGAC
AAAGGCCAAGTATGGGAATCTTGACAAGAAATGCCAAAGTTTTAAAATTGAAGCCTATGGGGTTTTTGTCCTACAGCTGTAAGAAGAATTAAAGTGTTTGGGATT
CAAAAGTCTAACATGTATGTGTACTTTTAGGTTTCCATTTGGATTGGTGTCTCATGTAATCAATTCATTATTGATTTATTGCATCAGAGACCAAAGTATTTGATA
GTGA
Protein sequenceShow/hide protein sequence
MTKLKLQKLRSTRTSMASNNLIFLLLPSLALLQSIFAVEYTVTNKAGGTLGGTRFDNIIGANYSRQTLVAATALIWNIFRQSTAADRKNVQKVSLFIDKNYDGVA
FTTNNEIHVSANYISSYSGDLKREFTGVLYHEITHIWQWSGNSTVPGGLIEGIADYVRLKSGYIPGNWVEPGGGNRWDEGYDVTARFLEYLEGVRSGLVGELNRR
LRDGYSADYFRQLVGKPVDELWAEYKTKAKYGNLDKKCQSFKIEAYGVFVLQL