; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038632 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038632
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBet_v_1 domain-containing protein
Genome locationscaffold12:2678770..2685262
RNA-Seq ExpressionSpg038632
SyntenySpg038632
Gene Ontology termsGO:0006952 - defense response (biological process)
InterPro domainsIPR000916 - Bet v I/Major latex protein
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650605.1 hypothetical protein Csa_010824 [Cucumis sativus]9.3e-6376.16Show/hide
Query:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVF
        +MAQIAKI+E+VQLKSSG KF+EF KNKMD+FP +F G+VESYKFVEGNSFTHGS+S WKYD GFG + EVK++LLVDE  KTIIYECLEGDLFKDF++F
Subjt:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVF

Query:  QVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL
        +V+I+V+DGG++GNSSVNWCLEFVK+NENV PP +YLQFGVK+CK+VDA+L
Subjt:  QVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL

XP_004145832.1 MLP-like protein 34 [Cucumis sativus]1.2e-6276.67Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKI+E+VQLKSSG KF+EF KNKMD+FP +F G+VESYKFVEGNSFTHGS+S WKYD GFG + EVK++LLVDE  KTIIYECLEGDLFKDF++F+
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL
        V+I+V+DGG++GNSSVNWCLEFVK+NENV PP +YLQFGVK+CK+VDA+L
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL

XP_004151401.1 MLP-like protein 28 [Cucumis sativus]1.0e-6479.33Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKIS+QVQLK  G KFY+F KNKMDH P VFP   ESYK VEGNS THGS+S WKYD+GFGSS EVK+++LVDEP KTIIYECLEGDLFKDF++F 
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL
        V+I+V+DGGNNGNSSVNWCLE+VKANENVDPP NYLQFG+KLCKNVDA L
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL

XP_022143420.1 uncharacterized protein LOC111013301 [Momordica charantia]1.5e-6581.58Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKISEQVQLKS GHKFYEFLKNKMD+FP +FPG++ESYKF EGNSFTHGSISHWKYD G GSS EVK+RL+VDEP KTIIYECLEGDLFKDFE+FQ
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDG-GNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
        V+I+VSDG GNNG SSV W LEFVKANE V PP++YLQ GVK+CK+VDA LC
Subjt:  VRIQVSDG-GNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

XP_038880356.1 MLP-like protein 34 [Benincasa hispida]1.9e-7182.58Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKISE+VQLKSSG +FY+F KNKMD+FP +FPG+VESYKFVEGNSFTHGS+S WKYD+GFGSS EVKV+LLVDEP KTIIYECLEGDLFKDFE+F 
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKELN
        V+I+V+DGGNNGNSSVNWC+EFVKANENV PP +YLQFGVK+CK+VDAHLCK+LN
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKELN

TrEMBL top hitse value%identityAlignment
A0A0A0L7N6 Bet_v_1 domain-containing protein4.8e-6579.33Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKIS+QVQLK  G KFY+F KNKMDH P VFP   ESYK VEGNS THGS+S WKYD+GFGSS EVK+++LVDEP KTIIYECLEGDLFKDF++F 
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL
        V+I+V+DGGNNGNSSVNWCLE+VKANENVDPP NYLQFG+KLCKNVDA L
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHL

A0A1S3C4Q2 MLP-like protein 282.4e-5672.19Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIA+ISEQVQLK  G KFY+F +NKMDH P +FP   ESYK VEGNS THG +S WK ++GFG S EVK++LLVDE  KTIIYECLEGDLFKDF++F 
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
        V+I+V+D GNNGNSSVNWCLE+VKAN+NVDPP NYLQFG+KL K +DA LC
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

A0A1S3CQL5 MLP-like protein 287.7e-6374.17Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKI+E+VQLKSSG KF+EF KNK D+FP +FPG+++SYKFVEGNSF+HGS+S WKYD GFG + EVKV+LL+DE  KTIIYECLEGDLFKDF++F+
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
        V+I+V+DGG++GNSSVNWCLEFVK+NENV PP +YLQFG K+CK+VDA+LC
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

A0A5A7UCZ9 MLP-like protein 287.7e-6374.17Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKI+E+VQLKSSG KF+EF KNK D+FP +FPG+++SYKFVEGNSF+HGS+S WKYD GFG + EVKV+LL+DE  KTIIYECLEGDLFKDF++F+
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
        V+I+V+DGG++GNSSVNWCLEFVK+NENV PP +YLQFG K+CK+VDA+LC
Subjt:  VRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

A0A6J1CNR9 uncharacterized protein LOC1110133017.4e-6681.58Show/hide
Query:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ
        MAQIAKISEQVQLKS GHKFYEFLKNKMD+FP +FPG++ESYKF EGNSFTHGSISHWKYD G GSS EVK+RL+VDEP KTIIYECLEGDLFKDFE+FQ
Subjt:  MAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQ

Query:  VRIQVSDG-GNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
        V+I+VSDG GNNG SSV W LEFVKANE V PP++YLQ GVK+CK+VDA LC
Subjt:  VRIQVSDG-GNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

SwissProt top hitse value%identityAlignment
P19825 Major latex protein 151.2e-1227.03Show/hide
Query:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI
        + K+  + ++  +  K+Y+  K+  D  P   P    S K VEG+  T G +  W Y    G    VK +   ++  +TI +  +EG +  D++ F   +
Subjt:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI

Query:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
         V    N   S V W +++ K NE+   P++YL F  +  +++++HLC
Subjt:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

Q06394 Major latex protein 1466.0e-1226.35Show/hide
Query:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI
        + K+  + ++  +  K+Y+  K+  D  P V P    S K VEG+  T G +  W Y    G     K +   ++  +TI +  + GDL  D++ F   +
Subjt:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI

Query:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
         V+   N     V W +++ K NE+   P+ YL    ++ +++ +HLC
Subjt:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

Q06395 Major latex protein 1491.3e-1127.03Show/hide
Query:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI
        + K+  + ++  +  K+Y+  K+  D  P   P  V S K VEG+  T G +  W Y    G +   K +   ++  +TI +   EGDL  D++ F   +
Subjt:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTIIYECLEGDLFKDFEVFQVRI

Query:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC
         V    N   S V + L++ K NE+   P +YL    +  ++++ +LC
Subjt:  QVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLC

Q941R6 MLP-like protein 318.9e-1630.87Show/hide
Query:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEVFQVRIQ
        K+   +++K+S  KF+     +  H     PG ++  +  EG+    GSI  W Y    G +   K R+   EP+K +I +  +EGDL K+++ F + IQ
Subjt:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEVFQVRIQ

Query:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        V+       S V+W +E+ K ++ V  P  +L F V++ K +D HL  E
Subjt:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE

Q9SSK7 MLP-like protein 342.5e-1833.55Show/hide
Query:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV
        E+     +  +V++K+S  KF+     K  H     PG+++S    EG+  T GSI  W Y    G +   K R+   +P+K +I +  +EGDL K+++ 
Subjt:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV

Query:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        F + IQV+       S V+W  E+ K NE V  P   LQF V++ K +D HL  E
Subjt:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE

Arabidopsis top hitse value%identityAlignment
AT1G23130.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein9.8e-1026.67Show/hide
Query:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRL-LVDEPKKTIIYECLEGDLFKDFEVFQVRIQ
        ++   +++K+S  KF+  L  +        P D++     EG     GS+  W Y    G     + R+  VD+ K  ++   ++GDL K+F+ F V IQ
Subjt:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRL-LVDEPKKTIIYECLEGDLFKDFEVFQVRIQ

Query:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKEL
         +       S V   L++ + +E V PP   L+   KL +++D  L  E+
Subjt:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKEL

AT1G70840.1 MLP-like protein 316.3e-1730.87Show/hide
Query:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEVFQVRIQ
        K+   +++K+S  KF+     +  H     PG ++  +  EG+    GSI  W Y    G +   K R+   EP+K +I +  +EGDL K+++ F + IQ
Subjt:  KISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEVFQVRIQ

Query:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        V+       S V+W +E+ K ++ V  P  +L F V++ K +D HL  E
Subjt:  VSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE

AT1G70850.1 MLP-like protein 341.8e-1933.55Show/hide
Query:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV
        E+     +  +V++K+S  KF+     K  H     PG+++S    EG+  T GSI  W Y    G +   K R+   +P+K +I +  +EGDL K+++ 
Subjt:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV

Query:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        F + IQV+       S V+W  E+ K NE V  P   LQF V++ K +D HL  E
Subjt:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE

AT1G70850.3 MLP-like protein 341.8e-1933.55Show/hide
Query:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV
        E+     +  +V++K+S  KF+     K  H     PG+++S    EG+  T GSI  W Y    G +   K R+   +P+K +I +  +EGDL K+++ 
Subjt:  EMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDEPKKTII-YECLEGDLFKDFEV

Query:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        F + IQV+       S V+W  E+ K NE V  P   LQF V++ K +D HL  E
Subjt:  FQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE

AT5G28010.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein2.4e-1631.79Show/hide
Query:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRL-LVDEPKKTIIYECLEGDLFKDFEVFQVR
        + K+  +V++K+    FY     +  H     P +V+S    +G   T GSI +W Y    G +   K R+ LV+  KK I +  +EGD+  +++ F + 
Subjt:  IAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRL-LVDEPKKTIIYECLEGDLFKDFEVFQVR

Query:  IQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE
        IQV+       S V W +E+ K +ENV  P N L F  ++ K +D HL  E
Subjt:  IQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATGATGCGTTTGGTAGTTTTCACTTGATTGATAAAGATTACATGGAATGGAACAATAAAGATCTTGTGATTTTGGATCACTCTGATGCACAAGGAGCTGATGAGGA
CAACCGGGCAGAGATAGGACCAGGAAATCGATCCAGAGGAAGACCAGACCAAAGGCTTATATATTGCGAGGTCTCCACTAATCTTTTCAATGTGGGATCCTCAACAGTTT
CAACACGACCTCGGCCCGAGGTCGAGCTCGTTCGCCTCCGTTTGGTCCCTGCTGCCTCTGGCCGCCCCGGTTTCGCCTGGTTTGTTCCAAAACGCCTCCGAATTCCTAAA
AACCCTAGGAGCATGAGCAGGTATTTAAACCCCTCTTCGTCACTGAAGAAGGGATCCCGAACTTTATTCTCTACTCTCTCCTCTTGCTCTCTTGCTCTCCTCCTTCCGTT
TTCTGACTTAAGCACCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGCAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACGAACCAGGCGGAACCGGGGCGGCCAGAG
ACATCAGGGACCGAAAGGAGGCGACCGAGCTCGGCTCGCGCAAGTGGGCCGAATGGTCGGGCCAAGTATGCCCGGCCCTTTGGTCGGTTCTTCCTCTGGATCGGTCTCCT
AGTCCTATTTCTGTCCGGATGTCCTCGTCAGCTCCTTGTGCATCGAGGTGGTCCAAAATTACCTATAACAGAACAGATCTTAAAAGTGCTTTGGAAATGGCTCAGATTGC
TAAGATCTCAGAGCAAGTGCAGCTGAAGTCATCTGGTCACAAATTCTATGAGTTTTTAAAGAACAAAATGGATCATTTTCCTCTAGTGTTCCCTGGAGATGTTGAGAGCT
ATAAGTTTGTGGAAGGAAATAGTTTCACTCATGGTAGCATCTCCCATTGGAAATACGACTGGGGTTTCGGTAGCTCGACAGAGGTAAAGGTGAGGCTACTAGTGGATGAG
CCTAAGAAAACAATAATTTACGAGTGTCTTGAAGGAGATTTGTTCAAAGATTTCGAAGTATTCCAAGTGAGAATTCAAGTCAGTGATGGTGGCAACAATGGCAATAGCTC
AGTCAATTGGTGTTTGGAATTTGTGAAAGCAAATGAAAATGTGGATCCACCATATAACTATCTCCAATTTGGAGTTAAATTGTGCAAAAATGTGGATGCTCACCTTTGCA
AAGAATTGAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATGATGCGTTTGGTAGTTTTCACTTGATTGATAAAGATTACATGGAATGGAACAATAAAGATCTTGTGATTTTGGATCACTCTGATGCACAAGGAGCTGATGAGGA
CAACCGGGCAGAGATAGGACCAGGAAATCGATCCAGAGGAAGACCAGACCAAAGGCTTATATATTGCGAGGTCTCCACTAATCTTTTCAATGTGGGATCCTCAACAGTTT
CAACACGACCTCGGCCCGAGGTCGAGCTCGTTCGCCTCCGTTTGGTCCCTGCTGCCTCTGGCCGCCCCGGTTTCGCCTGGTTTGTTCCAAAACGCCTCCGAATTCCTAAA
AACCCTAGGAGCATGAGCAGGTATTTAAACCCCTCTTCGTCACTGAAGAAGGGATCCCGAACTTTATTCTCTACTCTCTCCTCTTGCTCTCTTGCTCTCCTCCTTCCGTT
TTCTGACTTAAGCACCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGCAGGGTTTTTAGGAATTCGGAGGCGTTTCGGGACGAACCAGGCGGAACCGGGGCGGCCAGAG
ACATCAGGGACCGAAAGGAGGCGACCGAGCTCGGCTCGCGCAAGTGGGCCGAATGGTCGGGCCAAGTATGCCCGGCCCTTTGGTCGGTTCTTCCTCTGGATCGGTCTCCT
AGTCCTATTTCTGTCCGGATGTCCTCGTCAGCTCCTTGTGCATCGAGGTGGTCCAAAATTACCTATAACAGAACAGATCTTAAAAGTGCTTTGGAAATGGCTCAGATTGC
TAAGATCTCAGAGCAAGTGCAGCTGAAGTCATCTGGTCACAAATTCTATGAGTTTTTAAAGAACAAAATGGATCATTTTCCTCTAGTGTTCCCTGGAGATGTTGAGAGCT
ATAAGTTTGTGGAAGGAAATAGTTTCACTCATGGTAGCATCTCCCATTGGAAATACGACTGGGGTTTCGGTAGCTCGACAGAGGTAAAGGTGAGGCTACTAGTGGATGAG
CCTAAGAAAACAATAATTTACGAGTGTCTTGAAGGAGATTTGTTCAAAGATTTCGAAGTATTCCAAGTGAGAATTCAAGTCAGTGATGGTGGCAACAATGGCAATAGCTC
AGTCAATTGGTGTTTGGAATTTGTGAAAGCAAATGAAAATGTGGATCCACCATATAACTATCTCCAATTTGGAGTTAAATTGTGCAAAAATGTGGATGCTCACCTTTGCA
AAGAATTGAACTAA
Protein sequenceShow/hide protein sequence
MYDAFGSFHLIDKDYMEWNNKDLVILDHSDAQGADEDNRAEIGPGNRSRGRPDQRLIYCEVSTNLFNVGSSTVSTRPRPEVELVRLRLVPAASGRPGFAWFVPKRLRIPK
NPRSMSRYLNPSSSLKKGSRTLFSTLSSCSLALLLPFSDLSTGGGVASTTPVCRVFRNSEAFRDEPGGTGAARDIRDRKEATELGSRKWAEWSGQVCPALWSVLPLDRSP
SPISVRMSSSAPCASRWSKITYNRTDLKSALEMAQIAKISEQVQLKSSGHKFYEFLKNKMDHFPLVFPGDVESYKFVEGNSFTHGSISHWKYDWGFGSSTEVKVRLLVDE
PKKTIIYECLEGDLFKDFEVFQVRIQVSDGGNNGNSSVNWCLEFVKANENVDPPYNYLQFGVKLCKNVDAHLCKELN