; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr000140 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr000140
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCASP-like protein
Genome locationtig00000058:56602..57318
RNA-Seq ExpressionSgr000140
SyntenySgr000140
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592153.1 hypothetical protein SDJN03_14499, partial [Cucurbita argyrosperma subsp. sororia]1.6e-6764.1Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  KD+QF GAF IF+ET KIIN+NR+IFAMAAL FIHPLN++ SGFM TLN LL NL                YGN+S LFSH W LFWPF VF I 
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         LF  S++STAGV+ TV  +Y+GRE S K+TMSVV KVWKR+LVTFLCV+LAFLAY+IIAGF LFLI+W       PFGKVDG+I  VFL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +++ QL+GVVSALEE S GFKAMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

KAG7025017.1 hypothetical protein SDJN02_13838, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-6764.1Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  KD+QF GAF IF+ET KIIN+NR+IFAMAAL FIHPLN++ SGFM TLN LL NL                YGN+S LFSH W LFWPF VF I 
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         LF  S++STAGV+ TV  +Y+GRE S K+TMSVV KVWKR+LVTFLCV+LAFLAY+IIAGF LFLI+W       PFGKVDG+I  VFL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +++ QL+GVVSALEE S GFKAMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

XP_022936737.1 uncharacterized protein LOC111443241 [Cucurbita moschata]1.4e-6663.25Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  KD+QF GAF IF+ET KIIN+NR+IFAMAAL FIHPLN++ SGFM TLN LL NL                YGN+S LFSH W LFWPF VFYII
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         +F +S+ STAGV+  VA +Y+GRE S K+ MSVVAKVWKR+LVTFLCV+L FLAY+IIAGF LFLI+W        FGK DG+  A+FL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +V+LQL+GVVSALEE S GF+AMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

XP_022975879.1 uncharacterized protein LOC111476451 [Cucurbita maxima]5.1e-6161.54Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+K KD+QF GAF+IF+ET KIIN+NRKIFAMAAL FIHPLN++ SGFM TL+ LL +L              + YGN+S LFS       PFNVF II
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         +F +S+ STAGV+ TVA +Y+ RE S K+TMSVVAKVWKR+LVTFLCV+L FLAY+IIAGF LFLI+W        FGK DG+  A+FL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +V+LQL+GVVSALEE S GF+AMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

XP_023535683.1 uncharacterized protein LOC111797044 [Cucurbita pepo subsp. pepo]2.6e-6562.39Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  KD+QF GAF+IF+ET KIIN+N KIF+MAAL FIHPLN++ SGFM TLN LL NL                YGN+S LFSH W LFWPF VF I 
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         LF  S++STAGV+ TV  +Y+GRE S K+TMSVV KVWKR+LVTFLCV L FLAY+IIAGF LFLI+W       PFGKVDG+  A+FL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +++ QL+GVVSALEE S GF+AMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

TrEMBL top hitse value%identityAlignment
A0A1S3CMT6 uncharacterized protein LOC1035022625.3e-5658.72Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKW-PLFWPFNVFYI
        MD K K++QFLGAF IF ET+KII++N+KIFAM+AL FIHPLNFI SGFM TLN +L NL                YGN S LFS  +  + +P+++ YI
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKW-PLFWPFNVFYI

Query:  ISLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSY
          LF +S++STAGV+ TVA +Y G+E S K+ MSVV KVWKRLLVTFLCV+L F  Y++I G ALF+II        P GKVD +   V  VFYFVGL Y
Subjt:  ISLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSY

Query:  LVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        LVVVLQLSGVVS LEE S GFKAMAKSR L+K  M
Subjt:  LVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

A0A5D3DZ60 Putative transmembrane protein5.3e-5658.72Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKW-PLFWPFNVFYI
        MD K K++QFLGAF IF ET+KII++N+KIFAM+AL FIHPLNFI SGFM TLN +L NL                YGN S LFS  +  + +P+++ YI
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKW-PLFWPFNVFYI

Query:  ISLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSY
          LF +S++STAGV+ TVA +Y G+E S K+ MSVV KVWKRLLVTFLCV+L F  Y++I G ALF+II        P GKVD +   V  VFYFVGL Y
Subjt:  ISLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSY

Query:  LVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        LVVVLQLSGVVS LEE S GFKAMAKSR L+K  M
Subjt:  LVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

A0A6J1DJN8 uncharacterized protein LOC1110211547.2e-6160.78Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MDV+ K +Q+LG  +IF+E+ KIIN+NRKIFAMAAL FIHPLN+I S  MGTLNLLL N+  H              G+ S LFSH WP FWPFNVF II
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         LF +S+VST  V YTVA +YTGRE SPK+    + KVWKR+LVTFLCV++AFL YNI+AG  +FL+I+TTIL   P   V G++ AVF  FYF  L YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKG
        V VLQLSGVVS LE+   GFKAMAKSR L+KG
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKG

A0A6J1F996 uncharacterized protein LOC1114432416.7e-6763.25Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  KD+QF GAF IF+ET KIIN+NR+IFAMAAL FIHPLN++ SGFM TLN LL NL                YGN+S LFSH W LFWPF VFYII
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         +F +S+ STAGV+  VA +Y+GRE S K+ MSVVAKVWKR+LVTFLCV+L FLAY+IIAGF LFLI+W        FGK DG+  A+FL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +V+LQL+GVVSALEE S GF+AMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

A0A6J1ILV5 uncharacterized protein LOC1114764512.5e-6161.54Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+K KD+QF GAF+IF+ET KIIN+NRKIFAMAAL FIHPLN++ SGFM TL+ LL +L              + YGN+S LFS       PFNVF II
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL
         +F +S+ STAGV+ TVA +Y+ RE S K+TMSVVAKVWKR+LVTFLCV+L FLAY+IIAGF LFLI+W        FGK DG+  A+FL+ Y +GL YL
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYL

Query:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM
        +V+LQL+GVVSALEE S GF+AMAKSR L+KGKM
Subjt:  VVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31130.1 unknown protein4.7e-2034.89Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD++ ++LQFL    + +E+  I  R+ + F +  L FI PL+F               L H +        L +     SD   H W +   F   Y+I
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGK--VDGSIFAVFLVFYFVGLS
         LF FS++STA V +TVA +YTG+  S  +T+S + KV+KRL +TFL V L   AYN +  F +FL++    L     G   V G I +   V YF    
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGK--VDGSIFAVFLVFYFVGLS

Query:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGK
        Y   +  L  V+S LE   YG  AM K+  L+KGK
Subjt:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGK

AT2G18690.2 unknown protein2.1e-0436.63Show/hide
Query:  KVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIF-----AVFLVFYFVGLSYLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVK
        K WK  LVT+  + L  L +    GF  F+I+   +L     G V+   F      V L+ + V  SY  +   LS V+S LEE SYGF+A+ K+  +VK
Subjt:  KVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIF-----AVFLVFYFVGLSYLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVK

Query:  G
        G
Subjt:  G

AT4G19950.1 unknown protein7.2e-2134.89Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  ++LQFL    I RE+  I   + K F +  L  I PL+F               L H +        +  Y         H+W +   F   YII
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVF--YFVGLS
         LF FS++STA V +TVA +YTG+  S  +TMS +  V KRL +TFL V L  LAYN +     FLI   T++       V  ++F++ ++F  + V   
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVF--YFVGLS

Query:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGK
        Y+  +  L+ VVS LE   YG  AM KS  L+KGK
Subjt:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGK

AT5G44860.1 unknown protein5.7e-1833.19Show/hide
Query:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII
        MD+  ++LQFL    I RE+  I   + K F +  L  I PL+F         +L    +L  +     S   +          +H+W L   +   Y+I
Subjt:  MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYII

Query:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVF-YFVGLS-
         LF FS++STA V +TVA +YTG+  S  +TMS +  V KRL +TFL V L  L YN +  F LFL++    +       V  ++F++ ++F  F+G+  
Subjt:  SLFYFSVVSTAGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVF-YFVGLS-

Query:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKMGL
        Y+     L+ VVS LE   YG  AM KS  L+ G+  +
Subjt:  YLVVVLQLSGVVSALEEGSYGFKAMAKSRSLVKGKMGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGTAAAGCAGAAAGATCTGCAGTTCCTTGGAGCCTTCGATATCTTCCGAGAAACCCACAAGATCATCAACAGAAACAGGAAGATCTTTGCCATGGCGGCTCTTCT
CTTCATCCACCCTCTAAACTTCATCTACTCGGGTTTCATGGGAACCTTAAATCTCCTCCTCGGCAATCTCCTCCACCACGTATCAGTCGCCAACCAGAGCGGCGATCTTC
GTGAATATTATGGAAACTTATCAGATCTCTTCTCCCATAAATGGCCCTTGTTTTGGCCCTTCAATGTCTTCTACATCATATCCCTCTTCTATTTCTCCGTCGTCTCCACC
GCCGGCGTGGCTTACACCGTCGCCTGCGTATACACTGGCCGAGAAACCTCTCCCAAGAACACCATGAGCGTCGTCGCCAAGGTCTGGAAGCGACTTCTGGTCACATTCCT
CTGTGTTCTTTTAGCTTTCTTGGCGTATAATATAATAGCTGGATTTGCATTGTTCTTGATTATTTGGACGACCATTCTGAAACATGGGCCGTTCGGGAAAGTCGATGGTT
CAATTTTTGCTGTGTTTTTGGTTTTTTACTTTGTTGGGTTGTCGTATTTGGTCGTGGTTTTGCAACTTTCGGGTGTAGTTTCTGCGTTGGAAGAAGGGTCTTATGGGTTT
AAGGCAATGGCGAAGAGCAGGTCGCTGGTGAAGGGGAAGATGGGGCTGCGACGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACGTAAAGCAGAAAGATCTGCAGTTCCTTGGAGCCTTCGATATCTTCCGAGAAACCCACAAGATCATCAACAGAAACAGGAAGATCTTTGCCATGGCGGCTCTTCT
CTTCATCCACCCTCTAAACTTCATCTACTCGGGTTTCATGGGAACCTTAAATCTCCTCCTCGGCAATCTCCTCCACCACGTATCAGTCGCCAACCAGAGCGGCGATCTTC
GTGAATATTATGGAAACTTATCAGATCTCTTCTCCCATAAATGGCCCTTGTTTTGGCCCTTCAATGTCTTCTACATCATATCCCTCTTCTATTTCTCCGTCGTCTCCACC
GCCGGCGTGGCTTACACCGTCGCCTGCGTATACACTGGCCGAGAAACCTCTCCCAAGAACACCATGAGCGTCGTCGCCAAGGTCTGGAAGCGACTTCTGGTCACATTCCT
CTGTGTTCTTTTAGCTTTCTTGGCGTATAATATAATAGCTGGATTTGCATTGTTCTTGATTATTTGGACGACCATTCTGAAACATGGGCCGTTCGGGAAAGTCGATGGTT
CAATTTTTGCTGTGTTTTTGGTTTTTTACTTTGTTGGGTTGTCGTATTTGGTCGTGGTTTTGCAACTTTCGGGTGTAGTTTCTGCGTTGGAAGAAGGGTCTTATGGGTTT
AAGGCAATGGCGAAGAGCAGGTCGCTGGTGAAGGGGAAGATGGGGCTGCGACGGTAG
Protein sequenceShow/hide protein sequence
MDVKQKDLQFLGAFDIFRETHKIINRNRKIFAMAALLFIHPLNFIYSGFMGTLNLLLGNLLHHVSVANQSGDLREYYGNLSDLFSHKWPLFWPFNVFYIISLFYFSVVST
AGVAYTVACVYTGRETSPKNTMSVVAKVWKRLLVTFLCVLLAFLAYNIIAGFALFLIIWTTILKHGPFGKVDGSIFAVFLVFYFVGLSYLVVVLQLSGVVSALEEGSYGF
KAMAKSRSLVKGKMGLRR