; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028584 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028584
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant protein of unknown function (DUF868)
Genome locationscaffold7:11477456..11480213
RNA-Seq ExpressionSpg028584
SyntenySpg028584
Gene Ontology termsNA
InterPro domainsIPR008586 - Protein of unknown function DUF868, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452874.1 PREDICTED: uncharacterized protein LOC103493769 [Cucumis melo]1.6e-7576.62Show/hide
Query:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS
        DLRRARF SSSPEP +GFFIAVVVDGE TLLV DMIKE + KI+AAK PQ+ QTLILKREHV AHKIYTTKAKF  QIREI+IDC F +  DD+ GLSFS
Subjt:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS

Query:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA
        VDGKRVL+IKRLKWKFRGNERIEVAGV ++VYWDVYNWVFELEKENRGNAVFMFRFEEE EEQ+N   +QQNWNLGLNELEWRRM+       I  + + 
Subjt:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA

Query:  S
        S
Subjt:  S

XP_022936246.1 uncharacterized protein LOC111442918 [Cucurbita moschata]3.6e-7574.37Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD
        DLRRARFSSSPEP++GFFIAVVV+GE TLLV DMIKE +AK +AAK +I QTLILKREHVIAHKIYTTKAKFG QIREI+IDC  F+  DDE GLSFSVD
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD

Query:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS
         K VL+IKRLKWKFRGNERIEV G+PV+VYWDVY+WVFE EKENRGNAVFMFRFE   +EQ+N  ARQQN NLGL ELEWRRM+       +  +A+ S
Subjt:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS

XP_022975758.1 uncharacterized protein LOC111476229 [Cucurbita maxima]7.2e-7675.38Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD
        DLRRARFSSSPEP++GFFIAVVV+GE  LLV DMIKE +AK +AAK QI QTLILKREHVIAHKIYTTKAKFG QIREI+IDC  F+  DDE GLSFSVD
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD

Query:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS
         K VL+IKRLKWKFRGNERIEV G+PV+VYWDVY+WVFE EKENRGNAVFMFRFE   EEQ+N  ARQQN NLGLNELEWRRM+       +  +A+ S
Subjt:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS

XP_023536134.1 uncharacterized protein LOC111797382 [Cucurbita pepo subsp. pepo]2.7e-7574.37Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD
        DLRRARFSSSPEP++GFFIAVVV+GE TLLV DMIKE +AK +AAK +I QTLILKREHVIA+KIYTTKAKFG QIREI+IDC  F+  DDE GLSFSVD
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD

Query:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS
         K VL+IKRLKWKFRGNERIEV G+PV+VYWDVY+W+FE EKENRGNAVFMFRFE   EEQ+N  ARQQN NLGLNELEWRRM+       +  +A+ S
Subjt:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS

XP_038896842.1 uncharacterized protein LOC120085067 [Benincasa hispida]1.2e-7857.68Show/hide
Query:  PSSLSTREAAVSDLEIELSSRTSLLRRAFYTNTSSSNLQPSPQ-ECHRRLIPNPATPLP-HSPLSVLRSSLETKGKSPSFQTFLFDLLASIQLCSGEGRR
        PSS   R          +S   +L    ++TN +  +L  S     H  L+     P P  SPL+ L SS  +   S SF+  +  L+            
Subjt:  PSSLSTREAAVSDLEIELSSRTSLLRRAFYTNTSSSNLQPSPQ-ECHRRLIPNPATPLP-HSPLSVLRSSLETKGKSPSFQTFLFDLLASIQLCSGEGRR

Query:  QLRRVCGLKKKRD---VREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIK
          R   G +K  D   V  DLRRARFSSSPEP +GFFIAVVVDGE TLLV DMI E + KI+AAK P+I QTLILKREHVIAHKIYTTKAKF  QIREI+
Subjt:  QLRRVCGLKKKRD---VREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIK

Query:  IDCSFFDDGDDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEW
        IDC F +  DDE GLSFSVDGK+VL+IKRLKWKFRGNERIEVAGVP++VYWDVYNWVFELEKENRGNAVFMFRFEEED+EQ+N + RQQNWNLGLNELEW
Subjt:  IDCSFFDDGDDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEW

Query:  RRMKKGKEKKKIQSAAAAS
        RRM+       +  + + S
Subjt:  RRMKKGKEKKKIQSAAAAS

TrEMBL top hitse value%identityAlignment
A0A0A0L0G2 Uncharacterized protein3.3e-7455.84Show/hide
Query:  PSSLSTREAAVSDLEIELSSRTSLLRRAFYTNTSSSNLQPSPQ-ECHRRLIPNPATPLP-HSPLSVLRSSLETKGKSPSFQTFLFDLLASIQLCSGEGRR
        PSS   R    S   + +S   +L    ++TN +  +L  S     H  L+     P P  SP +   SS  +   S SF+  +  L+   +  S +   
Subjt:  PSSLSTREAAVSDLEIELSSRTSLLRRAFYTNTSSSNLQPSPQ-ECHRRLIPNPATPLP-HSPLSVLRSSLETKGKSPSFQTFLFDLLASIQLCSGEGRR

Query:  QLRRVCGLKKKRDVREDLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKID
               L     V  DLRRARF SSSPEP +GFFIAVVVDGE TLLV DM+KE + KI+AAK PQ+ QTLILKREHV AHK+YTTKAKF  QIREI+ID
Subjt:  QLRRVCGLKKKRDVREDLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKID

Query:  CSFFDDGDDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRR
        C F +  DD+ GLSFSVDGKRVL+IKRLKWKFRGNERIEVAGVP++VYWDVYNWVFELEKE+RGNAVFMFRFEEE+EEQ+++  +QQNWNLGLNELEWRR
Subjt:  CSFFDDGDDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRR

Query:  MKKGKEKKKIQSAAAAS
        M+       I  + + S
Subjt:  MKKGKEKKKIQSAAAAS

A0A1S3BW34 uncharacterized protein LOC1034937697.8e-7676.62Show/hide
Query:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS
        DLRRARF SSSPEP +GFFIAVVVDGE TLLV DMIKE + KI+AAK PQ+ QTLILKREHV AHKIYTTKAKF  QIREI+IDC F +  DD+ GLSFS
Subjt:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS

Query:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA
        VDGKRVL+IKRLKWKFRGNERIEVAGV ++VYWDVYNWVFELEKENRGNAVFMFRFEEE EEQ+N   +QQNWNLGLNELEWRRM+       I  + + 
Subjt:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA

Query:  S
        S
Subjt:  S

A0A5D3D939 DUF868 domain-containing protein7.8e-7676.62Show/hide
Query:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS
        DLRRARF SSSPEP +GFFIAVVVDGE TLLV DMIKE + KI+AAK PQ+ QTLILKREHV AHKIYTTKAKF  QIREI+IDC F +  DD+ GLSFS
Subjt:  DLRRARF-SSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAK-PQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFS

Query:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA
        VDGKRVL+IKRLKWKFRGNERIEVAGV ++VYWDVYNWVFELEKENRGNAVFMFRFEEE EEQ+N   +QQNWNLGLNELEWRRM+       I  + + 
Subjt:  VDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAA

Query:  S
        S
Subjt:  S

A0A6J1F6Z5 uncharacterized protein LOC1114429181.7e-7574.37Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD
        DLRRARFSSSPEP++GFFIAVVV+GE TLLV DMIKE +AK +AAK +I QTLILKREHVIAHKIYTTKAKFG QIREI+IDC  F+  DDE GLSFSVD
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD

Query:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS
         K VL+IKRLKWKFRGNERIEV G+PV+VYWDVY+WVFE EKENRGNAVFMFRFE   +EQ+N  ARQQN NLGL ELEWRRM+       +  +A+ S
Subjt:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS

A0A6J1IK75 uncharacterized protein LOC1114762293.5e-7675.38Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD
        DLRRARFSSSPEP++GFFIAVVV+GE  LLV DMIKE +AK +AAK QI QTLILKREHVIAHKIYTTKAKFG QIREI+IDC  F+  DDE GLSFSVD
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVD

Query:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS
         K VL+IKRLKWKFRGNERIEV G+PV+VYWDVY+WVFE EKENRGNAVFMFRFE   EEQ+N  ARQQN NLGLNELEWRRM+       +  +A+ S
Subjt:  GKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G04220.1 Plant protein of unknown function (DUF868)1.1e-2637.58Show/hide
Query:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLS
        +V  D R A+F+SSPEP + F++A+V + E  LLV D  K+   + K+    +   L  K+E+V   K +TT+AKF  + +E +I       G  E  + 
Subjt:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLS

Query:  FSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFR
         S+DG  ++Q+K L+WKFRGN+ + V   PV+V+WDVY+W+F +     G+ +F+F+
Subjt:  FSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFR

AT2G25200.1 Plant protein of unknown function (DUF868)3.5e-2839.11Show/hide
Query:  LKKKRDVRE--DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDG
        L +K D+R   DL  A+F S P+P +GF++AV V GE  LLV       N K +  +    Q L+ K+E++  +++Y+TK     ++REI ID       
Subjt:  LKKKRDVRE--DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDG

Query:  DDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVF---ELEKENRGNAVFMFRFEEEDEEQTNSL
        +D++ L FSVD K VL+I +L+WKFRGN +I + GV +++ WDV+NW+F   +  K ++  AVF+ RFE ++ E  + L
Subjt:  DDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVF---ELEKENRGNAVFMFRFEEEDEEQTNSL

AT3G04860.1 Plant protein of unknown function (DUF868)3.5e-2839.9Show/hide
Query:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAA-KPQISQTLILKREHVIAHKIYTTKAKFG--SQIREIKIDCSFFDDGDDES
        DV  DL  A+F SSPEP  GF++ VVVD E  LL+ DM KE   K  AA    +    I K+EHV   + + TKA+F    +  ++ I+C   D    + 
Subjt:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAA-KPQISQTLILKREHVIAHKIYTTKAKFG--SQIREIKIDCSFFDDGDDES

Query:  GLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEE-----QTNSLARQQNWNLGLNELEWR
         L   VDGK ++Q++RL WKFRGN+ I V  + VEV WDV++W F L   + GNAVFMFR  +  E+     Q  + ++ Q++   L    W+
Subjt:  GLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEE-----QTNSLARQQNWNLGLNELEWR

AT5G11000.1 Plant protein of unknown function (DUF868)2.4e-4545.33Show/hide
Query:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQIS-QTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSV
        DL +A+F S  EP +GF+IAVVVDGE  LLV D +KE  A+ K+AKP  + Q L+L++EHV   +++TTKA+FG + REI IDC      D+++ L FSV
Subjt:  DLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQIS-QTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSV

Query:  DGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKE-----NRGNAVFMFRF---------------EEEDEEQTNSLA-----RQQNWNLG
        D K+VLQIKRL+WKFRGNE++E+ GV V++ WDVYNW+F+ +         G+AVFMFRF               EEEDE+  N +      +Q   + G
Subjt:  DGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKE-----NRGNAVFMFRF---------------EEEDEEQTNSLA-----RQQNWNLG

Query:  LNEL-EWRRMKKGKEKKKIQSAAAA
        +  + EWR+M+K   K K  S++++
Subjt:  LNEL-EWRRMKKGKEKKKIQSAAAA

AT5G28150.1 Plant protein of unknown function (DUF868)3.2e-2943.37Show/hide
Query:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKF--GSQIREIKIDCSFFDDGDDESG
        DV  DL  A+F S PE   GF++ VVVD E  LL+ DM KE   K  A+   +    I K+EHV   +++ TKA+     +  ++ I+C   D    +  
Subjt:  DVREDLRRARFSSSPEPHAGFFIAVVVDGETTLLVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKF--GSQIREIKIDCSFFDDGDDESG

Query:  LSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEE
        L   VDGK +LQ+KRLKWKFRGN+ I V  + VEV WDV++W+F L     GNAVFMFR  +  E+
Subjt:  LSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFELEKENRGNAVFMFRFEEEDEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCAACAGGAAGCCGACCAGCTACAAATCTGGGAGCGGTAGTTGTAGCCCAGCCACCGGCGCGGTTCAACTGCAGCTCGGCCGTCGTAGTCGTCCGTTCGTGGTCCT
CGCGTCGGTGATCTCGAGGATCTTGAAGAAATCGCTCCAGATCTCGGCATGGGCGTGCGTGCATCACAAACTTGCTGGAGCTGCCGCCTCGCCGGAGCTGTCGAACGCCT
CGCTGGAGCTGCCATCGAGCTTGTCCACACGAGAAGCCGCTGTTTCAGATCTGGAAATCGAGCTGTCATCTAGAACCTCGTTGCTTCGTCGAGCCTTCTACACAAATACG
TCGTCGTCGAACCTTCAACCTTCTCCACAAGAATGCCACCGCCGCCTTATACCGAACCCTGCTACGCCTCTTCCTCATTCACCTCTCTCTGTCCTTCGATCTAGTCTTGA
AACGAAGGGAAAATCCCCTTCGTTTCAGACCTTCCTCTTTGATCTTTTAGCTTCGATTCAGCTTTGTTCCGGTGAAGGCAGGAGGCAGCTCCGACGAGTTTGTGGTCTGA
AGAAGAAGAGGGATGTGAGGGAGGACCTCCGGCGAGCCCGTTTCTCGTCCTCGCCGGAACCCCATGCCGGATTCTTCATCGCCGTCGTCGTCGACGGCGAAACGACTCTT
CTTGTCGACGATATGATCAAAGAAGTCAACGCGAAGATCAAAGCGGCAAAGCCCCAAATTTCACAAACCCTAATTCTAAAAAGGGAACACGTGATCGCGCACAAAATCTA
CACCACGAAGGCAAAATTCGGCAGCCAAATCCGAGAAATCAAAATCGACTGTAGCTTCTTCGACGACGGCGACGACGAATCGGGGCTTTCGTTCAGCGTCGACGGGAAAA
GGGTTCTGCAAATCAAGCGGCTGAAATGGAAATTCAGAGGAAACGAAAGAATCGAAGTGGCTGGAGTTCCAGTGGAGGTTTATTGGGACGTGTACAACTGGGTCTTCGAA
TTGGAGAAGGAAAATAGAGGAAATGCAGTGTTCATGTTCAGATTCGAAGAGGAAGACGAAGAACAGACCAATTCATTAGCGCGACAGCAGAATTGGAACCTGGGATTGAA
CGAATTGGAATGGAGGAGGATGAAGAAGGGAAAAGAGAAAAAAAAAATTCAGTCGGCGGCGGCGGCAAGCGGTGGCCGCCGGCCGTCGGAGATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCAACAGGAAGCCGACCAGCTACAAATCTGGGAGCGGTAGTTGTAGCCCAGCCACCGGCGCGGTTCAACTGCAGCTCGGCCGTCGTAGTCGTCCGTTCGTGGTCCT
CGCGTCGGTGATCTCGAGGATCTTGAAGAAATCGCTCCAGATCTCGGCATGGGCGTGCGTGCATCACAAACTTGCTGGAGCTGCCGCCTCGCCGGAGCTGTCGAACGCCT
CGCTGGAGCTGCCATCGAGCTTGTCCACACGAGAAGCCGCTGTTTCAGATCTGGAAATCGAGCTGTCATCTAGAACCTCGTTGCTTCGTCGAGCCTTCTACACAAATACG
TCGTCGTCGAACCTTCAACCTTCTCCACAAGAATGCCACCGCCGCCTTATACCGAACCCTGCTACGCCTCTTCCTCATTCACCTCTCTCTGTCCTTCGATCTAGTCTTGA
AACGAAGGGAAAATCCCCTTCGTTTCAGACCTTCCTCTTTGATCTTTTAGCTTCGATTCAGCTTTGTTCCGGTGAAGGCAGGAGGCAGCTCCGACGAGTTTGTGGTCTGA
AGAAGAAGAGGGATGTGAGGGAGGACCTCCGGCGAGCCCGTTTCTCGTCCTCGCCGGAACCCCATGCCGGATTCTTCATCGCCGTCGTCGTCGACGGCGAAACGACTCTT
CTTGTCGACGATATGATCAAAGAAGTCAACGCGAAGATCAAAGCGGCAAAGCCCCAAATTTCACAAACCCTAATTCTAAAAAGGGAACACGTGATCGCGCACAAAATCTA
CACCACGAAGGCAAAATTCGGCAGCCAAATCCGAGAAATCAAAATCGACTGTAGCTTCTTCGACGACGGCGACGACGAATCGGGGCTTTCGTTCAGCGTCGACGGGAAAA
GGGTTCTGCAAATCAAGCGGCTGAAATGGAAATTCAGAGGAAACGAAAGAATCGAAGTGGCTGGAGTTCCAGTGGAGGTTTATTGGGACGTGTACAACTGGGTCTTCGAA
TTGGAGAAGGAAAATAGAGGAAATGCAGTGTTCATGTTCAGATTCGAAGAGGAAGACGAAGAACAGACCAATTCATTAGCGCGACAGCAGAATTGGAACCTGGGATTGAA
CGAATTGGAATGGAGGAGGATGAAGAAGGGAAAAGAGAAAAAAAAAATTCAGTCGGCGGCGGCGGCAAGCGGTGGCCGCCGGCCGTCGGAGATATGA
Protein sequenceShow/hide protein sequence
MFNRKPTSYKSGSGSCSPATGAVQLQLGRRSRPFVVLASVISRILKKSLQISAWACVHHKLAGAAASPELSNASLELPSSLSTREAAVSDLEIELSSRTSLLRRAFYTNT
SSSNLQPSPQECHRRLIPNPATPLPHSPLSVLRSSLETKGKSPSFQTFLFDLLASIQLCSGEGRRQLRRVCGLKKKRDVREDLRRARFSSSPEPHAGFFIAVVVDGETTL
LVDDMIKEVNAKIKAAKPQISQTLILKREHVIAHKIYTTKAKFGSQIREIKIDCSFFDDGDDESGLSFSVDGKRVLQIKRLKWKFRGNERIEVAGVPVEVYWDVYNWVFE
LEKENRGNAVFMFRFEEEDEEQTNSLARQQNWNLGLNELEWRRMKKGKEKKKIQSAAAASGGRRPSEI