; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034518 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034518
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationscaffold4:15721516..15723588
RNA-Seq ExpressionSpg034518
SyntenySpg034518
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067725.1 uncharacterized protein E6C27_scaffold352G00380 [Cucumis melo var. makuwa]2.7e-4038.49Show/hide
Query:  VRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRELAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDH
        VRVLI++  D+ + LPIP+VG+I SV DA+GSHVPW K L+ ++++KK   K  REL A+K  KS     K+ + +++VL  +E +M  +YL I +    
Subjt:  VRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRELAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDH

Query:  VFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLISVGSGPKENRSRALCKRLRQVNDKLLICPFNPEYHWMLLVISIKT
        +  Y F+ ++MK+S+K+ C MEEL  +VI  Y+  L E DP I+E+YAF++P  I                                             
Subjt:  VFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLISVGSGPKENRSRALCKRLRQVNDKLLICPFNPEYHWMLLVISIKT

Query:  FTIYSIDSLKH-DFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVK------SIECGYYTLKFIRDIVSHKSRVITDV
          +Y++DSL+H   RD++K+MVNT +RMFY++TN ++ P  WV  K      S ECGYY +KF++DIV  KS  ITDV
Subjt:  FTIYSIDSLKH-DFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVK------SIECGYYTLKFIRDIVSHKSRVITDV

TYK18876.1 uncharacterized protein E5676_scaffold204G00800 [Cucumis melo var. makuwa]1.3e-4537.79Show/hide
Query:  EKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRE
        E   +GR C LAV  V NIV  GIV++R   +EIVY V L    V V I++  D+ + LPIPIVGNI SV DA+GSHV W K L++++++KK   K  RE
Subjt:  EKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRE

Query:  LAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLIS
        L A+K  K      K+S  +++VL  +E +M  +YL I +    +F Y F+ ++MK+S+K+ C MEELA +VI  YI  L E DP ILE+Y F++P  IS
Subjt:  LAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLIS

Query:  VGSGPKENRSRALCKRL-RQVNDKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVKSIECGYYTLK
         G G  E+R+R LC RL  Q +D                                                                   S ECG Y +K
Subjt:  VGSGPKENRSRALCKRL-RQVNDKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVKSIECGYYTLK

Query:  FIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYVAQYM
        F++DIV  KS  IT VVLTR   ++QSE + +RVE C+++  Y+
Subjt:  FIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYVAQYM

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.1e-4431.36Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV
            S+ECGYY  K+IR+IV + S  I+++  T+   + Q E +E+R+E  ++V
Subjt:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]1.1e-4431.36Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV
            S+ECGYY  K+IR+IV + S  I+++  T+   + Q E +E+R+E  ++V
Subjt:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.6e-4031.79Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKS
            S+ECGYY  K+IR+IV + S
Subjt:  ----SIECGYYTLKFIRDIVSHKS

TrEMBL top hitse value%identityAlignment
A0A5A7VPW2 DUF4218 domain-containing protein1.3e-4038.49Show/hide
Query:  VRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRELAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDH
        VRVLI++  D+ + LPIP+VG+I SV DA+GSHVPW K L+ ++++KK   K  REL A+K  KS     K+ + +++VL  +E +M  +YL I +    
Subjt:  VRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRELAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDH

Query:  VFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLISVGSGPKENRSRALCKRLRQVNDKLLICPFNPEYHWMLLVISIKT
        +  Y F+ ++MK+S+K+ C MEEL  +VI  Y+  L E DP I+E+YAF++P  I                                             
Subjt:  VFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLISVGSGPKENRSRALCKRLRQVNDKLLICPFNPEYHWMLLVISIKT

Query:  FTIYSIDSLKH-DFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVK------SIECGYYTLKFIRDIVSHKSRVITDV
          +Y++DSL+H   RD++K+MVNT +RMFY++TN ++ P  WV  K      S ECGYY +KF++DIV  KS  ITDV
Subjt:  FTIYSIDSLKH-DFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVK------SIECGYYTLKFIRDIVSHKSRVITDV

A0A5D3D5S7 Uncharacterized protein6.1e-4637.79Show/hide
Query:  EKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRE
        E   +GR C LAV  V NIV  GIV++R   +EIVY V L    V V I++  D+ + LPIPIVGNI SV DA+GSHV W K L++++++KK   K  RE
Subjt:  EKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMHRE

Query:  LAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLIS
        L A+K  K      K+S  +++VL  +E +M  +YL I +    +F Y F+ ++MK+S+K+ C MEELA +VI  YI  L E DP ILE+Y F++P  IS
Subjt:  LAAIKEKKSNSQVSKVSVAVKYVLGFIE-NMTRDYLDITI-PDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLIS

Query:  VGSGPKENRSRALCKRL-RQVNDKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVKSIECGYYTLK
         G G  E+R+R LC RL  Q +D                                                                   S ECG Y +K
Subjt:  VGSGPKENRSRALCKRL-RQVNDKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVKSIECGYYTLK

Query:  FIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYVAQYM
        F++DIV  KS  IT VVLTR   ++QSE + +RVE C+++  Y+
Subjt:  FIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYVAQYM

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X15.1e-4531.36Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV
            S+ECGYY  K+IR+IV + S  I+++  T+   + Q E +E+R+E  ++V
Subjt:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X47.7e-4131.79Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKS
            S+ECGYY  K+IR+IV + S
Subjt:  ----SIECGYYTLKFIRDIVSHKS

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X25.1e-4531.36Show/hide
Query:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH
        +DE+  +G+ C LAVE V NIV  G ++        V+GVPL  + VRV++D+V D  A +PIP+ G IE++   +G  V WP+ LVIL +EK       
Subjt:  SDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKEPKKMH

Query:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL
          +++ +  ++ +Q+SK   V V++K +  ++       D ++I +   +F    + +L +N + ++C+M E+  + I  YIAYL + ++  I +++  +
Subjt:  RELAAIKEKKSNSQVSK---VSVAVKYVLGFI--ENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCE-FDPFILEQYAFL

Query:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--
         P  IS     +E R R L  RL  VN ++L++ P+    HWML++I+++   +Y +DSL+   ++D + ++NT+++++ ++ +IQ       W  +K  
Subjt:  HPTLISVGSGPKENRSRALCKRLRQVN-DKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFK--WVYVK--

Query:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV
            S+ECGYY  K+IR+IV + S  I+++  T+   + Q E +E+R+E  ++V
Subjt:  ----SIECGYYTLKFIRDIVSHKSRVITDVVLTRGTTFSQSEFNEIRVELCEYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAACCTTCTGATGAAAAACCTGACCAAGGGAGATCATGTAAACTTGCTGTTGAAGATGTAAAGAATATTGT
GGTGGCAGGAATTGTATATAAGAGAAAAGACGACCATGAAATTGTTTATGGGGTCCCACTCACAGCAAACTATGTGCGAGTATTGATTGATGTTGTACATGACGCTGATG
CTCAATTGCCCATACCTATAGTTGGAAATATTGAGTCTGTATTCGACGCAGTAGGCTCTCATGTTCCATGGCCTAAAGAACTCGTAATTCTGGACAAAGAAAAAAAGGAA
CCTAAGAAAATGCATCGTGAATTAGCTGCAATAAAGGAGAAGAAGTCAAATAGTCAAGTGTCAAAAGTGTCGGTAGCAGTGAAATACGTATTAGGATTTATAGAGAACAT
GACAAGGGATTACTTAGATATTACAATACCCGATCACGTGTTCGACTACCACTTCGACTTCCATCTAATGAAAAATAGCATGAAAGAATTTTGCTCGATGGAGGAATTAG
CAACAACTGTTATAACAATTTATATCGCATACCTATGCGAGTTTGACCCATTCATCTTAGAGCAATATGCATTTCTACATCCAACATTGATCTCAGTTGGCTCGGGACCT
AAAGAGAACCGCTCTCGAGCCCTATGTAAAAGGTTAAGACAGGTCAATGACAAATTATTGATATGTCCTTTTAATCCCGAATATCATTGGATGTTGTTGGTGATATCGAT
AAAAACATTCACAATTTATTCAATTGACTCCCTGAAGCATGACTTTCGTGATGATGTAAAGAACATGGTTAATACGGCTATAAGAATGTTCTATTCTCAAACTAATATAC
AAAGTCCCCCCTTCAAATGGGTATATGTTAAGTCTATAGAGTGTGGATACTACACTCTTAAATTTATACGAGATATAGTATCCCATAAGAGCCGAGTGATTACAGATGTG
GTATTGACGAGAGGAACTACATTTAGCCAATCAGAATTCAATGAGATAAGAGTTGAATTATGTGAATATGTAGCACAATACATGAGATGCTTTGAAGGTTGTGTTCTGGG
TTTAGAAGTGCATATTTCTGTTCTTTCA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAACCTTCTGATGAAAAACCTGACCAAGGGAGATCATGTAAACTTGCTGTTGAAGATGTAAAGAATATTGT
GGTGGCAGGAATTGTATATAAGAGAAAAGACGACCATGAAATTGTTTATGGGGTCCCACTCACAGCAAACTATGTGCGAGTATTGATTGATGTTGTACATGACGCTGATG
CTCAATTGCCCATACCTATAGTTGGAAATATTGAGTCTGTATTCGACGCAGTAGGCTCTCATGTTCCATGGCCTAAAGAACTCGTAATTCTGGACAAAGAAAAAAAGGAA
CCTAAGAAAATGCATCGTGAATTAGCTGCAATAAAGGAGAAGAAGTCAAATAGTCAAGTGTCAAAAGTGTCGGTAGCAGTGAAATACGTATTAGGATTTATAGAGAACAT
GACAAGGGATTACTTAGATATTACAATACCCGATCACGTGTTCGACTACCACTTCGACTTCCATCTAATGAAAAATAGCATGAAAGAATTTTGCTCGATGGAGGAATTAG
CAACAACTGTTATAACAATTTATATCGCATACCTATGCGAGTTTGACCCATTCATCTTAGAGCAATATGCATTTCTACATCCAACATTGATCTCAGTTGGCTCGGGACCT
AAAGAGAACCGCTCTCGAGCCCTATGTAAAAGGTTAAGACAGGTCAATGACAAATTATTGATATGTCCTTTTAATCCCGAATATCATTGGATGTTGTTGGTGATATCGAT
AAAAACATTCACAATTTATTCAATTGACTCCCTGAAGCATGACTTTCGTGATGATGTAAAGAACATGGTTAATACGGCTATAAGAATGTTCTATTCTCAAACTAATATAC
AAAGTCCCCCCTTCAAATGGGTATATGTTAAGTCTATAGAGTGTGGATACTACACTCTTAAATTTATACGAGATATAGTATCCCATAAGAGCCGAGTGATTACAGATGTG
GTATTGACGAGAGGAACTACATTTAGCCAATCAGAATTCAATGAGATAAGAGTTGAATTATGTGAATATGTAGCACAATACATGAGATGCTTTGAAGGTTGTGTTCTGGG
TTTAGAAGTGCATATTTCTGTTCTTTCA
Protein sequenceShow/hide protein sequence
MSNQGSCPQVPDPQPSDEKPDQGRSCKLAVEDVKNIVVAGIVYKRKDDHEIVYGVPLTANYVRVLIDVVHDADAQLPIPIVGNIESVFDAVGSHVPWPKELVILDKEKKE
PKKMHRELAAIKEKKSNSQVSKVSVAVKYVLGFIENMTRDYLDITIPDHVFDYHFDFHLMKNSMKEFCSMEELATTVITIYIAYLCEFDPFILEQYAFLHPTLISVGSGP
KENRSRALCKRLRQVNDKLLICPFNPEYHWMLLVISIKTFTIYSIDSLKHDFRDDVKNMVNTAIRMFYSQTNIQSPPFKWVYVKSIECGYYTLKFIRDIVSHKSRVITDV
VLTRGTTFSQSEFNEIRVELCEYVAQYMRCFEGCVLGLEVHISVLS