; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015122 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015122
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNon-specific serine/threonine protein kinase
Genome locationscaffold3:37472370..37490238
RNA-Seq ExpressionSpg015122
SyntenySpg015122
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0019222 - regulation of metabolic process (biological process)
GO:0004672 - protein kinase activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR006575 - RWD domain
IPR016135 - Ubiquitin-conjugating enzyme/RWD-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592986.1 eIF-2-alpha kinase GCN2, partial [Cucurbita argyrosperma subsp. sororia]4.0e-4356.19Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEESEL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

KAG7025395.1 eIF-2-alpha kinase GCN2, partial [Cucurbita argyrosperma subsp. argyrosperma]9.5e-5339.19Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEESEL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSV----KYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGP
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSV    KYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ  
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSV----KYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGP

Query:  CVKETKDVLSLLSLIGEVIFRLERRDVRLWS--PNPIEGFSCPSLFHNWFCPALVARY-RGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSAS
          K  +D      L+ E +   +   +  W     P++ F    +   W    +   Y   F  + +           ++ LC                 
Subjt:  CVKETKDVLSLLSLIGEVIFRLERRDVRLWS--PNPIEGFSCPSLFHNWFCPALVARY-RGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSAS

Query:  LPVDVESPPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNESACFFASLCGIWLE
                        +A+E+       D  + C Q                A+YNARDGRIMIFNL EAAQEFLSEIVTIGQSNES+           E
Subjt:  LPVDVESPPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNESACFFASLCGIWLE

Query:  RNSRIFRGVENSTSELWETIK
          +++    E ST++L +  K
Subjt:  RNSRIFRGVENSTSELWETIK

XP_022959907.1 eIF-2-alpha kinase GCN2 isoform X1 [Cucurbita moschata]4.0e-4356.19Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEESEL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

XP_023004333.1 eIF-2-alpha kinase GCN2 isoform X1 [Cucurbita maxima]1.2e-4255.67Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEE EL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

XP_023514310.1 eIF-2-alpha kinase GCN2 isoform X1 [Cucurbita pepo subsp. pepo]4.0e-4356.19Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEESEL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

TrEMBL top hitse value%identityAlignment
A0A0A0K8G5 RWD domain-containing protein9.6e-4336.75Show/hide
Query:  MGHSSKKKRRGGG--GKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MG SSKKKRRGGG  GK++KGRTPL DYS SGEES+L++EE+TAL                                                       
Subjt:  MGHSSKKKRRGGG--GKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE
                        CAIFQEDCKVVTG  PQVTIKL+PYSNDMGFED DVSAL SVKYLPGYPYKCPKLLITPE+GLAKGDTEKLLSLLHEQ      
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE

Query:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES
                                                                                                            
Subjt:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES

Query:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNESA
                                                     ANYNARDGRIMIFNL EAAQEFLSEIVTIG+SNESA
Subjt:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNESA

A0A1S3BCY3 Non-specific serine/threonine protein kinase1.6e-4236.84Show/hide
Query:  MGHSSKKKRRGG--GGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MG SSKKKRRGG  GGK++KGRTPL DYS SGEES+L++EEITAL                                                       
Subjt:  MGHSSKKKRRGG--GGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE
                        CAIFQEDCKVVTG  PQVTIKL+PYSNDMGFED DVSALLSVKYLPGYPYKCPKLLITPE+GL KGDTEKLLSLLHEQ      
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE

Query:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES
                                                                                                            
Subjt:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES

Query:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES
                                                     ANYNARDGRIMIFNL EAAQEFLSEIVTIG+SNES
Subjt:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES

A0A1S3BE48 eIF-2-alpha kinase GCN2 isoform X41.6e-4236.84Show/hide
Query:  MGHSSKKKRRGG--GGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MG SSKKKRRGG  GGK++KGRTPL DYS SGEES+L++EEITAL                                                       
Subjt:  MGHSSKKKRRGG--GGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE
                        CAIFQEDCKVVTG  PQVTIKL+PYSNDMGFED DVSALLSVKYLPGYPYKCPKLLITPE+GL KGDTEKLLSLLHEQ      
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKE

Query:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES
                                                                                                            
Subjt:  TKDVLSLLSLIGEVIFRLERRDVRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVES

Query:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES
                                                     ANYNARDGRIMIFNL EAAQEFLSEIVTIG+SNES
Subjt:  PPTLVETFLDAIEEASKITRSDTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES

A0A6J1H5V7 Non-specific serine/threonine protein kinase1.9e-4356.19Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEESEL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

A0A6J1KZ64 Non-specific serine/threonine protein kinase5.6e-4355.67Show/hide
Query:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG
        MGHSSKKKRR  GGGGK++KGRTP KD+S SGEE EL+SEE+TAL                                                       
Subjt:  MGHSSKKKRR--GGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLG

Query:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                        C IFQEDCKVV+G  PQVTIKLRPYSNDMGFEDLDVSA LSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
Subjt:  ANPFPLIVVGLSWVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

SwissProt top hitse value%identityAlignment
Q9LX30 eIF-2-alpha kinase GCN26.0e-2642.13Show/hide
Query:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL
        MG SS  KKK+RGG G++ +    LKD+ S + E++ELLSEEITAL+                                                     
Subjt:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL

Query:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                          AIFQEDCKVV+   SPPQ+ IKLRPYS DMG+ED D+SA+L V+ LPGYPYKCPKL ITPE+GL   D EKLLSLL +Q
Subjt:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

Q9LX30 eIF-2-alpha kinase GCN24.1e-0649.15Show/hide
Query:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES
        +  L     E +L+ + D     +AN NAR+GR+MIFNLVEAAQEFLSEI+      ES
Subjt:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES

Arabidopsis top hitse value%identityAlignment
AT3G59410.1 protein kinase family protein4.3e-2742.13Show/hide
Query:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL
        MG SS  KKK+RGG G++ +    LKD+ S + E++ELLSEEITAL+                                                     
Subjt:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL

Query:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
                          AIFQEDCKVV+   SPPQ+ IKLRPYS DMG+ED D+SA+L V+ LPGYPYKCPKL ITPE+GL   D EKLLSLL +Q
Subjt:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

AT3G59410.1 protein kinase family protein2.9e-0749.15Show/hide
Query:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES
        +  L     E +L+ + D     +AN NAR+GR+MIFNLVEAAQEFLSEI+      ES
Subjt:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES

AT3G59410.2 protein kinase family protein2.4e-3046.7Show/hide
Query:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL
        MG SS  KKK+RGG G++ +    LKD+ S + E++ELLSEEITALN         +N G+              G+         D   VT        
Subjt:  MGHSS--KKKRRGGGGKKNKGRTPLKDY-SLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLL

Query:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ
         +N F             AIFQEDCKVV+   SPPQ+ IKLRPYS DMG+ED D+SA+L V+ LPGYPYKCPKL ITPE+GL   D EKLLSLL +Q
Subjt:  GANPFPLIVVGLSWVFRCAIFQEDCKVVTG--SPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQ

AT3G59410.2 protein kinase family protein2.9e-0749.15Show/hide
Query:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES
        +  L     E +L+ + D     +AN NAR+GR+MIFNLVEAAQEFLSEI+      ES
Subjt:  DTNLWCMQVEVILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCATAGTTCCAAGAAGAAGCGGCGTGGTGGGGGTGGGAAGAAGAACAAGGGAAGGACGCCACTGAAAGATTACTCGTTGAGTGGTGAAGAGAGTGAGCTTCTCTC
AGAAGAGATTACTGCCTTGAATCGAATGAAAAAATTGAAGTCAGGGTTAATGAATACAGGTGTTGCAATTCCTTATCCTTCTCCCTCTACTTGGGCTCCATCGCGTGGTT
GGTTGATTGCTTTGAACTCCAATGTTAGGGATTCACCTTCCGTAACTTCCTTAGGGTTCATGCTCTTGCTAGGAGCAAACCCCTTTCCGTTAATAGTTGTTGGCTTGAGT
TGGGTATTTAGGTGTGCGATATTTCAAGAAGACTGCAAAGTTGTCACTGGCTCACCTCCTCAAGTTACCATCAAACTTAGGCCATATTCAAATGACATGGGGTTTGAAGA
TCTAGATGTATCTGCTCTTCTTTCGGTGAAGTATTTGCCTGGATACCCATACAAATGCCCAAAGTTGCTTATAACCCCAGAGAAAGGTTTGGCAAAAGGTGACACTGAAA
AGTTGCTTTCTCTTCTTCATGAACAGGGTCCTTGTGTCAAGGAAACTAAAGACGTCTTGTCTCTCTTATCTTTGATTGGGGAGGTCATTTTTAGACTTGAGAGGAGGGAT
GTTCGCCTTTGGAGTCCTAATCCCATTGAGGGATTTTCTTGCCCGTCGCTTTTTCATAACTGGTTTTGTCCTGCCCTGGTCGCGAGATATAGAGGCTTTCGACAGATGCT
TGAGGAATTCTTCCTCCATCTTTCGTTTTGGGAGAAGAGGAAGTTCTTATGCAAGCGTGCAGTCGTTTCTTGTGGTGAGAGGACGGTAATAGTCTCCGCCTCTTTGCCCG
TCGATGTTGAGTCTCCACCCACACTGGTTGAGACCTTCTTGGATGCCATCGAGGAAGCGTCCAAAATCACACGCTCTGATACCAACTTATGGTGTATGCAAGTGGAAGTT
ATTTTGGCTGCCATGCATGATGAACACTTTTTTAAAGAGGCAAATTATAACGCTCGAGACGGAAGGATAATGATTTTCAATTTGGTTGAGGCTGCTCAGGAGTTCTTGTC
AGAAATAGTAACTATAGGACAATCAAATGAATCGGCTTGCTTTTTTGCTTCTTTGTGTGGTATTTGGCTTGAGAGGAATAGTAGAATCTTTAGAGGAGTGGAGAATTCCA
CGTCTGAGCTTTGGGAGACGATTAAATTTAATTCTGTGAAACTTCTGAAGCAATCTAGAAAGGGAATTGTCCAAGAAATTTTCCTTTACTTGGACTTCGTGTCATGCGGG
TATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCATAGTTCCAAGAAGAAGCGGCGTGGTGGGGGTGGGAAGAAGAACAAGGGAAGGACGCCACTGAAAGATTACTCGTTGAGTGGTGAAGAGAGTGAGCTTCTCTC
AGAAGAGATTACTGCCTTGAATCGAATGAAAAAATTGAAGTCAGGGTTAATGAATACAGGTGTTGCAATTCCTTATCCTTCTCCCTCTACTTGGGCTCCATCGCGTGGTT
GGTTGATTGCTTTGAACTCCAATGTTAGGGATTCACCTTCCGTAACTTCCTTAGGGTTCATGCTCTTGCTAGGAGCAAACCCCTTTCCGTTAATAGTTGTTGGCTTGAGT
TGGGTATTTAGGTGTGCGATATTTCAAGAAGACTGCAAAGTTGTCACTGGCTCACCTCCTCAAGTTACCATCAAACTTAGGCCATATTCAAATGACATGGGGTTTGAAGA
TCTAGATGTATCTGCTCTTCTTTCGGTGAAGTATTTGCCTGGATACCCATACAAATGCCCAAAGTTGCTTATAACCCCAGAGAAAGGTTTGGCAAAAGGTGACACTGAAA
AGTTGCTTTCTCTTCTTCATGAACAGGGTCCTTGTGTCAAGGAAACTAAAGACGTCTTGTCTCTCTTATCTTTGATTGGGGAGGTCATTTTTAGACTTGAGAGGAGGGAT
GTTCGCCTTTGGAGTCCTAATCCCATTGAGGGATTTTCTTGCCCGTCGCTTTTTCATAACTGGTTTTGTCCTGCCCTGGTCGCGAGATATAGAGGCTTTCGACAGATGCT
TGAGGAATTCTTCCTCCATCTTTCGTTTTGGGAGAAGAGGAAGTTCTTATGCAAGCGTGCAGTCGTTTCTTGTGGTGAGAGGACGGTAATAGTCTCCGCCTCTTTGCCCG
TCGATGTTGAGTCTCCACCCACACTGGTTGAGACCTTCTTGGATGCCATCGAGGAAGCGTCCAAAATCACACGCTCTGATACCAACTTATGGTGTATGCAAGTGGAAGTT
ATTTTGGCTGCCATGCATGATGAACACTTTTTTAAAGAGGCAAATTATAACGCTCGAGACGGAAGGATAATGATTTTCAATTTGGTTGAGGCTGCTCAGGAGTTCTTGTC
AGAAATAGTAACTATAGGACAATCAAATGAATCGGCTTGCTTTTTTGCTTCTTTGTGTGGTATTTGGCTTGAGAGGAATAGTAGAATCTTTAGAGGAGTGGAGAATTCCA
CGTCTGAGCTTTGGGAGACGATTAAATTTAATTCTGTGAAACTTCTGAAGCAATCTAGAAAGGGAATTGTCCAAGAAATTTTCCTTTACTTGGACTTCGTGTCATGCGGG
TATTAG
Protein sequenceShow/hide protein sequence
MGHSSKKKRRGGGGKKNKGRTPLKDYSLSGEESELLSEEITALNRMKKLKSGLMNTGVAIPYPSPSTWAPSRGWLIALNSNVRDSPSVTSLGFMLLLGANPFPLIVVGLS
WVFRCAIFQEDCKVVTGSPPQVTIKLRPYSNDMGFEDLDVSALLSVKYLPGYPYKCPKLLITPEKGLAKGDTEKLLSLLHEQGPCVKETKDVLSLLSLIGEVIFRLERRD
VRLWSPNPIEGFSCPSLFHNWFCPALVARYRGFRQMLEEFFLHLSFWEKRKFLCKRAVVSCGERTVIVSASLPVDVESPPTLVETFLDAIEEASKITRSDTNLWCMQVEV
ILAAMHDEHFFKEANYNARDGRIMIFNLVEAAQEFLSEIVTIGQSNESACFFASLCGIWLERNSRIFRGVENSTSELWETIKFNSVKLLKQSRKGIVQEIFLYLDFVSCG
Y