; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G25900 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G25900
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionDual specificity protein kinase splA
Genome locationChr4:23152895..23156309
RNA-Seq ExpressionCSPI04G25900
SyntenyCSPI04G25900
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649926.1 hypothetical protein Csa_011922 [Cucumis sativus]7.0e-16199.3Show/hide
Query:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS
        MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS
Subjt:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS

Query:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFW
        RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFW
Subjt:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFW

Query:  FNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY
        FNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLS +
Subjt:  FNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY

KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]3.8e-7460.63Show/hide
Query:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA
        MNSTDQL NFE A +I +P  +P    KKQVRRRR S RRLYK++PL+MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P QF  CFE 
Subjt:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA

Query:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS--------SFCSLSFWPPPSSYICPTLS-CPDTHQELPKS
        E R KSRRNPRIYP    DCSFY ENGS  +APPP  ++L+ +IPIQT   +    DT S        SF SLSF  PPSSYICPT      THQE+PKS
Subjt:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS--------SFCSLSFWPPPSSYICPTLS-CPDTHQELPKS

Query:  VSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQYSN
        +SL EEEG LMASD+FW NN PTG SEK++    +E   EEEAM  +A+I+  SM+ K LEIDG+            S+ AMEFPDWLSINDDFL   SN
Subjt:  VSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQYSN

Query:  YHCVEEDYLQDPDLS
        Y    EDYLQDPDLS
Subjt:  YHCVEEDYLQDPDLS

XP_016901295.1 PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo]3.1e-14591.35Show/hide
Query:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS
        MNS DQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP+ GQCFEAEGRRKS
Subjt:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS

Query:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF
        +RNPRIYP CSYDCSFYLENGSG VAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPT+SCPDT HQE PKSVSLREEEGNLMASDVF
Subjt:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF

Query:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY
        WFNNDPTGV+EKDMQQE VLEEEAM  AM D+KSMSMDVKALEID  HSSDNAM FPDW+SINDD L QYSNYHCVEED LQ+PDLS +
Subjt:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]2.9e-7460.25Show/hide
Query:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA
        MNSTDQL NFE A +I +P  +P    KKQVRRRR S RRLYK++PL+MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P QF  CFE 
Subjt:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA

Query:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTLS-CPDTHQELP
        E R KSRRNPRIYP    DCSFY ENGS  +APPP  ++L+ +IPIQT   +    DT S          SF SLSF  PPSSYICPT      THQE+P
Subjt:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTLS-CPDTHQELP

Query:  KSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQY
        KS+SL EEEG LMASD+FW NN PTG SEK++    +E   EEEAM  +A+I+  S+D K LEIDG+            S+ AMEFPDWLSINDDFL   
Subjt:  KSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQY

Query:  SNYHCVEEDYLQDPDLS
        SNY    EDYLQDPDLS
Subjt:  SNYHCVEEDYLQDPDLS

XP_038897806.1 uncharacterized protein LOC120085720 [Benincasa hispida]8.1e-10975.08Show/hide
Query:  MNSTDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGR
        MNS DQL NFEAAAQIS  KPD E KKQVRRRRHSRRRLYKE+PLDMAEARREIVTALKLHRA STKE AREQQQKQDQ+  QS P+FPQ G CFE +GR
Subjt:  MNSTDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGR

Query:  RKSRRNPRIYPDCSYDCSFYLENGSGLVAPP--PENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMA
        RKSRRN R YP    DCSFYLENGSG VAPP   +NL TEIP Q+FDDDFKT    SS+C LSFW PPSSYI PT+SC  THQE+PKS+SL EEEGNLMA
Subjt:  RKSRRNPRIYPDCSYDCSFYLENGSGLVAPP--PENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMA

Query:  SDVFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS
        SDVFWFNND     +KDM QEG +EE    AMA+++ M+MDVKALE DG HS +N MEF DW SINDDFL Q+SNYHCVEEDYLQDPDLSWYQFN S
Subjt:  SDVFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS

TrEMBL top hitse value%identityAlignment
A0A0A0L091 Uncharacterized protein2.7e-142100Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEID
        DFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEID
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMSMDVKALEID

Query:  GRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS
        GRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS
Subjt:  GRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS

A0A1S4DZY0 uncharacterized protein LOC1034937171.5e-14591.35Show/hide
Query:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS
        MNS DQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP+ GQCFEAEGRRKS
Subjt:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS

Query:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF
        +RNPRIYP CSYDCSFYLENGSG VAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPT+SCPDT HQE PKSVSLREEEGNLMASDVF
Subjt:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF

Query:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY
        WFNNDPTGV+EKDMQQE VLEEEAM  AM D+KSMSMDVKALEID  HSSDNAM FPDW+SINDD L QYSNYHCVEED LQ+PDLS +
Subjt:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X21.5e-14591.35Show/hide
Query:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS
        MNS DQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP+ GQCFEAEGRRKS
Subjt:  MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKS

Query:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF
        +RNPRIYP CSYDCSFYLENGSG VAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPT+SCPDT HQE PKSVSLREEEGNLMASDVF
Subjt:  RRNPRIYPDCSYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDT-HQELPKSVSLREEEGNLMASDVF

Query:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY
        WFNNDPTGV+EKDMQQE VLEEEAM  AM D+KSMSMDVKALEID  HSSDNAM FPDW+SINDD L QYSNYHCVEED LQ+PDLS +
Subjt:  WFNNDPTGVSEKDMQQEGVLEEEAM-HAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWY

A0A6J1FRD8 uncharacterized protein LOC1114462251.4e-7460.25Show/hide
Query:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA
        MNSTDQL NFE A +I +P  +P    KKQVRRRR S RRLYK++PL+MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P QF  CFE 
Subjt:  MNSTDQL-NFEAAAQISKPDEEP----KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQCFEA

Query:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTLS-CPDTHQELP
        E R KSRRNPRIYP    DCSFY ENGS  +APPP  ++L+ +IPIQT   +    DT S          SF SLSF  PPSSYICPT      THQE+P
Subjt:  EGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTLS-CPDTHQELP

Query:  KSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQY
        KS+SL EEEG LMASD+FW NN PTG SEK++    +E   EEEAM  +A+I+  S+D K LEIDG+            S+ AMEFPDWLSINDDFL   
Subjt:  KSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFLLQY

Query:  SNYHCVEEDYLQDPDLS
        SNY    EDYLQDPDLS
Subjt:  SNYHCVEEDYLQDPDLS

A0A6J1IXC1 uncharacterized protein LOC1114807862.2e-7258.75Show/hide
Query:  MNSTDQL-NFEAAAQISKPDEEP--------KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQ
        MNSTDQL NFE A +I +P  +P        KKQVRRRR + RRLYK++PL+MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P QF  
Subjt:  MNSTDQL-NFEAAAQISKPDEEP--------KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-QFGQ

Query:  CFEAEGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS---------SFCSLSFWPPPSSYICPTLS-CPDTHQ
        CFE E R KSRRNPRIYP    DCSFY +NGS  +APPP  ++L+ +IPIQT   +    DT S         SF SLSF   PSSYICPT      TH+
Subjt:  CFEAEGRRKSRRNPRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCS---------SFCSLSFWPPPSSYICPTLS-CPDTHQ

Query:  ELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFL
        E+PKS+SL EEEG LMASD+FW NN PTG SEK++    +E   EEEAM  +A+I+  SMD K LEIDG+            S+ AMEFPDWLSINDDFL
Subjt:  ELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQ---QEGVLEEEAMHAMADIKSMSMDVKALEIDGR----------HSSDNAMEFPDWLSINDDFL

Query:  LQYSNYHCVEEDYLQDPDLS
           SNY    EDYLQDPDLS
Subjt:  LQYSNYHCVEEDYLQDPDLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein6.2e-0629.84Show/hide
Query:  KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFYLENGS
        KKQVRRR H+ R  Y+E  L+MAEARREIVTALK HRAS  +  A      Q     Q   LF              S   P   PD      F   N S
Subjt:  KKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFYLENGS

Query:  GLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLEEE
             P + L   +  Q F+D  +T  T SS  S S     SS          +    P   +   +    + S     NN  T     ++  + V  E 
Subjt:  GLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLEEE

Query:  AMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSN
               IK  + +V  +E D      + MEFP WL+  ++ L    N
Subjt:  AMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTCCACAGATCAACTCAACTTTGAAGCTGCAGCACAAATTTCAAAGCCAGATGAAGAACCAAAGAAACAGGTTAGAAGGAGACGCCACAGCCGGCGGCGGCTTTA
CAAGGAAGTGCCTTTGGATATGGCTGAGGCTAGGAGAGAGATTGTAACTGCACTTAAGCTCCATAGAGCATCATCAACCAAAGAAGCTGCAAGAGAGCAGCAACAAAAGC
AGGACCAAGAAAGTAAACAATCATTTCCTCTGTTTCCTCAATTTGGCCAATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGAAGAAACCCCAGGATATACCCAGATTGT
TCATATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTCTTGTTGCTCCTCCACCTGAGAATCTCAATACAGAAATCCCTATACAAACCTTTGATGATGATTTCAAAAC
TTTGGATACTTGTTCTTCATTTTGCTCACTTTCATTCTGGCCCCCACCATCTTCATATATTTGTCCCACTCTTTCTTGTCCCGATACTCATCAGGAACTTCCCAAATCAG
TTTCGTTACGTGAGGAAGAAGGGAATTTAATGGCTTCTGATGTGTTTTGGTTCAATAATGACCCAACTGGAGTGAGTGAAAAGGACATGCAACAGGAGGGAGTTTTGGAG
GAGGAAGCTATGCATGCTATGGCTGATATCAAGTCCATGTCCATGGATGTGAAAGCTTTGGAGATTGATGGTCGCCATAGTTCTGATAATGCTATGGAATTTCCTGATTG
GTTGAGCATTAATGATGATTTTTTGCTGCAGTATTCGAATTATCATTGCGTAGAGGAAGATTATCTTCAAGATCCTGACCTATCCTGGTATCAATTTAACTTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
CGCAATCGGTGCATTATTTAAACCGATATCAATCCTCATCTCTGCACCTTTTTTCCTACCTGATAGAAAGTAAATAGATGGAACATTAAATAGTAAGCTCAAGGTTTAGC
ATCATTTCTCTATGATTGTTTCTTCCCTGAACTGCTAAAACAGTTCAAATCGTTTTTCCTCGTTGACTCAATGAATTCCACAGATCAACTCAACTTTGAAGCTGCAGCAC
AAATTTCAAAGCCAGATGAAGAACCAAAGAAACAGGTTAGAAGGAGACGCCACAGCCGGCGGCGGCTTTACAAGGAAGTGCCTTTGGATATGGCTGAGGCTAGGAGAGAG
ATTGTAACTGCACTTAAGCTCCATAGAGCATCATCAACCAAAGAAGCTGCAAGAGAGCAGCAACAAAAGCAGGACCAAGAAAGTAAACAATCATTTCCTCTGTTTCCTCA
ATTTGGCCAATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGAAGAAACCCCAGGATATACCCAGATTGTTCATATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTC
TTGTTGCTCCTCCACCTGAGAATCTCAATACAGAAATCCCTATACAAACCTTTGATGATGATTTCAAAACTTTGGATACTTGTTCTTCATTTTGCTCACTTTCATTCTGG
CCCCCACCATCTTCATATATTTGTCCCACTCTTTCTTGTCCCGATACTCATCAGGAACTTCCCAAATCAGTTTCGTTACGTGAGGAAGAAGGGAATTTAATGGCTTCTGA
TGTGTTTTGGTTCAATAATGACCCAACTGGAGTGAGTGAAAAGGACATGCAACAGGAGGGAGTTTTGGAGGAGGAAGCTATGCATGCTATGGCTGATATCAAGTCCATGT
CCATGGATGTGAAAGCTTTGGAGATTGATGGTCGCCATAGTTCTGATAATGCTATGGAATTTCCTGATTGGTTGAGCATTAATGATGATTTTTTGCTGCAGTATTCGAAT
TATCATTGCGTAGAGGAAGATTATCTTCAAGATCCTGACCTATCCTGGTATCAATTTAACTTTTCTTAATATTAACGAAATCCCCCCTTCCAATTAGAAAAACACATTTA
GGAGCCAGCATGTCTTTCATCTTCTTGTACACTTTTAGTTTCTCCTTTTTTTTCCTTTTCGGAGTGATGAACTGATTCTCTAGTTCAATGGACTTTTTGTGGGGTATGAA
TTTGACGTTTGTTGACAATAAAACTCAACCCTATATAATAAAATGAAATTTTTTTATATAAAATATAGCAAAATCTTT
Protein sequenceShow/hide protein sequence
MNSTDQLNFEAAAQISKPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDC
SYDCSFYLENGSGLVAPPPENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASDVFWFNNDPTGVSEKDMQQEGVLE
EEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSWYQFNFS