; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0181941 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0181941
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionDual specificity protein kinase splA
Genome locationCMiso1.1chr07:1058494..1062610
RNA-Seq ExpressionCmc07g0181941
SyntenyCmc07g0181941
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649926.1 hypothetical protein Csa_011922 [Cucumis sativus]1.3e-12990Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP+ GQCFEAEGRRKS+RNPRIYP CSYDCSFYLENGSG VAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
        DFKTLDTCSSFCSLSFWPPPSSYICPT+SCPDT HQE PKSVSLREEEGNLMASDVFWFNNDPTGV+EKDMQQE VLEEEAM  AM D+KSMSMDVKALE
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
        ID  HSSDNAMEFPDW+SINDD L QYSNYHCVEED LQ+PDLSCFD  KIED+ +EWLA
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

KAG6608324.1 hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia]5.8e-6657.39Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT
        MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P +   CFE E R KS+RNPRIYP    DCSFY ENGS F+APPP  ++L+ +IPIQT
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT

Query:  FDDDFKTLDTCS--------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAMAM
           +    DT S        SF SLSF  PPSSYICPT     T HQE PKS+SL EEEG LMASD+FW NN PTG +EK++    +E   EEEAM   +
Subjt:  FDDDFKTLDTCS--------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAMAM

Query:  DDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
              SM+ K LEID              S+ AMEFPDW+SINDD LQ  SNY    ED LQ+PDLSC DIG+IEDV  +WLA
Subjt:  DDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

XP_016901295.1 PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo]6.0e-14899.23Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
        DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
        IDCHHSSDNAM FPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIED+GKEWLA
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

XP_022940715.1 uncharacterized protein LOC111446225 [Cucurbita moschata]4.4e-6656.99Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT
        MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P +   CFE E R KS+RNPRIYP    DCSFY ENGS F+APPP  ++L+ +IPIQT
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT

Query:  FDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAM
           +    DT S          SF SLSF  PPSSYICPT     T HQE PKS+SL EEEG LMASD+FW NN PTG +EK++    +E   EEEAM  
Subjt:  FDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAM

Query:  AMDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
         +      S+D K LEID              S+ AMEFPDW+SINDD LQ  SNY    ED LQ+PDLSC DIG+IEDV  +WLA
Subjt:  AMDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

XP_038897806.1 uncharacterized protein LOC120085720 [Benincasa hispida]2.0e-8271.77Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPP--PENLNTEIPIQTF
        MAEARREIVTALKLHRA STKE AREQQQKQDQ+  QS P+FP+LG CFE +GRRKS+RN R YP    DCSFYLENGSGFVAPP   +NL TEIP Q+F
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPP--PENLNTEIPIQTF

Query:  DDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKA
        DDDFKT    SS+C LSFW PPSSYI PTVSC  T HQE PKS+SL EEEGNLMASDVFWFNND     +KDMQ+ AV  EEA A AM +++ M+MDVKA
Subjt:  DDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKA

Query:  LEIDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCF
        LE D HHS +N MEF DW SINDD LQQ+SNYHCVEED LQ+PDLS +
Subjt:  LEIDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCF

TrEMBL top hitse value%identityAlignment
A0A0A0L091 Uncharacterized protein2.7e-12290.65Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP+ GQCFEAEGRRKS+RNPRIYP CSYDCSFYLENGSG VAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
        DFKTLDTCSSFCSLSFWPPPSSYICPT+SCPDT HQE PKSVSLREEEGNLMASDVFWFNNDPTGV+EKDMQQE VLEEEAM  AM D+KSMSMDVKALE
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCF
        ID  HSSDNAMEFPDW+SINDD L QYSNYHCVEED LQ+PDLS +
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCF

A0A1S4DZY0 uncharacterized protein LOC1034937172.9e-14899.23Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
        DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
        IDCHHSSDNAM FPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIED+GKEWLA
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

A0A5A7V8V7 Putative WRKY transcription factor protein 1 isoform X22.9e-14899.23Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
        DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
        IDCHHSSDNAM FPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIED+GKEWLA
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

A0A6J1FRD8 uncharacterized protein LOC1114462252.1e-6656.99Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT
        MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P +   CFE E R KS+RNPRIYP    DCSFY ENGS F+APPP  ++L+ +IPIQT
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT

Query:  FDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAM
           +    DT S          SF SLSF  PPSSYICPT     T HQE PKS+SL EEEG LMASD+FW NN PTG +EK++    +E   EEEAM  
Subjt:  FDDDFKTLDTCS----------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAM

Query:  AMDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
         +      S+D K LEID              S+ AMEFPDW+SINDD LQ  SNY    ED LQ+PDLSC DIG+IEDV  +WLA
Subjt:  AMDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

A0A6J1IXC1 uncharacterized protein LOC1114807865.3e-6556.49Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT
        MAEARREIVTALKLHRA STKE A+EQQQKQDQ+ K S P++P +   CFE E R KS+RNPRIYP    DCSFY +NGS F+APPP  ++L+ +IPIQT
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFP-ELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPP--ENLNTEIPIQT

Query:  FDDDFKTLDTCS---------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAMA
           +    DT S         SF SLSF   PSSYICPT     T H+E PKS+SL EEEG LMASD+FW NN PTG +EK++    +E   EEEAM   
Subjt:  FDDDFKTLDTCS---------SFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQ---QEAVLEEEAMAMA

Query:  MDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA
        +      SMD K LEID              S+ AMEFPDW+SINDD LQ  SNY    ED LQ+PDLSC DIG+IEDV  +WLA
Subjt:  MDDLKSMSMDVKALEIDCH----------HSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein1.4e-0628.35Show/hide
Query:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD
        MAEARREIVTALK HRAS  +  A      Q     Q   LF              S   P   P       F   N S     P + L   +  Q F+D
Subjt:  MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDD

Query:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE
          +T  T SS  S S     SS I PT     +     P   +   +    + S     NN  T     ++  + V  E         +K  + +V  +E
Subjt:  DFKTLDTCSSFCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALE

Query:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDV-GKEWLA
         D      + MEFP W++  ++ L    N           P LSC +IG+IE + G +WLA
Subjt:  IDCHHSSDNAMEFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDV-GKEWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAGCTCCATAGAGCATCATCAACCAAAGAAGCTGCAAGAGAGCAGCAACAAAAGCAGGACCAAGAAAGTAAACA
ATCATTTCCTCTGTTTCCTGAATTAGGCCAATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAAAAGAAACCCCAGGATATACCCAAGTTGTTCGTATGATTGCTCATTTT
ATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGAGAATCTCAATACAGAAATCCCTATACAAACCTTTGATGATGATTTCAAAACTTTGGATACTTGTTCTTCA
TTTTGTTCACTTTCATTCTGGCCCCCACCATCTTCATATATTTGTCCCACTGTTTCTTGCCCTGATACTCATCATCAGGAATTCCCCAAATCAGTTTCATTACGTGAGGA
AGAAGGCAATTTAATGGCTTCTGATGTGTTTTGGTTCAATAATGACCCAACTGGAGTGAATGAAAAGGACATGCAGCAGGAAGCAGTGTTGGAGGAGGAAGCTATGGCTA
TGGCTATGGATGATTTGAAGTCCATGTCCATGGACGTGAAAGCTTTGGAGATTGATTGTCACCATAGTTCTGATAATGCTATGGAATTTCCTGATTGGATGAGCATTAAT
GACGATTCTTTGCAGCAGTATTCGAATTATCATTGTGTAGAGGAAGATTGCCTTCAAGAGCCTGACCTATCCTGCTTCGACATAGGGAAGATTGAAGATGTGGGTAAAGA
ATGGTTAGCATGA
mRNA sequenceShow/hide mRNA sequence
AAATTTCAAAAAGTAGATTTTAAGATTTAGCAAAACTAAAATCGAAGCAATATTGATAGATCAAAATGATATTTTAAGTTAAATTTAGAAATCAACTAAACACAATATTT
AAAATAATACTAATTATAGTATAAACGAATCGGGGAGGGGAGAGGAGAGTGGAGGGAAGAAAAGGCGACAGAGGGCGGTGAAGAACATGGGAGTTTCTGTTGGTGAGATC
ACAATCTGATGATCAAATCATTTGTTCCCTTTTTTTCTCTTTCATAAATTATTTGTTCTTCCATCATTCTGCCATTATTGTATTCTGATTTCTTCCTGTGGCTGTGCCGA
ACGGCGCCCATTGATTTTTTCAAAATGTGGCTGCGCTTCTTCTCTGTCTCTCTGCCCTTTGCCTCACCACAATCCGAAGAGCCTACTGAGCTCTGAGCTCATTGCGCCCC
ATCAAAATCCTACACACATTTAACAAGATCAACTCAACTTTGAAGCTGCCGCACAAATTTCAAAGCCAGATGAAGAACCAAAGAAACAGGTTAGGAGGAGACGCCATAGC
CGGCGGCGGCTTTACAAGGAAGTGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAGCTCCATAGAGCATCATCAACCAAAGAAGCTGCAAGAGA
GCAGCAACAAAAGCAGGACCAAGAAAGTAAACAATCATTTCCTCTGTTTCCTGAATTAGGCCAATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAAAAGAAACCCCAGGA
TATACCCAAGTTGTTCGTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGAGAATCTCAATACAGAAATCCCTATACAAACCTTTGAT
GATGATTTCAAAACTTTGGATACTTGTTCTTCATTTTGTTCACTTTCATTCTGGCCCCCACCATCTTCATATATTTGTCCCACTGTTTCTTGCCCTGATACTCATCATCA
GGAATTCCCCAAATCAGTTTCATTACGTGAGGAAGAAGGCAATTTAATGGCTTCTGATGTGTTTTGGTTCAATAATGACCCAACTGGAGTGAATGAAAAGGACATGCAGC
AGGAAGCAGTGTTGGAGGAGGAAGCTATGGCTATGGCTATGGATGATTTGAAGTCCATGTCCATGGACGTGAAAGCTTTGGAGATTGATTGTCACCATAGTTCTGATAAT
GCTATGGAATTTCCTGATTGGATGAGCATTAATGACGATTCTTTGCAGCAGTATTCGAATTATCATTGTGTAGAGGAAGATTGCCTTCAAGAGCCTGACCTATCCTGCTT
CGACATAGGGAAGATTGAAGATGTGGGTAAAGAATGGTTAGCATGATATAGTATGGTTTCATCTTCTTACCATCCCCTTTATCCAAAATATTATCCCATTCCCTATTTAA
ATCAATACTTTTTTTAATAATTATAATCATTTTATCGTTTCAAAATTTTATTGGAAATATTAAACATGGTGGCTTGGTTAAATATTGATTATTTGATAGATTGAGATTTC
GTATCAAATCAATTGATGGTGAGAATCTATCTTATACAATGGTGGATTCTCAACATCAAATTTGAATATCATAATTTAAACATACCAATATTTAGGGAAAAAAATAGGAC
CGCCAGAATTAATG
Protein sequenceShow/hide protein sequence
MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFYLENGSGFVAPPPENLNTEIPIQTFDDDFKTLDTCSS
FCSLSFWPPPSSYICPTVSCPDTHHQEFPKSVSLREEEGNLMASDVFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKSMSMDVKALEIDCHHSSDNAMEFPDWMSIN
DDSLQQYSNYHCVEEDCLQEPDLSCFDIGKIEDVGKEWLA