; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0006224 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0006224
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationchr08:2549095..2550645
RNA-Seq ExpressionPay0006224
SyntenyPay0006224
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-7661.41Show/hide
Query:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP
        MGCC+SS +S +S +KF          P + N +R+PPSSMEEETVKEVLSET ALKP    PP K+CPPEEDE  KP+GD    E EKKL EIPINGI 
Subjt:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +Q SEF EIS+ +K    + A FTD  DGG EVHQ  LK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TA  DS G RSAL RTPS RKSGKSSP+R  T      ATS+KVVEENNI +G   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]2.3e-145100Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]4.6e-13092Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSS+QSF+SPNKFH +SVN +RDPPSSMEEETVKEVLSETPALKPP   N  PE+DEF KPLGDE EKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQ DGGGEVHQT LKSSPVKLTKNQSVSSDVELKREI QSRTLTRRSDQSPVRRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RS SPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]2.4e-7862.42Show/hide
Query:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP
        MGCC+SS +S +S +KF          P + N +R+PPSSMEEETVKEVLSET ALKP    PP KNCPPEEDE  KP+GD    E EKKL EIPINGI 
Subjt:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF EIS+ +K    + A FTD  DGG EVHQ  LK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TA  DS G RSAL RTPS RKSGKSSP+R  T      ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]3.6e-11179.86Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPN---KNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNK
        MGCCLSS +SFNSPNKFH +S N +RDPPSSMEEETVKEVLSETP+LKPPP+   KN PPEED+  KP+G+E EKKL EI INGI E PSEFYEISH N+
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPN---KNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNK

Query:  CISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVS A  T+Q DGGGE+HQ  LKSSPVKL K+QS+S D E+KREI Q+RTLTRRSDQSPVRRNGA+GSMRMVHNRDM+PAMARR LRAEPPRRDPDEN
Subjt:  CISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        S RRS+SP+TA SD  G RSALSRTPS RKSGKSSP RA TATSQKVVEENNI+DGKFN+QIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein2.2e-13092Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSS+QSF+SPNKFH +SVN +RDPPSSMEEETVKEVLSETPALKPP   N  PE+DEF KPLGDE EKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQ DGGGEVHQT LKSSPVKLTKNQSVSSDVELKREI QSRTLTRRSDQSPVRRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RS SPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853121.1e-145100Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein1.1e-145100Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
        MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCIS

Query:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
        VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR
Subjt:  VSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSR

Query:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  RSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124334.7e-4057.59Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNA------NRDPPSSMEEETVKEVLSETPALK-PPPNKNCPPEEDEFHKPLGD-------ETEKKLSEIPINGIPEQP
        MGCC+SS    NS +KF  +S  A      +R+PPSSMEEETVKEVL+ETPALK PPP KN PP+EDE  KP+ D       E EKK+  IP N + E  
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNA------NRDPPSSMEEETVKEVLSETPALK-PPPNKNCPPEEDEFHKPLGD-------ETEKKLSEIPINGIPEQP

Query:  SEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMS
         EF EIS  ++C+  SAATFTD+ D G EVHQ   ++SPVKL KNQS S DV  KRE+  +R L RRSDQSPVRRNG VGS R+  NRDM+
Subjt:  SEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMS

A0A6J1EF08 uncharacterized protein LOC1114335671.2e-7862.42Show/hide
Query:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP
        MGCC+SS +S +S +KF          P + N +R+PPSSMEEETVKEVLSET ALKP    PP KNCPPEEDE  KP+GD    E EKKL EIPINGI 
Subjt:  MGCCLSSSQSFNSPNKFH---------PSSVNANRDPPSSMEEETVKEVLSETPALKP----PPNKNCPPEEDEFHKPLGD----ETEKKLSEIPINGIP

Query:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF EIS+ +K    + A FTD  DGG EVHQ  LK+ P     NQS+  +V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TA  DS G RSAL RTPS RKSGKSSP+R  T      ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein4.4e-0630.85Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEET-VKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKK------LSEIPINGIPE--QPSEFYE
        MGCCLSS+    +  K  P S      PPS ++EET VKEVLSET  L    N N   E+    K + +E EKK      +++ P+   P   +P +  E
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEET-VKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKK------LSEIPINGIPE--QPSEFYE

Query:  ISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNG--AVGSMRMVHNRDMSPAMARRGLRAEP
        +S  N  +S S  +  ++ D   E     +KS  V+  +     S   +    P      RR+D SP +RN     GS+R+V +   +            
Subjt:  ISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNG--AVGSMRMVHNRDMSPAMARRGLRAEP

Query:  PRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPS----TRKSGKSSPIRA---MTATSQKVVEEN--NIVDGKFNTQIESLENPLVSLECFIFL
          RD  E S RRS+SP+   S   G   +   T S     R+S     +R    M  + Q+         +    +   +S ENPLVSLECFIFL
Subjt:  PRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPS----TRKSGKSSPIRA---MTATSQKVVEEN--NIVDGKFNTQIESLENPLVSLECFIFL

AT1G61170.1 unknown protein6.6e-1029.72Show/hide
Query:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEET-VKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEI----------PINGIPEQPSEF
        MG C  SS + +  N+      N +    + +EEET VKEVLSET    P  +      +D     + ++ EKK   +          P +  PE+ SE 
Subjt:  MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEET-VKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEI----------PINGIPEQPSEF

Query:  YEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEP
         EI  ++   SVS+            V   G     V + + +S  S  + + ++  +   TRR+DQSP +RN    +                G R   
Subjt:  YEISHMNKCISVSAATFTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEP

Query:  PRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
          RDP E S RRS+SP+T  S     +S+      TRK+ + SP R     ++  +++    +  + T+ E LENPLVSLECFIFL
Subjt:  PRRDPDENSSRRSQSPSTAPSDSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTCTTAGTTCCTCCCAATCCTTCAATTCACCTAACAAATTCCATCCCAGTTCCGTCAATGCGAACAGAGACCCCCCGTCTTCCATGGAGGAAGAG
ACCGTCAAAGAAGTGCTCTCTGAAACCCCTGCTCTCAAACCGCCGCCGAACAAGAATTGTCCACCCGAAGAAGACGAATTTCACAAACCACTCGGTGATGAGACC
GAGAAGAAGCTTTCTGAAATTCCCATTAACGGAATTCCAGAGCAGCCTTCTGAATTCTATGAAATTTCCCATATGAACAAGTGTATCTCAGTCTCCGCCGCTACT
TTCACCGATCAAGCGGACGGGGGAGGTGAAGTTCATCAAACGGGTTTGAAATCATCGCCAGTGAAGTTGACGAAAAATCAATCTGTTTCCAGTGACGTTGAGTTA
AAAAGAGAAATTCCGCAGAGCAGGACACTGACCCGGAGATCCGACCAGTCACCAGTTCGACGAAATGGCGCAGTGGGTTCGATGAGAATGGTTCATAACAGAGAC
ATGAGTCCGGCAATGGCGCGGCGAGGATTGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGATCCCAATCGCCGTCTACCGCTCCTTCC
GACAGCGCAGGGTATAGATCTGCCCTCAGTCGGACACCGTCAACGAGAAAGTCCGGTAAATCATCCCCCATTAGGGCGATGACAGCGACAAGTCAAAAAGTAGTA
GAAGAAAACAATATCGTAGATGGAAAATTCAACACTCAGATTGAATCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
TGTGTGTGTGTGTGTGTATTATTAGAGCTAAGATCAAATTCATGTGTTGCTCACAAATCCGCCGCCCATTACATATAAATCAAATCAAAACGATGGACACACATC
ACGAGCCCCTAATTCCTTATTTATTAATTTATGCATCTCATCAATTGGTATAAAAAGGTTAAGTTTTTTATTTTATTTTTTTCTTTTTCCCTAATTAATTAATTC
GTACAATCCGCCGTAAGTGGTTGTGTCTTTAAAATTCAATTCATTGATATTTCACACTCTGAATTCCCAAATCTTCTACCTCTCCTCAAGAGTCAAAATTACAAT
GTTCTGAAGGCCGCCCTCGCAAATGGGTTGCTGTCTTAGTTCCTCCCAATCCTTCAATTCACCTAACAAATTCCATCCCAGTTCCGTCAATGCGAACAGAGACCC
CCCGTCTTCCATGGAGGAAGAGACCGTCAAAGAAGTGCTCTCTGAAACCCCTGCTCTCAAACCGCCGCCGAACAAGAATTGTCCACCCGAAGAAGACGAATTTCA
CAAACCACTCGGTGATGAGACCGAGAAGAAGCTTTCTGAAATTCCCATTAACGGAATTCCAGAGCAGCCTTCTGAATTCTATGAAATTTCCCATATGAACAAGTG
TATCTCAGTCTCCGCCGCTACTTTCACCGATCAAGCGGACGGGGGAGGTGAAGTTCATCAAACGGGTTTGAAATCATCGCCAGTGAAGTTGACGAAAAATCAATC
TGTTTCCAGTGACGTTGAGTTAAAAAGAGAAATTCCGCAGAGCAGGACACTGACCCGGAGATCCGACCAGTCACCAGTTCGACGAAATGGCGCAGTGGGTTCGAT
GAGAATGGTTCATAACAGAGACATGAGTCCGGCAATGGCGCGGCGAGGATTGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGATCCCA
ATCGCCGTCTACCGCTCCTTCCGACAGCGCAGGGTATAGATCTGCCCTCAGTCGGACACCGTCAACGAGAAAGTCCGGTAAATCATCCCCCATTAGGGCGATGAC
AGCGACAAGTCAAAAAGTAGTAGAAGAAAACAATATCGTAGATGGAAAATTCAACACTCAGATTGAATCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCAT
CTTCCTCTGATTTTGTGTGTGGGTTTCACTTGATCTTTCATTTTTCTTGTTTGATGTTATTAAAGTTTTTTGAGTGAACATTGAGGTAAATTTAACAACAACGGC
GGGGGATCGCCGGAGAATTCATTGAAGGTCACCGGAAAGGTTGGTATGTTGGAAATTTAATTTTCAAATTAGAAGTAAATTAATGGAATTAGATTTATTTGTCAA
TGGTTGTGATTTGAATAAGGAATTGAGGTAACCGACGTAAAAGAGAAGACAAAATCCGCCGCATAAATCACTGTTGATCACATGGTTTAAGAGGGAATGGGCTGT
AAAATGTTAATCACATTGATGATGGGTTTTAGTTTTAATTTTGAAGTATAATCCAAAACCGCAACCAAAAATGGTAGAGCC
Protein sequenceShow/hide protein sequence
MGCCLSSSQSFNSPNKFHPSSVNANRDPPSSMEEETVKEVLSETPALKPPPNKNCPPEEDEFHKPLGDETEKKLSEIPINGIPEQPSEFYEISHMNKCISVSAAT
FTDQADGGGEVHQTGLKSSPVKLTKNQSVSSDVELKREIPQSRTLTRRSDQSPVRRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENSSRRSQSPSTAPS
DSAGYRSALSRTPSTRKSGKSSPIRAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL