; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0015158 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0015158
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationchr04:2294383..2296903
RNA-Seq ExpressionPI0015158
SyntenyPI0015158
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-8163.76Show/hide
Query:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP
        MGCC+SS +S +S++KF          P + NGSR+PPSSMEEETVKEVLSET ALKP P SPP K+CPPEEDE  KPVGD    EIEKKL EIPINGI 
Subjt:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP

Query:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR
        +Q SEF EIS+P+K    + A FTD  DGG + HQ V K+ P     NQSI G+V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TAR DS G RSAL RTPS RKSGKSSP+   T      ATS+KVVEENNI +G   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]1.6e-13190.65Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS QSFNS NKFHP+SVN +RDPPSSMEEETVKEVLSETPALKPPP+   KNCPPEEDEFHKP+GDE EKKL EIPINGIPEQPSEFYEISH NK
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVSAATFTDQ DGGG+ HQT  KSSPVKLTKNQS+  DVELKRE+PQSRTLTRRSDQSPVRRNGAVGSMR+VHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        SSRRSQSPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]2.3e-12789.57Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS QSF+S NKFH NSVN SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+DEF KP+GDEIEKKL EIPINGIPEQPSEFYEISH NK
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVSAATFTDQTDGGG+ HQTV KSSPVKLTKNQS+  DVELKRE+ QSRTLTRRSDQSPVRRNGAVGS+R+VHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        SSRRS SPSTARSDSAGYRSALSRTPSARKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]5.1e-8264.43Show/hide
Query:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP
        MGCC+SS +S +S++KF          P + NGSR+PPSSMEEETVKEVLSET ALKP   SPP KNCPPEEDE  KPVGD    EIEKKL EIPINGI 
Subjt:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP

Query:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR
        +QPSEF EIS+P+K    + A FTD  DGG + HQ V K+ P     NQSI G+V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TAR DS G RSAL RTPS RKSGKSSP+   T      ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]1.5e-11882.73Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS +SFNS NKFH NS NGSRDPPSSMEEETVKEVLSETP+LKPPPSPPKKN PPEED+  KPVG+EIEKKLCEI INGI E PSEFYEISHPN+
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVS A  T+Q DGGG+ HQ V KSSPVKL K+QSI GD E+KRE+ Q+RTLTRRSDQSPVRRNGA+GSMR+VHNRDM+PAMARR LRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        S RRS+SP+TARSD  G RSALSRTPS RKSGKSSP  A TATSQKVVEENNI+DGKFN+QIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein1.1e-12789.57Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS QSF+S NKFH NSVN SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+DEF KP+GDEIEKKL EIPINGIPEQPSEFYEISH NK
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVSAATFTDQTDGGG+ HQTV KSSPVKLTKNQS+  DVELKRE+ QSRTLTRRSDQSPVRRNGAVGS+R+VHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        SSRRS SPSTARSDSAGYRSALSRTPSARKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853127.6e-13290.65Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS QSFNS NKFHP+SVN +RDPPSSMEEETVKEVLSETPALKPPP+   KNCPPEEDEFHKP+GDE EKKL EIPINGIPEQPSEFYEISH NK
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVSAATFTDQ DGGG+ HQT  KSSPVKLTKNQS+  DVELKRE+PQSRTLTRRSDQSPVRRNGAVGSMR+VHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        SSRRSQSPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein7.6e-13290.65Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK
        MGCCLSS QSFNS NKFHP+SVN +RDPPSSMEEETVKEVLSETPALKPPP+   KNCPPEEDEFHKP+GDE EKKL EIPINGIPEQPSEFYEISH NK
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIPEQPSEFYEISHPNK

Query:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN
        CISVSAATFTDQ DGGG+ HQT  KSSPVKLTKNQS+  DVELKRE+PQSRTLTRRSDQSPVRRNGAVGSMR+VHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        SSRRSQSPSTA SDSAGYRSALSRTPS RKSGKSSPI AMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124337.0e-4560.1Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNS------VNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGD-------EIEKKLCEIPINGIPE
        MGCC+SS    NS++KF  NS      +  SR+PPSSMEEETVKEVL+ETPALKPP  PP KN PP+EDE  KPV D       EIEKK+  IP N + E
Subjt:  MGCCLSSPQSFNSSNKFHPNS------VNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGD-------EIEKKLCEIPINGIPE

Query:  QPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMS
           EF EIS P++C+  SAATFTD+ D G + HQ VF++SPVKL KNQS  GDV  KREM  +R L RRSDQSPVRRNG VGS R+  NRDM+
Subjt:  QPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMS

A0A6J1EF08 uncharacterized protein LOC1114335672.5e-8264.43Show/hide
Query:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP
        MGCC+SS +S +S++KF          P + NGSR+PPSSMEEETVKEVLSET ALKP   SPP KNCPPEEDE  KPVGD    EIEKKL EIPINGI 
Subjt:  MGCCLSSPQSFNSSNKFH---------PNSVNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNCPPEEDEFHKPVGD----EIEKKLCEIPINGIP

Query:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR
        +QPSEF EIS+P+K    + A FTD  DGG + HQ V K+ P     NQSI G+V LKR++  ++TL RRSDQSPVRRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDEN  RRS+SP+TAR DS G RSAL RTPS RKSGKSSP+   T      ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMT------ATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein2.4e-0530.74Show/hide
Query:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEET-VKEVLSETPAL--KPPPSPPKKN--CPPEEDEFHKP--VGDEIEKKLCEIPINGIPEQPSEFY
        MGCCLSS     +  K  P S   +  PPS ++EET VKEVLSET  L      S  +K      +E+E  KP  V D  ++ +   P    PE+ SE  
Subjt:  MGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEET-VKEVLSETPAL--KPPPSPPKKN--CPPEEDEFHKP--VGDEIEKKLCEIPINGIPEQPSEFY

Query:  EISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNG--AVGSMRIVHNRDMSPAMARRGLRAE
        E    ++ + +S     D+ +        V + SP K              R         RR+D SP +RN     GS+R+V +   +           
Subjt:  EISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNG--AVGSMRIVHNRDMSPAMARRGLRAE

Query:  PPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSA---RKSGKSSPIGAMTATSQKVVEENNIVD-----GKFNT-QIESLENPLVSLECFIFL
           RD  E S RRS+SP+  RS   G   +   T S    R+  +S     +        +E N        G +N+   +S ENPLVSLECFIFL
Subjt:  PPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSA---RKSGKSSPIGAMTATSQKVVEENNIVD-----GKFNT-QIESLENPLVSLECFIFL

AT1G61170.1 unknown protein2.2e-0629.17Show/hide
Query:  CCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEET-VKEVLSETPALKPPPS--------PPKKNCPPEEDE---FHKPVGDEIEKKLCEIPINGIPEQPS
        CC+SS  +   +        N S    + +EEET VKEVLSET    P  S        P K     +E++   F K   D +  +    P +  PE+ S
Subjt:  CCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEET-VKEVLSETPALKPPPS--------PPKKNCPPEEDE---FHKPVGDEIEKKLCEIPINGIPEQPS

Query:  EFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRA
        E  EI   +   SVS+    +  D              V + + +S     + + ++  +   TRR+DQSP +RN    +                G R 
Subjt:  EFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRA

Query:  EPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL
            RDP E S RRS+SP+T RS     +S+       RK+ + SP       ++  +++    +  + T+ E LENPLVSLECFIFL
Subjt:  EPPRRDPDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATATGGATCTTATACTCGTAGATTACAATTAACATTGAATTACAATGTTCTGACCCGGCCGGCACAAATGGGTTGCTGTCTTAGTTCCCCCCAATCCTTCAATTC
ATCTAACAAATTCCATCCCAATTCTGTCAATGGAAGCAGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGTCAAAGAAGTGCTATCTGAAACCCCTGCTCTCAAACCGC
CGCCCTCGCCACCGAAGAAGAATTGTCCTCCGGAAGAAGATGAATTCCACAAACCAGTCGGTGATGAGATCGAGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTCCA
GAACAGCCTTCTGAATTCTATGAAATTTCCCATCCGAACAAGTGTATCTCAGTCTCCGCCGCTACTTTCACCGATCAAACGGACGGGGGCGGTCAATTTCATCAAACGGT
TTTCAAATCATCGCCAGTGAAGTTGACGAAAAATCAATCAATTTTCGGTGACGTTGAGTTAAAAAGAGAAATGCCGCAGAGCAGGACACTGACCCGGAGATCCGACCAGT
CACCAGTTCGACGAAATGGCGCCGTAGGGTCGATGAGAATTGTTCATAACAGAGACATGAGTCCGGCAATGGCGCGGCGAGGATTGAGAGCGGAGCCTCCCCGGAGAGAC
CCAGATGAGAATTCCAGCCGGAGATCCCAATCCCCGTCTACCGCACGTTCCGACAGCGCAGGGTATAGATCTGCCCTGAGTCGGACACCGTCAGCGAGAAAGTCCGGTAA
ATCATCGCCCATTGGGGCGATGACGGCGACAAGTCAAAAAGTAGTAGAAGAGAACAATATCGTAGATGGAAAATTCAACACTCAGATTGAATCACTTGAGAACCCTCTGG
TTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
AATACATTCAATTTTCTAAATTATGTTGTTTCGATTCCGGCTCGACTATGTCATATGGATCTTATACTCGTAGATTACAATTAACATTGAATTACAATGTTCTGACCCGG
CCGGCACAAATGGGTTGCTGTCTTAGTTCCCCCCAATCCTTCAATTCATCTAACAAATTCCATCCCAATTCTGTCAATGGAAGCAGAGACCCGCCGTCTTCCATGGAGGA
AGAGACCGTCAAAGAAGTGCTATCTGAAACCCCTGCTCTCAAACCGCCGCCCTCGCCACCGAAGAAGAATTGTCCTCCGGAAGAAGATGAATTCCACAAACCAGTCGGTG
ATGAGATCGAGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTCCAGAACAGCCTTCTGAATTCTATGAAATTTCCCATCCGAACAAGTGTATCTCAGTCTCCGCCGCT
ACTTTCACCGATCAAACGGACGGGGGCGGTCAATTTCATCAAACGGTTTTCAAATCATCGCCAGTGAAGTTGACGAAAAATCAATCAATTTTCGGTGACGTTGAGTTAAA
AAGAGAAATGCCGCAGAGCAGGACACTGACCCGGAGATCCGACCAGTCACCAGTTCGACGAAATGGCGCCGTAGGGTCGATGAGAATTGTTCATAACAGAGACATGAGTC
CGGCAATGGCGCGGCGAGGATTGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGATCCCAATCCCCGTCTACCGCACGTTCCGACAGCGCAGGG
TATAGATCTGCCCTGAGTCGGACACCGTCAGCGAGAAAGTCCGGTAAATCATCGCCCATTGGGGCGATGACGGCGACAAGTCAAAAAGTAGTAGAAGAGAACAATATCGT
AGATGGAAAATTCAACACTCAGATTGAATCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGATTTTGTGTGTGGGTTTCACTGATTTTTCATTTTT
CTTGTTTGATTTTATTAAAGTTTTTTGAGTGAACATTGAGGTAAATTTAACAACAACGGCGGTGGATCGCCGGAGAATTCATTGAAGATCACCGGAAAGGTAGGTATGTT
GGAAATTAATTTTCAAATTAGAAGTAAATTAATGGAATTAGATTATTTGTTAATGGTGTGATTTGAATAAGTTATTGAGGTAACCGACGTAAAAGAGAAGACAAAATCCG
CCGCATAAATCACTGTTGATCACATGATTTAAGAGGGAATGGGATGTAAAATGTTCATCACATTCATGGGTTTTAGTTTTAATTTTAAGTATAATCCAAAACCGCAACCA
AAAATGGTAGAGTAGCTCAAATTTGTGGCACAAGTTTCAAGTCTAAATCACATTACTTTAAAGATTATTATCTTGAGAAACTTCTAATTCTAAATAATACTCCTCTAGAT
TTGGGTCTCCC
Protein sequenceShow/hide protein sequence
MSYGSYTRRLQLTLNYNVLTRPAQMGCCLSSPQSFNSSNKFHPNSVNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNCPPEEDEFHKPVGDEIEKKLCEIPINGIP
EQPSEFYEISHPNKCISVSAATFTDQTDGGGQFHQTVFKSSPVKLTKNQSIFGDVELKREMPQSRTLTRRSDQSPVRRNGAVGSMRIVHNRDMSPAMARRGLRAEPPRRD
PDENSSRRSQSPSTARSDSAGYRSALSRTPSARKSGKSSPIGAMTATSQKVVEENNIVDGKFNTQIESLENPLVSLECFIFL