; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G022400 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G022400
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationCiama_Chr01:34732582..34733412
RNA-Seq ExpressionCaUC01G022400
SyntenyCaUC01G022400
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8065.2Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP P SPP K+ PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q  SEF EIS+P+   + +   D M+GG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T      ATS+KVV ENNI +G   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]2.0e-10980.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEISH N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ +GGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]2.2e-10880.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SS +SF S NKFH NS N SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+D+ +KP+G+EIEKKL EIPINGI + PSEFYEISH N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ +GGGEV+QTVLKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGS+RMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR+ SP+TAR D AG RSALSRTPS RKSGKSSPI A   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]1.0e-8165.88Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP   SPP KN PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q PSEF EIS+P+   + +   D M+GG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T      ATS+KVV ENNI DG   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]7.4e-12085Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SSGKSF+S NKFHRNSDNGSRDPPSSMEEETVKEVLSETP+LKPPPSPPKKN PPEED+  KPVGNEIEKKLCEI INGIA+ PSEFYEISHPN+
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVS----ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVS     +QM+GGGE++Q VLKSSPVKL K+QSIS D E+KREISQNRTLTRRSDQSPVRRNG +GSMRMVHNRD +PAMARR LRAEPPRRDPDEN
Subjt:  CLSVS----ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        S RR+RSPATAR DG GSRSALSRTPSVRKSGKSSP RAA  TATSQKVV ENNIIDGKFN+QIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein1.1e-10880.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SS +SF S NKFH NS N SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+D+ +KP+G+EIEKKL EIPINGI + PSEFYEISH N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ +GGGEV+QTVLKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGS+RMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR+ SP+TAR D AG RSALSRTPS RKSGKSSPI A   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853129.7e-11080.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEISH N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ +GGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein9.7e-11080.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEISH N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPND

Query:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ +GGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124335.1e-4259.79Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNG------SRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVG-------NEIEKKLCEIPINGIAQ
        MGCC+SSG   +SA+KF RNS         SR+PPSSMEEETVKEVL+ETPALKPP  PP KN PP+ED+A KPV        NEIEKK+  IP N +A+
Subjt:  MGCCISSGKSFHSANKFHRNSDNG------SRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVG-------NEIEKKLCEIPINGIAQ

Query:  PPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRD
           EF EIS P++CLS +   D+M+ G EV+Q V ++SPVKL KNQS S DV  KRE+  NR L RRSDQSPVRRNG VGS R+  NRD
Subjt:  PPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRD

A0A6J1EF08 uncharacterized protein LOC1114335675.0e-8265.88Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP   SPP KN PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q PSEF EIS+P+   + +   D M+GG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISHPNDCLSVS--ADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T      ATS+KVV ENNI DG   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----GTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein4.9e-0529.15Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKK---LCEIPINGIAQPPS--EFYE
        MGCC+SS     +  K    S   +  PPS ++EET VKEVLSET  L    +    N   E+    K +  E EKK   + ++    +   PS  E  +
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKK---LCEIPINGIAQPPS--EFYE

Query:  ISHPNDCLSVSADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTL----TRRSDQSPVRRNG--TVGSMRMVHNRDTSPAMARRGLRAEP
         S  ++  S+S   ++   + ++  +K     + + +S ++        S+NR +     RR+D SP +RN     GS+R+V +   +            
Subjt:  ISHPNDCLSVSADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTL----TRRSDQSPVRRNG--TVGSMRMVHNRDTSPAMARRGLRAEP

Query:  PRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSV----RKSGKSSPIRAATGTATSQKVVGENNIID--GKFNT-QIESLENPLVSLECFIFL
          RD  E S RR+RSPA  R    G   +   T S     R+S     +R       S +    +      G +N+   +S ENPLVSLECFIFL
Subjt:  PRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSV----RKSGKSSPIRAATGTATSQKVVGENNIID--GKFNT-QIESLENPLVSLECFIFL

AT1G61170.1 unknown protein2.8e-0831.47Show/hide
Query:  CCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEK-----KLCEIPI---NGIAQPP--SE
        CC+SSG +        R ++N S    + +EEET VKEVLSET    P  S     F   +D  +  +  + EK     K+   P+    G   P   SE
Subjt:  CCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEK-----KLCEIPI---NGIAQPP--SE

Query:  FYEISHPNDCLSVSADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRR-NGTVGSMRMVHNRDTSPAMARRGLRAEPPR
          EI   +   SVS+  +  G +    ++K       + +S     + + +++ N   TRR+DQSP +R NGT                   G R     
Subjt:  FYEISHPNDCLSVSADQMNGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRR-NGTVGSMRMVHNRDTSPAMARRGLRAEPPR

Query:  RDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        RDP E S RR+RSPAT R     ++S+       RK+ + SP R     A  +  + +    +  + T+ E LENPLVSLECFIFL
Subjt:  RDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTATTAGTTCCGGCAAATCCTTCCATTCAGCTAACAAATTTCATCGGAATTCTGACAATGGAAGCCGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTCTCTGAAACGCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAAAAGAATTTTCCACCGGAAGAAGACAAAGCCCAAAAACCAGTCGGTAATGAGATCG
AGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTGCACAACCACCTTCTGAATTCTATGAAATTTCCCATCCGAACGACTGTCTCTCAGTCTCCGCCGATCAAATGAAC
GGCGGCGGAGAGGTTAATCAGACGGTTCTGAAATCATCGCCAGTGAAATTGACGAAGAATCAATCAATTTCCCGAGACGTTGAGTTAAAAAGAGAAATATCGCAGAACAG
GACACTGACCCGGAGATCTGACCAGTCGCCAGTCCGACGAAACGGCACCGTTGGGTCGATGAGAATGGTTCATAACAGAGACACGAGTCCGGCAATGGCGCGTCGAGGAT
TGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGAACCCGATCGCCGGCTACCGCTCGTCCCGACGGCGCAGGGTCTAGATCTGCCTTGAGTCGG
ACCCCGTCAGTGAGAAAGTCCGGTAAATCATCGCCCATTAGGGCGGCGACGGGGACGGCGACAAGTCAAAAAGTAGTAGGAGAAAACAATATCATAGATGGAAAATTCAA
CACTCAGATTGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGTATTAGTTCCGGCAAATCCTTCCATTCAGCTAACAAATTTCATCGGAATTCTGACAATGGAAGCCGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTCTCTGAAACGCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAAAAGAATTTTCCACCGGAAGAAGACAAAGCCCAAAAACCAGTCGGTAATGAGATCG
AGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTGCACAACCACCTTCTGAATTCTATGAAATTTCCCATCCGAACGACTGTCTCTCAGTCTCCGCCGATCAAATGAAC
GGCGGCGGAGAGGTTAATCAGACGGTTCTGAAATCATCGCCAGTGAAATTGACGAAGAATCAATCAATTTCCCGAGACGTTGAGTTAAAAAGAGAAATATCGCAGAACAG
GACACTGACCCGGAGATCTGACCAGTCGCCAGTCCGACGAAACGGCACCGTTGGGTCGATGAGAATGGTTCATAACAGAGACACGAGTCCGGCAATGGCGCGTCGAGGAT
TGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGAACCCGATCGCCGGCTACCGCTCGTCCCGACGGCGCAGGGTCTAGATCTGCCTTGAGTCGG
ACCCCGTCAGTGAGAAAGTCCGGTAAATCATCGCCCATTAGGGCGGCGACGGGGACGGCGACAAGTCAAAAAGTAGTAGGAGAAAACAATATCATAGATGGAAAATTCAA
CACTCAGATTGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
Protein sequenceShow/hide protein sequence
MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISHPNDCLSVSADQMN
GGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSR
TPSVRKSGKSSPIRAATGTATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL