; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G23360 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G23360
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationClcChr01:34163021..34164310
RNA-Seq ExpressionClc01G23360
SyntenyClc01G23360
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-8165.88Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP P SPP K+ PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q  SEF EIS P+   + +   D MDGG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T    A ATS+KVV ENNI +G   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]5.8e-10980.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEIS  N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ DGGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]6.5e-10880.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SS +SF S NKFH NS N SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+D+ +KP+G+EIEKKL EIPINGI + PSEFYEIS  N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ DGGGEV+QTVLKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGS+RMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR+ SP+TAR D AG RSALSRTPS RKSGKSSPI A   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]2.1e-8266.55Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP   SPP KN PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q PSEF EIS P+   + +   D MDGG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T    A ATS+KVV ENNI DG   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]2.8e-11985Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SSGKSF+S NKFHRNSDNGSRDPPSSMEEETVKEVLSETP+LKPPPSPPKKN PPEED+  KPVGNEIEKKLCEI INGIA+ PSEFYEIS PN+
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVS----ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVS     +QMDGGGE++Q VLKSSPVKL K+QSIS D E+KREISQNRTLTRRSDQSPVRRNG +GSMRMVHNRD +PAMARR LRAEPPRRDPDEN
Subjt:  CLSVS----ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        S RR+RSPATAR DG GSRSALSRTPSVRKSGKSSP RA  ATATSQKVV ENNIIDGKFN+QIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein3.1e-10880.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SS +SF S NKFH NS N SRDPPSSMEEETVKEVLSETPALKP   P K N  PE+D+ +KP+G+EIEKKL EIPINGI + PSEFYEIS  N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ DGGGEV+QTVLKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGS+RMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR+ SP+TAR D AG RSALSRTPS RKSGKSSPI A   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853122.8e-10980.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEIS  N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ DGGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein2.8e-10980.36Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND
        MGCC+SS +SF+S NKFH +S N +RDPPSSMEEETVKEVLSETPALKPPP+   KN PPEED+  KP+G+E EKKL EIPINGI + PSEFYEIS  N 
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPND

Query:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN
        C+SVSA    DQ DGGGEV+QT LKSSPVKLTKNQS+S DVELKREI Q+RTLTRRSDQSPVRRNG VGSMRMVHNRD SPAMARRGLRAEPPRRDPDEN
Subjt:  CLSVSA----DQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDEN

Query:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        SSRR++SP+TA  D AG RSALSRTPS RKSGKSSPIRA   TATSQKVV ENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  SSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124331.0e-4260.32Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNG------SRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVG-------NEIEKKLCEIPINGIAQ
        MGCC+SSG   +SA+KF RNS         SR+PPSSMEEETVKEVL+ETPALKPP  PP KN PP+ED+A KPV        NEIEKK+  IP N +A+
Subjt:  MGCCISSGKSFHSANKFHRNSDNG------SRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVG-------NEIEKKLCEIPINGIAQ

Query:  PPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRD
           EF EIS P++CLS +   D+MD G EV+Q V ++SPVKL KNQS S DV  KRE+  NR L RRSDQSPVRRNG VGS R+  NRD
Subjt:  PPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRD

A0A6J1EF08 uncharacterized protein LOC1114335671.0e-8266.55Show/hide
Query:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA
        MGCC+SSGKS  SA+KF            +DNGSR+PPSSMEEETVKEVLSET ALKP   SPP KN PPEED+AQKPVG+    EIEKKL EIPINGI 
Subjt:  MGCCISSGKSFHSANKFHRN---------SDNGSRDPPSSMEEETVKEVLSETPALKP-PPSPPKKNFPPEEDKAQKPVGN----EIEKKLCEIPINGIA

Query:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL
        Q PSEF EIS P+   + +   D MDGG EV+Q VLK+ P     NQSI  +V LKR++S N+TL RRSDQSPVRRN  VGS R+V  RD SPAM  RGL
Subjt:  QPPSEFYEISQPNDCLSVS--ADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        R EP ++DPDEN  RR+RSPATAR D  GSRSAL RTPSVRKSGKSSP+R  T    A ATS+KVV ENNI DG   TQIESLENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAAT----ATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein4.9e-0529.43Show/hide
Query:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPN
        MGCC+SS     +  K    S   +  PPS ++EET VKEVLSET  L    +    N   E+    K +  E EKK            P    +++Q  
Subjt:  MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPN

Query:  DCLSVSADQMDGGGEVNQTVLKSSPVKLTKNQSISRDV-ELKREI--------SQNRTL----TRRSDQSPVRRNG--TVGSMRMVHNRDTSPAMARRGL
             S  + + G EV++    S  +    N+    +V ++K  +        S+NR +     RR+D SP +RN     GS+R+V +   +        
Subjt:  DCLSVSADQMDGGGEVNQTVLKSSPVKLTKNQSISRDV-ELKREI--------SQNRTL----TRRSDQSPVRRNG--TVGSMRMVHNRDTSPAMARRGL

Query:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSV----RKSGKSSPIRAATATATSQKVVGENNIID--GKFNT-QIESLENPLVSLECFIFL
              RD  E S RR+RSPA  R    G   +   T S     R+S     +R       S +    +      G +N+   +S ENPLVSLECFIFL
Subjt:  RAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSRTPSV----RKSGKSSPIRAATATATSQKVVGENNIID--GKFNT-QIESLENPLVSLECFIFL

AT1G61170.1 unknown protein6.2e-0831.47Show/hide
Query:  CCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEK-----KLCEIPI---NGIAQPP--SE
        CC+SSG +        R ++N S    + +EEET VKEVLSET    P  S     F   +D  +  +  + EK     K+   P+    G   P   SE
Subjt:  CCISSGKSFHSANKFHRNSDNGSRDPPSSMEEET-VKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEK-----KLCEIPI---NGIAQPP--SE

Query:  FYEISQPNDCLSVSADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRR-NGTVGSMRMVHNRDTSPAMARRGLRAEPPR
          EI   +   SVS+  +  G +    ++K       + +S     + + +++ N   TRR+DQSP +R NGT                   G R     
Subjt:  FYEISQPNDCLSVSADQMDGGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRR-NGTVGSMRMVHNRDTSPAMARRGLRAEPPR

Query:  RDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL
        RDP E S RR+RSPAT R     ++S+       RK+ + SP R     A  +  + +    +  + T+ E LENPLVSLECFIFL
Subjt:  RDPDENSSRRTRSPATARPDGAGSRSALSRTPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTATTAGTTCCGGCAAATCCTTCCATTCAGCTAACAAATTTCATCGGAATTCTGACAATGGAAGCCGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTCTCTGAAACGCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAAAAGAATTTTCCACCGGAAGAAGACAAAGCCCAAAAACCAGTCGGTAATGAGATCG
AGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTGCACAACCACCTTCTGAATTCTATGAAATTTCCCAACCGAACGACTGTCTCTCAGTCTCCGCCGATCAAATGGAC
GGCGGCGGAGAGGTTAATCAGACGGTTCTGAAATCATCGCCAGTGAAATTGACGAAGAATCAATCAATTTCCCGAGACGTTGAGTTAAAAAGAGAAATATCGCAGAACAG
GACACTGACCCGGAGATCTGACCAGTCGCCAGTCCGACGAAACGGCACCGTTGGGTCGATGAGAATGGTTCATAACAGAGACACGAGTCCGGCAATGGCGCGTCGAGGAT
TGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGAACCCGATCGCCGGCTACCGCTCGTCCCGACGGCGCAGGGTCTAGATCTGCCTTGAGTCGG
ACCCCGTCAGTGAGAAAGTCCGGTAAATCATCGCCCATTAGGGCGGCGACGGCGACGGCGACAAGTCAAAAAGTAGTAGGAGAAAACAATATCATAGATGGAAAATTCAA
CACTCAGATTGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
GTACAATCCGCCGTAACTGATTCTGCCTTTAAAATTAAATTCCATTGACCTCTTACTCTCAATTCCCAAATCTTCCACCTCCCCTGAAAATTACAATGTTCCCCGCCGCC
CACATCTTCTGAAGGCCGCCGGCACACATGGGTTGCTGTATTAGTTCCGGCAAATCCTTCCATTCAGCTAACAAATTTCATCGGAATTCTGACAATGGAAGCCGAGACCC
GCCGTCTTCCATGGAGGAAGAGACCGTCAAAGAAGTGCTCTCTGAAACGCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAAAAGAATTTTCCACCGGAAGAAGACAAAG
CCCAAAAACCAGTCGGTAATGAGATCGAGAAGAAGCTTTGTGAAATTCCCATTAACGGAATTGCACAACCACCTTCTGAATTCTATGAAATTTCCCAACCGAACGACTGT
CTCTCAGTCTCCGCCGATCAAATGGACGGCGGCGGAGAGGTTAATCAGACGGTTCTGAAATCATCGCCAGTGAAATTGACGAAGAATCAATCAATTTCCCGAGACGTTGA
GTTAAAAAGAGAAATATCGCAGAACAGGACACTGACCCGGAGATCTGACCAGTCGCCAGTCCGACGAAACGGCACCGTTGGGTCGATGAGAATGGTTCATAACAGAGACA
CGAGTCCGGCAATGGCGCGTCGAGGATTGAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGAGAACCCGATCGCCGGCTACCGCTCGTCCCGACGGC
GCAGGGTCTAGATCTGCCTTGAGTCGGACCCCGTCAGTGAGAAAGTCCGGTAAATCATCGCCCATTAGGGCGGCGACGGCGACGGCGACAAGTCAAAAAGTAGTAGGAGA
AAACAATATCATAGATGGAAAATTCAACACTCAGATTGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGATTTGTGTCGGTTTCACTGATTT
TCATTTTTCTTGTTTGATTTTGTTTTAAGTTTTGAGTGAACAGTGATGTAAATTTAACCATTACGGCGGTGGGTCGCCGGAGAATTCATTGAAGGTCACGAGTGAGGTTT
GTTGGAAATTGAAGCACACAATTAATTTTTAAATTAGAAGTAAATTAACGTAATTAGATTATTGTTAATGATGTGATTTGAATGAGGTAAATGAGGAAGGCGACGTAAAA
GAGAAGACAAAATCCGCCGCATAAATCACTGTTCATCACGTGATTTAAGAGGGGGAATAGGCTGTAAAATATGCTAATCA
Protein sequenceShow/hide protein sequence
MGCCISSGKSFHSANKFHRNSDNGSRDPPSSMEEETVKEVLSETPALKPPPSPPKKNFPPEEDKAQKPVGNEIEKKLCEIPINGIAQPPSEFYEISQPNDCLSVSADQMD
GGGEVNQTVLKSSPVKLTKNQSISRDVELKREISQNRTLTRRSDQSPVRRNGTVGSMRMVHNRDTSPAMARRGLRAEPPRRDPDENSSRRTRSPATARPDGAGSRSALSR
TPSVRKSGKSSPIRAATATATSQKVVGENNIIDGKFNTQIESLENPLVSLECFIFL