; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G003400 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G003400
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSerine/arginine repetitive matrix protein 1
Genome locationchr01:2979718..2980554
RNA-Seq ExpressionLsi01G003400
SyntenyLsi01G003400
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-8766.78Show/hide
Query:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA
        MGCC+SSGKS +SA+KFD           ++NG R+PPSSMEEETVKEVLSET ALKP P SPP KS P EEDEAQKPVG+    EIEKKL EIP+NGI 
Subjt:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA

Query:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR
        +Q SEF+EIS+P+     + A FTD MDGG EVHQ VLK+    LP NQ+I G+V LKR++S N+TL RRSDQSP RRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDENF RRSRSPATAR D  GSRSAL RTPSVRK+GKSSP+R       A ATS+KVVEENNI +G   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

XP_008441084.1 PREDICTED: uncharacterized protein LOC103485312 [Cucumis melo]4.6e-11481.65Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SS +SFNS NKF  +S N  RDPPSSMEEETVKEVLSETPALKPPP+   K+ P EEDE  KP+G+E EKKL EIP+NGI EQPSEF EISH N 
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VSAA FTDQ DGGGEVHQT LKSSPVKL KNQ++S DVELKREI Q+RTLTRRSDQSP RRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
         SRRS+SP+TA SD AG RSALSRTPS RK+GKSSPIRA  ATSQKVVEENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

XP_011658203.1 uncharacterized protein LOC105435961 [Cucumis sativus]1.5e-11280.94Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SS +SF+S NKF  NS N  RDPPSSMEEETVKEVLSETPALKP   P K +   E+DE +KP+G+EIEKKL EIP+NGI EQPSEF EISH N 
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VSAA FTDQ DGGGEVHQTVLKSSPVKL KNQ++S DVELKREI Q+RTLTRRSDQSP RRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
         SRRS SP+TARSD AG RSALSRTPS RK+GKSSPI A  ATSQKVVEENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]1.7e-8766.78Show/hide
Query:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA
        MGCC+SSGKS +SA+KFD           ++NG R+PPSSMEEETVKEVLSET ALKP   SPP K+ P EEDEAQKPVG+    EIEKKL EIP+NGI 
Subjt:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA

Query:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF+EIS+P+     + A FTD MDGG EVHQ VLK+    LP NQ+I G+V LKR++S N+TL RRSDQSP RRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDENF RRSRSPATAR D  GSRSAL RTPSVRK+GKSSP+R       A ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]1.4e-12385.61Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SSGKSFNS NKF RNS+NG RDPPSSMEEETVKEVLSETP+LKPPPSPPKK+ P EED+  KPVGNEIEKKL EI +NGIAE PSEF EISHPN+
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VS AI T+QMDGGGE+HQ VLKSSPVKLPK+Q+ISGD E+KREISQNRTLTRRSDQSP RRNGA+GSMRMVHNRDM+PAMARR LRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
          RRSRSPATARSDG GSRSALSRTPSVRK+GKSSP RAA ATSQKVVEENNIIDGKFN+QIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein7.2e-11380.94Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SS +SF+S NKF  NS N  RDPPSSMEEETVKEVLSETPALKP   P K +   E+DE +KP+G+EIEKKL EIP+NGI EQPSEF EISH N 
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VSAA FTDQ DGGGEVHQTVLKSSPVKL KNQ++S DVELKREI Q+RTLTRRSDQSP RRNGAVGS+RMVHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
         SRRS SP+TARSD AG RSALSRTPS RK+GKSSPI A  ATSQKVVEENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853122.3e-11481.65Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SS +SFNS NKF  +S N  RDPPSSMEEETVKEVLSETPALKPPP+   K+ P EEDE  KP+G+E EKKL EIP+NGI EQPSEF EISH N 
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VSAA FTDQ DGGGEVHQT LKSSPVKL KNQ++S DVELKREI Q+RTLTRRSDQSP RRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
         SRRS+SP+TA SD AG RSALSRTPS RK+GKSSPIRA  ATSQKVVEENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein2.3e-11481.65Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND
        MGCC+SS +SFNS NKF  +S N  RDPPSSMEEETVKEVLSETPALKPPP+   K+ P EEDE  KP+G+E EKKL EIP+NGI EQPSEF EISH N 
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPND

Query:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
        C+ VSAA FTDQ DGGGEVHQT LKSSPVKL KNQ++S DVELKREI Q+RTLTRRSDQSP RRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN
Subjt:  CLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDEN

Query:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
         SRRS+SP+TA SD AG RSALSRTPS RK+GKSSPIRA  ATSQKVVEENNI+DGKFNTQIESLENPLVSLECFIFL
Subjt:  FSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124338.7e-5063.21Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNG------CRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVG-------NEIEKKLREIPVNGIAE
        MGCC+SSG   NSA+KFDRNS          R+PPSSMEEETVKEVL+ETPALKPP  PP K+ P +EDEA KPV        NEIEKK+R IP N +AE
Subjt:  MGCCISSGKSFNSANKFDRNSNNG------CRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVG-------NEIEKKLREIPVNGIAE

Query:  QPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMS
           EF+EIS P++CL  SAA FTD+MD G EVHQ V ++SPVKLPKNQ+ SGDV  KRE+  NR L RRSDQSP RRNG VGS R+  NRDM+
Subjt:  QPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMS

A0A6J1EF08 uncharacterized protein LOC1114335678.1e-8866.78Show/hide
Query:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA
        MGCC+SSGKS +SA+KFD           ++NG R+PPSSMEEETVKEVLSET ALKP   SPP K+ P EEDEAQKPVG+    EIEKKL EIP+NGI 
Subjt:  MGCCISSGKSFNSANKFDRN---------SNNGCRDPPSSMEEETVKEVLSETPALKP-PPSPPKKSFPLEEDEAQKPVGN----EIEKKLREIPVNGIA

Query:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR
        +QPSEF+EIS+P+     + A FTD MDGG EVHQ VLK+    LP NQ+I G+V LKR++S N+TL RRSDQSP RRN  VGS R+V  RD SPAM  R
Subjt:  EQPSEFNEISHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARR

Query:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
        GLR EP ++DPDENF RRSRSPATAR D  GSRSAL RTPSVRK+GKSSP+R       A ATS+KVVEENNI DG   TQIESLENPLVSLECFIFL
Subjt:  GLRAEPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRA------AMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein3.1e-0728.96Show/hide
Query:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEET-VKEVLSETPAL---KPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEIS
        MGCC+SS     +  K D  S      PPS ++EET VKEVLSET  L       +  K +    ++E +K  G  ++     +       +P + +E+S
Subjt:  MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEET-VKEVLSETPAL---KPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEIS

Query:  HPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTL----TRRSDQSPARRNG--AVGSMRMVHNRDMSPAMARRGLRA
          N  L  S     ++ D              VK  K+  +      K   S+NR +     RR+D SP +RN     GS+R+V +   +          
Subjt:  HPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTL----TRRSDQSPARRNG--AVGSMRMVHNRDMSPAMARRGLRA

Query:  EPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPS--VRKTGKSSPIRAAMATSQKVVEEN-------NIIDGKFNTQIESLENPLVSLECFIFL
            RD  E   RRSRSPA  RS   G   +   T S    +    SP R  +  +    ++          I    +   +S ENPLVSLECFIFL
Subjt:  EPPRRDPDENFSRRSRSPATARSDGAGSRSALSRTPS--VRKTGKSSPIRAAMATSQKVVEEN-------NIIDGKFNTQIESLENPLVSLECFIFL

AT1G61170.1 unknown protein8.4e-1330.39Show/hide
Query:  CCISSGKSFNSANKFDRNSNNGCRDPPSSMEEET-VKEVLSETPALKPPPSPPKKSF------PLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEI
        CC+SSG +       DR + N      + +EEET VKEVLSET    P  S    +F       + EDE +KP    ++     + +   +  P E +E+
Subjt:  CCISSGKSFNSANKFDRNSNNGCRDPPSSMEEET-VKEVLSETPALKPPPSPPKKSF------PLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEI

Query:  SHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRR
        S         +   T  M+G  E H        V + + ++     + + +++ N   TRR+DQSP +RN    +                G R     R
Subjt:  SHPNDCLPVSAAIFTDQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRR

Query:  DPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL
        DP E   RRSRSPAT RS    ++S+       RK  + SP R  +  ++  +++    +  + T+ E LENPLVSLECFIFL
Subjt:  DPDENFSRRSRSPATARSDGAGSRSALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGTATTAGCTCCGGCAAATCCTTCAATTCAGCTAACAAATTCGATCGGAATTCTAACAATGGATGCAGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTTTCTGAAACCCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAGAAGAGTTTTCCACTGGAAGAAGACGAAGCCCAAAAACCAGTCGGTAATGAGATCG
AGAAGAAGCTTCGTGAAATTCCCGTTAACGGAATTGCAGAGCAACCTTCTGAATTCAATGAAATTTCCCATCCCAACGACTGTCTCCCAGTCTCCGCCGCTATTTTCACC
GATCAAATGGATGGCGGCGGAGAGGTTCATCAAACGGTTTTGAAATCATCGCCAGTGAAATTGCCGAAGAATCAAGCAATTTCCGGAGACGTTGAATTAAAAAGAGAAAT
ATCGCAGAACAGGACATTGACCCGGAGATCCGACCAGTCGCCAGCCCGACGAAACGGCGCCGTGGGGTCGATGAGAATGGTTCATAACAGAGACATGAGTCCGGCAATGG
CGCGGCGAGGATTAAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTTCAGCCGGAGATCCCGGTCGCCGGCTACCGCACGTTCCGACGGCGCAGGGTCTAGATCT
GCCCTGAGTCGGACCCCGTCAGTGAGAAAGACCGGTAAATCATCGCCCATTAGGGCAGCGATGGCGACAAGTCAAAAAGTAGTAGAAGAAAACAATATCATAGATGGAAA
ATTCAACACTCAGATCGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGTATTAGCTCCGGCAAATCCTTCAATTCAGCTAACAAATTCGATCGGAATTCTAACAATGGATGCAGAGACCCGCCGTCTTCCATGGAGGAAGAGACCGT
CAAAGAAGTGCTTTCTGAAACCCCTGCTCTGAAACCGCCGCCGTCGCCGCCGAAGAAGAGTTTTCCACTGGAAGAAGACGAAGCCCAAAAACCAGTCGGTAATGAGATCG
AGAAGAAGCTTCGTGAAATTCCCGTTAACGGAATTGCAGAGCAACCTTCTGAATTCAATGAAATTTCCCATCCCAACGACTGTCTCCCAGTCTCCGCCGCTATTTTCACC
GATCAAATGGATGGCGGCGGAGAGGTTCATCAAACGGTTTTGAAATCATCGCCAGTGAAATTGCCGAAGAATCAAGCAATTTCCGGAGACGTTGAATTAAAAAGAGAAAT
ATCGCAGAACAGGACATTGACCCGGAGATCCGACCAGTCGCCAGCCCGACGAAACGGCGCCGTGGGGTCGATGAGAATGGTTCATAACAGAGACATGAGTCCGGCAATGG
CGCGGCGAGGATTAAGAGCGGAGCCTCCCCGGAGAGACCCAGATGAGAATTTCAGCCGGAGATCCCGGTCGCCGGCTACCGCACGTTCCGACGGCGCAGGGTCTAGATCT
GCCCTGAGTCGGACCCCGTCAGTGAGAAAGACCGGTAAATCATCGCCCATTAGGGCAGCGATGGCGACAAGTCAAAAAGTAGTAGAAGAAAACAATATCATAGATGGAAA
ATTCAACACTCAGATCGAGTCACTTGAGAACCCTCTGGTTTCATTAGAGTGCTTCATCTTCCTCTGA
Protein sequenceShow/hide protein sequence
MGCCISSGKSFNSANKFDRNSNNGCRDPPSSMEEETVKEVLSETPALKPPPSPPKKSFPLEEDEAQKPVGNEIEKKLREIPVNGIAEQPSEFNEISHPNDCLPVSAAIFT
DQMDGGGEVHQTVLKSSPVKLPKNQAISGDVELKREISQNRTLTRRSDQSPARRNGAVGSMRMVHNRDMSPAMARRGLRAEPPRRDPDENFSRRSRSPATARSDGAGSRS
ALSRTPSVRKTGKSSPIRAAMATSQKVVEENNIIDGKFNTQIESLENPLVSLECFIFL