; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030481 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030481
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold6:34119095..34125717
RNA-Seq ExpressionSpg030481
SyntenySpg030481
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.3e-3128.71Show/hide
Query:  RFVNELARAKYQEVLKRDFLFERGF----GTD--FPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD
        +F N+ A+A++Q    R+  FE GF     TD  F   +   +  L W +F   P  VNA++V+EFYAN+   +   + VRG   +++  AIN  F LQ+
Subjt:  RFVNELARAKYQEVLKRDFLFERGF----GTD--FPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD

Query:  F--PHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            HA+F E      S++    ++++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++
Subjt:  F--PHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGMVY----GVNQILEQLTVLTSRLEFAERQAQ---------T
         DC  KK   L FPN IT LC + +V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGMVY----GVNQILEQLTVLTSRLEFAERQAQ---------T

Query:  YWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDL---FNLWIPPPPVEREED-------VDEEQAELGF------AECSAVFAPTPLSSRRRRSSRR---
        ++ Y K RD  +    Q       + FP FPD++   FN    P P     D        D  ++E          E +    P P  S  +RS RR   
Subjt:  YWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDL---FNLWIPPPPVEREED-------VDEEQAELGF------AECSAVFAPTPLSSRRRRSSRR---

Query:  RPGSAAAAVDTPSRGSLSPRRFSSR
          G  A    T S  S +P R  +R
Subjt:  RPGSAAAAVDTPSRGSLSPRRFSSR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]6.4e-3436.95Show/hide
Query:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F  E A  +Y+  +  R    E+GF  D         F+   I    W+QFC  P+     +VREFYANL    +  V VRGV   WS EAIN +F L
Subjt:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   ++ V + GA+W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.2e-4534.51Show/hide
Query:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F  E A  +Y+  +  R    E+GF  D         F+   I    W+QFC  P+     +VREFYANL   ++  V VRGV   WS EAIN +F L
Subjt:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   ++ V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGMVYGVNQ----ILEQLTVLTSR------------
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGMVYGVNQ----ILEQLTVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE
          L+   +Q Q +W Y+K RD AL+ ALQ+NF+RP   FP FP ++          E ++D   E AE
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.9e-3034.8Show/hide
Query:  IRFVNELARAKYQE-------VLKRDFLFERGFGTDFPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F ++ A  +Y+E        ++++F+++     + P F+   I    W+ FC  P+     +VREFY N+   DD  V +RGV    S EAIN +F L
Subjt:  IRFVNELARAKYQE-------VLKRDFLFERGFGTDFPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E V   +  +L   ++ V I GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS++ V L +++L   SI+VG++I  EI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVP
          C  +K G LFFP+ IT +C     P
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]9.5e-3837.5Show/hide
Query:  IVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD--FPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL   ++  + VRGV   WS EAIN +F L D    H+ F E +  P   +L   ++ V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD--FPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL---------QRM
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+         Q+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL---------QRM

Query:  QEVRQGGMVYG--VNQILEQLTVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE
           R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ+NF+RP   FP FP ++          E ++D   E AE
Subjt:  QEVRQGGMVYG--VNQILEQLTVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)3.1e-3436.95Show/hide
Query:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F  E A  +Y+  +  R    E+GF  D         F+   I    W+QFC  P+     +VREFYANL    +  V VRGV   WS EAIN +F L
Subjt:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   ++ V + GA+W VS    +T   + L   A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.0e-4534.51Show/hide
Query:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F  E A  +Y+  +  R    E+GF  D         F+   I    W+QFC  P+     +VREFYANL   ++  V VRGV   WS EAIN +F L
Subjt:  IRFVNELARAKYQ-EVLKRDFLFERGFGTDFPR------FLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E +   +   L   ++ V   GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGMVYGVNQ----ILEQLTVLTSR------------
          C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGMVYGVNQ----ILEQLTVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE
          L+   +Q Q +W Y+K RD AL+ ALQ+NF+RP   FP FP ++          E ++D   E AE
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE

A0A2P5DAQ2 Uncharacterized protein9.3e-3134.8Show/hide
Query:  IRFVNELARAKYQE-------VLKRDFLFERGFGTDFPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL
        ++F ++ A  +Y+E        ++++F+++     + P F+   I    W+ FC  P+     +VREFY N+   DD  V +RGV    S EAIN +F L
Subjt:  IRFVNELARAKYQE-------VLKRDFLFERGFGTDFPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDL

Query:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E V   +  +L   ++ V I GA+W VS    +T   + L   A  W  F++ RLLPTTH  TVS++ V L +++L   SI+VG++I  EI
Subjt:  QDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVP
          C  +K G LFFP+ IT +C     P
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVP

A0A2P5DXM3 Uncharacterized protein4.6e-3837.5Show/hide
Query:  IVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD--FPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR
        +VREFYANL   ++  + VRGV   WS EAIN +F L D    H+ F E +  P   +L   ++ V   GA+W VS    +T   + L   A  W  F++
Subjt:  IVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD--FPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIR

Query:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL---------QRM
         RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P +  +  + + G ID   +AR+         Q+ 
Subjt:  LRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARL---------QRM

Query:  QEVRQGGMVYG--VNQILEQLTVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE
           R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ+NF+RP   FP FP ++          E ++D   E AE
Subjt:  QEVRQGGMVYG--VNQILEQLTVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAE

A0A6A3BU96 Uncharacterized protein1.1e-3128.71Show/hide
Query:  RFVNELARAKYQEVLKRDFLFERGF----GTD--FPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD
        +F N+ A+A++Q    R+  FE GF     TD  F   +   +  L W +F   P  VNA++V+EFYAN+   +   + VRG   +++  AIN  F LQ+
Subjt:  RFVNELARAKYQEVLKRDFLFERGF----GTD--FPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQD

Query:  F--PHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
            HA+F E      S++    ++++  E  +W   QT +++     L+  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++
Subjt:  F--PHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGMVY----GVNQILEQLTVLTSRLEFAERQAQ---------T
         DC  KK   L FPN IT LC + +V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGMVY----GVNQILEQLTVLTSRLEFAERQAQ---------T

Query:  YWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDL---FNLWIPPPPVEREED-------VDEEQAELGF------AECSAVFAPTPLSSRRRRSSRR---
        ++ Y K RD  +    Q       + FP FPD++   FN    P P     D        D  ++E          E +    P P  S  +RS RR   
Subjt:  YWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDL---FNLWIPPPPVEREED-------VDEEQAELGF------AECSAVFAPTPLSSRRRRSSRR---

Query:  RPGSAAAAVDTPSRGSLSPRRFSSR
          G  A    T S  S +P R  +R
Subjt:  RPGSAAAAVDTPSRGSLSPRRFSSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACAAGAGCTCGAAAAGAGAGAGAAAGTGAAGAGGAGGAGGTGCCGGTTACACCGGAAGTGCAGAAAGGAAAAACGAAGAAAAAGAGAACTCCAGAAGAAAA
AGAGGCTAAACGAAGAAGGAGGCAGCAGAGGGCTGCGGAGCAAGAAACGATTCAAGAAGAGACGGTGAATGTCGCGGATACAGAGGGAACTACACATCCTGAGACAGGAC
CAATAATTTCTGTTGCGGTTCAAGAAGGGAATGTCGAGAAAAATCAAGAAACAGAGGGTGAAGAGCAGGTCGAAGGTGAGCCTGGCGAGGAGAAAACACCGGAGCAGGAG
GCTCATGTTGAAATCATTATGCCTGAACCACCAAAGCGCCACCGCATCAAGAGGAAGGCGGGTCGCGTGAAGGTGGTTCGGAACACGCCATCGCCTCCGGCATCGGACTC
TGAGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCGGAAGAAGAACGTTTGCGCGAACAGAGAGAAAGCAAGGGCAAAGGAATTGCCG
AAGCATCGGGAGAAATTGAGGAGCCGAGGGCCCCATTCATTCGCTTCGTCAACGAACTTGCCAGAGCAAAATATCAAGAAGTACTGAAGCGTGATTTCTTATTCGAGCGA
GGATTTGGCACTGATTTTCCCAGGTTCTTGGAGTCCGGAATAGCGAGCCTTGGGTGGAGACAGTTTTGTGTGAAGCCTGATCCTGTCAATGCCAATATCGTTCGGGAATT
CTATGCTAATCTTGATGTGAAGGATGATTTTGAAGTCATAGTGCGAGGAGTGCCTGCCCAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTTCAGGACTTTCCAC
ACGCAGTTTTCAATGAAATGGTGGTTGCCCCATCTAGTGACCAACTGAGTGCGGCTGTCCAGGAGGTAGGCATTGAGGGGGCCCAATGGAGGGTGTCGCAGACGCGGAAG
CATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGCTTCATTAGGCTCCGTTTGCTGCCAACAACGCATGACTCCACAGTGTCTCGGGACAGGGT
ATTGCTTGCCTTTGCTATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCGA
ACACGATTACGATGTTATGCAGCAGGGCAGAAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCGT
ATGCAAGAGGTTCGCCAGGGAGGGATGGTGTATGGCGTTAATCAAATCCTAGAGCAACTGACAGTGTTGACTAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTA
TTGGACTTATGCTAAAAGGAGAGATGATGCGCTAAGGGGGGCCTTGCAAAGCAATTTCTCAAGACCGTATCAGGCCTTCCCAGAGTTTCCCGATGATTTGTTTAATCTTT
GGATACCACCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGCAGAGCTTGGTTTTGCAGAGTGCTCAGCCGTTTTCGCGCCGACGCCGCTCAGCTCGCGT
CGCCGCCGGTCATCGCGCCGCCGCCCAGGTTCTGCAGCCGCCGCCGTCGACACGCCTAGCCGTGGGTCTCTCTCTCCGCGCCGTTTCTCCTCTCGTGGTCGTCGCGCAGG
CCGCCGTCCCTCCCTTCTCTTCGCGCGTGGAAGTCCAGCCGCCGCCGTTTCTCCTTGCAGCCGAGCGCCGCCGTTCGTGAGTCGTAGTGCCGCCGCTACCCAGATCGAAG
CTGTCGCATCTCTTCCTTCTCTCGTTCCCTTGCGTTTTCGACCAAGAAGAACTCGTGGATCTCGCGTGTCAAGCGATTCGGAGCCCTGTCGTTCCCTTTTCAGTCGATTC
CGCCTCTGTCCAGCAACGTCTCGGCTGTTGTTGGTGTCGTTTGGCGTTTCCGCCGGAAGCTCGTTTTGGGAGCGTGTCGTGCCGTTCCAGCTAGCGTTGCACATGGCGCA
GAATTCATGTGTAGCGGAGCATGACGCGGAAATCATCCTGTTGGATATCTGTGGTTGCATAAGCATGATGCTTGCTTGTGGTTGTGAAAGCATGTTGGATGCGTGTGTAA
GGCATGTTGATTGCGTTGTTGATGTTTGTGGTTTGTTAATGTATGATGTGATGAGCCTTGATTTGTCTAGAGGAGATAGGTTAAGGGTTGGAAAACCTGGGGCGTTACAA
GAAGAATGGAAGCTTTGGAAATCACTTTGTGCAGATTATGCTGCTGAGCGACTGGAAGGAGCAAATTCTATGTTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAG
CTCGTTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGTGATAAAGGATCAAGGAGAGCCTTACACGTGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACAAGAGCTCGAAAAGAGAGAGAAAGTGAAGAGGAGGAGGTGCCGGTTACACCGGAAGTGCAGAAAGGAAAAACGAAGAAAAAGAGAACTCCAGAAGAAAA
AGAGGCTAAACGAAGAAGGAGGCAGCAGAGGGCTGCGGAGCAAGAAACGATTCAAGAAGAGACGGTGAATGTCGCGGATACAGAGGGAACTACACATCCTGAGACAGGAC
CAATAATTTCTGTTGCGGTTCAAGAAGGGAATGTCGAGAAAAATCAAGAAACAGAGGGTGAAGAGCAGGTCGAAGGTGAGCCTGGCGAGGAGAAAACACCGGAGCAGGAG
GCTCATGTTGAAATCATTATGCCTGAACCACCAAAGCGCCACCGCATCAAGAGGAAGGCGGGTCGCGTGAAGGTGGTTCGGAACACGCCATCGCCTCCGGCATCGGACTC
TGAGGAAGAAAAAAGGGAAGCTGAGAATAAGGAAAAAGAAGAAGAGGCAAGAAAGGCGGAAGAAGAACGTTTGCGCGAACAGAGAGAAAGCAAGGGCAAAGGAATTGCCG
AAGCATCGGGAGAAATTGAGGAGCCGAGGGCCCCATTCATTCGCTTCGTCAACGAACTTGCCAGAGCAAAATATCAAGAAGTACTGAAGCGTGATTTCTTATTCGAGCGA
GGATTTGGCACTGATTTTCCCAGGTTCTTGGAGTCCGGAATAGCGAGCCTTGGGTGGAGACAGTTTTGTGTGAAGCCTGATCCTGTCAATGCCAATATCGTTCGGGAATT
CTATGCTAATCTTGATGTGAAGGATGATTTTGAAGTCATAGTGCGAGGAGTGCCTGCCCAATGGAGCCCAGAGGCCATTAATAATTTGTTTGATCTTCAGGACTTTCCAC
ACGCAGTTTTCAATGAAATGGTGGTTGCCCCATCTAGTGACCAACTGAGTGCGGCTGTCCAGGAGGTAGGCATTGAGGGGGCCCAATGGAGGGTGTCGCAGACGCGGAAG
CATACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGATGGGCTTCATTAGGCTCCGTTTGCTGCCAACAACGCATGACTCCACAGTGTCTCGGGACAGGGT
ATTGCTTGCCTTTGCTATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCGA
ACACGATTACGATGTTATGCAGCAGGGCAGAAGTGCCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGGCTTCAGCGT
ATGCAAGAGGTTCGCCAGGGAGGGATGGTGTATGGCGTTAATCAAATCCTAGAGCAACTGACAGTGTTGACTAGTAGGTTAGAATTTGCTGAAAGGCAAGCTCAGACCTA
TTGGACTTATGCTAAAAGGAGAGATGATGCGCTAAGGGGGGCCTTGCAAAGCAATTTCTCAAGACCGTATCAGGCCTTCCCAGAGTTTCCCGATGATTTGTTTAATCTTT
GGATACCACCCCCGCCTGTTGAACGAGAAGAGGATGTTGATGAGGAGCAGGCAGAGCTTGGTTTTGCAGAGTGCTCAGCCGTTTTCGCGCCGACGCCGCTCAGCTCGCGT
CGCCGCCGGTCATCGCGCCGCCGCCCAGGTTCTGCAGCCGCCGCCGTCGACACGCCTAGCCGTGGGTCTCTCTCTCCGCGCCGTTTCTCCTCTCGTGGTCGTCGCGCAGG
CCGCCGTCCCTCCCTTCTCTTCGCGCGTGGAAGTCCAGCCGCCGCCGTTTCTCCTTGCAGCCGAGCGCCGCCGTTCGTGAGTCGTAGTGCCGCCGCTACCCAGATCGAAG
CTGTCGCATCTCTTCCTTCTCTCGTTCCCTTGCGTTTTCGACCAAGAAGAACTCGTGGATCTCGCGTGTCAAGCGATTCGGAGCCCTGTCGTTCCCTTTTCAGTCGATTC
CGCCTCTGTCCAGCAACGTCTCGGCTGTTGTTGGTGTCGTTTGGCGTTTCCGCCGGAAGCTCGTTTTGGGAGCGTGTCGTGCCGTTCCAGCTAGCGTTGCACATGGCGCA
GAATTCATGTGTAGCGGAGCATGACGCGGAAATCATCCTGTTGGATATCTGTGGTTGCATAAGCATGATGCTTGCTTGTGGTTGTGAAAGCATGTTGGATGCGTGTGTAA
GGCATGTTGATTGCGTTGTTGATGTTTGTGGTTTGTTAATGTATGATGTGATGAGCCTTGATTTGTCTAGAGGAGATAGGTTAAGGGTTGGAAAACCTGGGGCGTTACAA
GAAGAATGGAAGCTTTGGAAATCACTTTGTGCAGATTATGCTGCTGAGCGACTGGAAGGAGCAAATTCTATGTTGCAGCAAAACTGGGAGCAGAAACTGCCACATCACAG
CTCGTTAGCCAACTTCATGAACCGACTTCTATTGAGATATTTTCGTGATAAAGGATCAAGGAGAGCCTTACACGTGTCCTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERESEEEEVPVTPEVQKGKTKKKRTPEEKEAKRRRRQQRAAEQETIQEETVNVADTEGTTHPETGPIISVAVQEGNVEKNQETEGEEQVEGEPGEEKTPEQE
AHVEIIMPEPPKRHRIKRKAGRVKVVRNTPSPPASDSEEEKREAENKEKEEEARKAEEERLREQRESKGKGIAEASGEIEEPRAPFIRFVNELARAKYQEVLKRDFLFER
GFGTDFPRFLESGIASLGWRQFCVKPDPVNANIVREFYANLDVKDDFEVIVRGVPAQWSPEAINNLFDLQDFPHAVFNEMVVAPSSDQLSAAVQEVGIEGAQWRVSQTRK
HTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAEVPTVPEDMIMTDKGIIDTPNLARLQR
MQEVRQGGMVYGVNQILEQLTVLTSRLEFAERQAQTYWTYAKRRDDALRGALQSNFSRPYQAFPEFPDDLFNLWIPPPPVEREEDVDEEQAELGFAECSAVFAPTPLSSR
RRRSSRRRPGSAAAAVDTPSRGSLSPRRFSSRGRRAGRRPSLLFARGSPAAAVSPCSRAPPFVSRSAAATQIEAVASLPSLVPLRFRPRRTRGSRVSSDSEPCRSLFSRF
RLCPATSRLLLVSFGVSAGSSFWERVVPFQLALHMAQNSCVAEHDAEIILLDICGCISMMLACGCESMLDACVRHVDCVVDVCGLLMYDVMSLDLSRGDRLRVGKPGALQ
EEWKLWKSLCADYAAERLEGANSMLQQNWEQKLPHHSSLANFMNRLLLRYFRDKGSRRALHVS