; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005417 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005417
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold6:34626695..34647042
RNA-Seq ExpressionSpg005417
SyntenySpg005417
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8661093.1 hypothetical protein F3Y22_tig00116939pilonHSYRG00213 [Hibiscus syriacus]4.3e-2331.54Show/hide
Query:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A+ Q    R   FE GF       G   P   +  +  L W++F   P  VNA++V+EFYAN+   N + + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        D    HA F E   +   D                      ++    +  A  W  F++ +L+PT+H++TVS  R+LL  +I+ S  IDVG II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY
        C  KK   L FPN IT LC +  V     D I+     I+   L  L  ++  +    V+
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.6e-2528.78Show/hide
Query:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A+ Q    R+  FE GF       G   P   +  ++ L W +F   P  VNA++V+EFYAN+   N   + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        +    HA+F E   +   D                      ++    +  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLAVLTSRLEFAERQAQ---------TYW
        C  KK   L FPN IT LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          ++
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLAVLTSRLEFAERQAQ---------TYW

Query:  TYAKRRDDALRGALQTNFSTPYPAFPVFPDDL---FNLWIPPPP
         Y K RD  +    Q         FP FPD++   FN    P P
Subjt:  TYAKRRDDALRGALQTNFSTPYPAFPVFPDDL---FNLWIPPPP

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-2535.34Show/hide
Query:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +++  +  R    E+GF  D       LP F    I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQD--FPHAVFNE---------------------MVAAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F E                      V+A  +     +     A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNE---------------------MVAAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL
          C  +K G LFFP+ IT LC  A    +  +  + + G ID   +AR+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.6e-3634.14Show/hide
Query:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +++  +  R    E+GF  D       LP F    I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQD--FPHAVF------------NEMVAAPSSD-QLSA-----AVRES---EANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F             E VAA  ++  +SA      +R +    A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVF------------NEMVAAPSSD-QLSA-----AVRES---EANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLAVLTSR------------
          C  +K G LFFP+ IT LC  A    +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLAVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE
          L+   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++    +     E E   D+D   +  E
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.6e-3036.03Show/hide
Query:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVAAPS----SDQLSAAVRE-----------------SEANTWMGFIRLRL
        +VREFYANL    +  + VRGV V WS EAIN +F L D    H+ F E +  P      + ++AA  E                   A  W  F++ RL
Subjt:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVAAPS----SDQLSAAVRE-----------------SEANTWMGFIRLRL

Query:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL---------QRMQEV
        LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A      E +  T  G ID   +AR+         Q+    
Subjt:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL---------QRMQEV

Query:  RQGGLVYG--VNQILEQLAVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE
        R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++    +     E E   D+D   +  E
Subjt:  RQGGLVYG--VNQILEQLAVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-2535.34Show/hide
Query:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +++  +  R    E+GF  D       LP F    I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQD--FPHAVFNE---------------------MVAAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F E                      V+A  +     +     A  W  F++  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVFNE---------------------MVAAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL
          C  +K G LFFP+ IT LC  A    +  +  + + G ID   +AR+
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-3634.14Show/hide
Query:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD
        ++F  + A  +++  +  R    E+GF  D       LP F    I    W+QFCA PE     +VREFYANL    +  V VRGV V WS EAIN +F 
Subjt:  IRFVNDLARAKHQ-EVLKRDFLFERGFGSD-------LPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFD

Query:  LQD--FPHAVF------------NEMVAAPSSD-QLSA-----AVRES---EANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
        L D    H+ F             E VAA  ++  +SA      +R +    A  W  F++ RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  LQD--FPHAVF------------NEMVAAPSSD-QLSA-----AVRES---EANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLAVLTSR------------
          C  +K G LFFP+ IT LC  A    +  +  + + G ID   +AR+ +        Q           N+    IL+QL  L  R            
Subjt:  VDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQR-------MQEVRQGGLVYGVNQ----ILEQLAVLTSR------------

Query:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE
          L+   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++    +     E E   D+D   +  E
Subjt:  --LEFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE

A0A2P5DXM3 Uncharacterized protein7.9e-3136.03Show/hide
Query:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVAAPS----SDQLSAAVRE-----------------SEANTWMGFIRLRL
        +VREFYANL    +  + VRGV V WS EAIN +F L D    H+ F E +  P      + ++AA  E                   A  W  F++ RL
Subjt:  IVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQD--FPHAVFNEMVAAPS----SDQLSAAVRE-----------------SEANTWMGFIRLRL

Query:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL---------QRMQEV
        LPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A      E +  T  G ID   +AR+         Q+    
Subjt:  LPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARL---------QRMQEV

Query:  RQGGLVYG--VNQILEQLAVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE
        R            +L+QL  L  RL   E   +Q Q +W Y+K RD AL+ ALQ NF+ P P FP FP ++    +     E E   D+D   +  E
Subjt:  RQGGLVYG--VNQILEQLAVLTSRL---EFAERQAQTYWTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELE

A0A6A2WM54 Uncharacterized protein2.1e-2331.54Show/hide
Query:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A+ Q    R   FE GF       G   P   +  +  L W++F   P  VNA++V+EFYAN+   N + + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        D    HA F E   +   D                      ++    +  A  W  F++ +L+PT+H++TVS  R+LL  +I+ S  IDVG II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY
        C  KK   L FPN IT LC +  V     D I+     I+   L  L  ++  +    V+
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY

A0A6A3BU96 Uncharacterized protein7.7e-2628.78Show/hide
Query:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ
        +F ND A+A+ Q    R+  FE GF       G   P   +  ++ L W +F   P  VNA++V+EFYAN+   N   + VRG  ++++  AIN  F LQ
Subjt:  RFVNDLARAKHQEVLKRDFLFERGF-------GSDLPRFFESGIVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQ

Query:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD
        +    HA+F E   +   D                      ++    +  A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ D
Subjt:  DF--PHAVFNEMVAAPSSD---------------------QLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIVD

Query:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLAVLTSRLEFAERQAQ---------TYW
        C  KK   L FPN IT LC +  V     D I+     I    L  L  ++  +    V+    G  +   ++ +L       + QAQ          ++
Subjt:  CWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVY----GVNQILEQLAVLTSRLEFAERQAQ---------TYW

Query:  TYAKRRDDALRGALQTNFSTPYPAFPVFPDDL---FNLWIPPPP
         Y K RD  +    Q         FP FPD++   FN    P P
Subjt:  TYAKRRDDALRGALQTNFSTPYPAFPVFPDDL---FNLWIPPPP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACGTAAGCGCCACGTCACTGCCACGAGCAATGTTGGTGATTTTGGGAGAAAAAGTGCAAAAAAAGAAGCAAATGAAGCTCAAATCCAAGCTTGGAGCAAAAGGGA
ATCCAATGAGGCCAATCATGATGGAAAAAGCCCAAAAAGAAGTGAAGAGAAGGCCCAAAAATTAGCCCAAACCGAAGCTGGAAATTCTGGAAATTGGGTCAGCGTCGAAA
CTGAAGGATTGGAATTCCATAATTCTGTAAAGCGGAAGCGTGATTGGGACCGATTCTGTGAACACCACCACTATGACTACACCGGTATGACTCTGAGACTTCTAGAGGCA
AGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGAACTTGCCTGCTTGTGAGAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAGAATGAATTAGATTTGCC
TGAAATCAATAATTTTGATGATGATATTGATTTTCCTGACATTGAGGATGAGCATGAAATGCATAAGAAAGATTGCTTGATAGATAATTTTGAGTCTGATCATGATTACA
CTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAATCTGACCCAGAG
GAACTCGAATCTGAATCCCCTGAGCATGTAGATATTTTTTCTGATGAATGGACTGGCATGATTGATAGGCCATCTTTAGATCCTAGACCAGTTGATATCATAACGCTTGA
TGACTTTGTTAACAACTGTTTTGTGAATAGAGATTTAGAAAAGAGATTTGATGCATCTGATTATTTTCTGAAATTTCAAACTAGTTTAAGATTTCTTCAAGAGTATGCGG
TGCTTGCTTATTCCACTTGCAGGAGCAAAGTTGGTGATTTTGGGAGAAAAAGTGCCAAAAATGAAGCAAATGAAGCTCAAATCCAAGCTTGGAGCAAAAGGGAATCCAAT
GAGGCCAATCATGATGGAAAAAGCCCAAAAAGAAGTGAAGAGAAGGCCCAAAAATTAGCCCAAACCGAAGCTGGAAATTATGGAAATTGGGTCAGCGACGAGACGCTGTG
TGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAGATTCTATGCTGCAGCAAAACTGGGAACAAAAACGGCCACATCACAGCTCGTTAGCCAACTTCA
TGAACCGACTTCTCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATTGGAAGGACGTTTAGGAACAGAAGAAGGGAGCAGCGCAGAAACCAAATGGAGAACGCC
CCGCAACTTCCGCAGGTTTCTGAAGGTCCAGCAGAAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTAATAACAGGTTCAGCAGATTGTTGCGGCAAAGA
TATTGCTGGAGCAAATTATTCCGAGATAGAAGGGTTTCTGTTGGAATTTATTATTGTTACCGTTAGACTTAGCTTTAAATTCTCGCAGCCCTATTTATTCGCAATCTCTG
GTAAGTACATTTCACTTCGTTTTCTTGATCAAATTCTTATGCGTGTGTTATTCTTTCTTTCTTTGCAAAACCCTATTATACCTTCCATGGCTAAAACAAGAGCGCGAAAA
GAGAGGGAGAGTGAGGAGGAAGAGATACCCGTCACACTGGAAGTTCAAAAAGGAAAATCTAAGAAGAAAAGGACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCA
GCAGAGGGCTGCGGAACAAGAAGTCATCCAAGAGACAGCGGTTGTGCAGGATCTTGAGACAGAACCGATAGTTTCTGCTACGGTTCAGGAAGGGAATGCTGAGAAGGAAC
ATGAGACAGAGGTCGGAGGACAAGTCGCAGGTGTGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCCCATGTTGAAATTATTATGCCTGAACCACCCAAGTGC
CGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGGTTCGGAACACTCCATCACCTCCGACGTCGGACTCTGAGGAAGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGA
AGAAGAAGCAAGAAAGGAAGAGGAAGAGCGTTTGCGTGTCCGGAGAGAAAGCAGAGGCAAAGGAATTGCCGAAGCATCAGGAGAAATTGAGGAGCCGAGGGCGCCATTCA
TTCGCTTCGTCAACGATCTTGCTCGAGCAAAACACCAGGAGGTACTGAAACGGGACTTCTTGTTCGAACGTGGATTTGGCAGTGATTTGCCTAGGTTCTTTGAGTCTGGA
ATAGTGAACCTTGGATGGAGGCAATTTTGTGCGAAACCAGAACATGTCAATGCCAACATTGTTCGAGAATTTTATGCCAATCTTGACGTTAAGAATGATTTTGAGGTTAT
CGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAACTGTTCGATCTCCAGGATTTTCCGCATGCCGTTTTTAATGAGATGGTGGCTGCACCATCTAGTG
ATCAACTGAGTGCGGCTGTCCGGGAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACGGTATCTCGGGACAGGGTA
TTACTTGCCTTTGCAATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCGAA
CACGATTACGATGTTATGCAGCAGGGCAGGAGTGTCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGACTTCAGCGTA
TGCAAGAGGTTCGCCAAGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGGCTGTGTTGACCAGTAGGTTAGAGTTTGCTGAAAGGCAAGCTCAGACCTAC
TGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGGGGGCCTTGCAAACCAATTTCTCAACACCGTATCCGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTG
GATACCACCCCCGCCTGTTGAACGAGAAGAGAATGATGATGAAGACCAGGTTGGTGATGAGCTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAA
GAGCTTATGACTCAGAGCTTGGTTTTGCAGAGTGCTCAGATTATGCTGCTGAGCGACTGGAGGGAGCAGATTCTATGCTGCAGCAAAACTGGGAACAAAAACTGCCACAT
CACAGCTCGTTAGCCAACTTCATGAACCGACTTCTTAATAAATTGACTATTGCATTTTGGTTAATTGTTGTGCTTATTCCTATTTTATGCCCGCCTGGCCAGCTGGAGCC
CAACACTGTGCAATGTAGCGTTGCGACGCTACCCATTTCTTGGGCAACAAGACACGTGCATAGCAGCGTCGCGACACTGTGCAATGTAGCGTCGCGACGCTACCTCCAAT
CCGGGCCTATAAAAGGCACCTCTTGGTGCCTCATTTTAGCATCAATCACTCCATTCTTTCCTTCCTTTCCTCCCTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTC
TAG
mRNA sequenceShow/hide mRNA sequence
ATGCCACGTAAGCGCCACGTCACTGCCACGAGCAATGTTGGTGATTTTGGGAGAAAAAGTGCAAAAAAAGAAGCAAATGAAGCTCAAATCCAAGCTTGGAGCAAAAGGGA
ATCCAATGAGGCCAATCATGATGGAAAAAGCCCAAAAAGAAGTGAAGAGAAGGCCCAAAAATTAGCCCAAACCGAAGCTGGAAATTCTGGAAATTGGGTCAGCGTCGAAA
CTGAAGGATTGGAATTCCATAATTCTGTAAAGCGGAAGCGTGATTGGGACCGATTCTGTGAACACCACCACTATGACTACACCGGTATGACTCTGAGACTTCTAGAGGCA
AGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGAACTTGCCTGCTTGTGAGAATGAAAGATCTGCAGTTGATGATTTACCTTCCTTTGAGAATGAATTAGATTTGCC
TGAAATCAATAATTTTGATGATGATATTGATTTTCCTGACATTGAGGATGAGCATGAAATGCATAAGAAAGATTGCTTGATAGATAATTTTGAGTCTGATCATGATTACA
CTGAATCTATTGAGTCTGATCTTGACATTCCTGAATGCATGAACCCTGATAATGTTAATACCTTTGATTCGTGTCCTGATGATGTATATAGCATAGAATCTGACCCAGAG
GAACTCGAATCTGAATCCCCTGAGCATGTAGATATTTTTTCTGATGAATGGACTGGCATGATTGATAGGCCATCTTTAGATCCTAGACCAGTTGATATCATAACGCTTGA
TGACTTTGTTAACAACTGTTTTGTGAATAGAGATTTAGAAAAGAGATTTGATGCATCTGATTATTTTCTGAAATTTCAAACTAGTTTAAGATTTCTTCAAGAGTATGCGG
TGCTTGCTTATTCCACTTGCAGGAGCAAAGTTGGTGATTTTGGGAGAAAAAGTGCCAAAAATGAAGCAAATGAAGCTCAAATCCAAGCTTGGAGCAAAAGGGAATCCAAT
GAGGCCAATCATGATGGAAAAAGCCCAAAAAGAAGTGAAGAGAAGGCCCAAAAATTAGCCCAAACCGAAGCTGGAAATTATGGAAATTGGGTCAGCGACGAGACGCTGTG
TGATAACTGCCCAAAAGATTATGCTGCTGAGCGACTGGAGGGAGCAGATTCTATGCTGCAGCAAAACTGGGAACAAAAACGGCCACATCACAGCTCGTTAGCCAACTTCA
TGAACCGACTTCTCGATCCGCCTGGGGTACGGTTTGAGCTTGATCCAGAAATTGGAAGGACGTTTAGGAACAGAAGAAGGGAGCAGCGCAGAAACCAAATGGAGAACGCC
CCGCAACTTCCGCAGGTTTCTGAAGGTCCAGCAGAAGCAAACCCCCAGCAGAATCCGTTGCTGCAGCAAAACCCACTAATAACAGGTTCAGCAGATTGTTGCGGCAAAGA
TATTGCTGGAGCAAATTATTCCGAGATAGAAGGGTTTCTGTTGGAATTTATTATTGTTACCGTTAGACTTAGCTTTAAATTCTCGCAGCCCTATTTATTCGCAATCTCTG
GTAAGTACATTTCACTTCGTTTTCTTGATCAAATTCTTATGCGTGTGTTATTCTTTCTTTCTTTGCAAAACCCTATTATACCTTCCATGGCTAAAACAAGAGCGCGAAAA
GAGAGGGAGAGTGAGGAGGAAGAGATACCCGTCACACTGGAAGTTCAAAAAGGAAAATCTAAGAAGAAAAGGACGCCAGAAGAGAAGGAGGCGAAGCGAAGACGGAGGCA
GCAGAGGGCTGCGGAACAAGAAGTCATCCAAGAGACAGCGGTTGTGCAGGATCTTGAGACAGAACCGATAGTTTCTGCTACGGTTCAGGAAGGGAATGCTGAGAAGGAAC
ATGAGACAGAGGTCGGAGGACAAGTCGCAGGTGTGCCTGAAAAAGAGAAAACACCGGAGCCGGTGCAGGAGGCCCATGTTGAAATTATTATGCCTGAACCACCCAAGTGC
CGCCGCATCAAGAGGAAGGCGGGTCGCGTGAGGGTGGTTCGGAACACTCCATCACCTCCGACGTCGGACTCTGAGGAAGAAAGAAGGGAAGCTGAGAATAAGGAAAAAGA
AGAAGAAGCAAGAAAGGAAGAGGAAGAGCGTTTGCGTGTCCGGAGAGAAAGCAGAGGCAAAGGAATTGCCGAAGCATCAGGAGAAATTGAGGAGCCGAGGGCGCCATTCA
TTCGCTTCGTCAACGATCTTGCTCGAGCAAAACACCAGGAGGTACTGAAACGGGACTTCTTGTTCGAACGTGGATTTGGCAGTGATTTGCCTAGGTTCTTTGAGTCTGGA
ATAGTGAACCTTGGATGGAGGCAATTTTGTGCGAAACCAGAACATGTCAATGCCAACATTGTTCGAGAATTTTATGCCAATCTTGACGTTAAGAATGATTTTGAGGTTAT
CGTTCGAGGAGTGCCTGTACAGTGGAGTCCTGAGGCCATTAATGAACTGTTCGATCTCCAGGATTTTCCGCATGCCGTTTTTAATGAGATGGTGGCTGCACCATCTAGTG
ATCAACTGAGTGCGGCTGTCCGGGAGAGTGAAGCCAACACTTGGATGGGTTTCATCAGGCTACGCTTGCTGCCGACAACACACGACTCCACGGTATCTCGGGACAGGGTA
TTACTTGCCTTTGCAATTCTTCGCTCGATGAGTATTGATGTAGGAAAAATTATTTCTTCTGAGATTGTTGATTGCTGGAAAAAGAAGGTGGGGAAGCTGTTCTTTCCGAA
CACGATTACGATGTTATGCAGCAGGGCAGGAGTGTCCACGGTTCCAGAAGATATGATCATGACTGATAAGGGAATCATTGACACACCTAATCTGGCGCGACTTCAGCGTA
TGCAAGAGGTTCGCCAAGGTGGGCTTGTGTATGGCGTTAATCAGATCCTAGAGCAACTGGCTGTGTTGACCAGTAGGTTAGAGTTTGCTGAAAGGCAAGCTCAGACCTAC
TGGACTTATGCTAAAAGGAGAGATGATGCGCTCAGGGGGGCCTTGCAAACCAATTTCTCAACACCGTATCCGGCCTTTCCAGTGTTTCCCGATGATTTGTTTAATCTTTG
GATACCACCCCCGCCTGTTGAACGAGAAGAGAATGATGATGAAGACCAGGTTGGTGATGAGCTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAA
GAGCTTATGACTCAGAGCTTGGTTTTGCAGAGTGCTCAGATTATGCTGCTGAGCGACTGGAGGGAGCAGATTCTATGCTGCAGCAAAACTGGGAACAAAAACTGCCACAT
CACAGCTCGTTAGCCAACTTCATGAACCGACTTCTTAATAAATTGACTATTGCATTTTGGTTAATTGTTGTGCTTATTCCTATTTTATGCCCGCCTGGCCAGCTGGAGCC
CAACACTGTGCAATGTAGCGTTGCGACGCTACCCATTTCTTGGGCAACAAGACACGTGCATAGCAGCGTCGCGACACTGTGCAATGTAGCGTCGCGACGCTACCTCCAAT
CCGGGCCTATAAAAGGCACCTCTTGGTGCCTCATTTTAGCATCAATCACTCCATTCTTTCCTTCCTTTCCTCCCTCCTTTGGCTCCTTTGGAGCCTCTCTCAAGGCTTTC
TAG
Protein sequenceShow/hide protein sequence
MPRKRHVTATSNVGDFGRKSAKKEANEAQIQAWSKRESNEANHDGKSPKRSEEKAQKLAQTEAGNSGNWVSVETEGLEFHNSVKRKRDWDRFCEHHHYDYTGMTLRLLEA
RDLWEPLGEFSENLPACENERSAVDDLPSFENELDLPEINNFDDDIDFPDIEDEHEMHKKDCLIDNFESDHDYTESIESDLDIPECMNPDNVNTFDSCPDDVYSIESDPE
ELESESPEHVDIFSDEWTGMIDRPSLDPRPVDIITLDDFVNNCFVNRDLEKRFDASDYFLKFQTSLRFLQEYAVLAYSTCRSKVGDFGRKSAKNEANEAQIQAWSKRESN
EANHDGKSPKRSEEKAQKLAQTEAGNYGNWVSDETLCDNCPKDYAAERLEGADSMLQQNWEQKRPHHSSLANFMNRLLDPPGVRFELDPEIGRTFRNRRREQRRNQMENA
PQLPQVSEGPAEANPQQNPLLQQNPLITGSADCCGKDIAGANYSEIEGFLLEFIIVTVRLSFKFSQPYLFAISGKYISLRFLDQILMRVLFFLSLQNPIIPSMAKTRARK
ERESEEEEIPVTLEVQKGKSKKKRTPEEKEAKRRRRQQRAAEQEVIQETAVVQDLETEPIVSATVQEGNAEKEHETEVGGQVAGVPEKEKTPEPVQEAHVEIIMPEPPKC
RRIKRKAGRVRVVRNTPSPPTSDSEEERREAENKEKEEEARKEEEERLRVRRESRGKGIAEASGEIEEPRAPFIRFVNDLARAKHQEVLKRDFLFERGFGSDLPRFFESG
IVNLGWRQFCAKPEHVNANIVREFYANLDVKNDFEVIVRGVPVQWSPEAINELFDLQDFPHAVFNEMVAAPSSDQLSAAVRESEANTWMGFIRLRLLPTTHDSTVSRDRV
LLAFAILRSMSIDVGKIISSEIVDCWKKKVGKLFFPNTITMLCSRAGVSTVPEDMIMTDKGIIDTPNLARLQRMQEVRQGGLVYGVNQILEQLAVLTSRLEFAERQAQTY
WTYAKRRDDALRGALQTNFSTPYPAFPVFPDDLFNLWIPPPPVEREENDDEDQVGDELEARVYCTIKWVIPCLRAYDSELGFAECSDYAAERLEGADSMLQQNWEQKLPH
HSSLANFMNRLLNKLTIAFWLIVVLIPILCPPGQLEPNTVQCSVATLPISWATRHVHSSVATLCNVASRRYLQSGPIKGTSWCLILASITPFFPSFPPSFGSFGASLKAF