; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020733 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020733
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:25726025..25729528
RNA-Seq ExpressionSpg020733
SyntenySpg020733
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8661093.1 hypothetical protein F3Y22_tig00116939pilonHSYRG00213 [Hibiscus syriacus]6.3e-3133.46Show/hide
Query:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A A++Q    R   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    ++ + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        D    HA F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +I+ S  IDVG II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I+   L  L  ++  +    VH
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]3.1e-3032.7Show/hide
Query:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L  ++  +    VH
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.1e-3539.6Show/hide
Query:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L T +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]7.6e-3740.4Show/hide
Query:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L T +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.1e-3035.68Show/hide
Query:  IRFVNNLALAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDL
        ++F +  A  +Y+E        ++++F+++     E P F+   I    W  FCA PE      VREFY N+ + ++  V +RGV V  S EA+N +F L
Subjt:  IRFVNNLALAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDL

Query:  QDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VG++I  EI
Subjt:  QDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVP
          C  +K G LFFP+ IT +CR    P
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)5.3e-3639.6Show/hide
Query:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D  E  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L T +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.7e-3740.4Show/hide
Query:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL D EE  V VRGV V WS EA+N +F 
Subjt:  IRFVNNLALAKYQEMLK-RDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFD

Query:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        L D P    +E +   +   L T +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SE
Subjt:  LQDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL
        I  C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL

A0A2P5DAQ2 Uncharacterized protein2.0e-3035.68Show/hide
Query:  IRFVNNLALAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDL
        ++F +  A  +Y+E        ++++F+++     E P F+   I    W  FCA PE      VREFY N+ + ++  V +RGV V  S EA+N +F L
Subjt:  IRFVNNLALAKYQE-------MLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDL

Query:  QDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
         D P    +E V   +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH  TVS++ V L +++L   SI+VG++I  EI
Subjt:  QDFPHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVP
          C  +K G LFFP+ IT +CR    P
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVP

A0A6A2WM54 Uncharacterized protein3.0e-3133.46Show/hide
Query:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A A++Q    R   FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    ++ + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        D    HA F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +I+ S  IDVG II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I+   L  L  ++  +    VH
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH

A0A6A3BU96 Uncharacterized protein1.5e-3032.7Show/hide
Query:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ
        +F N+ A A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+    +  + VRG  + ++  A+N  F LQ
Subjt:  RFVNNLALAKYQEMLKRDFLFERGF-------GNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQ

Query:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE
        +    HA+F E      +++    + ++  E  +W   QT + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  DF--PHAVFNEMVVAPSNDQLSTAVREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSE

Query:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH
        + DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L  ++  +    VH
Subjt:  ILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRMQEVRQGGLVH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGAGCAGAATCATCCGAATTAGGAAGGGTAAATTTGCAAGCGTCATCTGATGAAGCCACGTGTCACGCAAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGA
GAAAGAAGCTAAACGGAGAAGAAGGCAGCAGAGGGTTGCGGAGCAAGAGGCTATCCAAGAAGAACCAGTGAATGACCCAGATACGGAAGGAATTCAGAATCCTGAGGTAG
AACCGATAGTTCAAGATTCGGTGCAAGAGGAGAATGTTGAGAAGAATCAAGAAACACAAGCTGAAGAAGTTCGAGACGAACAGGCCGCGGTTGTGCCTGAGGAAGGGGAT
GAACAGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGAGCCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGGTGATTCGGACTGATAC
CCCATCACCACCATCGTCGGATTCTGAGAAAGAGAAGGCAGAGCGAGAGGAAAGAGAGAAAAAAGAAGCTGAGGAAAAAGTGCGAGAAGAAGCAAAGAAGGCTGAGGAAG
AGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATTAGGTGCGGCTGACGAGGTTGAGGCACGAGGGTTACCTTTTATTCGCTTCGTCAACAAT
CTTGCTCTAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAGCGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACTGGAATAGAAAACCTCGGCTG
GAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAG
TGGATTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTACGGCT
GTCCGAGAGGTTGGCATTGAGGGGGCCCAATGGAGGTTGTCGCAGACGCGGAAGCGCACATTTCAGGCAGCTTATTTGAAAAGCGAGGCCAACACCTGGATGGGTTTTAT
TAAGTTGCGCTTACTACCAACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTT
CGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATG
ATATTACCAGATAAGGGAATAATTGATACGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGGCATCAACACGATTTTAGAACA
ACTCGCACTTTCGGCCAGCAGGCAGGACTTGGACTGGTTAAGCTTAATTAGATCAAGCCGAATTGGTGATGAGTTTGAGGCATGGGTATACTGCACCATAAAGTGGGTCA
TCCCGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATTTAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGG
CAAGTTTTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAGGAATTATTTTGCTGCAGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCGAA
TTTTATGCTGGAGCAAACCCGAAAGCAGAACTGCCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGAGCAGAATCATCCGAATTAGGAAGGGTAAATTTGCAAGCGTCATCTGATGAAGCCACGTGTCACGCAAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGA
GAAAGAAGCTAAACGGAGAAGAAGGCAGCAGAGGGTTGCGGAGCAAGAGGCTATCCAAGAAGAACCAGTGAATGACCCAGATACGGAAGGAATTCAGAATCCTGAGGTAG
AACCGATAGTTCAAGATTCGGTGCAAGAGGAGAATGTTGAGAAGAATCAAGAAACACAAGCTGAAGAAGTTCGAGACGAACAGGCCGCGGTTGTGCCTGAGGAAGGGGAT
GAACAGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGAGCCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGGTGATTCGGACTGATAC
CCCATCACCACCATCGTCGGATTCTGAGAAAGAGAAGGCAGAGCGAGAGGAAAGAGAGAAAAAAGAAGCTGAGGAAAAAGTGCGAGAAGAAGCAAAGAAGGCTGAGGAAG
AGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCGAGGCATTAGGTGCGGCTGACGAGGTTGAGGCACGAGGGTTACCTTTTATTCGCTTCGTCAACAAT
CTTGCTCTAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAGCGAGGATTTGGCAATGAGTTGCCACGGTTCTTGAGGACTGGAATAGAAAACCTCGGCTG
GAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGCAAATCTTGACGATAAGGAAGAATTTCAGGTTATAGTTCGAGGAGTCCCAG
TGGATTGGAGCCCAGAAGCTGTTAATGAATTGTTTGATCTCCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTACGGCT
GTCCGAGAGGTTGGCATTGAGGGGGCCCAATGGAGGTTGTCGCAGACGCGGAAGCGCACATTTCAGGCAGCTTATTTGAAAAGCGAGGCCAACACCTGGATGGGTTTTAT
TAAGTTGCGCTTACTACCAACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGCTCAATGAGTATTGATGTAGGAAAAATAATTT
CGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGTTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGATATG
ATATTACCAGATAAGGGAATAATTGATACGCCTAACTTGGCACGGCTCCAGCGTATGCAGGAGGTACGTCAGGGTGGGCTGGTCCACGGCATCAACACGATTTTAGAACA
ACTCGCACTTTCGGCCAGCAGGCAGGACTTGGACTGGTTAAGCTTAATTAGATCAAGCCGAATTGGTGATGAGTTTGAGGCATGGGTATACTGCACCATAAAGTGGGTCA
TCCCGTGCTTAAGAGCTTATGACTGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATTTAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGG
CAAGTTTTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAGGAATTATTTTGCTGCAGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCGAA
TTTTATGCTGGAGCAAACCCGAAAGCAGAACTGCCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MAGAESSELGRVNLQASSDEATCHAKAKTKKKKTPEEKEAKRRRRQQRVAEQEAIQEEPVNDPDTEGIQNPEVEPIVQDSVQEENVEKNQETQAEEVRDEQAAVVPEEGD
EQETVQEAHVEVIMPEPPKSRRIKRKAGRVQVIRTDTPSPPSSDSEKEKAEREEREKKEAEEKVREEAKKAEEEILRKRREDKGKGIAEALGAADEVEARGLPFIRFVNN
LALAKYQEMLKRDFLFERGFGNELPRFLRTGIENLGWSQFCAKPEPVNSNFVREFYANLDDKEEFQVIVRGVPVDWSPEAVNELFDLQDFPHAVFNEMVVAPSNDQLSTA
VREVGIEGAQWRLSQTRKRTFQAAYLKSEANTWMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDM
ILPDKGIIDTPNLARLQRMQEVRQGGLVHGINTILEQLALSASRQDLDWLSLIRSSRIGDEFEAWVYCTIKWVIPCLRAYDCRAALSLKNKNLNPLKMCFDMSDNRAKLW
QVFRIELKVVIICPCRRNYFAAAECSESVAGRLEGANFMLEQTRKQNCHVTAR