; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015795 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015795
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold6:40715290..40717020
RNA-Seq ExpressionSpg015795
SyntenySpg015795
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]4.0e-2134.38Show/hide
Query:  FLRIGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTG
        F+   I  HGW  FC  P +    +VREFYAN+   ++E  F QN   P  A               Y +     ++EQL   + EV IEGA WQ+S  G
Subjt:  FLRIGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTG

Query:  KRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCKRAGVP--EDEGDEVRQG
          T     LKR A  W  F+  R +P+TH   V+++RVLL+++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  +DE      G
Subjt:  KRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCKRAGVP--EDEGDEVRQG

Query:  GL-VYGINTILEQLALSASRQEFA
         +    I+ I +  A++A++ E A
Subjt:  GL-VYGINTILEQLALSASRQEFA

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.8e-2131.74Show/hide
Query:  EIEESQLPYDRFVNNFARAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNF-----------------
        E E ++  Y+  + N  R   AE   + F+ +   +   L F+   I  H W+ FCA PE     +VREFYAN+   D  +N                  
Subjt:  EIEESQLPYDRFVNNFARAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNF-----------------

Query:  ------PHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIV
              P   ++E +   +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH   VS++R+LL+ ++L   SI+VG+MI 
Subjt:  ------PHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIV

Query:  NEISGCWKKKVGKLFFPNTITMLCKRAGVP
        +EI  C  +K G LFFP+ IT LC+ A  P
Subjt:  NEISGCWKKKVGKLFFPNTITMLCKRAGVP

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.3e-2729.08Show/hide
Query:  ESQLPYDRFVNNFA-RAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANI-DKED------GFQ--------------
        E++    R+ NN   R   AE   + F+ +   +   L F+   I  H W+ FCA PE     +VREFYAN+ D E+      G Q              
Subjt:  ESQLPYDRFVNNFA-RAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANI-DKED------GFQ--------------

Query:  NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEIS
          P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH   VS++R+LL+ ++L   SI+VG+MI +EI 
Subjt:  NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVP------------------------EDEGDEVRQ---------------GGLVYGINTILEQLALSASRQ-----
         C  +K G LFFP+ IT LC+ A  P                        E   +  +Q               G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVP------------------------EDEGDEVRQ---------------GGLVYGINTILEQLALSASRQ-----

Query:  --EFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE
          +   +Q   F  Y K RD  LKKALQ NF++P P  PAFP+++L         E E E D++   E
Subjt:  --EFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.1e-2231.56Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N
        +F +  A  +Y E        ++++F+++     +   F+   I  H W+LFCA PE     +VREFY N+   D       G Q               
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N

Query:  FPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISG
         P   ++E V   +  +L   +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH   VS+E V L++++L   SI+VG+MI  EI  
Subjt:  FPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISG

Query:  CWKKKVGKLFFPNTITMLCKRAGVP
        C  +K G LFFP+ IT +C+    P
Subjt:  CWKKKVGKLFFPNTITMLCKRAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]7.2e-2331.4Show/hide
Query:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+ D E+      G Q                P   ++E +   +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGDEVRQGGLVYGI---------------------
        LPTTH  +VS++R+LL+ ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LC+ A    +E +++   G +  I                     
Subjt:  LPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGDEVRQGGLVYGI---------------------

Query:  ----------NTILEQLAL---SASRQEFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE
                    +L+QL       S+QE   +Q   F  Y K RD  LKKALQ NF++P P  PAFP+++L         E E E D++   E
Subjt:  ----------NTILEQLAL---SASRQEFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)8.6e-2231.74Show/hide
Query:  EIEESQLPYDRFVNNFARAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNF-----------------
        E E ++  Y+  + N  R   AE   + F+ +   +   L F+   I  H W+ FCA PE     +VREFYAN+   D  +N                  
Subjt:  EIEESQLPYDRFVNNFARAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNF-----------------

Query:  ------PHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIV
              P   ++E +   +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH   VS++R+LL+ ++L   SI+VG+MI 
Subjt:  ------PHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIV

Query:  NEISGCWKKKVGKLFFPNTITMLCKRAGVP
        +EI  C  +K G LFFP+ IT LC+ A  P
Subjt:  NEISGCWKKKVGKLFFPNTITMLCKRAGVP

A0A2P5BCG4 Uncharacterized protein (Fragment)6.2e-2829.08Show/hide
Query:  ESQLPYDRFVNNFA-RAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANI-DKED------GFQ--------------
        E++    R+ NN   R   AE   + F+ +   +   L F+   I  H W+ FCA PE     +VREFYAN+ D E+      G Q              
Subjt:  ESQLPYDRFVNNFA-RAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANI-DKED------GFQ--------------

Query:  NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEIS
          P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH   VS++R+LL+ ++L   SI+VG+MI +EI 
Subjt:  NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVP------------------------EDEGDEVRQ---------------GGLVYGINTILEQLALSASRQ-----
         C  +K G LFFP+ IT LC+ A  P                        E   +  +Q               G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVP------------------------EDEGDEVRQ---------------GGLVYGINTILEQLALSASRQ-----

Query:  --EFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE
          +   +Q   F  Y K RD  LKKALQ NF++P P  PAFP+++L         E E E D++   E
Subjt:  --EFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE

A0A2P5DAQ2 Uncharacterized protein3.0e-2231.56Show/hide
Query:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N
        +F +  A  +Y E        ++++F+++     +   F+   I  H W+LFCA PE     +VREFY N+   D       G Q               
Subjt:  RFVNNFARAKYAE-------LLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKED-------GFQ--------------N

Query:  FPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISG
         P   ++E V   +  +L   +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH   VS+E V L++++L   SI+VG+MI  EI  
Subjt:  FPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISG

Query:  CWKKKVGKLFFPNTITMLCKRAGVP
        C  +K G LFFP+ IT +C+    P
Subjt:  CWKKKVGKLFFPNTITMLCKRAGVP

A0A2P5DXM3 Uncharacterized protein3.5e-2331.4Show/hide
Query:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+ D E+      G Q                P   ++E +   +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANI-DKED------GFQ--------------NFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGDEVRQGGLVYGI---------------------
        LPTTH  +VS++R+LL+ ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LC+ A    +E +++   G +  I                     
Subjt:  LPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEGDEVRQGGLVYGI---------------------

Query:  ----------NTILEQLAL---SASRQEFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE
                    +L+QL       S+QE   +Q   F  Y K RD  LKKALQ NF++P P  PAFP+++L         E E E D++   E
Subjt:  ----------NTILEQLAL---SASRQEFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQE

W9QTD9 Uncharacterized protein1.9e-2134.38Show/hide
Query:  FLRIGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTG
        F+   I  HGW  FC  P +    +VREFYAN+   ++E  F QN   P  A               Y +     ++EQL   + EV IEGA WQ+S  G
Subjt:  FLRIGIADHGWELFCAKPESVNAQVVREFYANI---DKEDGF-QN--FPHAA---------------YNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTG

Query:  KRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCKRAGVP--EDEGDEVRQG
          T     LKR A  W  F+  R +P+TH   V+++RVLL+++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  +DE      G
Subjt:  KRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCKRAGVP--EDEGDEVRQG

Query:  GL-VYGINTILEQLALSASRQEFA
         +    I+ I +  A++A++ E A
Subjt:  GL-VYGINTILEQLALSASRQEFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCAAGAAAAGAGAGAGATCATGAGGAAGAAGAGGTACCCGTGACCCCCGAAGCACCGAAAGTTAAGACGAAAAAGAAGAAGACGCCAGAAGAAAA
AGAAGCTAAAAGAAGGAGAAGGCAGCAGAGGGCTGAGGATCAAGAAGTTATAGAGAAGGTGGTGGAAGATGTCGCTGCCACGATGGTTGAAGAGGATCCGAAAGAACAAG
AAGAACAAAACCCAAAGCAGACTGAGCCAGGTGTTGCGGATACAGAGGAAGTTCGAGAGGAAAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAATT
ACAGAGGAAGTTCAAGAAAAGCAGGTCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGAATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCAT
GCCAGAAGTACCGAAACGTCGCCGCATTAAGAGGAAAGCAGGCGGCGTTAGGGTTGTCCGAACTGATATTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAAAATGCAG
AGAAAGAAGAGCGTGAGAAGAAGGAAGCCGAAGAAAAAGCAAGAGAAGAGGAAGAGAAAAAGGCTGAGGAAGAGCGATTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGT
GTTGCTGCAGCATCGGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGA
CTTCCTGTTTGAAAGAGGATTTAGCGGTGATCTTCTGCATTTTCTGAGGATCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCAC
AGGTGGTGCGTGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAGAATTTCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGTTAAGT
GATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACGTTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGG
ATTTATCAGACAGAGGATGCTTCCAACGACTCATGATTTGATGGTCTCGAGGGAACGGGTTCTTCTGGTTTTTGCTATTTTGCGGTCTCTCAGCATTGATGTAGGGAAGA
TGATTGTTAATGAGATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGAGGGCAGGGGTTCCAGAGGATGAAGGA
GATGAGGTACGTCAAGGAGGGCTTGTTTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTT
GAACTATGTTAAGAGTCGTGATGCCAATCTGAAGAAGGCACTGCAGGAGAATTTTTCCAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGA
TCCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGGTCAGGAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCAAGAAAAGAGAGAGATCATGAGGAAGAAGAGGTACCCGTGACCCCCGAAGCACCGAAAGTTAAGACGAAAAAGAAGAAGACGCCAGAAGAAAA
AGAAGCTAAAAGAAGGAGAAGGCAGCAGAGGGCTGAGGATCAAGAAGTTATAGAGAAGGTGGTGGAAGATGTCGCTGCCACGATGGTTGAAGAGGATCCGAAAGAACAAG
AAGAACAAAACCCAAAGCAGACTGAGCCAGGTGTTGCGGATACAGAGGAAGTTCGAGAGGAAAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAATT
ACAGAGGAAGTTCAAGAAAAGCAGGTCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGAATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCAT
GCCAGAAGTACCGAAACGTCGCCGCATTAAGAGGAAAGCAGGCGGCGTTAGGGTTGTCCGAACTGATATTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAAAATGCAG
AGAAAGAAGAGCGTGAGAAGAAGGAAGCCGAAGAAAAAGCAAGAGAAGAGGAAGAGAAAAAGGCTGAGGAAGAGCGATTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGT
GTTGCTGCAGCATCGGAGGAACCTGATGAAATAGAAGAGTCACAATTGCCGTATGATCGCTTCGTCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGA
CTTCCTGTTTGAAAGAGGATTTAGCGGTGATCTTCTGCATTTTCTGAGGATCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCAC
AGGTGGTGCGTGAATTTTATGCTAATATTGACAAAGAAGATGGTTTCCAGAATTTCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGTTAAGT
GATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACGTTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGG
ATTTATCAGACAGAGGATGCTTCCAACGACTCATGATTTGATGGTCTCGAGGGAACGGGTTCTTCTGGTTTTTGCTATTTTGCGGTCTCTCAGCATTGATGTAGGGAAGA
TGATTGTTAATGAGATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGAGGGCAGGGGTTCCAGAGGATGAAGGA
GATGAGGTACGTCAAGGAGGGCTTGTTTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTT
GAACTATGTTAAGAGTCGTGATGCCAATCTGAAGAAGGCACTGCAGGAGAATTTTTCCAAACCATATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGA
TCCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGGTCAGGAAGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDHEEEEVPVTPEAPKVKTKKKKTPEEKEAKRRRRQQRAEDQEVIEKVVEDVAATMVEEDPKEQEEQNPKQTEPGVADTEEVREENTEEVREENTEEVREI
TEEVQEKQVEDVQEQQAEDVQVTNNEPVQEARVEVIMPEVPKRRRIKRKAGGVRVVRTDIPSPPTTDSERENAEKEEREKKEAEEKAREEEEKKAEEERLLKRRAEKGKS
VAAASEEPDEIEESQLPYDRFVNNFARAKYAELLKRDFLFERGFSGDLLHFLRIGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQNFPHAAYNEMVVAPSNEQLS
DAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDLMVSRERVLLVFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPEDEG
DEVRQGGLVYGINTILEQLALSASRQEFAERQALTFLNYVKSRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQGQED