; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010687 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010687
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionAA_kinase domain-containing protein
Genome locationscaffold5:13508058..13509353
RNA-Seq ExpressionSpg010687
SyntenySpg010687
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]4.7e-2427.9Show/hide
Query:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL---
        FV+ +A+  Y  +  R   FE GF      + +L   +   +  H W+ F   P  VNA +V+EFY+NI + +   V+VRG+ + ++P+AIN  + L   
Subjt:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL---

Query:  --------QNFPHAAYNVM---VVAPSNEQLSDAVREVGIEGAR-------------QRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCW
                Q   H  Y  +   +  P        ++   ++  R              +L+PT+H++TVS +R+LL  +IL   +ID+GKI+V     C 
Subjt:  --------QNFPHAAYNVM---VVAPSNEQLSDAVREVGIEGAR-------------QRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCW

Query:  KKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEAR------------QGGLVYGVNTILEQLALSASRQEF--VERQALTFWN
        K++   L FPN IT LC++  V E   D IL     ++   +  L   +EA+                V   +T LEQ A+  + Q    +  + + ++ 
Subjt:  KKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEAR------------QGGLVYGVNTILEQLALSASRQEF--VERQALTFWN

Query:  YVKNRDANLKKALQENFSK
        Y K RDA L  AL E+  +
Subjt:  YVKNRDANLKKALQENFSK

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.6e-2428.27Show/hide
Query:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNFPHAAYNVMVVAPSNEQ---LSDAVREVGIEGARQ------------------------RLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNE
        +LQ            A SN+    L D   E      RQ                        +L+PT+H++TVS  R+LL  +++ S  IDVG+I+V +
Subjt:  NLQNFPHAAYNVMVVAPSNEQ---LSDAVREVGIEGARQ------------------------RLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNE

Query:  ISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEARQGGLVY----GVNTILEQLALSASRQEFVERQA---------L
        +  C  KK   L FPN IT LC++  V EN  D IL     I    L  L  ++  +    V+    G      ++ L A  +   + QA          
Subjt:  ISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEARQGGLVY----GVNTILEQLALSASRQEFVERQA---------L

Query:  TFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
         F+ YVK+RD  ++   QE           FP+++L
Subjt:  TFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.5e-2532.93Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++  +   +   L   +  V + GA                            +  LLPTTH  TVS++R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.6e-3531.41Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++  +   + + L   +  V   GA                            + RLLPTTH  TVS++R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL---------QRMQEAR---------QGGLVYGVNTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+         Q+   +R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL---------QRMQEAR---------QGGLVYGVNTILEQLALSASRQ----

Query:  ---EFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P   AFP+++L
Subjt:  ---EFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.4e-2834.55Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNVMVVAPSNEQLSDAVREVGIE------GA-------------------RQRL
        +VREFYAN+   +   + VRGV+V WS  AINA++ L +    H+ +   +  P    + + V   G E      GA                   + RL
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNVMVVAPSNEQLSDAVREVGIE------GA-------------------RQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDL-----------------PNLA
        LPTTH   VS++R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID                  P+ +
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDL-----------------PNLA

Query:  RLQRMQEARQGGLVYGVNTILEQLALSASRQEFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
        R      +R  G V      LEQ     S+QE   +Q   FW Y K RD  LKKALQ NF++P P   AFP+++L
Subjt:  RLQRMQEARQGGLVYGVNTILEQLALSASRQEFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.7e-2532.93Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+       V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++  +   +   L   +  V + GA                            +  LLPTTH  TVS++R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+
Subjt:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)7.5e-3631.41Show/hide
Query:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F   +A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFVNNSARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI
         + P   ++  +   + + L   +  V   GA                            + RLLPTTH  TVS++R+LL  ++L   SI+VG+++ +EI
Subjt:  QNFPHAAYNVMVVAPSNEQLSDAVREVGIEGA----------------------------RQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEI

Query:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL---------QRMQEAR---------QGGLVYGVNTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+         Q+   +R          G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARL---------QRMQEAR---------QGGLVYGVNTILEQLALSASRQ----

Query:  ---EFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
           +   +Q   FW Y K RD  LKKALQ NF++P P   AFP+++L
Subjt:  ---EFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

A0A2P5DXM3 Uncharacterized protein6.8e-2934.55Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNVMVVAPSNEQLSDAVREVGIE------GA-------------------RQRL
        +VREFYAN+   +   + VRGV+V WS  AINA++ L +    H+ +   +  P    + + V   G E      GA                   + RL
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNVMVVAPSNEQLSDAVREVGIE------GA-------------------RQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDL-----------------PNLA
        LPTTH   VS++R+LL  ++L   SI+VG+++ +EI  C  +K G LFFP+ IT LC+ A    NE    L + G ID                  P+ +
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDL-----------------PNLA

Query:  RLQRMQEARQGGLVYGVNTILEQLALSASRQEFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
        R      +R  G V      LEQ     S+QE   +Q   FW Y K RD  LKKALQ NF++P P   AFP+++L
Subjt:  RLQRMQEARQGGLVYGVNTILEQLALSASRQEFVERQALTFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

A0A6A2ZUE4 Uncharacterized protein2.3e-2427.9Show/hide
Query:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL---
        FV+ +A+  Y  +  R   FE GF      + +L   +   +  H W+ F   P  VNA +V+EFY+NI + +   V+VRG+ + ++P+AIN  + L   
Subjt:  FVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL---

Query:  --------QNFPHAAYNVM---VVAPSNEQLSDAVREVGIEGAR-------------QRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCW
                Q   H  Y  +   +  P        ++   ++  R              +L+PT+H++TVS +R+LL  +IL   +ID+GKI+V     C 
Subjt:  --------QNFPHAAYNVM---VVAPSNEQLSDAVREVGIEGAR-------------QRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNEISGCW

Query:  KKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEAR------------QGGLVYGVNTILEQLALSASRQEF--VERQALTFWN
        K++   L FPN IT LC++  V E   D IL     ++   +  L   +EA+                V   +T LEQ A+  + Q    +  + + ++ 
Subjt:  KKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEAR------------QGGLVYGVNTILEQLALSASRQEF--VERQALTFWN

Query:  YVKNRDANLKKALQENFSK
        Y K RDA L  AL E+  +
Subjt:  YVKNRDANLKKALQENFSK

A0A6A3BU96 Uncharacterized protein7.8e-2528.27Show/hide
Query:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K +   + VRG ++ ++  AIN  +
Subjt:  LPYDRFVNNSARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALY

Query:  NLQNFPHAAYNVMVVAPSNEQ---LSDAVREVGIEGARQ------------------------RLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNE
        +LQ            A SN+    L D   E      RQ                        +L+PT+H++TVS  R+LL  +++ S  IDVG+I+V +
Subjt:  NLQNFPHAAYNVMVVAPSNEQ---LSDAVREVGIEGARQ------------------------RLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIVVNE

Query:  ISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEARQGGLVY----GVNTILEQLALSASRQEFVERQA---------L
        +  C  KK   L FPN IT LC++  V EN  D IL     I    L  L  ++  +    V+    G      ++ L A  +   + QA          
Subjt:  ISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEARQGGLVY----GVNTILEQLALSASRQEFVERQA---------L

Query:  TFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL
         F+ YVK+RD  ++   QE           FP+++L
Subjt:  TFWNYVKNRDANLKKALQENFSKPYPALLAFPEDLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGGAGGTACCCAAACGTCGCCGCATTAAGCGAAAAGCGGGCCGCGTCAGGGTAGTCCGAACTGATACCCCCTCGCCTCCAACCACTGATTCTGAAAGAGAAAATGC
AGAGAAAGAAGAGCGTGAGAAGAAAGAGGCCGAGGAAAAAGGCAAAAGTGTTGCTGAAGCATCGGAAGAACCTGATGAGATAGAAGAGCCACAGTTGCCGTATGATCGCT
TCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGCTGCTGAAAAGAGATTTCTTGTTTGAGAGAGGATTTAGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCA
GACCACGGCTGGGAATTGTTTTGTGCAAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGCGAATTTTATGCAAATATTGACAAAGAAGACGGTTTCCAAGTGATTGTTAG
AGGAGTCGAAGTAGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAAAATTTCCCCCACGCAGCGTATAATGTGATGGTTGTGGCGCCATCTAATGAGCAGC
TGAGCGATGCTGTGCGGGAAGTGGGTATTGAAGGGGCACGACAGAGGTTGCTTCCAACGACTCATGACTCGACGGTCTCTAGGGAACGGGTTCTTCTGGCTTTCGCGATT
TTGCGGTCTCTCAGTATCGATGTAGGGAAGATTGTTGTGAATGAGATATCTGGATGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTG
TAAGCAAGCAGGGGTTCCAGAGAATGAAGGAGATGTCATATTATTTGACAACGGAATTATCGACTTGCCTAACTTGGCACGGCTTCAGCGTATGCAAGAGGCGCGTCAGG
GTGGACTTGTCTACGGTGTCAACACGATTTTAGAACAACTGGCACTTTCGGCTAGTAGGCAAGAGTTTGTCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAAT
CGTGATGCCAATCTGAAGAAGGCGCTACAGGAGAATTTTTCCAAGCCATATCCAGCCCTTCTAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCCCCACCGAT
TGAAAGAGGAGAAGAGGATGATGAAAATGAGCAGGGCCAAGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGGAGGTACCCAAACGTCGCCGCATTAAGCGAAAAGCGGGCCGCGTCAGGGTAGTCCGAACTGATACCCCCTCGCCTCCAACCACTGATTCTGAAAGAGAAAATGC
AGAGAAAGAAGAGCGTGAGAAGAAAGAGGCCGAGGAAAAAGGCAAAAGTGTTGCTGAAGCATCGGAAGAACCTGATGAGATAGAAGAGCCACAGTTGCCGTATGATCGCT
TCGTCAACAATTCTGCCAGAGCAAAATATGCTGAGCTGCTGAAAAGAGATTTCTTGTTTGAGAGAGGATTTAGCGGTGATCTTCCGCATTTTCTGAGGACCGGCATTGCA
GACCACGGCTGGGAATTGTTTTGTGCAAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGCGAATTTTATGCAAATATTGACAAAGAAGACGGTTTCCAAGTGATTGTTAG
AGGAGTCGAAGTAGACTGGAGTCCTAGTGCTATTAACGCACTGTATAACCTTCAAAATTTCCCCCACGCAGCGTATAATGTGATGGTTGTGGCGCCATCTAATGAGCAGC
TGAGCGATGCTGTGCGGGAAGTGGGTATTGAAGGGGCACGACAGAGGTTGCTTCCAACGACTCATGACTCGACGGTCTCTAGGGAACGGGTTCTTCTGGCTTTCGCGATT
TTGCGGTCTCTCAGTATCGATGTAGGGAAGATTGTTGTGAATGAGATATCTGGATGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTG
TAAGCAAGCAGGGGTTCCAGAGAATGAAGGAGATGTCATATTATTTGACAACGGAATTATCGACTTGCCTAACTTGGCACGGCTTCAGCGTATGCAAGAGGCGCGTCAGG
GTGGACTTGTCTACGGTGTCAACACGATTTTAGAACAACTGGCACTTTCGGCTAGTAGGCAAGAGTTTGTCGAGAGGCAAGCTTTGACCTTCTGGAACTATGTTAAAAAT
CGTGATGCCAATCTGAAGAAGGCGCTACAGGAGAATTTTTCCAAGCCATATCCAGCCCTTCTAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCCCCACCGAT
TGAAAGAGGAGAAGAGGATGATGAAAATGAGCAGGGCCAAGAGGACTGA
Protein sequenceShow/hide protein sequence
MPEVPKRRRIKRKAGRVRVVRTDTPSPPTTDSERENAEKEEREKKEAEEKGKSVAEASEEPDEIEEPQLPYDRFVNNSARAKYAELLKRDFLFERGFSGDLPHFLRTGIA
DHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNVMVVAPSNEQLSDAVREVGIEGARQRLLPTTHDSTVSRERVLLAFAI
LRSLSIDVGKIVVNEISGCWKKKVGKLFFPNTITMLCKQAGVPENEGDVILFDNGIIDLPNLARLQRMQEARQGGLVYGVNTILEQLALSASRQEFVERQALTFWNYVKN
RDANLKKALQENFSKPYPALLAFPEDLLNPWIPPPPIERGEEDDENEQGQED