; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034441 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034441
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold4:16021869..16023884
RNA-Seq ExpressionSpg034441
SyntenySpg034441
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.1e-2128.08Show/hide
Query:  EDKGKAKYAELLKRDFLFERGF------SGDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM-----
        ++  K  Y  +  R   FE GF      + +L   +   +T H W+ F   P  VNA +V+EFY+NI +     V+VRG+++ ++P A N    +     
Subjt:  EDKGKAKYAELLKRDFLFERGF------SGDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM-----

Query:  -----------ETFEN-----------------RERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRK
                   ET++                  + +T     L      W  F+K  L+PT+H++TVS +R+LL  +IL   +ID+GKII      C ++
Subjt:  -----------ETFEN-----------------RERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRK

Query:  KVGKLFFPNTITMLCKKAGVPENE-DVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGIHTILEQLALSASRQEFAERQSQ--TFWSYV
        +   L FPN IT LC+K  V E   D IL     ++   +  L   +EA+                V    T LEQ A+  + Q   +   +   +++Y 
Subjt:  KVGKLFFPNTITMLCKKAGVPENE-DVILFDKGIIDTPNLARLQRTQEAR------------QGGLVYGIHTILEQLALSASRQEFAERQSQ--TFWSYV

Query:  KRRDANLKKALQENFSK
        KRRDA L  AL E+  +
Subjt:  KRRDANLKKALQENFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.1e-2439.09Show/hide
Query:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------
        G LP F+   IT H W+ FC+ PE     +VREFYAN+       V VRGV V WS  A NA+  +        E  EN                     
Subjt:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------

Query:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI
         + Q AY      L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+  
Subjt:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI

Query:  LFDKGIIDTPNLARLQRTQE
        L + G ID   +AR+  TQE
Subjt:  LFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]9.8e-3433.73Show/hide
Query:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------
        G LP F+   IT H W+ FC+ PE     +VREFYAN+   E   V VRGV V WS  A NA+  +        E  +N  +                  
Subjt:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------

Query:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI
         + Q AY      L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+  
Subjt:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI

Query:  LFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQENFSK
        L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q Q FW+Y K RD  LKKALQ NF++
Subjt:  LFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQENFSK

Query:  PYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
        P P    FP ++L         + E E D++   E
Subjt:  PYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.2e-2334.45Show/hide
Query:  ETFENRERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDV
        E  +NR    +   L   A  W  F+K  LLPTTH  TVS++R+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+ 
Subjt:  ETFENRERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDV

Query:  ILFDKGIIDTPNLARLQRTQEAR--------------------QGGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQEN
         L   G ID   +AR+  TQE +                     G ++  +  + ++L+    +Q       +   +Q Q FW+Y K RD  LKKALQ N
Subjt:  ILFDKGIIDTPNLARLQRTQEAR--------------------QGGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQEN

Query:  FSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
        F++P P    FP +LL         + E E D++   E
Subjt:  FSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.2e-2835.47Show/hide
Query:  VVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER-------------------TFQSAY------LKKEANTWMGFIKQML
        +VREFYAN+   E   + VRGV V WS  A NA+  +        E  EN                      + Q AY      L   A  W  F+K  L
Subjt:  VVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER-------------------TFQSAY------LKKEANTWMGFIKQML

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVPENEDVILFDKGIIDTPNLARL------QRTQE-----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A    NE+  L + G ID   +AR+      + TQ+     
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVPENEDVILFDKGIIDTPNLARL------QRTQE-----

Query:  ------ARQGGLVYGIHTILEQLALSASRQEFAERQSQTFWSYVKRRDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
              +R  G V      LEQ     S+QE   +Q Q FW+Y K RD  LKKALQ NF++P P    FP ++L         + E E D++   E
Subjt:  ------ARQGGLVYGIHTILEQLALSASRQEFAERQSQTFWSYVKRRDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.0e-2439.09Show/hide
Query:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------
        G LP F+   IT H W+ FC+ PE     +VREFYAN+       V VRGV V WS  A NA+  +        E  EN                     
Subjt:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------

Query:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI
         + Q AY      L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+  
Subjt:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI

Query:  LFDKGIIDTPNLARLQRTQE
        L + G ID   +AR+  TQE
Subjt:  LFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)4.7e-3433.73Show/hide
Query:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------
        G LP F+   IT H W+ FC+ PE     +VREFYAN+   E   V VRGV V WS  A NA+  +        E  +N  +                  
Subjt:  GDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER------------------

Query:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI
         + Q AY      L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+  
Subjt:  -TFQSAY------LKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDVI

Query:  LFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQENFSK
        L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q Q FW+Y K RD  LKKALQ NF++
Subjt:  LFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQENFSK

Query:  PYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
        P P    FP ++L         + E E D++   E
Subjt:  PYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

A0A2P5CEY2 Uncharacterized protein5.8e-2434.45Show/hide
Query:  ETFENRERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDV
        E  +NR    +   L   A  W  F+K  LLPTTH  TVS++R+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A  P   NE+ 
Subjt:  ETFENRERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVP--ENEDV

Query:  ILFDKGIIDTPNLARLQRTQEAR--------------------QGGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQEN
         L   G ID   +AR+  TQE +                     G ++  +  + ++L+    +Q       +   +Q Q FW+Y K RD  LKKALQ N
Subjt:  ILFDKGIIDTPNLARLQRTQEAR--------------------QGGLVYGIHTILEQLALSASRQ-------EFAERQSQTFWSYVKRRDANLKKALQEN

Query:  FSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
        F++P P    FP +LL         + E E D++   E
Subjt:  FSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

A0A2P5DXM3 Uncharacterized protein6.0e-2935.47Show/hide
Query:  VVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER-------------------TFQSAY------LKKEANTWMGFIKQML
        +VREFYAN+   E   + VRGV V WS  A NA+  +        E  EN                      + Q AY      L   A  W  F+K  L
Subjt:  VVREFYANIDKEEGFQVIVRGVAVDWSPGANNALGAM--------ETFENRER-------------------TFQSAY------LKKEANTWMGFIKQML

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVPENEDVILFDKGIIDTPNLARL------QRTQE-----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+ A    NE+  L + G ID   +AR+      + TQ+     
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVPENEDVILFDKGIIDTPNLARL------QRTQE-----

Query:  ------ARQGGLVYGIHTILEQLALSASRQEFAERQSQTFWSYVKRRDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE
              +R  G V      LEQ     S+QE   +Q Q FW+Y K RD  LKKALQ NF++P P    FP ++L         + E E D++   E
Subjt:  ------ARQGGLVYGIHTILEQLALSASRQEFAERQSQTFWSYVKRRDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQE

A0A6A3BU96 Uncharacterized protein1.2e-2128.77Show/hide
Query:  DKGKAKYAELLKRDFLFERGF------SGDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNAL---------
        D+ KA++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K     + VRG  + ++  A N           
Subjt:  DKGKAKYAELLKRDFLFERGF------SGDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGANNAL---------

Query:  --------------GAME--TFENRE--------RTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKK
                      G +E   FEN E         +     L+  A  W  F+K  L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++  C  KK
Subjt:  --------------GAME--TFENRE--------RTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKK

Query:  VGKLFFPNTITMLCKKAGVPENE-DVILFDKGIIDTPNLARLQRTQEARQGGLVY----GIHTILEQLALSASRQEFAERQSQ---------TFWSYVKR
           L FPN IT LC+K  V EN  D IL     I    L  L   +  +    V+    G      ++ L A  +   + Q+Q          F+ YVK 
Subjt:  VGKLFFPNTITMLCKKAGVPENE-DVILFDKGIIDTPNLARLQRTQEARQGGLVY----GIHTILEQLALSASRQEFAERQSQ---------TFWSYVKR

Query:  RDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQED
        RD  ++   QE           FPD++L      P    E   + EH   D
Subjt:  RDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAATTCTGATGCTGCCACGTGTCGCCCAGGGGCAATCCAAGGGTTAGAGGATATTGCTGTCGAGGCAGTTGAAGAAGGCAATTCGAAAGAACCTGAAGGACAAAC
CCCAGGGGAGACTGACCCGATAGTTGCAGATACGGAGGGAGTTCAAGAAGAACACACAGAGGAAGTTCAAGAAAAACAGACAGAGGATGCGCGAGAAGGACAGACAGAGG
ATGTGCCGGAAAAAGGCAATGAGCAGGTGGAACAAGTGCAGGAGGCTCGAGTGGAGGTTATCATGCCGAAAGTGCCAAGACGACGCCGCCGGAAGCAAAAAGCCGGCCGT
GTTAAGGAGAAAAAAGAGGCAGAAGACAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAGGAAGAAGTGTTGCTTAAACAAAGGGAAGACAAGGGCAAGGCAAAGTATGC
TGAGCTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTACAGGCTGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCCAAGC
CTGAATCTGTGAACGCGCATGTAGTGCGCGAGTTTTATGCAAACATTGACAAGGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCC
AATAACGCCCTGGGCGCAATGGAGACTTTCGAAAACAGAGAAAGGACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACATGGATGGGATTCATCAAACAAATGTT
GCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTCGCGATTTTAAGGTCTCTCAGTATTGATGTGGGTAAGATTATTGCGAGTGAGATAT
CTGGATGCTGGCGGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACGATGCTTTGCAAGAAAGCAGGGGTTCCGGAGAATGAAGATGTCATATTATTTGACAAG
GGAATCATTGATACGCCTAATTTGGCACGGCTTCAACGTACGCAAGAGGCACGTCAGGGTGGGCTTGTTTATGGCATTCACACGATTTTAGAACAACTTGCACTGTCAGC
CAGCAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTGAAAAAGGCGCTACAGGAAAATTTTTCCAAACCATATC
CAGCCCTTCTAGTATTCCCTGATGATTTATTGAACCCCTGGATTCTGCCCCCACCAATGCAAAGAGAAGAAGAGGATGATGAAGAGCATGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGAATTCTGATGCTGCCACGTGTCGCCCAGGGGCAATCCAAGGGTTAGAGGATATTGCTGTCGAGGCAGTTGAAGAAGGCAATTCGAAAGAACCTGAAGGACAAAC
CCCAGGGGAGACTGACCCGATAGTTGCAGATACGGAGGGAGTTCAAGAAGAACACACAGAGGAAGTTCAAGAAAAACAGACAGAGGATGCGCGAGAAGGACAGACAGAGG
ATGTGCCGGAAAAAGGCAATGAGCAGGTGGAACAAGTGCAGGAGGCTCGAGTGGAGGTTATCATGCCGAAAGTGCCAAGACGACGCCGCCGGAAGCAAAAAGCCGGCCGT
GTTAAGGAGAAAAAAGAGGCAGAAGACAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAGGAAGAAGTGTTGCTTAAACAAAGGGAAGACAAGGGCAAGGCAAAGTATGC
TGAGCTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTACAGGCTGGCATTACGAACCACGGTTGGGAGTTATTCTGTTCCAAGC
CTGAATCTGTGAACGCGCATGTAGTGCGCGAGTTTTATGCAAACATTGACAAGGAAGAGGGTTTCCAAGTTATCGTTCGAGGAGTAGCAGTTGACTGGAGTCCTGGTGCC
AATAACGCCCTGGGCGCAATGGAGACTTTCGAAAACAGAGAAAGGACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACATGGATGGGATTCATCAAACAAATGTT
GCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTTCTGGCTTTCGCGATTTTAAGGTCTCTCAGTATTGATGTGGGTAAGATTATTGCGAGTGAGATAT
CTGGATGCTGGCGGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACGATGCTTTGCAAGAAAGCAGGGGTTCCGGAGAATGAAGATGTCATATTATTTGACAAG
GGAATCATTGATACGCCTAATTTGGCACGGCTTCAACGTACGCAAGAGGCACGTCAGGGTGGGCTTGTTTATGGCATTCACACGATTTTAGAACAACTTGCACTGTCAGC
CAGCAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAGCTATGTTAAACGTCGTGATGCCAACCTGAAAAAGGCGCTACAGGAAAATTTTTCCAAACCATATC
CAGCCCTTCTAGTATTCCCTGATGATTTATTGAACCCCTGGATTCTGCCCCCACCAATGCAAAGAGAAGAAGAGGATGATGAAGAGCATGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MENSDAATCRPGAIQGLEDIAVEAVEEGNSKEPEGQTPGETDPIVADTEGVQEEHTEEVQEKQTEDAREGQTEDVPEKGNEQVEQVQEARVEVIMPKVPRRRRRKQKAGR
VKEKKEAEDKAREEAEKKAEEEVLLKQREDKGKAKYAELLKRDFLFERGFSGDLPHFLQAGITNHGWELFCSKPESVNAHVVREFYANIDKEEGFQVIVRGVAVDWSPGA
NNALGAMETFENRERTFQSAYLKKEANTWMGFIKQMLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWRKKVGKLFFPNTITMLCKKAGVPENEDVILFDK
GIIDTPNLARLQRTQEARQGGLVYGIHTILEQLALSASRQEFAERQSQTFWSYVKRRDANLKKALQENFSKPYPALLVFPDDLLNPWILPPPMQREEEDDEEHGQED