; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005315 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005315
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold6:36117149..36120253
RNA-Seq ExpressionSpg005315
SyntenySpg005315
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]1.5e-2025.71Show/hide
Query:  FANNFARAKYAKLLKRDFLFECSF------SGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNL---
        F +  A+  Y  +  R   FE  F      + +L   +   +  H W+ F   P  VNA +V+EFY+NI + +   V+VRG+ + ++P+AIN  + L   
Subjt:  FANNFARAKYAKLLKRDFLFECSF------SGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNL---

Query:  --------QNFPHAAYNEMV------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCW
                Q   H  Y  ++                        + P  +  +  ++ +++PT+H++T+S +R+LL  +IL   +I++G+II      C 
Subjt:  --------QNFPHAAYNEMV------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCW

Query:  KKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARLQRMKE------------VRQGGLIYGINTILEKLALSASRQEFAE--RQTLTFWN
        K++   L FPN IT LC K  V  +  D IL     ++K  +  L   KE            V     +   +T LE+ A+  + Q   +   + + ++ 
Subjt:  KKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARLQRMKE------------VRQGGLIYGINTILEKLALSASRQEFAE--RQTLTFWN

Query:  YVKNRDASLRRALQENFSK
        Y K RDA L  AL E+  +
Subjt:  YVKNRDASLRRALQENFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-2232.17Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS
        A + H A++ E +    R+ NN          ++ F+ + S + G LP F+   I  H+W+ FCA PE     +VREFYAN+       V VRGV+V WS
Subjt:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS

Query:  PSAINALYNLQN--FPHAAYNEMV---------------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN
          AINA++ L +    H+ + E +                                 + P+ +     ++  +LPTTH  T+S++R+LL  ++L   SIN
Subjt:  PSAINALYNLQN--FPHAAYNEMV---------------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN

Query:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL
        VGR+I SEI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+
Subjt:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.6e-3030.43Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS
        A + H A++ E +    R+ NN          ++ F+ + S + G LP F+   I  H+W+ FCA PE     +VREFYAN+   +   V VRGV+V WS
Subjt:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS

Query:  PSAINALYNL-----------QNF----------------------PHAAYN--EMVVAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN
          AINA++ L           QN                          AY      + P+ +     ++ R+LPTTH  T+S++R+LL  ++L   SIN
Subjt:  PSAINALYNL-----------QNF----------------------PHAAYN--EMVVAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN

Query:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEVR---------QGGLIYGINTILEKLAL
        VGR+I SEI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+         Q+    R          G ++  +  + ++L+ 
Subjt:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEVR---------QGGLIYGINTILEKLAL

Query:  SASRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPL
           +Q       +   +Q   FW Y K RD +L++ALQ NF++P+
Subjt:  SASRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.3e-2132.84Show/hide
Query:  LKRDFLFECSFSGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQL
        ++++F+++ S   + P F+   I  H+W+LFCA PE     +VREFY N+   D   V +RGV+V  S  AIN +++L +    H+ + E +  P    +
Subjt:  LKRDFLFECSFSGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQL

Query:  SDAV---------------------------------RERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSK
         + V                                 + R+LPTTH  T+S+E V L +++L   SINVGR+I  EI  C  +K G LFFP+ IT +C  
Subjt:  SDAV---------------------------------RERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSK

Query:  TGVP
        T  P
Subjt:  TGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.4e-2331.7Show/hide
Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQLSDAV---------------------------------RERM
        +VREFYAN+   +   + VRGV+V WS  AINA++ L +    H+ + E +  P    + + V                                 + R+
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQLSDAV---------------------------------RERM

Query:  LPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEV
        LPTTH   +S++R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+ IT LC      V+E    L + G ID   +AR+         Q+    
Subjt:  LPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEV

Query:  R---------QGGLIYGINTILEKLALSASRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPL
        R          G ++  +  + ++L    S+QE   +Q   FW Y K RD +L++ALQ NF++P+
Subjt:  R---------QGGLIYGINTILEKLALSASRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-2232.17Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS
        A + H A++ E +    R+ NN          ++ F+ + S + G LP F+   I  H+W+ FCA PE     +VREFYAN+       V VRGV+V WS
Subjt:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS

Query:  PSAINALYNLQN--FPHAAYNEMV---------------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN
          AINA++ L +    H+ + E +                                 + P+ +     ++  +LPTTH  T+S++R+LL  ++L   SIN
Subjt:  PSAINALYNLQN--FPHAAYNEMV---------------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN

Query:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL
        VGR+I SEI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+
Subjt:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)2.2e-3030.43Show/hide
Query:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS
        A + H A++ E +    R+ NN          ++ F+ + S + G LP F+   I  H+W+ FCA PE     +VREFYAN+   +   V VRGV+V WS
Subjt:  ASEEHDAIE-EQQLPFDRFANNFARAKYAKLLKRDFLFECSFS-GDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWS

Query:  PSAINALYNL-----------QNF----------------------PHAAYN--EMVVAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN
          AINA++ L           QN                          AY      + P+ +     ++ R+LPTTH  T+S++R+LL  ++L   SIN
Subjt:  PSAINALYNL-----------QNF----------------------PHAAYN--EMVVAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSIN

Query:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEVR---------QGGLIYGINTILEKLAL
        VGR+I SEI  C  +K G LFFP+ IT LC     P    +  L + G ID   +AR+         Q+    R          G ++  +  + ++L+ 
Subjt:  VGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEVR---------QGGLIYGINTILEKLAL

Query:  SASRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPL
           +Q       +   +Q   FW Y K RD +L++ALQ NF++P+
Subjt:  SASRQ-------EFAERQTLTFWNYVKNRDASLRRALQENFSKPL

A0A2P5DAQ2 Uncharacterized protein6.5e-2232.84Show/hide
Query:  LKRDFLFECSFSGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQL
        ++++F+++ S   + P F+   I  H+W+LFCA PE     +VREFY N+   D   V +RGV+V  S  AIN +++L +    H+ + E +  P    +
Subjt:  LKRDFLFECSFSGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQL

Query:  SDAV---------------------------------RERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSK
         + V                                 + R+LPTTH  T+S+E V L +++L   SINVGR+I  EI  C  +K G LFFP+ IT +C  
Subjt:  SDAV---------------------------------RERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSK

Query:  TGVP
        T  P
Subjt:  TGVP

A0A2P5DXM3 Uncharacterized protein1.2e-2331.7Show/hide
Query:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQLSDAV---------------------------------RERM
        +VREFYAN+   +   + VRGV+V WS  AINA++ L +    H+ + E +  P    + + V                                 + R+
Subjt:  VVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNLQN--FPHAAYNEMVVAPSDEQLSDAV---------------------------------RERM

Query:  LPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEV
        LPTTH   +S++R+LL  ++L   SINVGR+I SEI  C  +K G LFFP+ IT LC      V+E    L + G ID   +AR+         Q+    
Subjt:  LPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARL---------QRMKEV

Query:  R---------QGGLIYGINTILEKLALSASRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPL
        R          G ++  +  + ++L    S+QE   +Q   FW Y K RD +L++ALQ NF++P+
Subjt:  R---------QGGLIYGINTILEKLALSASRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPL

A0A6A2ZUE4 Uncharacterized protein7.2e-2125.71Show/hide
Query:  FANNFARAKYAKLLKRDFLFECSF------SGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNL---
        F +  A+  Y  +  R   FE  F      + +L   +   +  H W+ F   P  VNA +V+EFY+NI + +   V+VRG+ + ++P+AIN  + L   
Subjt:  FANNFARAKYAKLLKRDFLFECSF------SGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQVIVRGVEVDWSPSAINALYNL---

Query:  --------QNFPHAAYNEMV------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCW
                Q   H  Y  ++                        + P  +  +  ++ +++PT+H++T+S +R+LL  +IL   +I++G+II      C 
Subjt:  --------QNFPHAAYNEMV------------------------VAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCW

Query:  KKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARLQRMKE------------VRQGGLIYGINTILEKLALSASRQEFAE--RQTLTFWN
        K++   L FPN IT LC K  V  +  D IL     ++K  +  L   KE            V     +   +T LE+ A+  + Q   +   + + ++ 
Subjt:  KKKVGKLFFPNTITMLCSKTGVPVDEGDVILFDKGIIDKPNLARLQRMKE------------VRQGGLIYGINTILEKLALSASRQEFAE--RQTLTFWN

Query:  YVKNRDASLRRALQENFSK
        Y K RDA L  AL E+  +
Subjt:  YVKNRDASLRRALQENFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAAACAAGAGCTAGGAAAGAGAGGGAGAGTGAAGAGGAGGAGGTACCGGTCACGCCGGAAGTGCAAAAAGGGAAAACCAAGAAAAAAAGAACGTCAGAGGAAAA
GGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAGCAGGAGGAAGTTCAGGAGGTGGCAGAGGTTGTTGCCACTGCAGCGGATGAAGGAAATACTCAAGAACCTA
AAGTGAAAAACCCGGTTACGATTCAAGAAGAGAATGTTAGGGAAAATCAAGAAACAGAGACTGACGAAGAGGCTCGGGTTGAAGTCATCATGCCTGAACCACCAAAGAAG
CGCCGCATTAAGCGGAAGGCAGGCCGCATCAGGCGCAGGGCGGAAAAGGGCAAAAACATTGCTGAAGCATCGGAGGAACACGATGCAATAGAAGAACAACAGTTACCATT
TGATCGCTTCGCCAATAATTTTGCCAGAGCAAAGTACGCTAAGCTTCTGAAAAGAGATTTCTTGTTTGAGTGCAGTTTTAGTGGCGATCTTCCACAGTTTCTGAGGACCG
GTATTGCAGACCACGACTGGGAGCTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGATTCCAGGTA
ATTGTTCGAGGAGTCGAGGTGGATTGGAGTCCTAGTGCTATCAATGCACTGTACAATCTTCAAAACTTTCCCCATGCGGCATATAATGAGATGGTTGTGGCGCCATCTGA
TGAGCAACTAAGTGATGCGGTGCGGGAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTGCGGTCTCTCAGTA
TTAACGTAGGGAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAGAAGGTGGGGAAGTTGTTCTTCCCAAATACAATTACAATGCTTTGCAGTAAAACAGGGGTT
CCAGTGGATGAGGGAGATGTGATCCTGTTTGACAAAGGGATCATCGACAAGCCCAATTTGGCACGGCTCCAGCGCATGAAGGAGGTCCGTCAAGGTGGGCTTATTTACGG
CATCAACACGATTCTAGAAAAACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAA
GAAGGGCACTGCAAGAGAATTTTTCCAAACCCCTAGTTGGTGATGAGCTTGAGGCATGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGT
AAGGCTGCTTTAAGTCTGATAAACAAGGATATAAACCCTTTGAAAATATGTTTTGAAATGTCTGATAATAGAGCTAGGCCGTGGAATCATTTTGCTGCAGCAGAGTTCGG
TTTTGCGGAGTGCTCAGTTCAAGTTTGGGCGACTGGGGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGAAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATG
AACCGACTTTTGTTGAGTTATTCTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAAACAAGAGCTAGGAAAGAGAGGGAGAGTGAAGAGGAGGAGGTACCGGTCACGCCGGAAGTGCAAAAAGGGAAAACCAAGAAAAAAAGAACGTCAGAGGAAAA
GGAAGCAAAGAAAAGGAGAAGGCAGCAAAGGGCTGCAGAGCAGGAGGAAGTTCAGGAGGTGGCAGAGGTTGTTGCCACTGCAGCGGATGAAGGAAATACTCAAGAACCTA
AAGTGAAAAACCCGGTTACGATTCAAGAAGAGAATGTTAGGGAAAATCAAGAAACAGAGACTGACGAAGAGGCTCGGGTTGAAGTCATCATGCCTGAACCACCAAAGAAG
CGCCGCATTAAGCGGAAGGCAGGCCGCATCAGGCGCAGGGCGGAAAAGGGCAAAAACATTGCTGAAGCATCGGAGGAACACGATGCAATAGAAGAACAACAGTTACCATT
TGATCGCTTCGCCAATAATTTTGCCAGAGCAAAGTACGCTAAGCTTCTGAAAAGAGATTTCTTGTTTGAGTGCAGTTTTAGTGGCGATCTTCCACAGTTTCTGAGGACCG
GTATTGCAGACCACGACTGGGAGCTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAACATTGTCAAGGAGGATGGATTCCAGGTA
ATTGTTCGAGGAGTCGAGGTGGATTGGAGTCCTAGTGCTATCAATGCACTGTACAATCTTCAAAACTTTCCCCATGCGGCATATAATGAGATGGTTGTGGCGCCATCTGA
TGAGCAACTAAGTGATGCGGTGCGGGAGAGGATGCTTCCAACAACACATGACTCGACAATCTCCAGGGAACGGGTTCTCCTAGCTTTTGCCATCTTGCGGTCTCTCAGTA
TTAACGTAGGGAGGATCATTGCGAGTGAAATTTCTGGTTGCTGGAAAAAGAAGGTGGGGAAGTTGTTCTTCCCAAATACAATTACAATGCTTTGCAGTAAAACAGGGGTT
CCAGTGGATGAGGGAGATGTGATCCTGTTTGACAAAGGGATCATCGACAAGCCCAATTTGGCACGGCTCCAGCGCATGAAGGAGGTCCGTCAAGGTGGGCTTATTTACGG
CATCAACACGATTCTAGAAAAACTGGCACTTTCGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGGGATGCCAGCTTAA
GAAGGGCACTGCAAGAGAATTTTTCCAAACCCCTAGTTGGTGATGAGCTTGAGGCATGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGAGCTTATGACTGT
AAGGCTGCTTTAAGTCTGATAAACAAGGATATAAACCCTTTGAAAATATGTTTTGAAATGTCTGATAATAGAGCTAGGCCGTGGAATCATTTTGCTGCAGCAGAGTTCGG
TTTTGCGGAGTGCTCAGTTCAAGTTTGGGCGACTGGGGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGAAGAAACTGCCACATCACAGCTCGTTAGCCAACTTCATG
AACCGACTTTTGTTGAGTTATTCTCGTGA
Protein sequenceShow/hide protein sequence
MTKTRARKERESEEEEVPVTPEVQKGKTKKKRTSEEKEAKKRRRQQRAAEQEEVQEVAEVVATAADEGNTQEPKVKNPVTIQEENVRENQETETDEEARVEVIMPEPPKK
RRIKRKAGRIRRRAEKGKNIAEASEEHDAIEEQQLPFDRFANNFARAKYAKLLKRDFLFECSFSGDLPQFLRTGIADHDWELFCAKPESVNAQVVREFYANIVKEDGFQV
IVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSDEQLSDAVRERMLPTTHDSTISRERVLLAFAILRSLSINVGRIIASEISGCWKKKVGKLFFPNTITMLCSKTGV
PVDEGDVILFDKGIIDKPNLARLQRMKEVRQGGLIYGINTILEKLALSASRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPLVGDELEAWVYCTIKWVIPCLRAYDC
KAALSLINKDINPLKICFEMSDNRARPWNHFAAAEFGFAECSVQVWATGGSKFCAAAKLGEETATSQLVSQLHEPTFVELFS