; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019710 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019710
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold5:32235539..32237209
RNA-Seq ExpressionSpg019710
SyntenySpg019710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8680640.1 hypothetical protein F3Y22_tig00111372pilonHSYRG00020 [Hibiscus syriacus]2.8e-1328.94Show/hide
Query:  SYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFN
        S+ +F ++ A+A++    K+   FE GF       G     +   +T   W+ F   P SVNA VV+EFYANI K       VRG ++ ++P AI   F+
Subjt:  SYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFN

Query:  LRD-------------------------FPHTAYNEMAVAP-------------------SNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEI
        L+D                         F +T +N    +                     ++ +  +    VS  R+LL  +I  S  IDVG+II  ++
Subjt:  LRD-------------------------FPHTAYNEMAVAP-------------------SNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITTHC--TGVPENEGDVIL
        + C  KK   L FPN IT  C    V EN  D IL
Subjt:  SGCWKKKVGKLFFPNTITTHC--TGVPENEGDVIL

KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.1e-1325.71Show/hide
Query:  FVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLR--
        FV+  A+  Y  +  R   FE GF      + +L   +   +T H W+ F   P  VNA +V+EFY+NI +      +VRGI + ++P AIN  F L+  
Subjt:  FVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLR--

Query:  -----------------------DFPHTAYNEMAVA-------------------PSNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEISGCW
                                 P T +N   +                      ++ +  +    VS +R+LL  +IL   +ID+GKII      C 
Subjt:  -----------------------DFPHTAYNEMAVA-------------------PSNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEISGCW

Query:  KKKVGKLFFPNTITTHC--TGVPENEGDVILFDKGIIDTPTLARLQRTQEA------------CQGGLVYGINTILEQLALSASRQEFAE--RQALTFWN
        K++   L FPN IT  C    V E   D IL     ++   +  L   +EA                 V   +T LEQ A+  + Q   +   + + ++ 
Subjt:  KKKVGKLFFPNTITTHC--TGVPENEGDVILFDKGIIDTPTLARLQRTQEA------------CQGGLVYGINTILEQLALSASRQEFAE--RQALTFWN

Query:  YVRSRDANLKKALEENFSK
        Y + RDA L  AL E+  +
Subjt:  YVRSRDANLKKALEENFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.5e-1631.62Show/hide
Query:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL
        V+F    A  +Y   +  R    E+GF  D       L F+   IT H W+ FC+ PE     +VREFYAN+         VRG++V WS  AINA+F L
Subjt:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL

Query:  RD--FPHTAYNE-------------MAVAPS-------------NEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASEIS
         D    H+ + E             +AVA +                L+ A +                + VS++R+LL  ++L   SI+VG++I SEI 
Subjt:  RD--FPHTAYNE-------------MAVAPS-------------NEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASEIS

Query:  GCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARLQRTQE
         C  +K G LFFP+ IT  C     P    +  L + G ID   +AR+  TQE
Subjt:  GCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]4.1e-2527.91Show/hide
Query:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL
        V+F    A  +Y   +  R    E+GF  D       L F+   IT H W+ FC+ PE     +VREFYAN+   E     VRG++V WS  AINA+F L
Subjt:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL

Query:  RDFPHTAYNEMAVAPSNEQLSDAVREV---------------------------------------------VSRERVLLAFAILRSLSIDVGKIIASEI
         D P   ++E     + + L   +  V                                             VS++R+LL  ++L   SI+VG++I SEI
Subjt:  RDFPHTAYNEMAVAPSNEQLSDAVREV---------------------------------------------VSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARL------QRTQE------------ACQGGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT  C     P    +  L + G ID   +AR+      + TQ+               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARL------QRTQE------------ACQGGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE
           +   +Q   FW Y + RD  LKKAL+ NF++P P  P FP ++L         E E + ++D   E
Subjt:  ---EFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.2e-2130.48Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLRD--FPHTAYNEMAVAPS--------------------------NEQLSDAVR----------
        +VREFYAN+   E     VRG++V WS  AINA+F L D    H+ + E    P                              L+ A +          
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLRD--FPHTAYNEMAVAPS--------------------------NEQLSDAVR----------

Query:  ------EVVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITTHCTGVPENEGDVILFDKGIIDTPTLARL------QRTQE------
              ++VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C   P    +  L + G ID   +AR+      + TQ+      
Subjt:  ------EVVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITTHCTGVPENEGDVILFDKGIIDTPTLARL------QRTQE------

Query:  -ACQGGLVYG-INTILEQLALSASRQEFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE
         A       G +   L+ L    S+QE   +Q   FW Y + RD  LKKAL+ NF++P P  P FP ++L         E E + ++D   E
Subjt:  -ACQGGLVYG-INTILEQLALSASRQEFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.7e-1631.62Show/hide
Query:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL
        V+F    A  +Y   +  R    E+GF  D       L F+   IT H W+ FC+ PE     +VREFYAN+         VRG++V WS  AINA+F L
Subjt:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL

Query:  RD--FPHTAYNE-------------MAVAPS-------------NEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASEIS
         D    H+ + E             +AVA +                L+ A +                + VS++R+LL  ++L   SI+VG++I SEI 
Subjt:  RD--FPHTAYNE-------------MAVAPS-------------NEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASEIS

Query:  GCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARLQRTQE
         C  +K G LFFP+ IT  C     P    +  L + G ID   +AR+  TQE
Subjt:  GCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.0e-2527.91Show/hide
Query:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL
        V+F    A  +Y   +  R    E+GF  D       L F+   IT H W+ FC+ PE     +VREFYAN+   E     VRG++V WS  AINA+F L
Subjt:  VRFVNNLARAKYA-ELLKRDFLFERGFSGD------LLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNL

Query:  RDFPHTAYNEMAVAPSNEQLSDAVREV---------------------------------------------VSRERVLLAFAILRSLSIDVGKIIASEI
         D P   ++E     + + L   +  V                                             VS++R+LL  ++L   SI+VG++I SEI
Subjt:  RDFPHTAYNEMAVAPSNEQLSDAVREV---------------------------------------------VSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARL------QRTQE------------ACQGGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT  C     P    +  L + G ID   +AR+      + TQ+               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITTHCTG--VPENEGDVILFDKGIIDTPTLARL------QRTQE------------ACQGGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE
           +   +Q   FW Y + RD  LKKAL+ NF++P P  P FP ++L         E E + ++D   E
Subjt:  ---EFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE

A0A2P5DXM3 Uncharacterized protein6.0e-2230.48Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLRD--FPHTAYNEMAVAPS--------------------------NEQLSDAVR----------
        +VREFYAN+   E     VRG++V WS  AINA+F L D    H+ + E    P                              L+ A +          
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLRD--FPHTAYNEMAVAPS--------------------------NEQLSDAVR----------

Query:  ------EVVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITTHCTGVPENEGDVILFDKGIIDTPTLARL------QRTQE------
              ++VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C   P    +  L + G ID   +AR+      + TQ+      
Subjt:  ------EVVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITTHCTGVPENEGDVILFDKGIIDTPTLARL------QRTQE------

Query:  -ACQGGLVYG-INTILEQLALSASRQEFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE
         A       G +   L+ L    S+QE   +Q   FW Y + RD  LKKAL+ NF++P P  P FP ++L         E E + ++D   E
Subjt:  -ACQGGLVYG-INTILEQLALSASRQEFAERQALTFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE

A0A6A2YMQ9 Uncharacterized protein1.3e-1328.94Show/hide
Query:  SYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFN
        S+ +F ++ A+A++    K+   FE GF       G     +   +T   W+ F   P SVNA VV+EFYANI K       VRG ++ ++P AI   F+
Subjt:  SYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFN

Query:  LRD-------------------------FPHTAYNEMAVAP-------------------SNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEI
        L+D                         F +T +N    +                     ++ +  +    VS  R+LL  +I  S  IDVG+II  ++
Subjt:  LRD-------------------------FPHTAYNEMAVAP-------------------SNEQLSDAVREVVSRERVLLAFAILRSLSIDVGKIIASEI

Query:  SGCWKKKVGKLFFPNTITTHC--TGVPENEGDVIL
        + C  KK   L FPN IT  C    V EN  D IL
Subjt:  SGCWKKKVGKLFFPNTITTHC--TGVPENEGDVIL

A0A6A3BU96 Uncharacterized protein2.3e-1326.26Show/hide
Query:  LSYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALF
        +++ +F N+ A+A++     R+  FE GF       G     +   +    W  F   P SVNA +V+EFYANI K       VRG ++ ++  AIN  F
Subjt:  LSYVRFVNNLARAKYAELLKRDFLFERGF------SGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALF

Query:  NLRD-------------------------FPHTAYN---EMAVAPSNEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASE
        +L++                         F +T +N       + + E+L    +                  VS  R+LL  +++ S  IDVG+II  +
Subjt:  NLRD-------------------------FPHTAYN---EMAVAPSNEQLSDAVR----------------EVVSRERVLLAFAILRSLSIDVGKIIASE

Query:  ISGCWKKKVGKLFFPNTITTHC--TGVPENEGDVILFDKGIIDTPTLARLQRTQEACQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT  C    V EN  D IL     I    L  L   +       V+         N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITTHC--TGVPENEGDVILFDKGIIDTPTLARLQRTQEACQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE
         F+ YV+ RD  ++   +E         P FPD++L  +      E E D  + P  +
Subjt:  TFWNYVRSRDANLKKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTCCCTGTTACCCCCGAAGCACTGAAAGTTAAAACGAAGAAGAAGAAAACACCAGAAGAAAC
AGAGGCTAAAAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTGTTGTGGAAGAAGATCCGAAAGAACCAGAAGGACAGA
ATCCAGAGCAGACTGACCCGATAGTTGCGGATACAGAGGCAGTTCGAGAAGAAAATGCAGAAGAAGTTCAAGAAAAGCAGACTGAGAATGTGCAAGAAGAACAGACAGAG
GTTGCGCCTGAAGAAGTTAATGAGCAAGAAAAGGAGGCTCGTGTGGAGGTGATCATGCCAAAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGAA
GAAAGAGGCCGAGGAAAAAGGAAGAGAAGAAGCAAGGAGAAAGGCTGAAGAAGAAAGGTTGCTAAAGCGAAGGGCAGACAAGGGCAAAAGTGTTGCTGCGACATCGGAGG
AACCTGACGAAATAGAAGATCCGCAATTGTCATATGTCCGCTTCGTCAACAACCTTGCTAGAGCAAAGTATGCTGAGTTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGA
TTCAGTGGTGATCTTCTGCATTTTCTGAGGGCCGGCATTACGGACCACGGCTGGGAGTTGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCGCGAGTTTTA
TGCGAATATTGACAAAGAAGAAGGTTTCCTAGCGATCGTTCGAGGTATTGAGGTCGACTGGAGTCCTGGTGCTATTAACGCCCTGTTCAACCTTCGCGATTTCCCCCACA
CAGCATATAATGAGATGGCTGTGGCGCCATCTAATGAGCAGTTGAGTGACGCTGTGAGGGAAGTTGTTTCTAGGGAAAGGGTTCTTCTGGCTTTCGCGATTTTGAGGTCT
CTCAGCATTGATGTAGGGAAGATTATTGCTAGTGAAATTTCTGGATGTTGGAAGAAGAAAGTGGGGAAACTGTTTTTTCCGAATACGATTACCACGCATTGCACAGGGGT
TCCAGAGAATGAAGGTGATGTTATTTTATTTGACAAGGGGATCATTGACACGCCTACCTTGGCGCGGCTTCAGCGTACGCAAGAGGCATGCCAGGGAGGGCTTGTCTATG
GCATCAACACGATTTTAGAACAACTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAGTCGTGATGCCAACTTG
AAGAAGGCCCTAGAGGAAAATTTTTCCAAACCTTATCCGGCCTTACCCATATTCCCTGATGATCTACTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGA
TGAAGAAGAAGATCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTCCCTGTTACCCCCGAAGCACTGAAAGTTAAAACGAAGAAGAAGAAAACACCAGAAGAAAC
AGAGGCTAAAAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTGTTGTGGAAGAAGATCCGAAAGAACCAGAAGGACAGA
ATCCAGAGCAGACTGACCCGATAGTTGCGGATACAGAGGCAGTTCGAGAAGAAAATGCAGAAGAAGTTCAAGAAAAGCAGACTGAGAATGTGCAAGAAGAACAGACAGAG
GTTGCGCCTGAAGAAGTTAATGAGCAAGAAAAGGAGGCTCGTGTGGAGGTGATCATGCCAAAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGAA
GAAAGAGGCCGAGGAAAAAGGAAGAGAAGAAGCAAGGAGAAAGGCTGAAGAAGAAAGGTTGCTAAAGCGAAGGGCAGACAAGGGCAAAAGTGTTGCTGCGACATCGGAGG
AACCTGACGAAATAGAAGATCCGCAATTGTCATATGTCCGCTTCGTCAACAACCTTGCTAGAGCAAAGTATGCTGAGTTGCTGAAGAGAGACTTCCTGTTTGAGAGAGGA
TTCAGTGGTGATCTTCTGCATTTTCTGAGGGCCGGCATTACGGACCACGGCTGGGAGTTGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGGTGCGCGAGTTTTA
TGCGAATATTGACAAAGAAGAAGGTTTCCTAGCGATCGTTCGAGGTATTGAGGTCGACTGGAGTCCTGGTGCTATTAACGCCCTGTTCAACCTTCGCGATTTCCCCCACA
CAGCATATAATGAGATGGCTGTGGCGCCATCTAATGAGCAGTTGAGTGACGCTGTGAGGGAAGTTGTTTCTAGGGAAAGGGTTCTTCTGGCTTTCGCGATTTTGAGGTCT
CTCAGCATTGATGTAGGGAAGATTATTGCTAGTGAAATTTCTGGATGTTGGAAGAAGAAAGTGGGGAAACTGTTTTTTCCGAATACGATTACCACGCATTGCACAGGGGT
TCCAGAGAATGAAGGTGATGTTATTTTATTTGACAAGGGGATCATTGACACGCCTACCTTGGCGCGGCTTCAGCGTACGCAAGAGGCATGCCAGGGAGGGCTTGTCTATG
GCATCAACACGATTTTAGAACAACTTGCACTGTCGGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAGTCGTGATGCCAACTTG
AAGAAGGCCCTAGAGGAAAATTTTTCCAAACCTTATCCGGCCTTACCCATATTCCCTGATGATCTACTGAACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGA
TGAAGAAGAAGATCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPEALKVKTKKKKTPEETEAKRRRRQQRAEDQEAVQKAAEDVVVEEDPKEPEGQNPEQTDPIVADTEAVREENAEEVQEKQTENVQEEQTE
VAPEEVNEQEKEARVEVIMPKVPKRRRIKRKAGRVKKKEAEEKGREEARRKAEEERLLKRRADKGKSVAATSEEPDEIEDPQLSYVRFVNNLARAKYAELLKRDFLFERG
FSGDLLHFLRAGITDHGWELFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPGAINALFNLRDFPHTAYNEMAVAPSNEQLSDAVREVVSRERVLLAFAILRS
LSIDVGKIIASEISGCWKKKVGKLFFPNTITTHCTGVPENEGDVILFDKGIIDTPTLARLQRTQEACQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVRSRDANL
KKALEENFSKPYPALPIFPDDLLNPWIPPPPVEREGDEEEDPGQED