; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021570 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021570
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold9:7442433..7446600
RNA-Seq ExpressionSpg021570
SyntenySpg021570
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]9.4e-1631.87Show/hide
Query:  PHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLS
        P F+   I  HGW  FC  P                            +V ++  AIN+++ L++     Y + A   ++EQL   + EV +EGA WQ+S
Subjt:  PHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLS

Query:  KAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGC-WRKKVGKLFFPNTIT
             T     LKR A  W  F+  R +P+TH  TV+++RVLL+++IL  +S+++ +I   EI  C   +K G L+FP+ IT
Subjt:  KAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGC-WRKKVGKLFFPNTIT

EXB93492.1 hypothetical protein L484_006967 [Morus notabilis]9.1e-1932.76Show/hide
Query:  KYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPEQVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTF
        K A  ++R F++ RG     P F+ + IA H W SFC  P        AIN+LY L D     +N  A + + +QL + + E+ VEG +W  +     TF
Subjt:  KYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPEQVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTF

Query:  QSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT
            L+     W  F+R RL+P++H   V +ER +L++ +++   ++VG++I  ++  C  +K G L+FP+ IT
Subjt:  QSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]9.1e-1932.74Show/hide
Query:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPG
        K E +    R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE                           QV WS  
Subjt:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPG

Query:  AINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDV
        AINA++ L D P   ++E     +   L   +  V V GA+W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL+ ++L   SI+V
Subjt:  AINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDV

Query:  GKIISSEIFGCWRKKVGKLFFPNTIT
        G++I SEI  C  +K G LFFP+ IT
Subjt:  GKIISSEIFGCWRKKVGKLFFPNTIT

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.7e-1831.73Show/hide
Query:  ERLLKRRADKGKSVAATSEEPDEKEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE-----------
        ER    R   G    A       K E +    R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE           
Subjt:  ERLLKRRADKGKSVAATSEEPDEKEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE-----------

Query:  ----------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTH
                        QV WS  AINA++ L D P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ RLLPTTH
Subjt:  ----------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTH

Query:  DSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT
          TVS++R+LL+ ++L   SI+VG++I SEI  C  +K G LFFP+ IT
Subjt:  DSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.0e-1729.68Show/hide
Query:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYH
        K E +   +R+  N+     +  ++++F+++     + P F+   I  H W+ FC+ PE                           QV  S  AIN ++ 
Subjt:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYH

Query:  LQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSE
        L D P   ++E     +  +L   +  V + GA+W +S     T   + L   A  W  F++ RLLPTTH  TVS+E V L++++L   SI+VG++I  E
Subjt:  LQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSE

Query:  IFGCWRKKVGKLFFPNTIT
        I  C  +K G LFFP+ IT
Subjt:  IFGCWRKKVGKLFFPNTIT

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.4e-1932.74Show/hide
Query:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPG
        K E +    R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE                           QV WS  
Subjt:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPG

Query:  AINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDV
        AINA++ L D P   ++E     +   L   +  V V GA+W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL+ ++L   SI+V
Subjt:  AINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDV

Query:  GKIISSEIFGCWRKKVGKLFFPNTIT
        G++I SEI  C  +K G LFFP+ IT
Subjt:  GKIISSEIFGCWRKKVGKLFFPNTIT

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-1831.73Show/hide
Query:  ERLLKRRADKGKSVAATSEEPDEKEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE-----------
        ER    R   G    A       K E +    R+ NN+          R    E+GF        G LP F+   I  H W+ FC+ PE           
Subjt:  ERLLKRRADKGKSVAATSEEPDEKEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGF-------SGDLPHFLRTSIANHGWESFCSKPE-----------

Query:  ----------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTH
                        QV WS  AINA++ L D P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F++ RLLPTTH
Subjt:  ----------------QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTH

Query:  DSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT
          TVS++R+LL+ ++L   SI+VG++I SEI  C  +K G LFFP+ IT
Subjt:  DSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT

A0A2P5CPE8 Uncharacterized protein4.6e-1640.88Show/hide
Query:  QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAIL
        QV  S  AIN +Y L D P   ++E   A +  +L+  +  V + GA+W +S     T   + L   A  W  F++ RLLPTTH  TVS+ERVLL++++L
Subjt:  QVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAIL

Query:  RSLSIDVGKIISSEIFGCWRKKVGKLFFPNTITVTPC
           SI+VG+II  EI     +K G LFFP+ IT   C
Subjt:  RSLSIDVGKIISSEIFGCWRKKVGKLFFPNTITVTPC

A0A2P5DAQ2 Uncharacterized protein4.9e-1829.68Show/hide
Query:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYH
        K E +   +R+  N+     +  ++++F+++     + P F+   I  H W+ FC+ PE                           QV  S  AIN ++ 
Subjt:  KEEPQLPYVRFVNNLPGAKYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPE---------------------------QVDWSPGAINALYH

Query:  LQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSE
        L D P   ++E     +  +L   +  V + GA+W +S     T   + L   A  W  F++ RLLPTTH  TVS+E V L++++L   SI+VG++I  E
Subjt:  LQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTFQSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSE

Query:  IFGCWRKKVGKLFFPNTIT
        I  C  +K G LFFP+ IT
Subjt:  IFGCWRKKVGKLFFPNTIT

W9S7D3 Uncharacterized protein4.4e-1932.76Show/hide
Query:  KYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPEQVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTF
        K A  ++R F++ RG     P F+ + IA H W SFC  P        AIN+LY L D     +N  A + + +QL + + E+ VEG +W  +     TF
Subjt:  KYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPEQVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTF

Query:  QSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT
            L+     W  F+R RL+P++H   V +ER +L++ +++   ++VG++I  ++  C  +K G L+FP+ IT
Subjt:  QSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTIT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACGAGAGCAAGAAAAGAGAGAGATACTGAGGAAGAGGAGGTCCCTGTTACCCCTGAAGCGTCGAAAGTTAAAACGAAGAAGAAGAAAACACCAGAAGAGAA
AGAAGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGCTCAGAAAGCGGTGGAGGATGTCGTTGTGGAAGAAGATCCGAAAGAACCAGAAGAACAGA
ATCCAGAGCAGATTGGCCCAATAGTTGCAGATACAGAGGCAGTTCGAGAAGAAAATGCAGAAGACGTTCAAGAAGAGCAGACTGAGAATGTGCAAGAAGAACAGGCAGAG
GTTGCGCCTGAAGAAGTTAATGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGT
AGTCCGAACTGATACCCCCTCGCCTCCAACTACTGATTCGGAAAGAGAGAATGCTGAGAGAGAAGAGCAGGAGAAGAAAGAGGCCGAGGAAAAAGCACGAGAAGAAGTAA
GGAGAAAGGCTGAAGAAGAAAGGTTGCTCAAGCGAAGGGCAGACAAAGGCAAAAGTGTTGCTGCGACATCGGAGGAACCTGACGAAAAAGAAGAGCCGCAATTGCCGTAT
GTCCGCTTCGTCAACAACCTTCCTGGAGCAAAGTATGCTGAGTTTCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTCCCACATTTCCTGAGGACAAG
CATTGCGAACCATGGATGGGAATCATTCTGTTCCAAGCCTGAACAGGTCGATTGGAGTCCTGGTGCTATCAACGCGTTGTACCACCTTCAAGATTTTCCCCACGCAGCAT
ATAATGAGATGGCTGTGGCGCCATCGAATGAGCAGTTGAGTGATGCAGTGAGGGAAGTCGGTGTTGAAGGGGCGCAGTGGCAGCTGTCGAAGGCAGAGAAAAGGACATTC
CAGTCAGCCTACTTAAAGAGGGAAGCAAATACGTGGATGGGGTTTATCAGACAAAGGTTGCTTCCGACAACTCATGACTCGACGGTTTCCAGGGAACGAGTGCTTCTGGT
TTTTGCTATTTTGCGGTCTCTCAGTATTGATGTGGGAAAAATTATTTCAAGCGAAATATTTGGATGTTGGAGGAAGAAGGTTGGGAAATTGTTCTTTCCGAACACAATCA
CTGTAACGCCCTGCGTTTCGGAATCGAGCCCAGCCGCCCCTCCCCTTCAGTTCCTTCTTCTCGCGCAGCCAAGCCTGAAGCCGCATCTCTCCCCCTCTCTGCCGTTGTTG
CGCCGCCGCCAGCAGCCAAGCCGCAAGTCTCCTCTCCCCCTCGCGTGTTCGTTCGGCTCGAGCAGCAAGACCCACGCAGATCTCCATCTCCGGCGTCTTTCTCCCTCTTC
GCGTGAGCAGTCGTGGTTCCCGTCGTCGATCTCTCTCTCTCTCGTTCTTCCTCGCGTTGAAGCTGCCAGCGTCAAGACCCACGGACAGCAGCAGCCCGCGCCGCCGTTCG
TCTCCCTAAAGCTTGAGCCGTTCTCCTCGCATGTAAGCTGGGTTAGTGAGCAGAAGGCAAGACCTTCGTCGTTCTGGCATTTTGGGCACGTTTGGGCTGATTTAAAGTCC
GTTTCAGCGTGTTTGGACTATTTCGTTGGACCCAATTCGAACCCTGGGCAGCGCGTGTTCAAGGAGCTTTGGTTCTGGTTGTCAAATTCCAGCAAGCTGTTGGACTCGTT
AGTGTGGTTCGCTTGGGATCAATTAAAGCTTACTTGGGTTATTTCCAACAAGGAACTTTGTTCTGTTTGGTGGAGCTTAGAAGGTTCAAATAGGGTGGTTAAGCTCAATT
CTGATCTTTTTATGTTATCTTTGCTGATTAGGAGCTTGCCTAGATTGGGTCGGTGTGATGAGTCTCGAAAGCAAGTGTTGGGCGCCTGGGGAAGGGCCAATACTTTGCTA
AGTCATCGGTGCGCCTTGGGTACAAATGGTCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGAACGCCGAGCTCCGTAGAGAGCATTGTGGC
CCTGGGTACAAATGGTCAAGGGACAGTGCGACTCGAAGGATCAGTCTTGGAGAGCCATGTGAGGGCTAAATATGAGCGGGCTGCTTACCAGTACCTTAGTGTGCTGACCC
CCTCCCCTCTCTCTCCCCCCCAACTACCAGATTTTGCAGGTTATGAGGACTGCGTGGACTATGGTGATGCGGAGGAGACGTGTGAGGAAGGACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACGAGAGCAAGAAAAGAGAGAGATACTGAGGAAGAGGAGGTCCCTGTTACCCCTGAAGCGTCGAAAGTTAAAACGAAGAAGAAGAAAACACCAGAAGAGAA
AGAAGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGCTCAGAAAGCGGTGGAGGATGTCGTTGTGGAAGAAGATCCGAAAGAACCAGAAGAACAGA
ATCCAGAGCAGATTGGCCCAATAGTTGCAGATACAGAGGCAGTTCGAGAAGAAAATGCAGAAGACGTTCAAGAAGAGCAGACTGAGAATGTGCAAGAAGAACAGGCAGAG
GTTGCGCCTGAAGAAGTTAATGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGT
AGTCCGAACTGATACCCCCTCGCCTCCAACTACTGATTCGGAAAGAGAGAATGCTGAGAGAGAAGAGCAGGAGAAGAAAGAGGCCGAGGAAAAAGCACGAGAAGAAGTAA
GGAGAAAGGCTGAAGAAGAAAGGTTGCTCAAGCGAAGGGCAGACAAAGGCAAAAGTGTTGCTGCGACATCGGAGGAACCTGACGAAAAAGAAGAGCCGCAATTGCCGTAT
GTCCGCTTCGTCAACAACCTTCCTGGAGCAAAGTATGCTGAGTTTCTGAAGAGAGACTTCCTGTTTGAGAGAGGATTTAGTGGTGATCTCCCACATTTCCTGAGGACAAG
CATTGCGAACCATGGATGGGAATCATTCTGTTCCAAGCCTGAACAGGTCGATTGGAGTCCTGGTGCTATCAACGCGTTGTACCACCTTCAAGATTTTCCCCACGCAGCAT
ATAATGAGATGGCTGTGGCGCCATCGAATGAGCAGTTGAGTGATGCAGTGAGGGAAGTCGGTGTTGAAGGGGCGCAGTGGCAGCTGTCGAAGGCAGAGAAAAGGACATTC
CAGTCAGCCTACTTAAAGAGGGAAGCAAATACGTGGATGGGGTTTATCAGACAAAGGTTGCTTCCGACAACTCATGACTCGACGGTTTCCAGGGAACGAGTGCTTCTGGT
TTTTGCTATTTTGCGGTCTCTCAGTATTGATGTGGGAAAAATTATTTCAAGCGAAATATTTGGATGTTGGAGGAAGAAGGTTGGGAAATTGTTCTTTCCGAACACAATCA
CTGTAACGCCCTGCGTTTCGGAATCGAGCCCAGCCGCCCCTCCCCTTCAGTTCCTTCTTCTCGCGCAGCCAAGCCTGAAGCCGCATCTCTCCCCCTCTCTGCCGTTGTTG
CGCCGCCGCCAGCAGCCAAGCCGCAAGTCTCCTCTCCCCCTCGCGTGTTCGTTCGGCTCGAGCAGCAAGACCCACGCAGATCTCCATCTCCGGCGTCTTTCTCCCTCTTC
GCGTGAGCAGTCGTGGTTCCCGTCGTCGATCTCTCTCTCTCTCGTTCTTCCTCGCGTTGAAGCTGCCAGCGTCAAGACCCACGGACAGCAGCAGCCCGCGCCGCCGTTCG
TCTCCCTAAAGCTTGAGCCGTTCTCCTCGCATGTAAGCTGGGTTAGTGAGCAGAAGGCAAGACCTTCGTCGTTCTGGCATTTTGGGCACGTTTGGGCTGATTTAAAGTCC
GTTTCAGCGTGTTTGGACTATTTCGTTGGACCCAATTCGAACCCTGGGCAGCGCGTGTTCAAGGAGCTTTGGTTCTGGTTGTCAAATTCCAGCAAGCTGTTGGACTCGTT
AGTGTGGTTCGCTTGGGATCAATTAAAGCTTACTTGGGTTATTTCCAACAAGGAACTTTGTTCTGTTTGGTGGAGCTTAGAAGGTTCAAATAGGGTGGTTAAGCTCAATT
CTGATCTTTTTATGTTATCTTTGCTGATTAGGAGCTTGCCTAGATTGGGTCGGTGTGATGAGTCTCGAAAGCAAGTGTTGGGCGCCTGGGGAAGGGCCAATACTTTGCTA
AGTCATCGGTGCGCCTTGGGTACAAATGGTCAAGGGGCGATGCACAGTTCGAGGCCTTGGGTACAAATGGTCAAGGGTCGAACGCCGAGCTCCGTAGAGAGCATTGTGGC
CCTGGGTACAAATGGTCAAGGGACAGTGCGACTCGAAGGATCAGTCTTGGAGAGCCATGTGAGGGCTAAATATGAGCGGGCTGCTTACCAGTACCTTAGTGTGCTGACCC
CCTCCCCTCTCTCTCCCCCCCAACTACCAGATTTTGCAGGTTATGAGGACTGCGTGGACTATGGTGATGCGGAGGAGACGTGTGAGGAAGGACCTTAA
Protein sequenceShow/hide protein sequence
MAKTRARKERDTEEEEVPVTPEASKVKTKKKKTPEEKEAKRRRRQQRAEDQEAAQKAVEDVVVEEDPKEPEEQNPEQIGPIVADTEAVREENAEDVQEEQTENVQEEQAE
VAPEEVNEQEQEARVEVIMPEVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENAEREEQEKKEAEEKAREEVRRKAEEERLLKRRADKGKSVAATSEEPDEKEEPQLPY
VRFVNNLPGAKYAEFLKRDFLFERGFSGDLPHFLRTSIANHGWESFCSKPEQVDWSPGAINALYHLQDFPHAAYNEMAVAPSNEQLSDAVREVGVEGAQWQLSKAEKRTF
QSAYLKREANTWMGFIRQRLLPTTHDSTVSRERVLLVFAILRSLSIDVGKIISSEIFGCWRKKVGKLFFPNTITVTPCVSESSPAAPPLQFLLLAQPSLKPHLSPSLPLL
RRRQQPSRKSPLPLACSFGSSSKTHADLHLRRLSPSSREQSWFPSSISLSLVLPRVEAASVKTHGQQQPAPPFVSLKLEPFSSHVSWVSEQKARPSSFWHFGHVWADLKS
VSACLDYFVGPNSNPGQRVFKELWFWLSNSSKLLDSLVWFAWDQLKLTWVISNKELCSVWWSLEGSNRVVKLNSDLFMLSLLIRSLPRLGRCDESRKQVLGAWGRANTLL
SHRCALGTNGQGAMHSSRPWVQMVKGRTPSSVESIVALGTNGQGTVRLEGSVLESHVRAKYERAAYQYLSVLTPSPLSPPQLPDFAGYEDCVDYGDAEETCEEGP