; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018825 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018825
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:4621836..4623549
RNA-Seq ExpressionSpg018825
SyntenySpg018825
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]6.1e-2127.93Show/hide
Query:  FVNRLARAKYAELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPEYVNAQIVSEFYANINE----------------------------E
        FV+  A+  Y  +  R   FE GF         +G      +T H W+ F   P  VNA IV EFY+NI E                             
Subjt:  FVNRLARAKYAELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPEYVNAQIVSEFYANINE----------------------------E

Query:  DGFLAAFA----------ILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW
        D    +FA          IL D+ + G  W   + +++T     L      W  F+K +L+PT+H +TVS +R+LL  +IL   +ID+ KII      C 
Subjt:  DGFLAAFA----------ILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW

Query:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFWN
        K++   L FPN IT LC++  V E   D I+     ++   +  L   +EA+                V   +T LEQ A+  + Q   +   + + ++ 
Subjt:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFWN

Query:  YVKSRDANLKKALQENFSK-------PYPALPT
        Y K RDA L  AL E+  +         P LPT
Subjt:  YVKSRDANLKKALQENFSK-------PYPALPT

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.6e-2131.5Show/hide
Query:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-
        V+F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +                E+   A F 
Subjt:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-

Query:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI
                              +L  V + GA W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+V ++I SEI
Subjt:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQE
          C  +K G LFFP+ IT LC+    P    +  + + G +D   +AR+  TQE
Subjt:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]9.4e-3028.84Show/hide
Query:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-
        V+F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +                E+   A F 
Subjt:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-

Query:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI
                              +L  V   GA W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+V ++I SEI
Subjt:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+    P    +  + + G +D   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED
           +   +Q   FW Y K RD  LKKALQ NF++P P  P FP ++L        ++ E E + ++ G  +
Subjt:  ---EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]6.1e-2133.03Show/hide
Query:  ANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+V ++I SEI  C  +K G LFFP+ IT LC+    P    +  +   G +D   +AR+  T
Subjt:  ANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRT

Query:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKALQ NF++P P  PTFP +LL   
Subjt:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPW

Query:  IPPPPIEREEEDDENEQGQED
             ++ E E + ++ G  +
Subjt:  IPPPPIEREEEDDENEQGQED

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.1e-2534.75Show/hide
Query:  SEFYANINEEDGFLAAFAILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW
        SEF  NI E +       +L  V   GA W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+V ++I SEI  C 
Subjt:  SEFYANINEEDGFLAAFAILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW

Query:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALT
         +K G LFFP+ IT LC+      NE    + + G +D   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   
Subjt:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALT

Query:  FWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED
        FW Y K RD  LKKALQ NF++P P  P FP ++L        ++ E E + ++ G  +
Subjt:  FWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.7e-2131.5Show/hide
Query:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-
        V+F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +                E+   A F 
Subjt:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-

Query:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI
                              +L  V + GA W +S     T   + L   A  W  F+K  LLPTTH  TVS++R+LL  ++L   SI+V ++I SEI
Subjt:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQE
          C  +K G LFFP+ IT LC+    P    +  + + G +D   +AR+  TQE
Subjt:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)4.5e-3028.84Show/hide
Query:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-
        V+F    A  +Y   +  R    E+GF        G LP   +V IT H W+ FC+ PE     +V EFYAN+ +                E+   A F 
Subjt:  VRFVNRLARAKYA-ELLKRDFLFERGF-------SGDLPHFLRVGITNHGWELFCSKPEYVNAQIVSEFYANINE----------------EDGFLAAF-

Query:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI
                              +L  V   GA W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+V ++I SEI
Subjt:  ---------------------AILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEI

Query:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+    P    +  + + G +D   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED
           +   +Q   FW Y K RD  LKKALQ NF++P P  P FP ++L        ++ E E + ++ G  +
Subjt:  ---EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED

A0A2P5CEY2 Uncharacterized protein3.0e-2133.03Show/hide
Query:  ANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+V ++I SEI  C  +K G LFFP+ IT LC+    P    +  +   G +D   +AR+  T
Subjt:  ANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRT

Query:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKALQ NF++P P  PTFP +LL   
Subjt:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPW

Query:  IPPPPIEREEEDDENEQGQED
             ++ E E + ++ G  +
Subjt:  IPPPPIEREEEDDENEQGQED

A0A2P5DXM3 Uncharacterized protein2.0e-2534.75Show/hide
Query:  SEFYANINEEDGFLAAFAILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW
        SEF  NI E +       +L  V   GA W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+V ++I SEI  C 
Subjt:  SEFYANINEEDGFLAAFAILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW

Query:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALT
         +K G LFFP+ IT LC+      NE    + + G +D   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   
Subjt:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALT

Query:  FWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED
        FW Y K RD  LKKALQ NF++P P  P FP ++L        ++ E E + ++ G  +
Subjt:  FWNYVKSRDANLKKALQENFSKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED

A0A6A2ZUE4 Uncharacterized protein3.0e-2127.93Show/hide
Query:  FVNRLARAKYAELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPEYVNAQIVSEFYANINE----------------------------E
        FV+  A+  Y  +  R   FE GF         +G      +T H W+ F   P  VNA IV EFY+NI E                             
Subjt:  FVNRLARAKYAELLKRDFLFERGFSGDLPHFLRVG------ITNHGWELFCSKPEYVNAQIVSEFYANINE----------------------------E

Query:  DGFLAAFA----------ILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW
        D    +FA          IL D+ + G  W   + +++T     L      W  F+K +L+PT+H +TVS +R+LL  +IL   +ID+ KII      C 
Subjt:  DGFLAAFA----------ILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIASEISGCW

Query:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFWN
        K++   L FPN IT LC++  V E   D I+     ++   +  L   +EA+                V   +T LEQ A+  + Q   +   + + ++ 
Subjt:  KKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQEAR------------QGGLVYGINTILEQLALSASRQEFAE--RQALTFWN

Query:  YVKSRDANLKKALQENFSK-------PYPALPT
        Y K RDA L  AL E+  +         P LPT
Subjt:  YVKSRDANLKKALQENFSK-------PYPALPT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCAAGAAAAGAAAGAGATAATGAGGAAGAGGAGGTTCCTGTTACCACCGAAGTACAGAAAGCTAAGGCAAAGAAAAAGAAAACACTAGAAGAGAA
AGAGGCTAAGAGGAGAAGAAGACAGCAGAGGGTTGAGGATCAAGAAATAGTGCAAAAGATAGTCGAGGAGGTAGCTGCCGAGGCAGTTGAAGAAGGCAATCCGAAGGAAC
CTAAAGGACAAAACCTAGAGCAGATTGAGCCGATAGTTGCGGATACAGAGGAAGCAGAGGTTGCGCCTGAAAAAGGTAATGAGCCAGTACAAGAGGCTCGAGTGGAGGTG
ATCATGTCGGAAGTGCCAAAACGTCGCCGCGTGAAGCGAAAAGCTGGACGCGTCAAGGAGAAAAATGAGGCCGAGGATAAAGCAAGAGATGAAGCAAAGAAAAAGACTGA
AGAAGAAAGGTTGCTCAAGCAAAGGGCAGACAAAGGCAAGAGTGTTGCTGCGGCATCGGAGGAACCTGACGAGATAGAAGAGCCACAGTTGCCGTATGTCCGCTTCGTCA
ATCGTCTTGCCAGAGCAAAATATGCTGAGCTGCTGAAGAGAGACTTCCTATTCGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGGGTCGGCATTACGAACCAC
GGCTGGGAATTATTTTGTTCCAAGCCTGAATATGTGAACGCGCAGATAGTGAGCGAATTTTATGCAAATATTAACGAGGAAGATGGTTTCCTAGCGGCGTTCGCGATTTT
GAGGGACGTTGGTATTGAAGGGGCACATTGGAGGCTTTCAAAAACAGAGAAAAGGACGTTTCAGTCAGCCTATCTGAAGAGGGAAGCAAATACATGGATGGGATTTATCA
AACAAAGGTTGCTTCCAACGACTCATGAATCGACGGTTTCTAGGGAACGGGTTCTTCTGGCGTTCGCGATTTTGAGGTCTCTCAGCATTGATGTAAGGAAGATTATTGCT
AGTGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAACTGTTTTTTCCGAATACGATTACGATGCTTTGCAAGCGTGTAGGGGTTCCAGAGAATGAAGGTGACGTTAT
TGTATTTGACAAGGGAATTATGGATACACCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTTGTCTATGGCATCAACACGATCTTAGAACAAC
TGGCACTGTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAAAAGTCGTGATGCCAATCTGAAGAAGGCGCTGCAGGAGAATTTT
TCCAAACCGTATCCAGCCCTTCCAACATTCCCTGACGATTTATTGAACCCCTGGATTCCGCCCCCACCGATTGAAAGAGAAGAAGAGGATGATGAAAATGAGCAGGGCCA
AGAGGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCAAGAAAAGAAAGAGATAATGAGGAAGAGGAGGTTCCTGTTACCACCGAAGTACAGAAAGCTAAGGCAAAGAAAAAGAAAACACTAGAAGAGAA
AGAGGCTAAGAGGAGAAGAAGACAGCAGAGGGTTGAGGATCAAGAAATAGTGCAAAAGATAGTCGAGGAGGTAGCTGCCGAGGCAGTTGAAGAAGGCAATCCGAAGGAAC
CTAAAGGACAAAACCTAGAGCAGATTGAGCCGATAGTTGCGGATACAGAGGAAGCAGAGGTTGCGCCTGAAAAAGGTAATGAGCCAGTACAAGAGGCTCGAGTGGAGGTG
ATCATGTCGGAAGTGCCAAAACGTCGCCGCGTGAAGCGAAAAGCTGGACGCGTCAAGGAGAAAAATGAGGCCGAGGATAAAGCAAGAGATGAAGCAAAGAAAAAGACTGA
AGAAGAAAGGTTGCTCAAGCAAAGGGCAGACAAAGGCAAGAGTGTTGCTGCGGCATCGGAGGAACCTGACGAGATAGAAGAGCCACAGTTGCCGTATGTCCGCTTCGTCA
ATCGTCTTGCCAGAGCAAAATATGCTGAGCTGCTGAAGAGAGACTTCCTATTCGAGAGAGGATTTAGTGGTGATCTTCCACATTTTCTGAGGGTCGGCATTACGAACCAC
GGCTGGGAATTATTTTGTTCCAAGCCTGAATATGTGAACGCGCAGATAGTGAGCGAATTTTATGCAAATATTAACGAGGAAGATGGTTTCCTAGCGGCGTTCGCGATTTT
GAGGGACGTTGGTATTGAAGGGGCACATTGGAGGCTTTCAAAAACAGAGAAAAGGACGTTTCAGTCAGCCTATCTGAAGAGGGAAGCAAATACATGGATGGGATTTATCA
AACAAAGGTTGCTTCCAACGACTCATGAATCGACGGTTTCTAGGGAACGGGTTCTTCTGGCGTTCGCGATTTTGAGGTCTCTCAGCATTGATGTAAGGAAGATTATTGCT
AGTGAAATATCTGGATGTTGGAAGAAGAAAGTGGGGAAACTGTTTTTTCCGAATACGATTACGATGCTTTGCAAGCGTGTAGGGGTTCCAGAGAATGAAGGTGACGTTAT
TGTATTTGACAAGGGAATTATGGATACACCTAACTTGGCACGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTTGTCTATGGCATCAACACGATCTTAGAACAAC
TGGCACTGTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTAAAAGTCGTGATGCCAATCTGAAGAAGGCGCTGCAGGAGAATTTT
TCCAAACCGTATCCAGCCCTTCCAACATTCCCTGACGATTTATTGAACCCCTGGATTCCGCCCCCACCGATTGAAAGAGAAGAAGAGGATGATGAAAATGAGCAGGGCCA
AGAGGACTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTTEVQKAKAKKKKTLEEKEAKRRRRQQRVEDQEIVQKIVEEVAAEAVEEGNPKEPKGQNLEQIEPIVADTEEAEVAPEKGNEPVQEARVEV
IMSEVPKRRRVKRKAGRVKEKNEAEDKARDEAKKKTEEERLLKQRADKGKSVAAASEEPDEIEEPQLPYVRFVNRLARAKYAELLKRDFLFERGFSGDLPHFLRVGITNH
GWELFCSKPEYVNAQIVSEFYANINEEDGFLAAFAILRDVGIEGAHWRLSKTEKRTFQSAYLKREANTWMGFIKQRLLPTTHESTVSRERVLLAFAILRSLSIDVRKIIA
SEISGCWKKKVGKLFFPNTITMLCKRVGVPENEGDVIVFDKGIMDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVKSRDANLKKALQENF
SKPYPALPTFPDDLLNPWIPPPPIEREEEDDENEQGQED