; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024206 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024206
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold4:17222970..17224694
RNA-Seq ExpressionSpg024206
SyntenySpg024206
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY08849.1 Uncharacterized protein TCM_024087 [Theobroma cacao]1.6e-2238.99Show/hide
Query:  IADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSID
        I D  W +FC +P++    VVREFYAN+ +    +A VRGA W+ S  E  +F+ + +K+E   W+ F+  RLL +TH S V+++R +L +AI+   SID
Subjt:  IADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSID

Query:  VGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQ
        VGK+I+  I    + K   + FP+ IT LC RAGV  ++ + +   K  I    L RL+
Subjt:  VGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQ

KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]6.1e-2226.82Show/hide
Query:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRG--------------
        + + +F N+ A+A++  F  R+  FE GF       G     +   +    W +F   P SVNA +V+EFYANI K       VRG              
Subjt:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRG--------------

Query:  --------------------------------AHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADE
                                          W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  --------------------------------AHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADE

Query:  ISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT LC++  V +N  D IL     I    L  L   +  +    V+         N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE
         F+ YV+  D  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.6e-3028.88Show/hide
Query:  RFVNNLARAKYAEFLK-RDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVR-----------------
        +F    A  +Y   ++ R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VR                 
Subjt:  RFVNNLARAKYAEFLK-RDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVR-----------------

Query:  -----------------------------GAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEIS
                                     GA W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+VG++I  EI 
Subjt:  -----------------------------GAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-----
         C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-----

Query:  --EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
          +   +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  --EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.2e-2233.49Show/hide
Query:  ANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y +  D  LKKALQ NF++P P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPW

Query:  IPPPPVEREGDGEEDPGQ
              E + DG  +  +
Subjt:  IPPPPVEREGDGEEDPGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.2e-2535.46Show/hide
Query:  EFYANIDKEEGFLAI----VRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVG
        EF  NI + E    +      GA W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G
Subjt:  EFYANIDKEEGFLAI----VRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVG

Query:  KLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYV
         LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   FW Y 
Subjt:  KLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYV

Query:  RTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        +  D  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  RTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

TrEMBL top hitse value%identityAlignment
A0A061F2U9 Uncharacterized protein7.7e-2338.99Show/hide
Query:  IADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSID
        I D  W +FC +P++    VVREFYAN+ +    +A VRGA W+ S  E  +F+ + +K+E   W+ F+  RLL +TH S V+++R +L +AI+   SID
Subjt:  IADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSID

Query:  VGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQ
        VGK+I+  I    + K   + FP+ IT LC RAGV  ++ + +   K  I    L RL+
Subjt:  VGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQ

A0A2P5BCG4 Uncharacterized protein (Fragment)7.7e-3128.88Show/hide
Query:  RFVNNLARAKYAEFLK-RDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVR-----------------
        +F    A  +Y   ++ R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VR                 
Subjt:  RFVNNLARAKYAEFLK-RDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVR-----------------

Query:  -----------------------------GAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEIS
                                     GA W +S     T   + L   A  W  F+K RLLPTTH  TVS++R+LL  ++L   SI+VG++I  EI 
Subjt:  -----------------------------GAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEIS

Query:  GCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-----
         C  +K G LFFP+ IT LC+ A  P    +  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q     
Subjt:  GCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQR---TQEARQ---------------GGLVYGINTILEQLALSASRQ-----

Query:  --EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
          +   +Q   FW Y +  D  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  --EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

A0A2P5CEY2 Uncharacterized protein5.9e-2333.49Show/hide
Query:  ANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRT
        A  W  F+K RLLPTTH  TVS++R+LL +++L   SI+VG++I  EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y +  D  LKKALQ NF++P P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYGINTILEQLALSASRQ-------EFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPW

Query:  IPPPPVEREGDGEEDPGQ
              E + DG  +  +
Subjt:  IPPPPVEREGDGEEDPGQ

A0A2P5DXM3 Uncharacterized protein5.7e-2635.46Show/hide
Query:  EFYANIDKEEGFLAI----VRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVG
        EF  NI + E    +      GA W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I  EI  C  +K G
Subjt:  EFYANIDKEEGFLAI----VRGAHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVG

Query:  KLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYV
         LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   FW Y 
Subjt:  KLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYV

Query:  RTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ
        +  D  LKKALQ NF++P P  PAFP+++L         E + DG  +  +
Subjt:  RTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQ

A0A6A3BU96 Uncharacterized protein2.9e-2226.82Show/hide
Query:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRG--------------
        + + +F N+ A+A++  F  R+  FE GF       G     +   +    W +F   P SVNA +V+EFYANI K       VRG              
Subjt:  LPYDRFVNNLARAKYAEFLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRG--------------

Query:  --------------------------------AHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADE
                                          W   +T + +     L+  A  W  F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  +
Subjt:  --------------------------------AHWRLSKTEKRTFQSAYLKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADE

Query:  ISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----
        +  C  KK   L FPN IT LC++  V +N  D IL     I    L  L   +  +    V+         N  +  LAL  A  Q  A+  AL     
Subjt:  ISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRTQEARQGGLVY-------GINTILEQLAL-SASRQEFAERQAL-----

Query:  TFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE
         F+ YV+  D  ++   QE         P FP+++L  +      E E D  + P  +
Subjt:  TFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAGACGGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAAGAAGATCCGAAAGAACCAGAAGGACGGA
ACCAAGAGCAGTCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAAGAAGTTCGAGAAGAAAATACAGAGGAGGTTCGAGAAGAAAATACAGAG
GAAGTTCAAGAAAAGCAGGTTGAGGATGTGCAAGAAGAACAGGCAGAGGTTGCACCTGAAGAAGTTAATGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATTATGCCGGA
AGTGCCCAAACGCCGTCGTATAAAGCAAAAAGCGGGCCGTGTTAAGGTAGTCCGAGCTGATACCCCCTCACCTCCAGCTACTGATTCTGAAAGAGAGAATGCTGAGGAAG
AAGAGCGTGGGAAGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCTCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCT
GCGGCATCAGAGGAACCGGATGAAATAGAAGAGTCACAATTGCCGTATGATCGTTTTGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTTCTGAAAAGAGACTTCCT
GTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTATTGCAGACCACGGGTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGG
TGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCAATTGTTCGAGGGGCGCACTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAATCAGCCTAT
TTGAAGAGGGAAGCAAATACTTGGATGAGATTTATCAAACAAAGGCTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTT
GAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTCTGCA
AGCGAGCAGGGGTTCCAAAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGT
GGGCTGGTCTACGGCATCAACACAATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTTTGGAACTATGTTAGAACTCT
TGATGCCAATTTGAAGAAGGCGCTGCAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTTG
AGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAAGAAAA
AGAAGCTAAAAGAAGAAGAAGACGGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAAGAAGATCCGAAAGAACCAGAAGGACGGA
ACCAAGAGCAGTCTGAGCCAGGAGTTGCGGATACAGAGGAAGTTCGAGAAGAAAATACAGAAGAAGTTCGAGAAGAAAATACAGAGGAGGTTCGAGAAGAAAATACAGAG
GAAGTTCAAGAAAAGCAGGTTGAGGATGTGCAAGAAGAACAGGCAGAGGTTGCACCTGAAGAAGTTAATGAGCAAGAACAGGAGGCTCGTGTGGAGGTGATTATGCCGGA
AGTGCCCAAACGCCGTCGTATAAAGCAAAAAGCGGGCCGTGTTAAGGTAGTCCGAGCTGATACCCCCTCACCTCCAGCTACTGATTCTGAAAGAGAGAATGCTGAGGAAG
AAGAGCGTGGGAAGAAGGAGGCTGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCTCAAGCAAAGGGCAGACAGGGGCAAGAGTGTTGCT
GCGGCATCAGAGGAACCGGATGAAATAGAAGAGTCACAATTGCCGTATGATCGTTTTGTCAACAATCTTGCCAGAGCAAAATATGCAGAGTTTCTGAAAAGAGACTTCCT
GTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTATTGCAGACCACGGGTGGGAACGGTTTTGTTCAAAGCCTGAATCTGTGAATGCGCAGGTGG
TGCGCGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCAATTGTTCGAGGGGCGCACTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAATCAGCCTAT
TTGAAGAGGGAAGCAAATACTTGGATGAGATTTATCAAACAAAGGCTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTT
GAGGTCTCTCAGTATTGATGTGGGAAAAATTATTGCTGATGAAATATCTGGTTGTTGGAAGAAGAAAGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTCTGCA
AGCGAGCAGGGGTTCCAAAGAATGAAGGAGATGTGATATTATTTGACAAGGGAATCATTGACACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGT
GGGCTGGTCTACGGCATCAACACAATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCTTTAACCTTTTGGAACTATGTTAGAACTCT
TGATGCCAATTTGAAGAAGGCGCTGCAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATTCCGCCACCGCCTGTTG
AGAGAGAAGGAGATGGAGAAGAAGATCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPEAPKVKAKKKKTPEEKEAKRRRRRQRAEDQEAVQKAAEDVIVEEDPKEPEGRNQEQSEPGVADTEEVREENTEEVREENTEEVREENTE
EVQEKQVEDVQEEQAEVAPEEVNEQEQEARVEVIMPEVPKRRRIKQKAGRVKVVRADTPSPPATDSERENAEEEERGKKEAEDKAREEAEKKAEEERLLKQRADRGKSVA
AASEEPDEIEESQLPYDRFVNNLARAKYAEFLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGAHWRLSKTEKRTFQSAY
LKREANTWMRFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIADEISGCWKKKVGKLFFPNTITMLCKRAGVPKNEGDVILFDKGIIDTPNLARLQRTQEARQG
GLVYGINTILEQLALSASRQEFAERQALTFWNYVRTLDANLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPGQED