; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026733 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026733
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold6:18945259..18946912
RNA-Seq ExpressionSpg026733
SyntenySpg026733
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.2e-1625.86Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLD-------------------------HKE
        +F N+ A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+                          H +
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLD-------------------------HKE

Query:  EFQDFPHAVFNEMVVAPSNDQL------------------STAVREM---------GFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
        E  D  HA+F E   +   D +                   +  RE           F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ 
Subjt:  EFQDFPHAVFNEMVVAPSNDQL------------------STAVREM---------GFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTF
        DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    +      +  ++E +    +++       + F
Subjt:  DCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTF

Query:  WDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREED
        + YVK RD  +    Q          P FP+++L  +      E E D
Subjt:  WDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.2e-1631.89Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANL---------------------------
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL                           
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANL---------------------------

Query:  ------DHKEEFQDFPH----AVFNEMVVAPSNDQLS-----TAVRE---------MGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
              +H E  ++        V   + VA +   +S     T +R            F+K  LLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  ------DHKEEFQDFPH----AVFNEMVVAPSNDQLS-----TAVRE---------MGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.3e-2730.16Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLDHKEE------------FQDFPHAVF-
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL   EE             ++  +AVF 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLDHKEE------------FQDFPHAVF-

Query:  --------NEMVVAPSNDQLSTAVREMG------------------------------FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
                +E +   +   L T +  +                               F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  --------NEMVVAPSNDQLSTAVREMG------------------------------FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCDIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCDIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L         E ++D   +  E
Subjt:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]8.5e-2337.26Show/hide
Query:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEAR--
        F+K RLLPTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+  TQE +  
Subjt:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEAR--

Query:  ------------------QGGLVCDIQQI------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPV
                           G ++  ++ +      QE+ Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP++LL         
Subjt:  ------------------QGGLVCDIQQI------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPV

Query:  EREEDDEEQGQE
        E ++D   +  E
Subjt:  EREEDDEEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.2e-2237.69Show/hide
Query:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------
        F+K RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+         
Subjt:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------

Query:  -----QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE
              R   A       D+ Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L         E ++D   +  E
Subjt:  -----QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)1.6e-2730.16Show/hide
Query:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLDHKEE------------FQDFPHAVF-
        ++F    A  +Y+  ++ R    E+GF         +LP F+   I    W QFCA PE      VREFYANL   EE             ++  +AVF 
Subjt:  IRFVNNLARAKYQEMLK-RDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLDHKEE------------FQDFPHAVF-

Query:  --------NEMVVAPSNDQLSTAVREMG------------------------------FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI
                +E +   +   L T +  +                               F+K RLLPTTH  TVS+DR+LL  ++L   SI+VG++I SEI
Subjt:  --------NEMVVAPSNDQLSTAVREMG------------------------------FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCDIQQI------QELLQLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  ++ +      QE+ Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQR---TQEARQ---------------GGLVCDIQQI------QELLQLH-S

Query:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE
        S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L         E ++D   +  E
Subjt:  SRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE

A0A2P5CEY2 Uncharacterized protein4.1e-2337.26Show/hide
Query:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEAR--
        F+K RLLPTTH  TVS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+  TQE +  
Subjt:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEAR--

Query:  ------------------QGGLVCDIQQI------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPV
                           G ++  ++ +      QE+ Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP++LL         
Subjt:  ------------------QGGLVCDIQQI------QELLQLH-SSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPV

Query:  EREEDDEEQGQE
        E ++D   +  E
Subjt:  EREEDDEEQGQE

A0A2P5DXM3 Uncharacterized protein2.0e-2237.69Show/hide
Query:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------
        F+K RLLPTTH   VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+         
Subjt:  FIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL---------

Query:  -----QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE
              R   A       D+ Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF+ P P  P FP+++L         E ++D   +  E
Subjt:  -----QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQE

A0A6A2Y697 Reverse transcriptase domain-containing protein2.2e-1626.41Show/hide
Query:  VEAQGLPFIRFVNNLARAKYQEMLKRDFLFERGFGNELPRFLRTRIGNLG-----------WSQFCAKPEPVNSNFVREFYANLDHKEEFQD--------
        V  +G  F +F N  A+A++Q    R   FE  F      F +   G LG           W +F   P  VN++        +D   +F+D        
Subjt:  VEAQGLPFIRFVNNLARAKYQEMLKRDFLFERGFGNELPRFLRTRIGNLG-----------WSQFCAKPEPVNSNFVREFYANLDHKEEFQD--------

Query:  -------FPHAVFN-----EMVVAPSNDQLSTAVREMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITML
               F +  +N        V   N QL   +    F+K +L+PT+H++TVS  R+LL  +I+ S  IDVG+II  ++ DC  KK   L FPN IT L
Subjt:  -------FPHAVFN-----EMVVAPSNDQLSTAVREMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITML

Query:  CRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNF
        CR+  V E+  D ILP    I    L  L             +++    Q      +  ++E +    + +         F+ YVK RDA +    Q   
Subjt:  CRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQSNF

Query:  SEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQED
               P F +++L+ +     +E ++D+E+   +D
Subjt:  SEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQED

A0A6A3BU96 Uncharacterized protein5.8e-1725.86Show/hide
Query:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLD-------------------------HKE
        +F N+ A+A++Q    R+  FE GF       G   P  +   +  L W +F   P  VN++ V+EFYAN+                          H +
Subjt:  RFVNNLARAKYQEMLKRDFLFERGF-------GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLD-------------------------HKE

Query:  EFQDFPHAVFNEMVVAPSNDQL------------------STAVREM---------GFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL
        E  D  HA+F E   +   D +                   +  RE           F+K +L+PT+H++TVS  R+LL  +++ S  IDVG+II  ++ 
Subjt:  EFQDFPHAVFNEMVVAPSNDQL------------------STAVREM---------GFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKIISSEIL

Query:  DCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTF
        DC  KK   L FPN IT LCR+  V E+  D ILP    I    L  L             +++    +      +  ++E +    +++       + F
Subjt:  DCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARL-------------QRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTF

Query:  WDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREED
        + YVK RD  +    Q          P FP+++L  +      E E D
Subjt:  WDYVKRRDAALRVALQSNFSEPYPALPVFPEDLLNPWIPPPPVEREED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAACACGAGCAAGGAAAGAAAGAGAAAGTGAGGAGGAGGAGGTGCCCGTTACCCCTGAAGTTCAGAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGAGAA
AGAAGCTAAACGGAGAAGAAGGCAGCAGAGGGCTGCGGAGCAAGAGGCTATCCAAGAAGGACCAGTGAATGACCCAGATACGGAAAGAATTCAGAATCCTGAGGTAGAAA
CGATAGTCCAAGATTCGGTGCAAAAGGAGAATGTTGAGAAGAATCAAGAAACACAAGCTGAAGAAGTTCGAGACGAACAGGCTGCGGTTGTGCCTGAGGAAGGGGATGAA
CAGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGCATCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGAGCGAGAGGAAAGAGAGAAA
AAAAAAAGCTGAGGAAAAAGTGCGAGAAGAAGCAAAGAGGGCTGAGGAAGAGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCAAGGCATCAGGTGCGG
CTGACGAGGTTGAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAGCGAGGATTT
GGAAATGAGTTGCCACGGTTCTTGAGGACTAGAATAGGAAACCTCGGTTGGAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGC
GAATCTTGACCATAAGGAAGAATTTCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTACGGCTGTCCGAGAGATGGGTT
TTATTAAGTTGCGCTTACTACCAACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGTTCAATGAGTATTGATGTAGGAAAAATA
ATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGA
TATGATATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGAACGCAGGAAGCACGCCAAGGGGGTTTGGTGTGCGACATCCAACAAATTCAGG
AACTGTTGCAACTGCATTCCAGCAGAATGGAATTCGCTGAAAGGCAGTTTCAGACTTTCTGGGACTATGTAAAGAGAAGGGATGCCGCCTTAAGGGTGGCCTTGCAATCA
AATTTTTCCGAACCATACCCGGCCTTACCCGTATTCCCTGAGGACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGATGATGAAGAGCAGGGTCA
GGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAACACGAGCAAGGAAAGAAAGAGAAAGTGAGGAGGAGGAGGTGCCCGTTACCCCTGAAGTTCAGAAAGCTAAAACCAAGAAGAAGAAAACGCCAGAAGAGAA
AGAAGCTAAACGGAGAAGAAGGCAGCAGAGGGCTGCGGAGCAAGAGGCTATCCAAGAAGGACCAGTGAATGACCCAGATACGGAAAGAATTCAGAATCCTGAGGTAGAAA
CGATAGTCCAAGATTCGGTGCAAAAGGAGAATGTTGAGAAGAATCAAGAAACACAAGCTGAAGAAGTTCGAGACGAACAGGCTGCGGTTGTGCCTGAGGAAGGGGATGAA
CAGGAAACGGTGCAGGAGGCTCATGTTGAGGTCATAATGCCTGAACCACCAAAGCATCGCCGCATCAAGCGGAAGGCTGGGCGCGTTCAGAGCGAGAGGAAAGAGAGAAA
AAAAAAAGCTGAGGAAAAAGTGCGAGAAGAAGCAAAGAGGGCTGAGGAAGAGATTTTGCGCAAGCGAAGAGAAGACAAGGGCAAAGGTATTGCCAAGGCATCAGGTGCGG
CTGACGAGGTTGAGGCACAAGGGTTACCTTTTATTCGCTTCGTCAACAACCTTGCTCGAGCAAAATACCAGGAGATGCTGAAACGGGACTTTCTGTTCGAGCGAGGATTT
GGAAATGAGTTGCCACGGTTCTTGAGGACTAGAATAGGAAACCTCGGTTGGAGCCAATTTTGTGCGAAACCAGAGCCTGTGAATTCCAACTTTGTTCGGGAATTTTACGC
GAATCTTGACCATAAGGAAGAATTTCAGGATTTTCCGCATGCAGTCTTCAATGAGATGGTGGTTGCCCCATCTAACGATCAGTTAAGTACGGCTGTCCGAGAGATGGGTT
TTATTAAGTTGCGCTTACTACCAACTACGCATGACTCCACAGTATCTCGGGACAGGGTATTGCTTGCCTTTGCTATTCTTCGTTCAATGAGTATTGATGTAGGAAAAATA
ATTTCGTCTGAGATTCTTGACTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCTATGCCGAAGGGCAGGGGTGCCAGAGAGTGAGGATGA
TATGATATTACCAGATAAGGGAATAATTGATACGCCAAATTTGGCTAGGCTTCAGAGAACGCAGGAAGCACGCCAAGGGGGTTTGGTGTGCGACATCCAACAAATTCAGG
AACTGTTGCAACTGCATTCCAGCAGAATGGAATTCGCTGAAAGGCAGTTTCAGACTTTCTGGGACTATGTAAAGAGAAGGGATGCCGCCTTAAGGGTGGCCTTGCAATCA
AATTTTTCCGAACCATACCCGGCCTTACCCGTATTCCCTGAGGACCTACTGAACCCCTGGATCCCACCCCCACCTGTTGAACGAGAGGAAGATGATGAAGAGCAGGGTCA
GGAAGATTGA
Protein sequenceShow/hide protein sequence
MEKTRARKERESEEEEVPVTPEVQKAKTKKKKTPEEKEAKRRRRQQRAAEQEAIQEGPVNDPDTERIQNPEVETIVQDSVQKENVEKNQETQAEEVRDEQAAVVPEEGDE
QETVQEAHVEVIMPEPPKHRRIKRKAGRVQSERKERKKKAEEKVREEAKRAEEEILRKRREDKGKGIAKASGAADEVEAQGLPFIRFVNNLARAKYQEMLKRDFLFERGF
GNELPRFLRTRIGNLGWSQFCAKPEPVNSNFVREFYANLDHKEEFQDFPHAVFNEMVVAPSNDQLSTAVREMGFIKLRLLPTTHDSTVSRDRVLLAFAILRSMSIDVGKI
ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPESEDDMILPDKGIIDTPNLARLQRTQEARQGGLVCDIQQIQELLQLHSSRMEFAERQFQTFWDYVKRRDAALRVALQS
NFSEPYPALPVFPEDLLNPWIPPPPVEREEDDEEQGQED