; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026294 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026294
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold13:15380541..15382186
RNA-Seq ExpressionSpg026294
SyntenySpg026294
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]9.7e-1425.14Show/hide
Query:  VPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKA--------
        + + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  VN+++V+EFYANI    +  I   GK         
Subjt:  VPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKA--------

Query:  ----HI-----SGCLFEE--------------------------------------RGQYMNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKI
            H+        +FEE                                      R +  N F +  L   +    VS  R+LL  +++ S  IDVG+I
Subjt:  ----HI-----SGCLFEE--------------------------------------RGQYMNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKI

Query:  ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLM-------DKGIIDTPNLARLQRTQEARQGGLVCSIHQMQEQLQ-LH
        I  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        K  +   ++   +   E R   L  ++ Q Q QL  LH
Subjt:  ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLM-------DKGIIDTPNLARLQRTQEARQGGLVCSIHQMQEQLQ-LH

Query:  SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGD
                   + F+ YVK RDV +    Q          P FPD++L  +      E E D
Subjt:  SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGD

KAF4375842.1 hypothetical protein G4B88_026421 [Cannabis sativa]2.7e-1627.72Show/hide
Query:  IEEPRVPYNRFIN---NLARAKYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHK-------------
        I  PR    R  N    L + KY+  +R ++F  +RG   GD+    +P +L   I    W Q C  P      +V+EFYAN   H+             
Subjt:  IEEPRVPYNRFIN---NLARAKYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHK-------------

Query:  -------EFQIVKDGKAHIS---GCLFEE----RGQY-------------MNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR
                + ++K+     S   G L +E    RG +             +  F Q+ L   +    VSR+R+ + + I++   I+VGK+I+ EI +C  
Subjt:  -------EFQIVKDGKAHIS---GCLFEE----RGQY-------------MNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR

Query:  KKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIID-TPNLARLQRTQEARQGGLVCSIHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQ
        +  GKLFF   IT  CR A VP   D+ P+  KG++   P+ A  + T          S   M E+L  H +  +    + QT W+Y + RDV +   L+
Subjt:  KKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIID-TPNLARLQRTQEARQGGLVCSIHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQ

Query:  SNF
         N+
Subjt:  SNF

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.3e-2128.8Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKAHIS-----------
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D +E  +   G   +S           
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKAHIS-----------

Query:  -GCLFEERGQYMNGFHQ-----------------------------AALTVD--------NSRL-------NVSRDRVLLAFAILRSMSIDVGKIISSEI
         G   +E  +++    Q                             +ALT           SRL        VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  -GCLFEERGQYMNGFHQ-----------------------------AALTVD--------NSRL-------NVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQR---TQEARQ---------------GGLVCSIHQMQEQL------QLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQR---TQEARQ---------------GGLVCSIHQMQEQL------QLH-S

Query:  SRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
        S ++   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP ++L         E + D   EA +
Subjt:  SRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]2.5e-1734.85Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQRTQEAR----------------
        VS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+  TQE +                
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQRTQEAR----------------

Query:  ----QGGLVCSIHQMQEQL------QLH-SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
             G ++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP +LL         E + D   EA +
Subjt:  ----QGGLVCSIHQMQEQL------QLH-SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.9e-1736.22Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARL---------QRTQEARQGGLVCS
        VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+         Q+   +R      S
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARL---------QRTQEARQGGLVCS

Query:  -----IHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
             + Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP ++L         E + D   EA +
Subjt:  -----IHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)3.0e-2128.8Show/hide
Query:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKAHIS-----------
        +F    A  +Y   ++ R    E+GF  D       LP F+   I    W QFCA PE     +VREFYAN+ D +E  +   G   +S           
Subjt:  RFINNLARAKYVEMLR-RDFLFERGFGDD-------LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKAHIS-----------

Query:  -GCLFEERGQYMNGFHQ-----------------------------AALTVD--------NSRL-------NVSRDRVLLAFAILRSMSIDVGKIISSEI
         G   +E  +++    Q                             +ALT           SRL        VS+DR+LL  ++L   SI+VG++I SEI
Subjt:  -GCLFEERGQYMNGFHQ-----------------------------AALTVD--------NSRL-------NVSRDRVLLAFAILRSMSIDVGKIISSEI

Query:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQR---TQEARQ---------------GGLVCSIHQMQEQL------QLH-S
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  ++++L      Q H  
Subjt:  LDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQR---TQEARQ---------------GGLVCSIHQMQEQL------QLH-S

Query:  SRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
        S ++   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP ++L         E + D   EA +
Subjt:  SRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

A0A2P5CEY2 Uncharacterized protein1.2e-1734.85Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQRTQEAR----------------
        VS+DR+LL +++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A  P   ++  L   G ID   +AR+  TQE +                
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQRTQEAR----------------

Query:  ----QGGLVCSIHQMQEQL------QLH-SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
             G ++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP +LL         E + D   EA +
Subjt:  ----QGGLVCSIHQMQEQL------QLH-SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

A0A2P5DXM3 Uncharacterized protein9.1e-1836.22Show/hide
Query:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARL---------QRTQEARQGGLVCS
        VS+DR+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT LCR A    +E+   L + G ID   +AR+         Q+   +R      S
Subjt:  VSRDRVLLAFAILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARL---------QRTQEARQGGLVCS

Query:  -----IHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ
             + Q  + L+   S+ E   +Q Q FW Y K RD AL+ ALQ+NF++P P  P FP ++L         E + D   EA +
Subjt:  -----IHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQ

A0A6A3BU96 Uncharacterized protein4.7e-1425.14Show/hide
Query:  VPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKA--------
        + + +F N+ A+A++     R+  FE           GFG D+       ++ L W +F   P  VN+++V+EFYANI    +  I   GK         
Subjt:  VPYNRFINNLARAKYVEMLRRDFLFE----------RGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKA--------

Query:  ----HI-----SGCLFEE--------------------------------------RGQYMNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKI
            H+        +FEE                                      R +  N F +  L   +    VS  R+LL  +++ S  IDVG+I
Subjt:  ----HI-----SGCLFEE--------------------------------------RGQYMNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKI

Query:  ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLM-------DKGIIDTPNLARLQRTQEARQGGLVCSIHQMQEQLQ-LH
        I  ++ DC  KK   L FPN IT LCR+  V E+              D +PL+        K  +   ++   +   E R   L  ++ Q Q QL  LH
Subjt:  ISSEILDCWRKKVGKLFFPNTITMLCRRAGVPED-------------EDDVPLM-------DKGIIDTPNLARLQRTQEARQGGLVCSIHQMQEQLQ-LH

Query:  SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGD
                   + F+ YVK RDV +    Q          P FPD++L  +      E E D
Subjt:  SSRMEFAERQFQTFWDYVKRRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGD

A0A7J6FZ22 Uncharacterized protein1.3e-1627.72Show/hide
Query:  IEEPRVPYNRFIN---NLARAKYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHK-------------
        I  PR    R  N    L + KY+  +R ++F  +RG   GD+    +P +L   I    W Q C  P      +V+EFYAN   H+             
Subjt:  IEEPRVPYNRFIN---NLARAKYVEMLR-RDFLFERGF--GDD----LPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHK-------------

Query:  -------EFQIVKDGKAHIS---GCLFEE----RGQY-------------MNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR
                + ++K+     S   G L +E    RG +             +  F Q+ L   +    VSR+R+ + + I++   I+VGK+I+ EI +C  
Subjt:  -------EFQIVKDGKAHIS---GCLFEE----RGQY-------------MNGFHQAALTVDNSRLNVSRDRVLLAFAILRSMSIDVGKIISSEILDCWR

Query:  KKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIID-TPNLARLQRTQEARQGGLVCSIHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQ
        +  GKLFF   IT  CR A VP   D+ P+  KG++   P+ A  + T          S   M E+L  H +  +    + QT W+Y + RDV +   L+
Subjt:  KKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIID-TPNLARLQRTQEARQGGLVCSIHQMQEQLQLHSSRMEFAERQFQTFWDYVKRRDVALRVALQ

Query:  SNF
         N+
Subjt:  SNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGACGAGAGAAAACAAAGAAAGAGGCAATGAGGAGGAAGCAGTACCAGTTAGTCCTGAGGTACCGAAAGCGAAAACAAAGAAGAAGAGAACACCTGAAGAGAG
GGAAGCCAAGAGGCGAAAGAGACAACAGAGGATTGAAGATCAAGAAACTATCCAAGAAGAACCAGTGAATGCCCCAGATACAGAAGGAGTTCAAAATCCTGAGATAGAAC
CGATAGTCCCAGATACGGTTCAAGACGAGAATATTGAGACGAATCAAGAAACACGGGCTGACGAAGTTCGAGACGAACAGGGCGCAGTTGTGCCTGAAAAAGGGAATGAA
AAGGAACCGGTGCAGGAGGCTCGGGTAGAGGTAGTCATGCCGGAAGTGCCAAAGCGTCGCCGCATTAAGCGTAAGGCGGGCCGCGTCAGGGTGATTCGAACTACCCCATC
ACCTCCGTCGTCAGATTCAGAGGAAGAGAGAAGAGAACAGGAAAAGAAGGAGGCGGAAGACAAAGCGAGGGAGGAAGAAGAAAAGAAGGCTGAGGAAGAAATTTTGGCCA
AGCAGAGGGAAGACAAGGGCAAAGGTATTGCTGAGGCATCGGGTGAGGCTGACGAGATTGAGGAACCGAGAGTACCGTATAATCGCTTCATCAATAACCTTGCTCGGGCA
AAGTATGTTGAGATGCTGAGGCGGGACTTTCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTG
TGCAAAGCCGGAACCTGTTAATTCAAACATTGTTCGGGAATTTTATGCAAATATTGACGATCACAAAGAATTTCAGATTGTCAAAGACGGAAAAGCGCACATTTCAGGCT
GCTTATTTGAAGAGCGAGGCCAATACATGAATGGGTTTCATCAAGCTGCGCTTACTGTCGACAACTCACGACTCAACGTGTCTCGAGACCGGGTTCTGCTTGCCTTTGCT
ATTCTTCGTTCGATGAGTATTGATGTGGGTAAAATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCT
ATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATGGACAAGGGAATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCC
AAGGAGGTTTGGTGTGCAGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAGTTTGCTGAAAGGCAATTTCAAACTTTCTGGGACTATGTGAAA
AGGAGGGATGTCGCGTTAAGGGTAGCTTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCTGCCGCCGCC
TGTTGAAAAAGAGGGAGATGAAGAAGAAGAGGCTGGTCAGGAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGACGAGAGAAAACAAAGAAAGAGGCAATGAGGAGGAAGCAGTACCAGTTAGTCCTGAGGTACCGAAAGCGAAAACAAAGAAGAAGAGAACACCTGAAGAGAG
GGAAGCCAAGAGGCGAAAGAGACAACAGAGGATTGAAGATCAAGAAACTATCCAAGAAGAACCAGTGAATGCCCCAGATACAGAAGGAGTTCAAAATCCTGAGATAGAAC
CGATAGTCCCAGATACGGTTCAAGACGAGAATATTGAGACGAATCAAGAAACACGGGCTGACGAAGTTCGAGACGAACAGGGCGCAGTTGTGCCTGAAAAAGGGAATGAA
AAGGAACCGGTGCAGGAGGCTCGGGTAGAGGTAGTCATGCCGGAAGTGCCAAAGCGTCGCCGCATTAAGCGTAAGGCGGGCCGCGTCAGGGTGATTCGAACTACCCCATC
ACCTCCGTCGTCAGATTCAGAGGAAGAGAGAAGAGAACAGGAAAAGAAGGAGGCGGAAGACAAAGCGAGGGAGGAAGAAGAAAAGAAGGCTGAGGAAGAAATTTTGGCCA
AGCAGAGGGAAGACAAGGGCAAAGGTATTGCTGAGGCATCGGGTGAGGCTGACGAGATTGAGGAACCGAGAGTACCGTATAATCGCTTCATCAATAACCTTGCTCGGGCA
AAGTATGTTGAGATGCTGAGGCGGGACTTTCTGTTTGAACGAGGATTTGGCGATGATCTGCCACGGTTCTTGAGGACTGGAATAGTGAACCTCGGCTGGAGTCAATTTTG
TGCAAAGCCGGAACCTGTTAATTCAAACATTGTTCGGGAATTTTATGCAAATATTGACGATCACAAAGAATTTCAGATTGTCAAAGACGGAAAAGCGCACATTTCAGGCT
GCTTATTTGAAGAGCGAGGCCAATACATGAATGGGTTTCATCAAGCTGCGCTTACTGTCGACAACTCACGACTCAACGTGTCTCGAGACCGGGTTCTGCTTGCCTTTGCT
ATTCTTCGTTCGATGAGTATTGATGTGGGTAAAATAATTTCTTCTGAGATTCTTGATTGCTGGCGGAAAAAGGTGGGGAAGCTGTTTTTCCCCAACACTATCACGATGCT
ATGCCGAAGGGCAGGGGTGCCAGAGGATGAGGATGATGTGCCACTAATGGACAAGGGAATAATTGACACACCAAATCTGGCTAGGCTTCAGAGGACGCAGGAAGCACGCC
AAGGAGGTTTGGTGTGCAGCATCCACCAAATGCAGGAGCAATTACAGCTGCATTCCAGTAGGATGGAGTTTGCTGAAAGGCAATTTCAAACTTTCTGGGACTATGTGAAA
AGGAGGGATGTCGCGTTAAGGGTAGCTTTGCAGTCGAATTTTTCCAAGCCATATCCGGCTTTACCCGTATTCCCTGATGACCTACTGAACCCCTGGATTCTGCCGCCGCC
TGTTGAAAAAGAGGGAGATGAAGAAGAAGAGGCTGGTCAGGAAGACTGA
Protein sequenceShow/hide protein sequence
MAKTRENKERGNEEEAVPVSPEVPKAKTKKKRTPEEREAKRRKRQQRIEDQETIQEEPVNAPDTEGVQNPEIEPIVPDTVQDENIETNQETRADEVRDEQGAVVPEKGNE
KEPVQEARVEVVMPEVPKRRRIKRKAGRVRVIRTTPSPPSSDSEEERREQEKKEAEDKAREEEEKKAEEEILAKQREDKGKGIAEASGEADEIEEPRVPYNRFINNLARA
KYVEMLRRDFLFERGFGDDLPRFLRTGIVNLGWSQFCAKPEPVNSNIVREFYANIDDHKEFQIVKDGKAHISGCLFEERGQYMNGFHQAALTVDNSRLNVSRDRVLLAFA
ILRSMSIDVGKIISSEILDCWRKKVGKLFFPNTITMLCRRAGVPEDEDDVPLMDKGIIDTPNLARLQRTQEARQGGLVCSIHQMQEQLQLHSSRMEFAERQFQTFWDYVK
RRDVALRVALQSNFSKPYPALPVFPDDLLNPWILPPPVEKEGDEEEEAGQED