; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028724 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028724
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold7:18152740..18155804
RNA-Seq ExpressionSpg028724
SyntenySpg028724
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]7.1e-1736.55Show/hide
Query:  YNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGC-WKK
        Y + A   ++EQL   + EV IEGA WQ+S  G  T     LKR A  W  F+  R +P+T   TV ++ VLL ++IL  +S+++  I + EI  C   +
Subjt:  YNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGC-WKK

Query:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR
        K G L+FP+ IT L  +A VP ++ + I+ + G I T +++R+ +
Subjt:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.3e-2327.69Show/hide
Query:  FLRTGIADHGWDLFCAKP----------------------------------ESVNA---QNFPHATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTG
        F+   I  H W  FCA P                                  E++NA      P   ++E     + + L   +  V   GA+W +S  G
Subjt:  FLRTGIADHGWDLFCAKP----------------------------------ESVNA---QNFPHATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTG

Query:  KRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI
          T   + L   A  W  F++ R+LPTT   TV ++ +LL  ++L   SI+VG +I +EI  C  +K G LFFP+ IT LC+ A  P    +  L + G 
Subjt:  KRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI

Query:  IDTPNLARLQR---TQEARQ---------------GGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALP
        ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKA+Q NF++  P  P
Subjt:  IDTPNLARLQR---TQEARQ---------------GGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALP

Query:  AFPEDLLNPWISPPPVEREEEDDEE
        AFP+++L         E E E D++
Subjt:  AFPEDLLNPWISPPPVEREEEDDEE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.3e-1832.09Show/hide
Query:  ANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT
        A  W  F++ R+LPTT   TV ++ +LL +++L   SI+VG +I +EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKA+Q NF++  P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPW

Query:  ISPPPVEREEEDDEE
              E E E D++
Subjt:  ISPPPVEREEEDDEE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.9e-2333.99Show/hide
Query:  HATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCW
        H+ + E    P   +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTT    V ++ +LL  ++L   SI+VG +I +EI  C 
Subjt:  HATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCW

Query:  KKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYVINTILEQLALSASRQEFAERQALT
         +K G LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   
Subjt:  KKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYVINTILEQLALSASRQEFAERQALT

Query:  FWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPWISPPPVEREEEDDEE
        FW Y K RD  LKKA+Q NF++  P  PAFP+++L         E E E D++
Subjt:  FWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPWISPPPVEREEEDDEE

XP_024971944.1 uncharacterized protein LOC112510826 [Cynara cardunculus var. scolymus]8.4e-1830.18Show/hide
Query:  YNEMAVAPSNEQLSDAMREVGIEGAQWQL-SKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKK
        +  ++ + S  +L +  RE+G  G +W   S    RT++++ LK  AN W+ FIR  + PTT DS++  E ++L + ++   +I+VG ++   I  C K+
Subjt:  YNEMAVAPSNEQLSDAMREVGIEGAQWQL-SKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKK

Query:  KVGKLFFPNTITMLCKRAGVPENEGDVIL---FDKGIIDTPNLARL-QRTQEARQ-GGLVYVINTILEQLALSASRQEFAERQALTFWNYVKSRDANLKK
        + GKLFFP+ I  L  +AGVPE   D+++    +K  ID   +++L +R++  R+  G+   +  +LEQ   S S  +F   Q       +K+  A+L  
Subjt:  KVGKLFFPNTITMLCKRAGVPENEGDVIL---FDKGIIDTPNLARL-QRTQEARQ-GGLVYVINTILEQLALSASRQEFAERQALTFWNYVKSRDANLKK

Query:  AVQENFSKLYPALPAFPEDLLN
         ++ +  K+        EDL N
Subjt:  AVQENFSKLYPALPAFPEDLLN

TrEMBL top hitse value%identityAlignment
A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-2327.69Show/hide
Query:  FLRTGIADHGWDLFCAKP----------------------------------ESVNA---QNFPHATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTG
        F+   I  H W  FCA P                                  E++NA      P   ++E     + + L   +  V   GA+W +S  G
Subjt:  FLRTGIADHGWDLFCAKP----------------------------------ESVNA---QNFPHATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTG

Query:  KRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI
          T   + L   A  W  F++ R+LPTT   TV ++ +LL  ++L   SI+VG +I +EI  C  +K G LFFP+ IT LC+ A  P    +  L + G 
Subjt:  KRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGI

Query:  IDTPNLARLQR---TQEARQ---------------GGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALP
        ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKA+Q NF++  P  P
Subjt:  IDTPNLARLQR---TQEARQ---------------GGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALP

Query:  AFPEDLLNPWISPPPVEREEEDDEE
        AFP+++L         E E E D++
Subjt:  AFPEDLLNPWISPPPVEREEEDDEE

A0A2P5CEY2 Uncharacterized protein6.3e-1932.09Show/hide
Query:  ANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT
        A  W  F++ R+LPTT   TV ++ +LL +++L   SI+VG +I +EI  C  +K G LFFP+ IT LC+ A  P    +  L   G ID   +AR+  T
Subjt:  ANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRT

Query:  QEAR--------------------QGGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPW
        QE +                     G ++  +  + ++L+    +Q       +   +Q   FW Y K RD  LKKA+Q NF++  P  P FP++LL   
Subjt:  QEAR--------------------QGGLVYVINTILEQLALSASRQ-------EFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPW

Query:  ISPPPVEREEEDDEE
              E E E D++
Subjt:  ISPPPVEREEEDDEE

A0A2P5DXM3 Uncharacterized protein1.9e-2333.99Show/hide
Query:  HATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCW
        H+ + E    P   +L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTT    V ++ +LL  ++L   SI+VG +I +EI  C 
Subjt:  HATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCW

Query:  KKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYVINTILEQLALSASRQEFAERQALT
         +K G LFFP+ IT LC+ A    NE    L + G ID   +AR+      + TQ+           +R  G V      LEQ     S+QE   +Q   
Subjt:  KKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL------QRTQE-----------ARQGGLVYVINTILEQLALSASRQEFAERQALT

Query:  FWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPWISPPPVEREEEDDEE
        FW Y K RD  LKKA+Q NF++  P  PAFP+++L         E E E D++
Subjt:  FWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPWISPPPVEREEEDDEE

A0A5D2B8V0 Uncharacterized protein1.6e-1432.13Show/hide
Query:  KRRAEKGKSVAEASEEHDEIEEHGRFVNNFARAKHAELL-KRDFCLK-EDLAVIFHFLRTGIADHGWDLFC---AKPESVNAQNFPHATYNEMAV-----
        ++R     +  E S   DE E   RF + F   KH  ++ ++ F LK  DL V+   +R  I    W+ FC   + P+    + F  +   + A      
Subjt:  KRRAEKGKSVAEASEEHDEIEEHGRFVNNFARAKHAELL-KRDFCLK-EDLAVIFHFLRTGIADHGWDLFC---AKPESVNAQNFPHATYNEMAV-----

Query:  APSN---EQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGK
          SN   + L   +  V   G+QW +   G  + Q  YLK  A  W  F+R   +P +  ST+  EW+LL +AIL   SI+VG II+ EI  C KKK   
Subjt:  APSN---EQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGK

Query:  LFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL-QRTQEARQG
         +FP+ IT LC +A V         + +G I   +L RL +R  E  QG
Subjt:  LFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARL-QRTQEARQG

W9QTD9 Uncharacterized protein3.4e-1736.55Show/hide
Query:  YNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGC-WKK
        Y + A   ++EQL   + EV IEGA WQ+S  G  T     LKR A  W  F+  R +P+T   TV ++ VLL ++IL  +S+++  I + EI  C   +
Subjt:  YNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGC-WKK

Query:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR
        K G L+FP+ IT L  +A VP ++ + I+ + G I T +++R+ +
Subjt:  KVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTACAGAAAGTTAAGGCGAAGAAAAAGAAGACCCCGGAGGAGAA
AGAAGCCGAGAGAAGAAAACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCCACTGTTGCTGCCACAGTAGAAGAAGGAGACCCGCAAGAATCTGATG
TACAGGATACAGAGGAAGTTGAGCCGAGAATGGCGGATACAGAGGGAGTTCAAGAAGAGGGACATATCGAGGAAATTCAAGAGAAACAGAATGAGGATGTACAGGCAGAG
GTTGCGACTGAAAGAGAGCCAGTTCAGGAGGCTCGTGTTGAGGTCATCATGCCGGAACCTCCGAAACGTCGCCGCATTAAGTGGAAGGCTGGGCGCGTTCAGGTGATTCG
GACTGATACCCCATCACCGCCGTCGTCGGATTCTGAGAAAGAGAAGGCGGAGCGAGAGGAACAAGAGAAAAAAGAAGCTGAGGAGAAGGCAAGAGAAGAGGTAGAGAAAA
AGGCTGAGGAAGAGCGGTTGCTCAAACGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCAGAAGAACACGATGAAATAGAAGAACATGGTCGCTTCGTCAACAAT
TTTGCCAGAGCAAAACACGCTGAGCTGCTGAAAAGAGACTTCTGTTTGAAAGAGGATTTAGCGGTGATCTTCCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGA
CTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGAATTTCCCCCATGCAACATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTATGCGGG
AGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACGTTTCAGTCAGCTTATCTGAAGAGGGAAGCAAACACGTGGATGGGATTTATCAGGCAG
AGGATGCTTCCAACGACTCGTGATTCGACGGTCTTGAGGGAATGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATTGATGTAGGGATGATAATTGTGAATGA
GATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAGATGTGATATTAT
TTGACAAGGGGATCATTGACACGCCCAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTACGTCATCAACACGATTTTAGAACAACTCGCA
CTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCCTTAACTTTCTGGAACTATGTTAAGAGTCGTGATGCCAATCTGAAGAAGGCGGTGCAGGAGAATTTTTCCAA
GTTGTATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATCTCACCCCCGCCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCT
TGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTTATGAC
TGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATATAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAA
AGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTTTGTGC
TGCAGCAAAGTTGGGAGCAAAACTGCCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGCGAGAAAAGAAAGGGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAAGTACAGAAAGTTAAGGCGAAGAAAAAGAAGACCCCGGAGGAGAA
AGAAGCCGAGAGAAGAAAACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGCCACTGTTGCTGCCACAGTAGAAGAAGGAGACCCGCAAGAATCTGATG
TACAGGATACAGAGGAAGTTGAGCCGAGAATGGCGGATACAGAGGGAGTTCAAGAAGAGGGACATATCGAGGAAATTCAAGAGAAACAGAATGAGGATGTACAGGCAGAG
GTTGCGACTGAAAGAGAGCCAGTTCAGGAGGCTCGTGTTGAGGTCATCATGCCGGAACCTCCGAAACGTCGCCGCATTAAGTGGAAGGCTGGGCGCGTTCAGGTGATTCG
GACTGATACCCCATCACCGCCGTCGTCGGATTCTGAGAAAGAGAAGGCGGAGCGAGAGGAACAAGAGAAAAAAGAAGCTGAGGAGAAGGCAAGAGAAGAGGTAGAGAAAA
AGGCTGAGGAAGAGCGGTTGCTCAAACGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCAGAAGAACACGATGAAATAGAAGAACATGGTCGCTTCGTCAACAAT
TTTGCCAGAGCAAAACACGCTGAGCTGCTGAAAAGAGACTTCTGTTTGAAAGAGGATTTAGCGGTGATCTTCCATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGA
CTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGAATTTCCCCCATGCAACATATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTATGCGGG
AGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACGTTTCAGTCAGCTTATCTGAAGAGGGAAGCAAACACGTGGATGGGATTTATCAGGCAG
AGGATGCTTCCAACGACTCGTGATTCGACGGTCTTGAGGGAATGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTCAGCATTGATGTAGGGATGATAATTGTGAATGA
GATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGGTTCCAGAGAATGAAGGAGATGTGATATTAT
TTGACAAGGGGATCATTGACACGCCCAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCTGGTCTACGTCATCAACACGATTTTAGAACAACTCGCA
CTTTCGGCCAGCAGGCAGGAGTTTGCCGAGAGGCAAGCCTTAACTTTCTGGAACTATGTTAAGAGTCGTGATGCCAATCTGAAGAAGGCGGTGCAGGAGAATTTTTCCAA
GTTGTATCCAGCCCTTCCAGCATTCCCTGAAGATTTATTGAACCCCTGGATCTCACCCCCGCCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCT
TGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTTATGAC
TGTAGGGCTGCTTTAAGTCTGAAAAACAAAAATATAAACCCCTTGAAAATGTGTTTTGATATGTCTGATAATAGAGCTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAA
AGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTTTGTGC
TGCAGCAAAGTTGGGAGCAAAACTGCCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVPVTPEVQKVKAKKKKTPEEKEAERRKRQQRAEEQEKATEVATVAATVEEGDPQESDVQDTEEVEPRMADTEGVQEEGHIEEIQEKQNEDVQAE
VATEREPVQEARVEVIMPEPPKRRRIKWKAGRVQVIRTDTPSPPSSDSEKEKAEREEQEKKEAEEKAREEVEKKAEEERLLKRRAEKGKSVAEASEEHDEIEEHGRFVNN
FARAKHAELLKRDFCLKEDLAVIFHFLRTGIADHGWDLFCAKPESVNAQNFPHATYNEMAVAPSNEQLSDAMREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQ
RMLPTTRDSTVLREWVLLAFAILRSLSIDVGMIIVNEISGCWKKKVGKLFFPNTITMLCKRAGVPENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYVINTILEQLA
LSASRQEFAERQALTFWNYVKSRDANLKKAVQENFSKLYPALPAFPEDLLNPWISPPPVEREEEDDEEQETFCLSIFSGLVVAAAKKILEVVLTYVIRFKLRSSPTLTYD
CRAALSLKNKNINPLKMCFDMSDNRAKLWQVLRIELKVVIICPCRKNYFAAAELGFAECSESVAGRLEGANFVLQQSWEQNCHVTAR