; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034689 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034689
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold7:44469130..44475783
RNA-Seq ExpressionSpg034689
SyntenySpg034689
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB93492.1 hypothetical protein L484_006967 [Morus notabilis]1.4e-1928.51Show/hide
Query:  KQREDKGKGI--------AEASPEPEDSETEEPRLPYHRFLNNLARLKYAKLLKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSPGAINALY
        K +  KGKG+        +E+    E++E ++   P      +    K A  ++R F++ RG     P F+ S I  H W  FC    +     AIN+LY
Subjt:  KQREDKGKGI--------AEASPEPEDSETEEPRLPYHRFLNNLARLKYAKLLKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSPGAINALY

Query:  NLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISS
        +L D     +N    + + +QL +V+ E+ VEG +W  +     TF    L+        F++ RL+P++H   V +ER +L + +++   ++VG++I  
Subjt:  NLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISS

Query:  EISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDV
        ++  C  +K G L+FP+ IT LC   GV   E ++
Subjt:  EISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDV

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.1e-1931.89Show/hide
Query:  RFLNNLARLKYA-KLLKRDFLFERGF-------NGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   IT H+W+ FC+  E    P                             AINA++ L
Subjt:  RFLNNLARLKYA-KLLKRDFLFERGF-------NGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNL

Query:  QDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEI
         D P   ++E +   +   L  V+  V V GA+W +S     T   + L   A     F+K  LLPTTH  TVS++R++L   +L   SI+VG++I SEI
Subjt:  QDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEI

Query:  SGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVTLFHKGIIVTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P    +  L + G I    +AR+  TQE
Subjt:  SGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVTLFHKGIIVTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.5e-2330.51Show/hide
Query:  GDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQW
        G LP F+   IT H+W+ FC+  E    P                             AINA++ L D P   ++E +   + + L  V+  V   GA+W
Subjt:  GDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQW

Query:  RLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVT
         +S     T   + L   A     F+K RLLPTTH  TVS++R++L   +L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P    +  
Subjt:  RLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVT

Query:  LFHKGIIVTPNLARLQR---TQEARQ---------------GGLAYGIHKIVEQLALSTNRQ-------EFAERQSQTFWNYVKRRDANLKKALQ
        L + G I    +AR+ +   T+  +Q               G +   +  + ++L+    +Q       +   +Q Q FW Y K RD  LKKALQ
Subjt:  LFHKGIIVTPNLARLQR---TQEARQ---------------GGLAYGIHKIVEQLALSTNRQ-------EFAERQSQTFWNYVKRRDANLKKALQ

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.4e-1829.95Show/hide
Query:  LKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSD
        ++++F+++     + P F+   I  H+W+LFC+  E    P                             AIN +++L D P   ++E V   +  +L  
Subjt:  LKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSD

Query:  VVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCN
        V+  V + GA+W +S     T   + L   A     F+K RLLPTTH  TVS+E V L + +L   SI+VG++I  EI  C  +K G LFFP+ IT +C 
Subjt:  VVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCN

Query:  KAGVPENEGDVTLFHKG
            P    +  L + G
Subjt:  KAGVPENEGDVTLFHKG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.5e-2134.78Show/hide
Query:  SPGAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLS
        S  AINA++ L D P   ++E +   +  +L  V+  V   GA+W +S     T   + L   A     F+K RLLPTTH   VS++R++L   +L   S
Subjt:  SPGAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLS

Query:  IDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENE------GDVTLFHKGIIV---------TPNLARLQRTQEARQGGLAYGIHKIVEQLALS
        I+VG++I SEI  C  +K G LFFP+ IT LC  A    NE      G++       I           P+ +R      +R  G      K +EQ    
Subjt:  IDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENE------GDVTLFHKGIIV---------TPNLARLQRTQEARQGGLAYGIHKIVEQLALS

Query:  TNRQEFAERQSQTFWNYVKRRDANLKKALQ
         ++QE   +Q Q FW Y K RD  LKKALQ
Subjt:  TNRQEFAERQSQTFWNYVKRRDANLKKALQ

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.0e-1931.89Show/hide
Query:  RFLNNLARLKYA-KLLKRDFLFERGF-------NGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   IT H+W+ FC+  E    P                             AINA++ L
Subjt:  RFLNNLARLKYA-KLLKRDFLFERGF-------NGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNL

Query:  QDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEI
         D P   ++E +   +   L  V+  V V GA+W +S     T   + L   A     F+K  LLPTTH  TVS++R++L   +L   SI+VG++I SEI
Subjt:  QDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEI

Query:  SGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVTLFHKGIIVTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P    +  L + G I    +AR+  TQE
Subjt:  SGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVTLFHKGIIVTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-2330.51Show/hide
Query:  GDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQW
        G LP F+   IT H+W+ FC+  E    P                             AINA++ L D P   ++E +   + + L  V+  V   GA+W
Subjt:  GDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQW

Query:  RLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVT
         +S     T   + L   A     F+K RLLPTTH  TVS++R++L   +L   SI+VG++I SEI  C  +K G LFFP+ IT LC  A  P    +  
Subjt:  RLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVT

Query:  LFHKGIIVTPNLARLQR---TQEARQ---------------GGLAYGIHKIVEQLALSTNRQ-------EFAERQSQTFWNYVKRRDANLKKALQ
        L + G I    +AR+ +   T+  +Q               G +   +  + ++L+    +Q       +   +Q Q FW Y K RD  LKKALQ
Subjt:  LFHKGIIVTPNLARLQR---TQEARQ---------------GGLAYGIHKIVEQLALSTNRQ-------EFAERQSQTFWNYVKRRDANLKKALQ

A0A2P5DAQ2 Uncharacterized protein1.7e-1829.95Show/hide
Query:  LKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSD
        ++++F+++     + P F+   I  H+W+LFC+  E    P                             AIN +++L D P   ++E V   +  +L  
Subjt:  LKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSP----------------------------GAINALYNLQDFPHTGYNEMVVAPSNEQLSD

Query:  VVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCN
        V+  V + GA+W +S     T   + L   A     F+K RLLPTTH  TVS+E V L + +L   SI+VG++I  EI  C  +K G LFFP+ IT +C 
Subjt:  VVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCN

Query:  KAGVPENEGDVTLFHKG
            P    +  L + G
Subjt:  KAGVPENEGDVTLFHKG

A0A2P5DXM3 Uncharacterized protein7.2e-2234.78Show/hide
Query:  SPGAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLS
        S  AINA++ L D P   ++E +   +  +L  V+  V   GA+W +S     T   + L   A     F+K RLLPTTH   VS++R++L   +L   S
Subjt:  SPGAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLS

Query:  IDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENE------GDVTLFHKGIIV---------TPNLARLQRTQEARQGGLAYGIHKIVEQLALS
        I+VG++I SEI  C  +K G LFFP+ IT LC  A    NE      G++       I           P+ +R      +R  G      K +EQ    
Subjt:  IDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENE------GDVTLFHKGIIV---------TPNLARLQRTQEARQGGLAYGIHKIVEQLALS

Query:  TNRQEFAERQSQTFWNYVKRRDANLKKALQ
         ++QE   +Q Q FW Y K RD  LKKALQ
Subjt:  TNRQEFAERQSQTFWNYVKRRDANLKKALQ

W9S7D3 Uncharacterized protein6.8e-2028.51Show/hide
Query:  KQREDKGKGI--------AEASPEPEDSETEEPRLPYHRFLNNLARLKYAKLLKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSPGAINALY
        K +  KGKG+        +E+    E++E ++   P      +    K A  ++R F++ RG     P F+ S I  H W  FC    +     AIN+LY
Subjt:  KQREDKGKGI--------AEASPEPEDSETEEPRLPYHRFLNNLARLKYAKLLKRDFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSPGAINALY

Query:  NLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISS
        +L D     +N    + + +QL +V+ E+ VEG +W  +     TF    L+        F++ RL+P++H   V +ER +L + +++   ++VG++I  
Subjt:  NLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQRLLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISS

Query:  EISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDV
        ++  C  +K G L+FP+ IT LC   GV   E ++
Subjt:  EISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAATATATGGCGCGTACAGATGCCACAATTCAAAGTAGTCAAGCTTCAATGAGAGCACTGGAATTGCAAATGGGCCAGCTAGCTAATGAGTTGAAGGCAAGGCC
TCAAGGGAAACTTCCTTCAGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTGAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGCCTA
GTAAACCCCAGGATATAGAAAATAATTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAAGGTGTTGGAGGCAGCAATAATGATGCTGGAGCATCTGAT
TGCATGAGATTTGAACGTGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCTGTTACCCCCGAAGTACCGAAAACGAAGGCAATGAAAAAGAAAACGCCAGAAGA
GAAAGAGGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAATTGTGCAGAAGGTAGTAGAGGATGTTGCTGCTGAAGTAGTGGAAGAAGGCAATCTGAAGG
AACCTGAAGGACAAAACCCAGAGCAGGCTGACCCGATAGTTGCGGATACGGATGAAGTTCAAGAAGAAAACACAGAGGAAGTTCAAGAAAAGCAGACTGAGGTTACGCAA
GAAGGACGGACAGAGGTTGCGCCTGAAAAAGGTAATGAACAAGAGCAGGAGGCTCGAGTGGAGGTTATCATGCCGGAAGTGCCACGACGTCGCCGCCGGAAGCAAAAAGC
CGGCAGTGTTAAGAAAAAAGAGGCCGAAGACAAAGCATCAGAGGAAACAGAGAAAAAGGCTGAGGAAGAAATTTTGCTCAAACAAAGGGAAGACAAGGGCAAGGGCATTG
CTGAAGCATCGCCGGAACCAGAAGATAGTGAAACAGAGGAACCAAGGTTGCCGTATCATCGCTTCCTCAACAATCTTGCAAGATTAAAGTATGCTAAGCTGCTGAAGAGA
GACTTCCTGTTTGAGAGAGGATTTAATGGTGATCTTCCACATTTTCTGCGGTCCGGCATTACGAACCACGATTGGGAGTTATTTTGTTCAAAGCATGAATCTGTGAAGAG
TCCTGGTGCCATTAACGCCCTGTATAATCTTCAAGATTTCCCCCACACAGGATACAATGAGATGGTTGTGGCGCCATCTAATGAGCAATTAAGCGATGTTGTTCGGGAAG
TTGGTGTTGAAGGGGCACAGTGGAGGCTTTCAAAAACAGAAAAAAAGACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACACGGATGGGATTTATCAAACAGAGG
TTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTGTTCTGGCTTTCGATATTTTAAGGTCTCTCAGTATTGATGTGGGTAAGATTATTTCGAGTGAAAT
ATCTGGATGCTGGCGGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACGATGCTTTGCAACAAAGCAGGGGTTCCGGAGAATGAAGGAGATGTCACATTATTTC
ACAAGGGAATCATTGTTACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTTGCTTATGGCATTCACAAGATTGTAGAACAACTTGCACTG
TCGACCAACAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAACTATGTTAAACGTCGTGATGCTAATCTAAAGAAGGCGCTACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAATATATGGCGCGTACAGATGCCACAATTCAAAGTAGTCAAGCTTCAATGAGAGCACTGGAATTGCAAATGGGCCAGCTAGCTAATGAGTTGAAGGCAAGGCC
TCAAGGGAAACTTCCTTCAGATACTGAGCACCCTAGAAGGGAAGGTAAGGAGCAGGTAAAGGCAGTGACTCTGAGGAGTGGTAAGCCACTAGAAGAAAGAAAAGAGCCTA
GTAAACCCCAGGATATAGAAAATAATTGTGATAAAAATGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAAGGTGTTGGAGGCAGCAATAATGATGCTGGAGCATCTGAT
TGCATGAGATTTGAACGTGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCTGTTACCCCCGAAGTACCGAAAACGAAGGCAATGAAAAAGAAAACGCCAGAAGA
GAAAGAGGCTAAGAGGAGAAGAAGACAGCAGAGGGCTGAGGATCAAGAAATTGTGCAGAAGGTAGTAGAGGATGTTGCTGCTGAAGTAGTGGAAGAAGGCAATCTGAAGG
AACCTGAAGGACAAAACCCAGAGCAGGCTGACCCGATAGTTGCGGATACGGATGAAGTTCAAGAAGAAAACACAGAGGAAGTTCAAGAAAAGCAGACTGAGGTTACGCAA
GAAGGACGGACAGAGGTTGCGCCTGAAAAAGGTAATGAACAAGAGCAGGAGGCTCGAGTGGAGGTTATCATGCCGGAAGTGCCACGACGTCGCCGCCGGAAGCAAAAAGC
CGGCAGTGTTAAGAAAAAAGAGGCCGAAGACAAAGCATCAGAGGAAACAGAGAAAAAGGCTGAGGAAGAAATTTTGCTCAAACAAAGGGAAGACAAGGGCAAGGGCATTG
CTGAAGCATCGCCGGAACCAGAAGATAGTGAAACAGAGGAACCAAGGTTGCCGTATCATCGCTTCCTCAACAATCTTGCAAGATTAAAGTATGCTAAGCTGCTGAAGAGA
GACTTCCTGTTTGAGAGAGGATTTAATGGTGATCTTCCACATTTTCTGCGGTCCGGCATTACGAACCACGATTGGGAGTTATTTTGTTCAAAGCATGAATCTGTGAAGAG
TCCTGGTGCCATTAACGCCCTGTATAATCTTCAAGATTTCCCCCACACAGGATACAATGAGATGGTTGTGGCGCCATCTAATGAGCAATTAAGCGATGTTGTTCGGGAAG
TTGGTGTTGAAGGGGCACAGTGGAGGCTTTCAAAAACAGAAAAAAAGACATTTCAGTCAGCCTATCTTAAGAAGGAAGCAAATACACGGATGGGATTTATCAAACAGAGG
TTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTGTTCTGGCTTTCGATATTTTAAGGTCTCTCAGTATTGATGTGGGTAAGATTATTTCGAGTGAAAT
ATCTGGATGCTGGCGGAAGAAAGTTGGGAAGTTGTTTTTCCCGAATACAATTACGATGCTTTGCAACAAAGCAGGGGTTCCGGAGAATGAAGGAGATGTCACATTATTTC
ACAAGGGAATCATTGTTACGCCTAACTTGGCACGGCTTCAGCGTACGCAAGAAGCACGTCAGGGTGGGCTTGCTTATGGCATTCACAAGATTGTAGAACAACTTGCACTG
TCGACCAACAGGCAAGAGTTTGCCGAGAGGCAATCTCAAACTTTCTGGAACTATGTTAAACGTCGTGATGCTAATCTAAAGAAGGCGCTACAATAA
Protein sequenceShow/hide protein sequence
MKEYMARTDATIQSSQASMRALELQMGQLANELKARPQGKLPSDTEHPRREGKEQVKAVTLRSGKPLEERKEPSKPQDIENNCDKNVVVEKELESGQGVGGSNNDAGASD
CMRFERARKERDNEEEEVPVTPEVPKTKAMKKKTPEEKEAKRRRRQQRAEDQEIVQKVVEDVAAEVVEEGNLKEPEGQNPEQADPIVADTDEVQEENTEEVQEKQTEVTQ
EGRTEVAPEKGNEQEQEARVEVIMPEVPRRRRRKQKAGSVKKKEAEDKASEETEKKAEEEILLKQREDKGKGIAEASPEPEDSETEEPRLPYHRFLNNLARLKYAKLLKR
DFLFERGFNGDLPHFLRSGITNHDWELFCSKHESVKSPGAINALYNLQDFPHTGYNEMVVAPSNEQLSDVVREVGVEGAQWRLSKTEKKTFQSAYLKKEANTRMGFIKQR
LLPTTHDSTVSRERVVLAFDILRSLSIDVGKIISSEISGCWRKKVGKLFFPNTITMLCNKAGVPENEGDVTLFHKGIIVTPNLARLQRTQEARQGGLAYGIHKIVEQLAL
STNRQEFAERQSQTFWNYVKRRDANLKKALQ