; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005502 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005502
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRT_RNaseH_2 domain-containing protein
Genome locationscaffold7:27864595..27867307
RNA-Seq ExpressionSpg005502
SyntenySpg005502
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.9e-2434.77Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKED

Query:  GFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRER
           V VRGV++S             D V E                   V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R
Subjt:  GFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRER

Query:  VLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQRTQE
        +LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR  RA  LV+E    L + G ID   +AR+  TQE
Subjt:  VLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.0e-3532.51Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVELS------------
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV++S            
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVELS------------

Query:  -DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEIS
         D V E                   V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI 
Subjt:  -DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ---
         C  +K G LFFP+ IT LCR  RA  LV+E    L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q   
Subjt:  GCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ---

Query:  ----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
            +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]3.5e-2434.93Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE-------------LSDAVRE----------------
        ++++F+++       P F+   I  H W+ FC+ PE     +VREFY N+   D   V +RGV+             L D + E                
Subjt:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE-------------LSDAVRE----------------

Query:  ---VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR-
           V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR 
Subjt:  ---VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR-

Query:  -RAGVLVDE
         RA  LV+E
Subjt:  -RAGVLVDE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-3035.15Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+   +   + VRGV++S             D V E                   V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANIDKEDGFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVLVDEGDVILFDKGIIDTSNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  LV+E    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVLVDEGDVILFDKGIIDTSNLARL------QRTQE----

Query:  -------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
               +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

TYG52543.1 hypothetical protein ES288_D09G036700v1 [Gossypium darwinii]1.2e-2434.62Show/hide
Query:  EHGRFINNFARAKYAELLK-RDFLFERGF---SGNL---PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE------LSDAVR
        E+   I+   + ++  + K +  + E+GF   S +L   P  +R  I    WERFC      + ++VREFYA++  +D  +VIVR         L   + 
Subjt:  EHGRFINNFARAKYAELLK-RDFLFERGF---SGNL---PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE------LSDAVR

Query:  EVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAG
         V   G+QW +   G  + Q  YLK  A  W  F+R   +P +H ST+S E +LL +AIL   SI+VGK+I+ EI  C KKK    +FP+ IT LC +A 
Subjt:  EVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAG

Query:  VLVDEGDVILFDKGIIDTSNLARL-QRTQEARQG
        V +       + +G I   +L RL +R  E  QG
Subjt:  VLVDEGDVILFDKGIIDTSNLARL-QRTQEARQG

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.9e-2434.77Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKED

Query:  GFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRER
           V VRGV++S             D V E                   V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R
Subjt:  GFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRER

Query:  VLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQRTQE
        +LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR  RA  LV+E    L + G ID   +AR+  TQE
Subjt:  VLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)9.5e-3632.51Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVELS------------
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV++S            
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVELS------------

Query:  -DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEIS
         D V E                   V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI 
Subjt:  -DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEIS

Query:  GCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ---
         C  +K G LFFP+ IT LCR  RA  LV+E    L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q   
Subjt:  GCWKKKVGKLFFPNTITMLCR--RAGVLVDEGDVILFDKGIIDTSNLARLQR---TQEARQ---------------GGLIYGINTVLEQLALSASRQ---

Query:  ----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
            +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ----EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A2P5DAQ2 Uncharacterized protein1.7e-2434.93Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE-------------LSDAVRE----------------
        ++++F+++       P F+   I  H W+ FC+ PE     +VREFY N+   D   V +RGV+             L D + E                
Subjt:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE-------------LSDAVRE----------------

Query:  ---VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR-
           V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR 
Subjt:  ---VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR-

Query:  -RAGVLVDE
         RA  LV+E
Subjt:  -RAGVLVDE

A0A2P5DXM3 Uncharacterized protein5.4e-3135.15Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM
        +VREFYAN+   +   + VRGV++S             D V E                   V   GA+W +S  G  T   + L   A  W  F++ R+
Subjt:  VVREFYANIDKEDGFQVIVRGVELS-------------DAVRE-------------------VGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRM

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVLVDEGDVILFDKGIIDTSNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  LV+E    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVLVDEGDVILFDKGIIDTSNLARL------QRTQE----

Query:  -------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
               +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A5D2B8V0 Uncharacterized protein5.8e-2534.62Show/hide
Query:  EHGRFINNFARAKYAELLK-RDFLFERGF---SGNL---PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE------LSDAVR
        E+   I+   + ++  + K +  + E+GF   S +L   P  +R  I    WERFC      + ++VREFYA++  +D  +VIVR         L   + 
Subjt:  EHGRFINNFARAKYAELLK-RDFLFERGF---SGNL---PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVE------LSDAVR

Query:  EVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAG
         V   G+QW +   G  + Q  YLK  A  W  F+R   +P +H ST+S E +LL +AIL   SI+VGK+I+ EI  C KKK    +FP+ IT LC +A 
Subjt:  EVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAG

Query:  VLVDEGDVILFDKGIIDTSNLARL-QRTQEARQG
        V +       + +G I   +L RL +R  E  QG
Subjt:  VLVDEGDVILFDKGIIDTSNLARL-QRTQEARQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGAAGCACCGAAACGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGC
GGAGAGAGTAGAGCATGAGAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAA
GTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTT
GAAAGAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCAGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGCACAGGTGGTGCG
TGAATTTTATGCTAATATTGATAAGGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGC
TGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGCGGATGCTTCCAACGACTCATGACTCGACG
GTCTCGAGGGAACGGGTTCTTCTAGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGG
GAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGAAGAGCAGGGGTTCTAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGTCTAACT
TGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGGTGGGCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGAG
AGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGA
GGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTG
CGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTG
GTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGGA
GCAAAGCTGGGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGAAGCACCGAAACGTCGCCGCGTTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGC
GGAGAGAGTAGAGCATGAGAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAA
GTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTT
GAAAGAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCAGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGCACAGGTGGTGCG
TGAATTTTATGCTAATATTGATAAGGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGC
TGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGCGGATGCTTCCAACGACTCATGACTCGACG
GTCTCGAGGGAACGGGTTCTTCTAGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGG
GAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGAAGAGCAGGGGTTCTAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGTCTAACT
TGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGGTGGGCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCGGCCAGCAGGCAAGAGTTTGCCGAG
AGGCAAGCTTTAACCTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGA
GGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGACCTGGTCGTTGCTG
CGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTG
GTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGGA
GCAAAGCTGGGAGTGA
Protein sequenceShow/hide protein sequence
MPEAPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAERVEHEKKEAEERAREEAEKKAEEERLLKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLF
ERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVELSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDST
VSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVLVDEGDVILFDKGIIDTSNLARLQRTQEARQGGLIYGINTVLEQLALSASRQEFAE
RQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQETFCLSIFSDLVVAAAKKILEVVLTYVIRFKLRSSPTLTKLWQVLRIELKV
VIICPCRKNYFAAAELGFAECSESVAGRLEGANSVLEQSWE