; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014825 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014825
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold3:36534687..36540839
RNA-Seq ExpressionSpg014825
SyntenySpg014825
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU48804.1 hypothetical protein TSUD_406370 [Trifolium subterraneum]7.6e-2734.5Show/hide
Query:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
        SW ++LI   F   +A+ I+N PL  R   D++IW  EK G FSVRSA+H+      + +  AS   N + +WK +WK    P  K   W++  DI+P  
Subjt:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG

Query:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP
          +K+KG+SL  + + C + E+   HL   C+L+++ W       S L     +  +L D   WL+E LS +++   +L  I +WKIW  RN+L   N+P
Subjt:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP

GAU50334.1 hypothetical protein TSUD_243120 [Trifolium subterraneum]2.2e-2634.5Show/hide
Query:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
        SW ++LI   F   +A+ I+N PL  R   D++IW  EK G FSVRS +H+      + +  AS   N + +WK +WK    P  K   W++  DI+P  
Subjt:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG

Query:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP
          IK+KG+SL  +   C + E+   HL   C+L+++ W     SS  L    + S        WL+E LS +++   +L  I +WKIW  RN+L   N+P
Subjt:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]4.4e-2725.3Show/hide
Query:  EDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASN-ISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDI
        E   W+ D++R  F   +A  I++ PL     +D+IIW   +KG FSV+SAY++A   I  +EV  +S   +   LW++LW  +  P+ +  +WK+  + 
Subjt:  EDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASN-ISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDI

Query:  IPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARN-------
        +P   N+ +KG+++  +   CG + ++  H+   C++ + +W  ++ + +    L   + +++D    +++  +  DLE+  ++ W IW  RN       
Subjt:  IPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARN-------

Query:  --------------VLAANNS--------PQS------------RYHFQRAWFEESRSRGVAWVVRDSVGSPICFGMKHNKRKWDINSLEALAIWEGLKC
                      +L   N+        PQS            + +   A  E  R+  V  ++RD+ G        + + ++ +  +EALA+  GL  
Subjt:  --------------VLAANNS--------PQS------------RYHFQRAWFEESRSRGVAWVVRDSVGSPICFGMKHNKRKWDINSLEALAIWEGLKC

Query:  LSNFSKEERLP-LIIESDALEVVVGINEPNFS
            +KE++LP +I+ESDAL VV  +N    S
Subjt:  LSNFSKEERLP-LIIESDALEVVVGINEPNFS

XP_030494780.1 uncharacterized protein LOC115710559 [Cannabis sativa]2.0e-2733Show/hide
Query:  VGCFINEDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWK
        V  FIN D +W    ++  F     + I+  PL D    D +IWG    G  +V+SAYHLAS++++I+V S S+    +  WKRLW  S  P+ K  +WK
Subjt:  VGCFINEDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWK

Query:  IINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARNVL
          N I+PC  N+ +K I   P    CG   ++ TH + +C   + +W     S  K F       ++ +++   ++N+S++D  L I ++W IWN RN++
Subjt:  IINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARNVL

Query:  AAN
          N
Subjt:  AAN

XP_030943489.1 uncharacterized protein LOC115968280 [Quercus lobata]2.9e-2626.47Show/hide
Query:  MVGCFINED-HSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNIST---IEVASASDYVNLKTLWKRLWKASTIPRAK
        MV   I++D   W+ DL+R  F   +A  I+N PL     +D+IIW   KKG F+V+SAY++A N++    IE  S+ D      LWKR+W      + +
Subjt:  MVGCFINED-HSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNIST---IEVASASDYVNLKTLWKRLWKASTIPRAK

Query:  ACSWKIINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWN
           W+   + +P   N+KK+GI+L  L   CGK  ++ TH +  C+  +++W  +      + +   + W+  D    ++E  +  DLE+  +M W IW 
Subjt:  ACSWKIINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWN

Query:  ARNVL-------------------------AANNSPQSRYHFQRAWF----------------EESRSRGVAWVVRDSVGSPICFGMKHNKRKWDINSLE
         RN +                         A ++S Q +     +W                 E  R+  V  ++RDS G  +     +   ++     E
Subjt:  ARNVL-------------------------AANNSPQSRYHFQRAWF----------------EESRSRGVAWVVRDSVGSPICFGMKHNKRKWDINSLE

Query:  ALAIWEGLKCLSNFSKEERLPLIIESDALEVVVGINEPNF
         LA+  G+         +   +IIESDAL V+  I    F
Subjt:  ALAIWEGLKCLSNFSKEERLPLIIESDALEVVVGINEPNF

TrEMBL top hitse value%identityAlignment
A0A2I4FDV1 uncharacterized protein LOC1089978932.4e-2632.45Show/hide
Query:  WRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCGA
        W ++LI+ +F+  +A+ I+ TP+     +D IIW   K G+FSV+SAYHL  +        AS    LK +WK++WK    P  K   W+   + +P  +
Subjt:  WRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCGA

Query:  NIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARN
        N+ KK +   PL   C ++E++  H++W+C+  +++W++    S KL     +S+++I ++  L E L+QE+L+  +I   +IW  RN
Subjt:  NIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARN

A0A2Z6MMG1 Uncharacterized protein1.4e-2634Show/hide
Query:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
        SW ++LI  +F   +A+ I+N PL  R   D++IW  EK G FSVRSA+H+      + +  AS   N + +W+ +WK    P  K   W++  DI+P  
Subjt:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG

Query:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP
          +K+KG+SL  +   C + E+   HL   C+L+++ W       S L     +  +L D   WL+E LS +++   +L  I +WKIW  RN+L   N+P
Subjt:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP

A0A2Z6NX89 zf-RVT domain-containing protein3.7e-2734.5Show/hide
Query:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
        SW ++LI   F   +A+ I+N PL  R   D++IW  EK G FSVRSA+H+      + +  AS   N + +WK +WK    P  K   W++  DI+P  
Subjt:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG

Query:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP
          +K+KG+SL  + + C + E+   HL   C+L+++ W       S L     +  +L D   WL+E LS +++   +L  I +WKIW  RN+L   N+P
Subjt:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP

A0A2Z6P1Q4 zf-RVT domain-containing protein1.1e-2634.5Show/hide
Query:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
        SW ++LI   F   +A+ I+N PL  R   D++IW  EK G FSVRS +H+      + +  AS   N + +WK +WK    P  K   W++  DI+P  
Subjt:  SWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG

Query:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP
          IK+KG+SL  +   C + E+   HL   C+L+++ W     SS  L    + S        WL+E LS +++   +L  I +WKIW  RN+L   N+P
Subjt:  ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDL---ELAIIMMWKIWNARNVLAANNSP

A0A803P9U8 Uncharacterized protein9.7e-2833Show/hide
Query:  VGCFINEDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWK
        V  FIN D +W    ++  F     + I+  PL D    D +IWG    G  +V+SAYHLAS++++I+V S S+    +  WKRLW  S  P+ K  +WK
Subjt:  VGCFINEDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWK

Query:  IINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARNVL
          N I+PC  N+ +K I   P    CG   ++ TH + +C   + +W     S  K F       ++ +++   ++N+S++D  L I ++W IWN RN++
Subjt:  IINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARNVL

Query:  AAN
          N
Subjt:  AAN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.1e-1023.33Show/hide
Query:  WRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHL-----ASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDI
        W    I    D  D   I    L   +  D+IIW     G ++VRS Y L     ++NI  I     S  ++LKT   R+W    +P+ K   W+ ++  
Subjt:  WRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHL-----ASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDI

Query:  IPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSW-----NLIDYWTWLIENLSQEDLE--LAIIMMWKIWNARN
        +     +  +G+ + P    C ++ ++  H ++ C      W R   SS     L    +     N++++    +++ +  D    L + ++W+IW ARN
Subjt:  IPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSW-----NLIDYWTWLIENLSQEDLE--LAIIMMWKIWNARN

Query:  VLAANNSPQS
         +  N   +S
Subjt:  VLAANNSPQS

AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0432.14Show/hide
Query:  LWKASTIPRAKACSWKIINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNC
        +W     P+ K   WK +N+ +P GA +  + IS+ P    C +  +T TH+++NC
Subjt:  LWKASTIPRAKACSWKIINDIIPCGANIKKKGISLIPLRVFCGKKEQTTTHLIWNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGATGCTTCATCAATGAAGATCACTCCTGGCGCAAAGACCTTATAAGGGGAAGTTTTGATGTCCTTGATGCCGAAGACATCATTAACACCCCCCTTGGGGACAG
AAGAGCTAAGGATGAGATAATTTGGGGCCTTGAAAAAAAGGGTACTTTCTCGGTTAGAAGTGCCTACCATTTGGCCTCAAATATATCCACCATCGAAGTAGCCTCTGCCT
CAGATTACGTTAATTTGAAAACTCTTTGGAAAAGGCTTTGGAAGGCGTCTACAATCCCTAGAGCTAAGGCGTGTTCGTGGAAGATTATCAACGATATTATCCCGTGTGGA
GCCAACATAAAAAAGAAAGGGATATCCCTAATTCCCCTCCGTGTTTTTTGTGGAAAAAAAGAACAAACCACCACTCATCTAATCTGGAATTGTAAATTGATTAGAGAGAT
GTGGACTCGTTTTATCCCTTCCTCATCCAAGCTGTTTGCTCTGTGCAGGACATCGTGGAACCTTATCGACTATTGGACTTGGCTAATTGAGAATTTGTCTCAGGAAGACT
TGGAGCTCGCAATAATCATGATGTGGAAAATATGGAATGCAAGAAATGTTTTAGCAGCTAACAATTCCCCCCAGTCCAGATATCATTTTCAGAGGGCTTGGTTTGAGGAA
TCGAGAAGCAGAGGTGTTGCTTGGGTCGTTCGTGACTCTGTTGGATCCCCGATCTGTTTCGGGATGAAGCATAACAAAAGGAAATGGGATATCAATTCGCTGGAAGCATT
AGCAATTTGGGAAGGTTTAAAATGCTTGTCAAATTTTTCAAAGGAGGAGCGTTTACCTCTGATAATCGAGTCGGACGCCTTAGAAGTCGTTGTCGGCATCAACGAGCCTA
ATTTCTCTCGGAGACAAAAAACATCTTTGACGCGATTCGCCCCCTCTGCTCCGACTGGGGCTCCGTCGAGATCATGCACTGTCCTCGGGCAGAAAATCGAGTCGCCCACA
CCATCACTCGCCTTGTTGTTGAGCTTCGTTTTGATCCGCCTGTGGATTTTTCCCAATCTCGGGAAGATGGATTTGCGTCGAGCAATTGAAGTAGCCCAAACCAAGAATTG
GGCCAGGTCTGAACCGACAAGGTTTTGGGCCAAAAATGCGAAGCCCATTTTGCGAGTTGCAATGAACCATAGCAATATGAGCCATTTTGGTCATTGGGGATCAGAAACTC
AACTCAACGAATTGCAGAGAGAAGAAAGAGGAAACGAAGAAGGAGCAACAGCGAAGATGATCCACCGGAAATGGAGTCTTCTCACCGGCCCCACTGCAATCTTGGGCGGC
ATAGTTCTCTCTCTGGCCGTCGCCAACTTCATCTTCGTTAAAGAAGGTAGGACCGGATCCGTTTCTGAAACCAAAGGAAAGGAAGCCAGAACCAGGTCGGGAATGTATAA
GGGAACAGTCGGGTGTGTGGTCGACCCCGAAGCTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGATGCTTCATCAATGAAGATCACTCCTGGCGCAAAGACCTTATAAGGGGAAGTTTTGATGTCCTTGATGCCGAAGACATCATTAACACCCCCCTTGGGGACAG
AAGAGCTAAGGATGAGATAATTTGGGGCCTTGAAAAAAAGGGTACTTTCTCGGTTAGAAGTGCCTACCATTTGGCCTCAAATATATCCACCATCGAAGTAGCCTCTGCCT
CAGATTACGTTAATTTGAAAACTCTTTGGAAAAGGCTTTGGAAGGCGTCTACAATCCCTAGAGCTAAGGCGTGTTCGTGGAAGATTATCAACGATATTATCCCGTGTGGA
GCCAACATAAAAAAGAAAGGGATATCCCTAATTCCCCTCCGTGTTTTTTGTGGAAAAAAAGAACAAACCACCACTCATCTAATCTGGAATTGTAAATTGATTAGAGAGAT
GTGGACTCGTTTTATCCCTTCCTCATCCAAGCTGTTTGCTCTGTGCAGGACATCGTGGAACCTTATCGACTATTGGACTTGGCTAATTGAGAATTTGTCTCAGGAAGACT
TGGAGCTCGCAATAATCATGATGTGGAAAATATGGAATGCAAGAAATGTTTTAGCAGCTAACAATTCCCCCCAGTCCAGATATCATTTTCAGAGGGCTTGGTTTGAGGAA
TCGAGAAGCAGAGGTGTTGCTTGGGTCGTTCGTGACTCTGTTGGATCCCCGATCTGTTTCGGGATGAAGCATAACAAAAGGAAATGGGATATCAATTCGCTGGAAGCATT
AGCAATTTGGGAAGGTTTAAAATGCTTGTCAAATTTTTCAAAGGAGGAGCGTTTACCTCTGATAATCGAGTCGGACGCCTTAGAAGTCGTTGTCGGCATCAACGAGCCTA
ATTTCTCTCGGAGACAAAAAACATCTTTGACGCGATTCGCCCCCTCTGCTCCGACTGGGGCTCCGTCGAGATCATGCACTGTCCTCGGGCAGAAAATCGAGTCGCCCACA
CCATCACTCGCCTTGTTGTTGAGCTTCGTTTTGATCCGCCTGTGGATTTTTCCCAATCTCGGGAAGATGGATTTGCGTCGAGCAATTGAAGTAGCCCAAACCAAGAATTG
GGCCAGGTCTGAACCGACAAGGTTTTGGGCCAAAAATGCGAAGCCCATTTTGCGAGTTGCAATGAACCATAGCAATATGAGCCATTTTGGTCATTGGGGATCAGAAACTC
AACTCAACGAATTGCAGAGAGAAGAAAGAGGAAACGAAGAAGGAGCAACAGCGAAGATGATCCACCGGAAATGGAGTCTTCTCACCGGCCCCACTGCAATCTTGGGCGGC
ATAGTTCTCTCTCTGGCCGTCGCCAACTTCATCTTCGTTAAAGAAGGTAGGACCGGATCCGTTTCTGAAACCAAAGGAAAGGAAGCCAGAACCAGGTCGGGAATGTATAA
GGGAACAGTCGGGTGTGTGGTCGACCCCGAAGCTGTCTAG
Protein sequenceShow/hide protein sequence
MVGCFINEDHSWRKDLIRGSFDVLDAEDIINTPLGDRRAKDEIIWGLEKKGTFSVRSAYHLASNISTIEVASASDYVNLKTLWKRLWKASTIPRAKACSWKIINDIIPCG
ANIKKKGISLIPLRVFCGKKEQTTTHLIWNCKLIREMWTRFIPSSSKLFALCRTSWNLIDYWTWLIENLSQEDLELAIIMMWKIWNARNVLAANNSPQSRYHFQRAWFEE
SRSRGVAWVVRDSVGSPICFGMKHNKRKWDINSLEALAIWEGLKCLSNFSKEERLPLIIESDALEVVVGINEPNFSRRQKTSLTRFAPSAPTGAPSRSCTVLGQKIESPT
PSLALLLSFVLIRLWIFPNLGKMDLRRAIEVAQTKNWARSEPTRFWAKNAKPILRVAMNHSNMSHFGHWGSETQLNELQREERGNEEGATAKMIHRKWSLLTGPTAILGG
IVLSLAVANFIFVKEGRTGSVSETKGKEARTRSGMYKGTVGCVVDPEAV