; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035017 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035017
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRibonuclease H-like superfamily protein
Genome locationchr3:13658733..13659750
RNA-Seq ExpressionLag0035017
SyntenyLag0035017
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143317.1 uncharacterized protein LOC111013216 [Momordica charantia]1.1e-1731.02Show/hide
Query:  WQFKQLNKEDINAAAILIWNIWSSRNKQVLEGA---PKATIEGILRSINKSIGEQVKSKDTNLSRST---------SANQPSQAWSPPLNQDWKLNSDAS
        W    L+ E++  + ++ W IW SRN+ +  G     +     I+  IN +I +      T  S+           + N     WS P    WKLN+DAS
Subjt:  WQFKQLNKEDINAAAILIWNIWSSRNKQVLEGA---PKATIEGILRSINKSIGEQVKSKDTNLSRST---------SANQPSQAWSPPLNQDWKLNSDAS

Query:  WNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLS
        W++    GG+GWI+ D  G ++ AG+ KI ++  I ALE + I+ GL  +++M+  +       PI++ESD+ EVI+ + +E  DL+
Subjt:  WNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLS

XP_022143535.1 uncharacterized protein LOC111013412 [Momordica charantia]2.0e-1933.33Show/hide
Query:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL
        +E+   + I+ W IW  RNK + +G    T +    I R I  S G          +KD +L R    N  +Q W PP +  WKLN++A+W  +TN GG+
Subjt:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL

Query:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF
        GWI+RD  G +I A  R I  E  I  LE +AI EGL            ++   PI +ESD+ E I  ++++ +D +E   +L E   + +     +   
Subjt:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF

Query:  CPRDCNRVAHSLARAAVSS
          R+ N+VAH LAR A+ +
Subjt:  CPRDCNRVAHSLARAAVSS

XP_022154990.1 uncharacterized protein LOC111022134 isoform X1 [Momordica charantia]4.6e-1631.98Show/hide
Query:  RKKEETTNHILWQFK-----------------------------------QLNKEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGE
        RKKEETT HILW+ K                                   +  +E+   + I+   IW  RNK + +G    T +    I R I  S G+
Subjt:  RKKEETTNHILWQFK-----------------------------------QLNKEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGE

Query:  QV----KSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEI
              KSKD +  R    N  ++ W PP +  WKLN+DA+W  +TN  G+GWI+RD  G +I  G R I  E  I  LE +AI EGL            
Subjt:  QV----KSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEI

Query:  RKLIPPIWVESDAAEVIKCINQ
        ++   PI +ESD+ E I  +++
Subjt:  RKLIPPIWVESDAAEVIKCINQ

XP_022155262.1 uncharacterized protein LOC111022403 [Momordica charantia]6.9e-2032.52Show/hide
Query:  EDINAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGS
        ED++   I  W IW+ RN  +  G   ++   +++ + K + E     +T+LS           W PP    W LN+DASW+ +T+ GG+GWIIR   G 
Subjt:  EDINAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGS

Query:  LICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSFCPRDCNRVAH
        ++ AG+R +     +K LEA AI+EGL    ++        ++ P+ +E+D+AEV   +N++ EDL++   ++ E  +L       AF+   R+ N  AH
Subjt:  LICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSFCPRDCNRVAH

Query:  SLARAA
        SLA+ A
Subjt:  SLARAA

XP_022156777.1 uncharacterized protein LOC111023608 [Momordica charantia]2.0e-1933.33Show/hide
Query:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL
        +E+   + I+ W IW  RNK + +G    T +    I R I  S G          +KD +L R    N  ++ W PP +  WKLN+DA+W  +TN GG+
Subjt:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL

Query:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF
        GWI+RD  G +I A  R I  E  I  LE +AI EGL            ++   PI +ESD+ E I  ++++ +D +E   +L E   +       +   
Subjt:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF

Query:  CPRDCNRVAHSLARAAVSS
          R+ N+VAH LAR A+ +
Subjt:  CPRDCNRVAHSLARAAVSS

TrEMBL top hitse value%identityAlignment
A0A6J1CP26 uncharacterized protein LOC1110134129.7e-2033.33Show/hide
Query:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL
        +E+   + I+ W IW  RNK + +G    T +    I R I  S G          +KD +L R    N  +Q W PP +  WKLN++A+W  +TN GG+
Subjt:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL

Query:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF
        GWI+RD  G +I A  R I  E  I  LE +AI EGL            ++   PI +ESD+ E I  ++++ +D +E   +L E   + +     +   
Subjt:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF

Query:  CPRDCNRVAHSLARAAVSS
          R+ N+VAH LAR A+ +
Subjt:  CPRDCNRVAHSLARAAVSS

A0A6J1CQG0 uncharacterized protein LOC1110132165.3e-1831.02Show/hide
Query:  WQFKQLNKEDINAAAILIWNIWSSRNKQVLEGA---PKATIEGILRSINKSIGEQVKSKDTNLSRST---------SANQPSQAWSPPLNQDWKLNSDAS
        W    L+ E++  + ++ W IW SRN+ +  G     +     I+  IN +I +      T  S+           + N     WS P    WKLN+DAS
Subjt:  WQFKQLNKEDINAAAILIWNIWSSRNKQVLEGA---PKATIEGILRSINKSIGEQVKSKDTNLSRST---------SANQPSQAWSPPLNQDWKLNSDAS

Query:  WNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLS
        W++    GG+GWI+ D  G ++ AG+ KI ++  I ALE + I+ GL  +++M+  +       PI++ESD+ EVI+ + +E  DL+
Subjt:  WNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLS

A0A6J1DL64 uncharacterized protein LOC111022134 isoform X12.2e-1631.98Show/hide
Query:  RKKEETTNHILWQFK-----------------------------------QLNKEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGE
        RKKEETT HILW+ K                                   +  +E+   + I+   IW  RNK + +G    T +    I R I  S G+
Subjt:  RKKEETTNHILWQFK-----------------------------------QLNKEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGE

Query:  QV----KSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEI
              KSKD +  R    N  ++ W PP +  WKLN+DA+W  +TN  G+GWI+RD  G +I  G R I  E  I  LE +AI EGL            
Subjt:  QV----KSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEI

Query:  RKLIPPIWVESDAAEVIKCINQ
        ++   PI +ESD+ E I  +++
Subjt:  RKLIPPIWVESDAAEVIKCINQ

A0A6J1DNV9 uncharacterized protein LOC1110224033.3e-2032.52Show/hide
Query:  EDINAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGS
        ED++   I  W IW+ RN  +  G   ++   +++ + K + E     +T+LS           W PP    W LN+DASW+ +T+ GG+GWIIR   G 
Subjt:  EDINAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGS

Query:  LICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSFCPRDCNRVAH
        ++ AG+R +     +K LEA AI+EGL    ++        ++ P+ +E+D+AEV   +N++ EDL++   ++ E  +L       AF+   R+ N  AH
Subjt:  LICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSFCPRDCNRVAH

Query:  SLARAA
        SLA+ A
Subjt:  SLARAA

A0A6J1DSV1 uncharacterized protein LOC1110236089.7e-2033.33Show/hide
Query:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL
        +E+   + I+ W IW  RNK + +G    T +    I R I  S G          +KD +L R    N  ++ W PP +  WKLN+DA+W  +TN GG+
Subjt:  KEDINAAAILIWNIWSSRNKQVLEGAPKATIE---GILRSINKSIGEQVK------SKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGL

Query:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF
        GWI+RD  G +I A  R I  E  I  LE +AI EGL            ++   PI +ESD+ E I  ++++ +D +E   +L E   +       +   
Subjt:  GWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAG-AAFSF

Query:  CPRDCNRVAHSLARAAVSS
          R+ N+VAH LAR A+ +
Subjt:  CPRDCNRVAHSLARAAVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0924.66Show/hide
Query:  NAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQP------SQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDS
        N    L+W +W SRN+ + +G      E + R++     E  +   T       A+ P      S  W  P  Q  K N+DA+W       G+GWI+R+ 
Subjt:  NAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQP------SQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDS

Query:  AGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQES---------EDLSEANHILIEAADLGRRAGAAFS
        +G ++  G R + +   +   E    +E L   +        +++I     ESDA  ++  +N +          ED+ +  H   E           F 
Subjt:  AGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQES---------EDLSEANHILIEAADLGRRAGAAFS

Query:  FCPRDCNRVAHSLARAAVS
        F PR  N+VA  +AR ++S
Subjt:  FCPRDCNRVAHSLARAAVS

AT3G25270.1 Ribonuclease H-like superfamily protein4.3e-0423.93Show/hide
Query:  NAAAILIWNIWSSRNKQVLEGAPKATIEGILRSIN------------KSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLG
        N A  ++W +W SRN+ V +    +    + R+ N            +S+ +QV S     SR          W  P +   K N D ++N  T     G
Subjt:  NAAAILIWNIWSSRNKQVLEGAPKATIEGILRSIN------------KSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLG

Query:  WIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCP
        W++RD  G  + +G    S        E  A++    + +    +   RK+I     E D+ +V + +N E  +    N I        R   A F + P
Subjt:  WIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCP

Query:  RDCNRVAHSLARAAVSSPSPSVIFANFSSSFRAS
        R  N+ A  LA+  +  P+ S  F  +  +F  S
Subjt:  RDCNRVAHSLARAAVSSPSPSVIFANFSSSFRAS

AT4G29090.1 Ribonuclease H-like superfamily protein2.0e-0925.87Show/hide
Query:  LIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGE-QVKSKDTNLSRSTSANQPS-QAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGH
        L+W +W +RN+ V  G  +   + +LR     + E +++++  +       N+ S   W PP +Q  K N+DA+WN++    G+GW++R+  G +   G 
Subjt:  LIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGE-QVKSKDTNLSRSTSANQPS-QAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAGH

Query:  RKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCPRDCNRVAHSLARAAV
        R + K   +   E  A+   +   LS+      R     +  ESD+  +I+ +N +    S    I      L +     F F PR+ N +A  +AR ++
Subjt:  RKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCPRDCNRVAHSLARAAV

Query:  S
        S
Subjt:  S

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.1e-0723.81Show/hide
Query:  LIWNIWSSRNKQVLEGAP---KATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAG
        L+W IW S N  V        + T+E  L    + +   + ++  N +R+   ++ ++ WSPP     K N DAS ++     GLGWI+R+S G++I  G
Subjt:  LIWNIWSSRNKQVLEGAP---KATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNTNEGGLGWIIRDSAGSLICAG

Query:  HRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCPRDCNRVAHSLARAA
          K       +  E   ++  +            +K+I     E D   + + IN +S +    + +    + +       FSF  R+ N  A  LA+ A
Subjt:  HRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCPRDCNRVAHSLARAA

Query:  VSSPSPSVIF
        +   +   +F
Subjt:  VSSPSPSVIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATTATTCCTACTAAGGAAGAAAGAGGAAACTACAAACCACATTCTTTGGCAGTTCAAGCAGCTCAATAAAGAAGATATTAACGCAGCAGCCATTTTAATATGGAA
TATTTGGAGCAGTCGGAACAAGCAAGTCTTAGAAGGAGCTCCCAAAGCTACCATAGAAGGCATTCTCAGAAGTATCAACAAATCAATTGGAGAGCAAGTAAAGTCCAAGG
ATACAAACCTTAGTAGAAGCACATCAGCGAACCAGCCAAGTCAGGCTTGGAGTCCGCCGCTAAATCAAGATTGGAAGCTGAATTCCGACGCCTCGTGGAACAAAAATACC
AACGAAGGTGGATTAGGGTGGATAATTCGTGACTCTGCAGGGTCCTTGATCTGTGCAGGCCATCGAAAAATCAGCAAAGAATGGCCAATTAAAGCCCTAGAAGCTTTGGC
AATTGTTGAAGGCCTGAATGTTTACCTGTCGATGAAGGAGACAACTGAAATCCGAAAGCTGATTCCACCAATTTGGGTTGAATCTGACGCCGCAGAAGTGATCAAATGCA
TAAATCAGGAGAGCGAAGATCTTTCGGAGGCAAATCACATCTTGATTGAAGCGGCCGATCTTGGTCGCAGGGCAGGGGCGGCTTTCTCTTTCTGTCCTAGAGATTGCAAT
CGCGTGGCTCACTCTCTTGCTCGAGCCGCTGTCTCGTCCCCTTCCCCTAGCGTTATTTTTGCTAATTTTAGTTCCTCTTTTCGAGCCAGCTTCGGACATTCCCCATCTTC
TTCTCCCGAAGATGCTATTTTTTGGAGGGATACGTGTGTTCCCAACTGGCTTTCCTCTGTTATTAATGAGGATTTTGTATACCTACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATATTATTCCTACTAAGGAAGAAAGAGGAAACTACAAACCACATTCTTTGGCAGTTCAAGCAGCTCAATAAAGAAGATATTAACGCAGCAGCCATTTTAATATGGAA
TATTTGGAGCAGTCGGAACAAGCAAGTCTTAGAAGGAGCTCCCAAAGCTACCATAGAAGGCATTCTCAGAAGTATCAACAAATCAATTGGAGAGCAAGTAAAGTCCAAGG
ATACAAACCTTAGTAGAAGCACATCAGCGAACCAGCCAAGTCAGGCTTGGAGTCCGCCGCTAAATCAAGATTGGAAGCTGAATTCCGACGCCTCGTGGAACAAAAATACC
AACGAAGGTGGATTAGGGTGGATAATTCGTGACTCTGCAGGGTCCTTGATCTGTGCAGGCCATCGAAAAATCAGCAAAGAATGGCCAATTAAAGCCCTAGAAGCTTTGGC
AATTGTTGAAGGCCTGAATGTTTACCTGTCGATGAAGGAGACAACTGAAATCCGAAAGCTGATTCCACCAATTTGGGTTGAATCTGACGCCGCAGAAGTGATCAAATGCA
TAAATCAGGAGAGCGAAGATCTTTCGGAGGCAAATCACATCTTGATTGAAGCGGCCGATCTTGGTCGCAGGGCAGGGGCGGCTTTCTCTTTCTGTCCTAGAGATTGCAAT
CGCGTGGCTCACTCTCTTGCTCGAGCCGCTGTCTCGTCCCCTTCCCCTAGCGTTATTTTTGCTAATTTTAGTTCCTCTTTTCGAGCCAGCTTCGGACATTCCCCATCTTC
TTCTCCCGAAGATGCTATTTTTTGGAGGGATACGTGTGTTCCCAACTGGCTTTCCTCTGTTATTAATGAGGATTTTGTATACCTACCTTAG
Protein sequenceShow/hide protein sequence
MILFLLRKKEETTNHILWQFKQLNKEDINAAAILIWNIWSSRNKQVLEGAPKATIEGILRSINKSIGEQVKSKDTNLSRSTSANQPSQAWSPPLNQDWKLNSDASWNKNT
NEGGLGWIIRDSAGSLICAGHRKISKEWPIKALEALAIVEGLNVYLSMKETTEIRKLIPPIWVESDAAEVIKCINQESEDLSEANHILIEAADLGRRAGAAFSFCPRDCN
RVAHSLARAAVSSPSPSVIFANFSSSFRASFGHSPSSSPEDAIFWRDTCVPNWLSSVINEDFVYLP