; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018227 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018227
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNase H domain-containing protein
Genome locationscaffold3:17053710..17061655
RNA-Seq ExpressionSpg018227
SyntenySpg018227
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019102973.1 PREDICTED: uncharacterized protein LOC109133719 [Beta vulgaris subsp. vulgaris]1.3e-0936.96Show/hide
Query:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS
        +++DAT  +   KCG+G+  R   G +  A    I G  SP  AEA AIL GLQ A  + ++ L V SDC N+IK +NG  + ++     + DI A+  S
Subjt:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS

Query:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPC-LWLGAFP
        F   +F+F  R YN  AH++A    S     +WL   P
Subjt:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPC-LWLGAFP

XP_027067423.1 uncharacterized protein LOC113693035 [Coffea arabica]1.3e-0929.94Show/hide
Query:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS
        ++TDA   + + + G GI+ R   G L  A+        + +  EALA+   L +AK +   ++ V SDC +++  IN       SI+T L DI+ ++  
Subjt:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS

Query:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS
        F+  NF+FV R  N  +H LA         + W   FP W+  M  K+     PF +
Subjt:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS

XP_027096164.1 uncharacterized protein LOC113716063 [Coffea arabica]5.7e-1027.43Show/hide
Query:  ARSGLRTSYLGHLRNGNWREGFDDNPSLHRMLEQCA-IALKGWGDYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRT
        AR  L  +   H+     +  FD   S H+     A  A + W ++  E      +R +  +S  +        + V ++TDA       + G GI+ R 
Subjt:  ARSGLRTSYLGHLRNGNWREGFDDNPSLHRMLEQCA-IALKGWGDYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRT

Query:  -KGGVLKAAQN-HSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNL
         KG +L+A  N H   G  S    EALAI   L +AK    +++ V SDC ++++ IN   + + +I+T L D++ ++  F+  +F F+SR  N+ +H L
Subjt:  -KGGVLKAAQN-HSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNL

Query:  ASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS
        A         + W   FP W+  + +KE  +  PF +
Subjt:  ASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS

XP_027152612.1 uncharacterized protein LOC113752740 [Coffea eugenioides]1.9e-1032.91Show/hide
Query:  LSEGEEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLW
        L E    IM+TDA    A TK G+GI+ +   G + A  +   SG       EA+AI T L  A    +  L +LSDC  ++  IN   QG +S+   + 
Subjt:  LSEGEEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLW

Query:  DIKAIQASFEIVNFNFVSRRYNMFAHNLASEGYS-APPCLWLGAFPEWMERMTVKESH
        DI+ +  SF   +F  + R  N+ +H LA    S      W  +FP W+     KE+H
Subjt:  DIKAIQASFEIVNFNFVSRRYNMFAHNLASEGYS-APPCLWLGAFPEWMERMTVKESH

XP_040953658.1 uncharacterized protein LOC121219476 [Gossypium hirsutum]7.4e-1030.54Show/hide
Query:  DYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTV
        DY TE   A    G +     E  Q +S     I+H D TF     +   G++   + GVL A +  + S   +P  AEA A L  ++L   L V R+ V
Subjt:  DYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTV

Query:  LSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNLASEGYSAPPCLWL
        + D   +IK    +   +S I   + DI+    SF+ + F+F+ +  N++AH LA E      CL+L
Subjt:  LSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNLASEGYSAPPCLWL

TrEMBL top hitse value%identityAlignment
A0A2Z6P0X3 Uncharacterized protein1.4e-0933.07Show/hide
Query:  LDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSING-QLQGQSSISTTLWDIKAIQASFEIVNF
        L    + G+G+V R + G + A     + G + P  AEA AI   ++LA     +++ + SDC+N+I+ IN  Q++ +S +   +W I   + +F I  F
Subjt:  LDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSING-QLQGQSSISTTLWDIKAIQASFEIVNF

Query:  NFVSRRYNMFAHNLASEGYSAPPCLWL
        N +SR+ N  AH LA   +S P  +WL
Subjt:  NFVSRRYNMFAHNLASEGYSAPPCLWL

A0A6P6SNG7 uncharacterized protein LOC1136930341.0e-0929.56Show/hide
Query:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS
        ++TDA   + + + G GI+ R   G L  A+        + +  EALA+   L +AK +   ++ V SDC +++  IN       SI+T L DI+ ++  
Subjt:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS

Query:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL---WLGAFPEWMERMTVKESHLYVPFAS
        F+  NF+FV R  N  +H LA         +   W   FP W+  M  K+     PF +
Subjt:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL---WLGAFPEWMERMTVKESHLYVPFAS

A0A6P6SNH4 uncharacterized protein LOC1136930356.1e-1029.94Show/hide
Query:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS
        ++TDA   + + + G GI+ R   G L  A+        + +  EALA+   L +AK +   ++ V SDC +++  IN       SI+T L DI+ ++  
Subjt:  MHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQAS

Query:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS
        F+  NF+FV R  N  +H LA         + W   FP W+  M  K+     PF +
Subjt:  FEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS

A0A6P6TGA3 uncharacterized protein LOC1137008274.0e-0930.19Show/hide
Query:  VIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQ
        V ++TDA       + G+GIV R   G L  A+  S          E+LAI + L++A+     ++ V SDC N++ SIN        + T L DI+A++
Subjt:  VIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQ

Query:  ASFEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS
         SF+   F+FV R  N  +H +A         + W  +FP W+  +  K+  +  PF +
Subjt:  ASFEIVNFNFVSRRYNMFAHNLASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS

A0A6P6UZA6 uncharacterized protein LOC1137160632.7e-1027.43Show/hide
Query:  ARSGLRTSYLGHLRNGNWREGFDDNPSLHRMLEQCA-IALKGWGDYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRT
        AR  L  +   H+     +  FD   S H+     A  A + W ++  E      +R +  +S  +        + V ++TDA       + G GI+ R 
Subjt:  ARSGLRTSYLGHLRNGNWREGFDDNPSLHRMLEQCA-IALKGWGDYFTEYWEANPARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRT

Query:  -KGGVLKAAQN-HSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNL
         KG +L+A  N H   G  S    EALAI   L +AK    +++ V SDC ++++ IN   + + +I+T L D++ ++  F+  +F F+SR  N+ +H L
Subjt:  -KGGVLKAAQN-HSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNL

Query:  ASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS
        A         + W   FP W+  + +KE  +  PF +
Subjt:  ASEGYSAPPCL-WLGAFPEWMERMTVKESHLYVPFAS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G09490.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.1e-0628.8Show/hide
Query:  EEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKA
        + V + TDA +   T   G G V+R    +       +      PL AEA+A+   LQ A+ + + +L++ SD   LI +I  +    +     ++DI  
Subjt:  EEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSSISTTLWDIKA

Query:  IQASFEIVNFNFVSRRYNMFAHNLA
        +   F  V+F+FV R  N  A  LA
Subjt:  IQASFEIVNFNFVSRRYNMFAHNLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACCGGCGGGACTTCAGCTTTCATGCTGCTATATCAGTGGACACAACGGAGCAAGGGTTCTTCATCTTCTACTGTGGGGAGAGAATCCAAATCAGGAACAAGAAGC
ATCTCGCAGCCAACGATTAGCAATTGCAGTTTTGCCTTCGTTGGAGCCAATTTCCCCTGTCGAATTCCCTTACCGTAGACCGTCTGGTATTCGAATCAACGAACCTGCTA
AGGGAGGATTGTTGCAGAGCCAAAACGCCAGGTCGTCGGCTCGTTATTCTCTCTTCGGTGATGATTTGACAAAAGAAAAAGGGAAGGCGAAAGTGGATGATTCAATTTCG
AAGGTCAATTGCGGTCTTCCTACAGCGGTGACGGAAGGGTGGCCGGAAACTGCAAATTGGAGGCCAGAAAGTTCCAGTCGGCGGGCGTTACAAAAGGCTGAGCCGTTGGG
CCTTTTTTCATTAGGCCCTAGAGACGTTACAGAGAAATTAAAAGACGTTGGGGAGACTTCTAGAAAAGATTCTGGGCCATTAAAGCAGCCGTTTCACAACTTTGTGGGTT
TTGGGCCAGAGAAGTACAAAGAGGAATTGCGGGCCAGCAATAGACGGTTATCCAGAACTTTACTATCTATTTTTAATGCTGCCGAAAATCATGCAGTGGGCCACGAGAAT
CCGAAAGCTGAAGGAACCAACGAAGAGGGAGACATGGAGAGTGATCCAGAAGAGGTGGAGGAGGTGGGCTTTCAGTCTCATGAAGGAAGTAATTGGGCTGATCCAAAGGA
GCAGATGGGCTTGCAAGTGGTTGCTGAAAATTGCCAACTAAATTCAATTCCACATGAGAAGGCCAAGAATGGAGGTGAAAATAGTGATGCAGCTAGTTTTATTTTTTCTT
CCAAGAAAATTCAGTCAATAAAGAAAGGAGGAATGTGGAAAAAGCGAGCGCGAGCGGAATTTGTTCCTATTGGTGTTAATGTGGATGCTGGGAAAAAGCTCAAAAGCGGA
AGGATGGGTCGTTGTTGTTCTCTTCGGGGAATATTAAGCGTCCCAAAGTTGACGATGGTGGACAGGCGGGGACTTCTGAGCAGCCTTGCCAATCATTATGAAAACATTAT
GCTGGAATGTGAAGGTGTACATATTACACATGCGGAAGCAATTGAAGATCGAAAGCACTTAAGACATTCAATTTTGCTCCTAAAGCATGCATGTTCTATAAAAGACAGAA
AAGAGAGTTTAGAGAAAACCAACCTTTGTAGCTCACAAATGAGAGAGAATCCATGCAAGAAGAAGCCTCAAGATGCAAGACACATGGTGGTCTTAAGGGTGAAAGTGGAA
GCTAGGTGTTCAAACAACATTACATGCAAGAGTGAATTCCGTCTTGCGAAACTATGTCCCCAGCTATCTATTCGATCTTATCCCCAAAATGGTAGGCTTGTTGAGTCGGC
GATGCTGGCCACTCTCACCCATGCAGATCAAAGGATAATCTCGTATAAACAGGAGTTCATAGCTCGCTCAGGATTAAGAACGAGTTACCTAGGTCATCTCCGAAATGGAA
ATTGGAGAGAGGGTTTTGATGACAATCCGTCGTTGCATCGGATGCTGGAACAATGTGCCATTGCCCTCAAAGGATGGGGGGACTACTTCACGGAATACTGGGAGGCAAAC
CCCGCGAGAGGATCTTTGGTTCAGTCAGAGGATGAGGTGATACAGATTCTTTCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTTTAGATGCTACGACCAA
ATGTGGAATTGGCATAGTACTACGTACTAAGGGAGGCGTTTTGAAGGCAGCGCAAAATCACTCTATTTCTGGTTGTCATTCCCCATTGGGGGCTGAAGCACTTGCCATTC
TGACAGGACTTCAATTAGCAAAGGGCTTGAAGGTGCGGCGGTTAACAGTTCTTTCAGATTGTTCGAATCTCATAAAGTCTATCAACGGTCAACTTCAAGGACAGTCAAGT
ATATCTACAACACTATGGGACATTAAAGCGATTCAGGCTTCATTTGAGATTGTAAATTTTAATTTTGTTAGTCGTCGTTATAATATGTTTGCTCATAATCTGGCCAGTGA
AGGCTATTCGGCACCACCATGCTTATGGTTAGGTGCTTTTCCTGAGTGGATGGAAAGGATGACAGTCAAGGAAAGCCATTTGTATGTTCCTTTTGCTTCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATACCGGCGGGACTTCAGCTTTCATGCTGCTATATCAGTGGACACAACGGAGCAAGGGTTCTTCATCTTCTACTGTGGGGAGAGAATCCAAATCAGGAACAAGAAGC
ATCTCGCAGCCAACGATTAGCAATTGCAGTTTTGCCTTCGTTGGAGCCAATTTCCCCTGTCGAATTCCCTTACCGTAGACCGTCTGGTATTCGAATCAACGAACCTGCTA
AGGGAGGATTGTTGCAGAGCCAAAACGCCAGGTCGTCGGCTCGTTATTCTCTCTTCGGTGATGATTTGACAAAAGAAAAAGGGAAGGCGAAAGTGGATGATTCAATTTCG
AAGGTCAATTGCGGTCTTCCTACAGCGGTGACGGAAGGGTGGCCGGAAACTGCAAATTGGAGGCCAGAAAGTTCCAGTCGGCGGGCGTTACAAAAGGCTGAGCCGTTGGG
CCTTTTTTCATTAGGCCCTAGAGACGTTACAGAGAAATTAAAAGACGTTGGGGAGACTTCTAGAAAAGATTCTGGGCCATTAAAGCAGCCGTTTCACAACTTTGTGGGTT
TTGGGCCAGAGAAGTACAAAGAGGAATTGCGGGCCAGCAATAGACGGTTATCCAGAACTTTACTATCTATTTTTAATGCTGCCGAAAATCATGCAGTGGGCCACGAGAAT
CCGAAAGCTGAAGGAACCAACGAAGAGGGAGACATGGAGAGTGATCCAGAAGAGGTGGAGGAGGTGGGCTTTCAGTCTCATGAAGGAAGTAATTGGGCTGATCCAAAGGA
GCAGATGGGCTTGCAAGTGGTTGCTGAAAATTGCCAACTAAATTCAATTCCACATGAGAAGGCCAAGAATGGAGGTGAAAATAGTGATGCAGCTAGTTTTATTTTTTCTT
CCAAGAAAATTCAGTCAATAAAGAAAGGAGGAATGTGGAAAAAGCGAGCGCGAGCGGAATTTGTTCCTATTGGTGTTAATGTGGATGCTGGGAAAAAGCTCAAAAGCGGA
AGGATGGGTCGTTGTTGTTCTCTTCGGGGAATATTAAGCGTCCCAAAGTTGACGATGGTGGACAGGCGGGGACTTCTGAGCAGCCTTGCCAATCATTATGAAAACATTAT
GCTGGAATGTGAAGGTGTACATATTACACATGCGGAAGCAATTGAAGATCGAAAGCACTTAAGACATTCAATTTTGCTCCTAAAGCATGCATGTTCTATAAAAGACAGAA
AAGAGAGTTTAGAGAAAACCAACCTTTGTAGCTCACAAATGAGAGAGAATCCATGCAAGAAGAAGCCTCAAGATGCAAGACACATGGTGGTCTTAAGGGTGAAAGTGGAA
GCTAGGTGTTCAAACAACATTACATGCAAGAGTGAATTCCGTCTTGCGAAACTATGTCCCCAGCTATCTATTCGATCTTATCCCCAAAATGGTAGGCTTGTTGAGTCGGC
GATGCTGGCCACTCTCACCCATGCAGATCAAAGGATAATCTCGTATAAACAGGAGTTCATAGCTCGCTCAGGATTAAGAACGAGTTACCTAGGTCATCTCCGAAATGGAA
ATTGGAGAGAGGGTTTTGATGACAATCCGTCGTTGCATCGGATGCTGGAACAATGTGCCATTGCCCTCAAAGGATGGGGGGACTACTTCACGGAATACTGGGAGGCAAAC
CCCGCGAGAGGATCTTTGGTTCAGTCAGAGGATGAGGTGATACAGATTCTTTCTGAAGGGGAGGAAGTCATCATGCATACTGATGCTACCTTTTTAGATGCTACGACCAA
ATGTGGAATTGGCATAGTACTACGTACTAAGGGAGGCGTTTTGAAGGCAGCGCAAAATCACTCTATTTCTGGTTGTCATTCCCCATTGGGGGCTGAAGCACTTGCCATTC
TGACAGGACTTCAATTAGCAAAGGGCTTGAAGGTGCGGCGGTTAACAGTTCTTTCAGATTGTTCGAATCTCATAAAGTCTATCAACGGTCAACTTCAAGGACAGTCAAGT
ATATCTACAACACTATGGGACATTAAAGCGATTCAGGCTTCATTTGAGATTGTAAATTTTAATTTTGTTAGTCGTCGTTATAATATGTTTGCTCATAATCTGGCCAGTGA
AGGCTATTCGGCACCACCATGCTTATGGTTAGGTGCTTTTCCTGAGTGGATGGAAAGGATGACAGTCAAGGAAAGCCATTTGTATGTTCCTTTTGCTTCTTCTTAG
Protein sequenceShow/hide protein sequence
MIPAGLQLSCCYISGHNGARVLHLLLWGENPNQEQEASRSQRLAIAVLPSLEPISPVEFPYRRPSGIRINEPAKGGLLQSQNARSSARYSLFGDDLTKEKGKAKVDDSIS
KVNCGLPTAVTEGWPETANWRPESSSRRALQKAEPLGLFSLGPRDVTEKLKDVGETSRKDSGPLKQPFHNFVGFGPEKYKEELRASNRRLSRTLLSIFNAAENHAVGHEN
PKAEGTNEEGDMESDPEEVEEVGFQSHEGSNWADPKEQMGLQVVAENCQLNSIPHEKAKNGGENSDAASFIFSSKKIQSIKKGGMWKKRARAEFVPIGVNVDAGKKLKSG
RMGRCCSLRGILSVPKLTMVDRRGLLSSLANHYENIMLECEGVHITHAEAIEDRKHLRHSILLLKHACSIKDRKESLEKTNLCSSQMRENPCKKKPQDARHMVVLRVKVE
ARCSNNITCKSEFRLAKLCPQLSIRSYPQNGRLVESAMLATLTHADQRIISYKQEFIARSGLRTSYLGHLRNGNWREGFDDNPSLHRMLEQCAIALKGWGDYFTEYWEAN
PARGSLVQSEDEVIQILSEGEEVIMHTDATFLDATTKCGIGIVLRTKGGVLKAAQNHSISGCHSPLGAEALAILTGLQLAKGLKVRRLTVLSDCSNLIKSINGQLQGQSS
ISTTLWDIKAIQASFEIVNFNFVSRRYNMFAHNLASEGYSAPPCLWLGAFPEWMERMTVKESHLYVPFASS