; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026509 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026509
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr10:38141640..38143158
RNA-Seq ExpressionLag0026509
SyntenyLag0026509
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG54013.1 hypothetical protein EZV62_019269 [Acer yangbiense]1.4e-3332.57Show/hide
Query:  KKLVNALIWNQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ
        K  ++ +   +   ++  +G NIF  +F+N   +  ILE GPWL+DK +L+L E  G     D+ FRYVPFWI  H LP AC +R   + +G L+G V++
Subjt:  KKLVNALIWNQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ

Query:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC--DEFKGSSKDELPYGPWMREPVKLKSKENSF
        +D  +  +   G  +R+++ IDV  PLKRG+ +  G D     + + YE+LP+FCY CG +GH++ DC  +  + +S     +GPWMR   + +SK    
Subjt:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC--DEFKGSSKDELPYGPWMREPVKLKSKENSF

Query:  SNRPPISFAGRGRGRMGEG----GRGSWRSAEPEANVSEDSE-----AEKQKLGCLEKKTA
            P      G     E     G   W   +  + +  D E      E +    +E KTA
Subjt:  SNRPPISFAGRGRGRMGEG----GRGSWRSAEPEANVSEDSE-----AEKQKLGCLEKKTA

XP_006485824.1 uncharacterized protein LOC102613298 [Citrus sinensis]5.4e-3338.86Show/hide
Query:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW
        + T + ++G NIFL KF +   K  IL  GPW +D++++++ EP G    +  DF +  FW+  H +P  C  +    +IG  +G V++V+  DD     
Subjt:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW

Query:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR
        GS +R++I ++V  PL + + LK  +DGG+  + + YEKLPDFC+ CG +GH   +C ++KG SKD+L YG WMR
Subjt:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR

XP_015388532.1 uncharacterized protein LOC107178176 [Citrus sinensis]1.6e-3228.92Show/hide
Query:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW
        + T + ++G NIF+ KF +   K  IL  GPW +D++++++ EP G    +  DF + PFW+  H +P  C  +    +IG  +G V++V    D     
Subjt:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW

Query:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMREPVK------LKSKENSFSNRPPIS
        GS +R++I ++V  PL + + LK  +DGG+  + + YEKLPDFC+ CG +GH   +C ++KG SKD+L YG WMR   +       + K+    ++ P +
Subjt:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMREPVK------LKSKENSFSNRPPIS

Query:  FA--------------GRGR-GRMGEGGRGSWRSAEPEANVSEDSEAEKQKLGCLEKKTAKEVPPSENRPQAPVKKVVPAAAEQSVSSTVQNPNKEKEKI
         A              GR    + G G + +  +    +N   DS+ E+   G L  +  K++  ++   +  VK++          STV+   KE ++ 
Subjt:  FA--------------GRGR-GRMGEGGRGSWRSAEPEANVSEDSEAEKQKLGCLEKKTAKEVPPSENRPQAPVKKVVPAAAEQSVSSTVQNPNKEKEKI

Query:  DVDLINEGIKGVESGKEGKEKSESV
        +  L+ +GI     G+E   K  ++
Subjt:  DVDLINEGIKGVESGKEGKEKSESV

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]2.7e-3241.04Show/hide
Query:  SIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSS
        S+  +GFNIFL  F  +  +  IL  GPW +D+A+++++ P   +   DMDFR V  W+HF  L  AC +++ A  +G+ +G+ E V+  +  +  WGS 
Subjt:  SIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSS

Query:  LRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC-DEFKGSSKDELPYGPWMR
        LR++++ DV  PL RGI L      G  WI + YE+LPDF Y CG L HI+ DC D    S    L YGPW+R
Subjt:  LRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC-DEFKGSSKDELPYGPWMR

XP_024035600.1 uncharacterized protein LOC112096408 [Citrus clementina]3.5e-3238.29Show/hide
Query:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW
        + T + ++G NIFL KF +   K  IL  GPW +D++++++ EP G    +  DF +  FW+  H +P  C  +    +IG  +G V++V+  DD     
Subjt:  EHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSW

Query:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR
        GS +R++I ++V  PL + + LK  +D G+  + + YEKLPDFC+ CG +GH   +C ++KG SKD+L YG WMR
Subjt:  GSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR

TrEMBL top hitse value%identityAlignment
A0A1S8AC25 CCHC-type domain-containing protein (Fragment)6.4e-3236.9Show/hide
Query:  KLVNALIW-NQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ
        K+    +W       I  +G N+F+ KF +   K  I+  GPW +D+A++ L EP G    +  DF +V FW+  H +P  C S+  A ++G ++G VE+
Subjt:  KLVNALIW-NQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ

Query:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFL-KSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR
        V+  D     +G  LR++I +D+  PLK+ I L +  +D  +  + V YE+LPDFC+ CG +GH   +C  +K  SKDEL YGPW++
Subjt:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFL-KSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPYGPWMR

A0A5C7H9Y2 CCHC-type domain-containing protein6.8e-3432.57Show/hide
Query:  KKLVNALIWNQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ
        K  ++ +   +   ++  +G NIF  +F+N   +  ILE GPWL+DK +L+L E  G     D+ FRYVPFWI  H LP AC +R   + +G L+G V++
Subjt:  KKLVNALIWNQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQ

Query:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC--DEFKGSSKDELPYGPWMREPVKLKSKENSF
        +D  +  +   G  +R+++ IDV  PLKRG+ +  G D     + + YE+LP+FCY CG +GH++ DC  +  + +S     +GPWMR   + +SK    
Subjt:  VDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC--DEFKGSSKDELPYGPWMREPVKLKSKENSF

Query:  SNRPPISFAGRGRGRMGEG----GRGSWRSAEPEANVSEDSE-----AEKQKLGCLEKKTA
            P      G     E     G   W   +  + +  D E      E +    +E KTA
Subjt:  SNRPPISFAGRGRGRMGEG----GRGSWRSAEPEANVSEDSE-----AEKQKLGCLEKKTA

A0A6J1BSZ1 uncharacterized protein LOC1110054811.3e-3241.04Show/hide
Query:  SIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSS
        S+  +GFNIFL  F  +  +  IL  GPW +D+A+++++ P   +   DMDFR V  W+HF  L  AC +++ A  +G+ +G+ E V+  +  +  WGS 
Subjt:  SIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSS

Query:  LRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC-DEFKGSSKDELPYGPWMR
        LR++++ DV  PL RGI L      G  WI + YE+LPDF Y CG L HI+ DC D    S    L YGPW+R
Subjt:  LRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC-DEFKGSSKDELPYGPWMR

A0A6J1D765 uncharacterized protein LOC1110179026.4e-3232.9Show/hide
Query:  NDETEEITQQLADLKVTAAEKTSIFQLK---EDAIDQSEKKLVNALIWN-QEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSY
        NDE E +T       +TA         K      I     + V   +W     T    +G NI++  FK+   K  +L +GPW ++K++L+L  P   + 
Subjt:  NDETEEITQQLADLKVTAAEKTSIFQLK---EDAIDQSEKKLVNALIWN-QEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSY

Query:  GEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGH
          DM+F +  FWI  H +P+ C S   A  +G+ LG VE+++  D  D   G  +R+++KIDV+ PL+RGI LK+  DG + W  + YEKLPDFCY CG 
Subjt:  GEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGH

Query:  LGHIINDCDEFKGSSKDELP--YGPWMREPVKLKSKENSFSNRPPISFAGR-GRGRM---GEGGRGSWRSAE--------PEANVSEDSEAEKQKLGCLE
        +GH   +C++         P  YG W+R    L  K  S          GR GRG     G GGRG WR  +        PE++     E    ++   E
Subjt:  LGHIINDCDEFKGSSKDELP--YGPWMREPVKLKSKENSFSNRPPISFAGR-GRGRM---GEGGRGSWRSAE--------PEANVSEDSEAEKQKLGCLE

Query:  KKTAKEV
          TA E+
Subjt:  KKTAKEV

A0A6J1DX30 uncharacterized protein LOC1110248747.1e-3133.97Show/hide
Query:  WNQEHTS--IVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDD
        W  E+ +  + ++G+N+FL  F  A  +  I ++GPW +D+ ++L+ +P       ++DF  +P W+ F  LP  C +R  A+ +G+ LG  E+ D  DD
Subjt:  WNQEHTS--IVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDD

Query:  TDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCG----HLGHIINDCDEFKGSSKDELPYGPWMREPVKLKSKENSF-SNR
         +P WGS+LR+++ +D++ PL+RGI L      G  WI + YE+LPDFCY CG       H       ++G+ K  +P     +E +  KS  NSF S+ 
Subjt:  TDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCG----HLGHIINDCDEFKGSSKDELPYGPWMREPVKLKSKENSF-SNR

Query:  PPISFAGRG
         P+    +G
Subjt:  PPISFAGRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42140.1 zinc ion binding;nucleic acid binding1.7e-0523.48Show/hide
Query:  ILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSG
        IL  GPW ++  M +++  +      D +F+ +PFWI    +P    +      IG  +G+  + +L  D      S L+ +                  
Subjt:  ILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPFWIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSG

Query:  KDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC
                   YEKL +FC  CG L H  ++C
Subjt:  KDGGEKWIAVTYEKLPDFCYGCGHLGHIINDC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGTAACGACGAGACAGAAGAAATAACTCAACAATTGGCAGATCTAAAAGTCACAGCTGCAGAGAAAACGAGTATTTTCCAGTTAAAAGAGGATGCAATAGATCA
ATCAGAGAAGAAGTTGGTGAATGCTCTGATTTGGAACCAAGAACATACTTCTATTGTGAATGTGGGTTTCAATATCTTTCTATGCAAATTCAAAAATGCTCGAGTTAAAG
GATTCATTTTGGAAGCTGGTCCGTGGCTTTATGATAAAGCCATGTTACTTCTTGAGGAGCCTAAAGGAGACAGCTATGGGGAGGACATGGATTTCAGGTATGTGCCCTTT
TGGATACATTTTCATAAATTGCCATATGCTTGTTTCTCTAGATCAGAGGCGATGGACATTGGGAGCTTACTAGGTGTGGTGGAACAAGTGGATCTTGATGATGATACAGA
TCCAAGTTGGGGCAGTTCACTAAGAATGAAGATCAAGATTGATGTCAATTACCCCCTAAAGCGTGGAATATTTCTAAAGTCTGGGAAGGATGGAGGAGAGAAGTGGATCG
CTGTAACATACGAAAAATTGCCTGATTTTTGCTACGGATGTGGTCATTTAGGCCATATTATTAATGATTGCGACGAATTCAAAGGTTCTTCTAAGGATGAATTACCCTAT
GGTCCATGGATGAGGGAGCCAGTTAAGCTTAAGAGTAAAGAAAATAGTTTTTCAAATAGACCTCCAATCTCCTTTGCAGGAAGAGGAAGGGGTCGAATGGGGGAAGGGGG
AAGAGGTAGCTGGCGATCGGCAGAACCAGAAGCTAATGTGAGTGAAGATTCAGAAGCTGAAAAGCAGAAACTTGGATGCTTGGAAAAGAAGACGGCAAAGGAGGTCCCAC
CGTCGGAAAACAGACCACAAGCACCGGTGAAAAAGGTGGTTCCGGCGGCTGCAGAACAATCGGTGAGTTCAACGGTACAAAATCCAAATAAGGAGAAAGAGAAAATTGAC
GTGGATCTCATTAATGAGGGAATTAAAGGCGTAGAGTCTGGTAAGGAAGGAAAGGAAAAATCTGAATCTGTGAGCTTTATTTGCCCTAACGGCTTAATTATAAAAAAGGA
AGGAAAAAGGATTTTAAATGCTCCTACTCTCATGGAATTGGATGGAGATGTTACAAGGAGTGAAATCTCAGTAAATACTCCAACCTTGTTAGATGTTACTCCAGACTCTA
AAAATGAGGGTTCAACAAATATTAAGGGTGAAGGACAAAATAAGTGGTGGAAGAGATTGGCTCGCAGTAAATTGCAGGATGACCATGAAAATAAAATGCAAGAGTGCACC
GAACCCACAGTGGGACATAAACATAAGCTTGAGATGGAGGAAAATGGTCATAATAACAAGAGACACGTGATGGAGGGGGATGAAAAAATTGCAAGATCTGTGGAGGCTGC
TGGACAACCCCGCCGGGCACAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGTAACGACGAGACAGAAGAAATAACTCAACAATTGGCAGATCTAAAAGTCACAGCTGCAGAGAAAACGAGTATTTTCCAGTTAAAAGAGGATGCAATAGATCA
ATCAGAGAAGAAGTTGGTGAATGCTCTGATTTGGAACCAAGAACATACTTCTATTGTGAATGTGGGTTTCAATATCTTTCTATGCAAATTCAAAAATGCTCGAGTTAAAG
GATTCATTTTGGAAGCTGGTCCGTGGCTTTATGATAAAGCCATGTTACTTCTTGAGGAGCCTAAAGGAGACAGCTATGGGGAGGACATGGATTTCAGGTATGTGCCCTTT
TGGATACATTTTCATAAATTGCCATATGCTTGTTTCTCTAGATCAGAGGCGATGGACATTGGGAGCTTACTAGGTGTGGTGGAACAAGTGGATCTTGATGATGATACAGA
TCCAAGTTGGGGCAGTTCACTAAGAATGAAGATCAAGATTGATGTCAATTACCCCCTAAAGCGTGGAATATTTCTAAAGTCTGGGAAGGATGGAGGAGAGAAGTGGATCG
CTGTAACATACGAAAAATTGCCTGATTTTTGCTACGGATGTGGTCATTTAGGCCATATTATTAATGATTGCGACGAATTCAAAGGTTCTTCTAAGGATGAATTACCCTAT
GGTCCATGGATGAGGGAGCCAGTTAAGCTTAAGAGTAAAGAAAATAGTTTTTCAAATAGACCTCCAATCTCCTTTGCAGGAAGAGGAAGGGGTCGAATGGGGGAAGGGGG
AAGAGGTAGCTGGCGATCGGCAGAACCAGAAGCTAATGTGAGTGAAGATTCAGAAGCTGAAAAGCAGAAACTTGGATGCTTGGAAAAGAAGACGGCAAAGGAGGTCCCAC
CGTCGGAAAACAGACCACAAGCACCGGTGAAAAAGGTGGTTCCGGCGGCTGCAGAACAATCGGTGAGTTCAACGGTACAAAATCCAAATAAGGAGAAAGAGAAAATTGAC
GTGGATCTCATTAATGAGGGAATTAAAGGCGTAGAGTCTGGTAAGGAAGGAAAGGAAAAATCTGAATCTGTGAGCTTTATTTGCCCTAACGGCTTAATTATAAAAAAGGA
AGGAAAAAGGATTTTAAATGCTCCTACTCTCATGGAATTGGATGGAGATGTTACAAGGAGTGAAATCTCAGTAAATACTCCAACCTTGTTAGATGTTACTCCAGACTCTA
AAAATGAGGGTTCAACAAATATTAAGGGTGAAGGACAAAATAAGTGGTGGAAGAGATTGGCTCGCAGTAAATTGCAGGATGACCATGAAAATAAAATGCAAGAGTGCACC
GAACCCACAGTGGGACATAAACATAAGCTTGAGATGGAGGAAAATGGTCATAATAACAAGAGACACGTGATGGAGGGGGATGAAAAAATTGCAAGATCTGTGGAGGCTGC
TGGACAACCCCGCCGGGCACAATGA
Protein sequenceShow/hide protein sequence
MERNDETEEITQQLADLKVTAAEKTSIFQLKEDAIDQSEKKLVNALIWNQEHTSIVNVGFNIFLCKFKNARVKGFILEAGPWLYDKAMLLLEEPKGDSYGEDMDFRYVPF
WIHFHKLPYACFSRSEAMDIGSLLGVVEQVDLDDDTDPSWGSSLRMKIKIDVNYPLKRGIFLKSGKDGGEKWIAVTYEKLPDFCYGCGHLGHIINDCDEFKGSSKDELPY
GPWMREPVKLKSKENSFSNRPPISFAGRGRGRMGEGGRGSWRSAEPEANVSEDSEAEKQKLGCLEKKTAKEVPPSENRPQAPVKKVVPAAAEQSVSSTVQNPNKEKEKID
VDLINEGIKGVESGKEGKEKSESVSFICPNGLIIKKEGKRILNAPTLMELDGDVTRSEISVNTPTLLDVTPDSKNEGSTNIKGEGQNKWWKRLARSKLQDDHENKMQECT
EPTVGHKHKLEMEENGHNNKRHVMEGDEKIARSVEAAGQPRRAQ