; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C006492 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C006492
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
Descriptionserine-aspartate repeat-containing protein F isoform X1
Genome locationchr06:3566514..3569730
RNA-Seq ExpressionMELO3C006492
SyntenyMELO3C006492
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049322.1 serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo var. makuwa]0.0e+0099.38Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTE+ST+NEHEA+RREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKP+TNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRV+KLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

XP_008438604.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X1 [Cucumis melo]0.0e+00100Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

XP_008438605.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X2 [Cucumis melo]0.0e+0096.9Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEK                         EDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

XP_008438606.1 PREDICTED: serine-aspartate repeat-containing protein F isoform X3 [Cucumis melo]0.0e+00100Show/hide
Query:  MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN
        MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN
Subjt:  MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN

Query:  MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK
        MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK
Subjt:  MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK

Query:  TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT
        TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT
Subjt:  TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT

Query:  KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN
        KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN
Subjt:  KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN

Query:  EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD
        EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD
Subjt:  EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD

Query:  TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ
        TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ
Subjt:  TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ

Query:  IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN

XP_011650954.1 myb-like protein X isoform X1 [Cucumis sativus]0.0e+0090.38Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPE FLLEG KNE E DEVAGFSQKEAR SSG ADKVNG+HHV+EKEEEKNKV +IEEFQVVSENLHD+TIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSD-----KQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKE
        KSKENK++E +SEENRSD     KQAS+QKEEEEKRGSNLN  VLSLNEP LEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNM+ VIDTDRDTNKE
Subjt:  KSKENKEMECDSEENRSD-----KQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKE

Query:  NDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFE
        NDGEKG  S+I+MITHASEDEKSE TSDVN+DQV+DND DTDKENDGERGRGSS N TTHAPKDPKSE NS+FDSDQVIDTD    KENDEERRKGSNFE
Subjt:  NDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFE

Query:  MTNSSENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCH
        M NSSENPKSEKTSNLD+NQVVRTDWDADK NDE RGNSSNFDMLIDASK+PNSENSS+LR  QHE PETNAESLTGSSDDG TDMEKKKGDLVEPRQCH
Subjt:  MTNSSENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCH

Query:  GYTAPAAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEA
        GYT PAAKNVDTKDKGTVTD+ CHNTS SLAEECLVIESPNSSVQ+PEVE+K EFQLR E LGTETVDEDNIPTQSKISNEV+EEFNTTESHSE NAEEA
Subjt:  GYTAPAAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEA

Query:  EVSPEFVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAEST
        EVSPEFV EN+NEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELST+NEHE ERREPDES FEPILGFQPQTQQKETTI FQTAEST
Subjt:  EVSPEFVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAEST

Query:  DESISAPRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGK
        DESISAPRQETDTETEKSKSNPSDS SYTQTASSTPTET+PSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIG IEMRKSPSFNIDIQIEGK
Subjt:  DESISAPRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGK

Query:  TGETEKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIF
        TGETEK PLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEES ME KAIDQNNFV+EKK AKNLPPPSPIRKGKRRTKSLIF
Subjt:  TGETEKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIF

Query:  GTCICCATAIN
        GTCICCATAIN
Subjt:  GTCICCATAIN

TrEMBL top hitse value%identityAlignment
A0A0A0L8I8 Uncharacterized protein0.0e+0090.38Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPE FLLEG KNE E DEVAGFSQKEAR SSG ADKVNG+HHV+EKEEEKNKV +IEEFQVVSENLHD+TIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSD-----KQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKE
        KSKENK++E +SEENRSD     KQAS+QKEEEEKRGSNLN  VLSLNEP LEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNM+ VIDTDRDTNKE
Subjt:  KSKENKEMECDSEENRSD-----KQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKE

Query:  NDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFE
        NDGEKG  S+I+MITHASEDEKSE TSDVN+DQV+DND DTDKENDGERGRGSS N TTHAPKDPKSE NS+FDSDQVIDTD    KENDEERRKGSNFE
Subjt:  NDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFE

Query:  MTNSSENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCH
        M NSSENPKSEKTSNLD+NQVVRTDWDADK NDE RGNSSNFDMLIDASK+PNSENSS+LR  QHE PETNAESLTGSSDDG TDMEKKKGDLVEPRQCH
Subjt:  MTNSSENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCH

Query:  GYTAPAAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEA
        GYT PAAKNVDTKDKGTVTD+ CHNTS SLAEECLVIESPNSSVQ+PEVE+K EFQLR E LGTETVDEDNIPTQSKISNEV+EEFNTTESHSE NAEEA
Subjt:  GYTAPAAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEA

Query:  EVSPEFVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAEST
        EVSPEFV EN+NEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELST+NEHE ERREPDES FEPILGFQPQTQQKETTI FQTAEST
Subjt:  EVSPEFVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAEST

Query:  DESISAPRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGK
        DESISAPRQETDTETEKSKSNPSDS SYTQTASSTPTET+PSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIG IEMRKSPSFNIDIQIEGK
Subjt:  DESISAPRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGK

Query:  TGETEKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIF
        TGETEK PLLYQIKTIEDL NLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEES ME KAIDQNNFV+EKK AKNLPPPSPIRKGKRRTKSLIF
Subjt:  TGETEKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIF

Query:  GTCICCATAIN
        GTCICCATAIN
Subjt:  GTCICCATAIN

A0A1S3AXG5 serine-aspartate repeat-containing protein F isoform X20.0e+0096.9Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEK                         EDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

A0A1S3AXG7 serine-aspartate repeat-containing protein F isoform X30.0e+00100Show/hide
Query:  MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN
        MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN
Subjt:  MECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVIN

Query:  MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK
        MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK
Subjt:  MITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEK

Query:  TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT
        TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT
Subjt:  TSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDT

Query:  KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN
        KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN
Subjt:  KDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKN

Query:  EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD
        EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD
Subjt:  EAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETD

Query:  TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ
        TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ
Subjt:  TETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQ

Query:  IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN
Subjt:  IKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN

A0A1S4DSH5 serine-aspartate repeat-containing protein F isoform X10.0e+00100Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

A0A5D3D317 Serine-aspartate repeat-containing protein F isoform X10.0e+0099.38Show/hide
Query:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
        MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV
Subjt:  MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEV

Query:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
        KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK
Subjt:  KSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEK

Query:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
        GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS
Subjt:  GKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSS

Query:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
        ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP
Subjt:  ENPKSEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAP

Query:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
        AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE
Subjt:  AAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQVPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPE

Query:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
        FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTE+ST+NEHEA+RREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS
Subjt:  FVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETTELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESIS

Query:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
        APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKP+TNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE
Subjt:  APRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQDSPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETE

Query:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
        KTPLLYQIKTIEDLSNLQEISFPNPMEKRV+KLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC
Subjt:  KTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCIC

Query:  CATAIN
        CATAIN
Subjt:  CATAIN

SwissProt top hitse value%identityAlignment
Q53653 Clumping factor A4.1e-0422.79Show/hide
Query:  IEDNVVEVKSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAV-PDSEENTNMNQVIDTDRD
        I+  VV  +  E  E+E   E++ SD  + S  +     GS+  +   S +        +   D+  +S  ++   +D A   DS+ +++ +   D+D D
Subjt:  IEDNVVEVKSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAV-PDSEENTNMNQVIDTDRD

Query:  TNKENDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKG
        ++ ++D +    S  +  + +  D  S++ SD + D   D+D D+D ++D +    S  ++ + +  D  S+ +SD DSD   D+D D+  ++D +    
Subjt:  TNKENDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKG

Query:  SNFEMTNSSENPK---SEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDME
        S+ +  + S++     S+  S+ DS+    +D D+D ++D    + S+ D   D+  D +S++ S+         E++++S + S  D  +D +
Subjt:  SNFEMTNSSENPK---SEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDME

Q5HHM8 Clumping factor A4.1e-0422.79Show/hide
Query:  IEDNVVEVKSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAV-PDSEENTNMNQVIDTDRD
        I+  VV  +  E  E+E   E++ SD  + S  +     GS+  +   S +        +   D+  +S  ++   +D A   DS+ +++ +   D+D D
Subjt:  IEDNVVEVKSKENKEMECDSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAV-PDSEENTNMNQVIDTDRD

Query:  TNKENDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKG
        ++ ++D +    S  +  + +  D  S++ SD + D   D+D D+D ++D +    S  ++ + +  D  S+ +SD DSD   D+D D+  ++D +    
Subjt:  TNKENDGEKGKGSVINMITHASEDEKSETTSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKG

Query:  SNFEMTNSSENPK---SEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDME
        S+ +  + S++     S+  S+ DS+    +D D+D ++D    + S+ D   D+  D +S++ S+         E++++S + S  D  +D +
Subjt:  SNFEMTNSSENPK---SEKTSNLDSNQVVRTDWDADKENDEGRGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDME

Arabidopsis top hitse value%identityAlignment
AT4G14650.1 unknown protein9.0e-0733.8Show/hide
Query:  ESISENSIGQIEMRKSPSFNIDIQIEGKTGET-EKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPS--FPGFVKEKEESGMEFKAIDQ
        E+I E+S        +PSF+  ++IE +  E+ E TP+L      ED + + E +    +E++ V L RS+S KSR S    G +K+  +S  E K  + 
Subjt:  ESISENSIGQIEMRKSPSFNIDIQIEGKTGET-EKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPS--FPGFVKEKEESGMEFKAIDQ

Query:  NNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN
        N       V K   P S     ++R+KS + GTC+CC TA+N
Subjt:  NNFVNEKKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTTAGAGAGGAAGAAAAGGAAAAAGCCGAAGGTCCAGAACTTTTTTTACTAGAAGGCGGGAAAAATGAAGGAGA
AGCCGATGAAGTTGCAGGCTTCAGCCAAAAAGAGGCACGGCCGAGTTCTGGTGAGGCAGATAAAGTAAATGGTGATCATCATGTGAGCGAGAAAGAGGAAGAGAAAAATA
AAGTATTTGATATAGAAGAGTTTCAAGTGGTTTCTGAAAACTTACATGATCAAACTATAGAGGACAATGTGGTCGAGGTCAAGTCTAAAGAAAATAAAGAAATGGAGTGC
GACTCAGAAGAGAATAGATCCGATAAGCAGGCTTCAAGTCAGAAAGAGGAAGAAGAGAAGAGAGGTTCCAATTTGAACACGACAGTACTTTCATTGAATGAGCCGAAACT
TGAGAAGACTGAAGAAAAATGCAAGGATGCTTTGGAATCGAGCTCGAAAAACACTTGCCATTCAGCAGATTGTGCCGTGCCAGATTCAGAAGAAAACACCAATATGAATC
AAGTTATTGATACTGACCGAGATACTAATAAGGAAAATGATGGAGAGAAAGGAAAGGGTTCCGTCATCAACATGATAACACATGCATCAGAAGATGAAAAATCTGAAACG
ACATCAGATGTTAATGTCGATCAAGTTATTGATAATGATGGAGATACAGATAAGGAAAATGATGGAGAAAGAGGAAGAGGTTCAAGCTTCAACACGACAACGCATGCACC
AAAGGATCCAAAATCTGAAATAAATTCAGACTTCGATTCTGATCAAGTTATTGATACTGATGTAGATGCACATAAGGAAAACGATGAAGAGAGGAGAAAGGGTTCCAACT
TCGAGATGACAAATTCATCAGAAAATCCAAAATCTGAAAAAACTTCAAACTTGGATAGTAATCAAGTTGTTCGTACTGATTGGGATGCAGATAAGGAAAACGATGAAGGG
AGGGGCAACAGCTCCAACTTCGACATGTTAATAGATGCATCAAAAGATCCAAATTCTGAAAATAGTTCAAATCTTAGGTGTAGTCAGCACGAGTCACCTGAAACCAACGC
TGAATCATTGACAGGATCAAGTGATGATGGAGGCACGGATATGGAGAAGAAAAAGGGTGATCTAGTGGAGCCAAGACAATGCCATGGGTATACAGCGCCAGCTGCCAAAA
ATGTTGATACTAAAGATAAAGGAACTGTGACAGATGTGATATGTCATAATACTTCAGGCTCGCTGGCAGAAGAGTGTCTGGTGATAGAATCACCCAATTCATCTGTGCAA
GTTCCTGAGGTTGAAGACAAGGTAGAATTCCAATTAAGAGAAGAACGTTTAGGAACTGAAACAGTCGATGAAGACAATATTCCAACTCAATCTAAGATTTCAAATGAGGT
CCAAGAGGAATTTAACACAACTGAGTCACACTCTGAGAAGAATGCAGAAGAAGCTGAAGTTTCTCCTGAATTTGTTGCCGAGAACAAGAACGAGGCTCCTGTAGAGGATT
GCGAGGATTCAGATGGAGAATATCTGGAAATTTCCGAACAAGGTATGGATATTTTAAATTTATCTATTGGAGATTGCAAGCATAAGAATGAGGAGATGGGAGAGACAACT
GAACTTTCAACAAGCAATGAACACGAAGCGGAAAGACGAGAACCAGATGAAAGCCCATTTGAGCCTATTTTAGGGTTTCAACCACAAACTCAGCAAAAGGAAACTACAAT
AGTTTTTCAAACTGCAGAAAGTACAGATGAATCGATTTCAGCACCGAGGCAAGAAACCGATACAGAAACTGAAAAATCCAAGAGCAATCCCTCGGATTCCCTAAGTTATA
CCCAGACAGCTTCATCAACACCCACAGAAACTAAACCATCGACAAATCCAATCGACGAACAGAGTTTAGCAACACTTCCATTCTCCACATTTGGAGGAGAAGATCAAGAT
TCTCCAGGAAGAACAAGCAATGAATCAATTTCAGAAAATTCAATTGGTCAAATCGAGATGCGTAAATCACCTAGCTTCAATATAGATATCCAAATCGAAGGAAAAACAGG
AGAAACAGAGAAAACTCCATTGCTATACCAAATTAAGACAATCGAAGACTTATCAAATCTGCAGGAGATTAGCTTCCCAAATCCAATGGAGAAACGAGTAGTGAAGTTGG
GAAGAAGCGACTCAGAGAAATCGAGGCCTTCTTTCCCAGGGTTCGTGAAAGAAAAAGAAGAATCAGGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGTCAACGAG
AAGAAGGTAGCGAAAAACTTACCACCGCCATCGCCGATCCGTAAAGGGAAGCGCAGAACGAAATCCCTCATTTTTGGGACCTGCATCTGCTGTGCTACAGCGATCAATTG
A
mRNA sequenceShow/hide mRNA sequence
AGAGACGAACAGAGAGGCTGAAGAGTTTGGCTGACAGATTACTCATTTAAGGCTTATGATCTAAGCAATATTTCTGAATTTGTTTAATCTTGTGCCAACAATCTAGCTGT
CTACAACATATCAAGCTGTCAAATATCATCACCATCATCACAATGGATTTGTTGAGTTGAATATGGGAAATGAGATGGGAAACAATAACACATCTGAGTTTAGAGAGGAA
GAAAAGGAAAAAGCCGAAGGTCCAGAACTTTTTTTACTAGAAGGCGGGAAAAATGAAGGAGAAGCCGATGAAGTTGCAGGCTTCAGCCAAAAAGAGGCACGGCCGAGTTC
TGGTGAGGCAGATAAAGTAAATGGTGATCATCATGTGAGCGAGAAAGAGGAAGAGAAAAATAAAGTATTTGATATAGAAGAGTTTCAAGTGGTTTCTGAAAACTTACATG
ATCAAACTATAGAGGACAATGTGGTCGAGGTCAAGTCTAAAGAAAATAAAGAAATGGAGTGCGACTCAGAAGAGAATAGATCCGATAAGCAGGCTTCAAGTCAGAAAGAG
GAAGAAGAGAAGAGAGGTTCCAATTTGAACACGACAGTACTTTCATTGAATGAGCCGAAACTTGAGAAGACTGAAGAAAAATGCAAGGATGCTTTGGAATCGAGCTCGAA
AAACACTTGCCATTCAGCAGATTGTGCCGTGCCAGATTCAGAAGAAAACACCAATATGAATCAAGTTATTGATACTGACCGAGATACTAATAAGGAAAATGATGGAGAGA
AAGGAAAGGGTTCCGTCATCAACATGATAACACATGCATCAGAAGATGAAAAATCTGAAACGACATCAGATGTTAATGTCGATCAAGTTATTGATAATGATGGAGATACA
GATAAGGAAAATGATGGAGAAAGAGGAAGAGGTTCAAGCTTCAACACGACAACGCATGCACCAAAGGATCCAAAATCTGAAATAAATTCAGACTTCGATTCTGATCAAGT
TATTGATACTGATGTAGATGCACATAAGGAAAACGATGAAGAGAGGAGAAAGGGTTCCAACTTCGAGATGACAAATTCATCAGAAAATCCAAAATCTGAAAAAACTTCAA
ACTTGGATAGTAATCAAGTTGTTCGTACTGATTGGGATGCAGATAAGGAAAACGATGAAGGGAGGGGCAACAGCTCCAACTTCGACATGTTAATAGATGCATCAAAAGAT
CCAAATTCTGAAAATAGTTCAAATCTTAGGTGTAGTCAGCACGAGTCACCTGAAACCAACGCTGAATCATTGACAGGATCAAGTGATGATGGAGGCACGGATATGGAGAA
GAAAAAGGGTGATCTAGTGGAGCCAAGACAATGCCATGGGTATACAGCGCCAGCTGCCAAAAATGTTGATACTAAAGATAAAGGAACTGTGACAGATGTGATATGTCATA
ATACTTCAGGCTCGCTGGCAGAAGAGTGTCTGGTGATAGAATCACCCAATTCATCTGTGCAAGTTCCTGAGGTTGAAGACAAGGTAGAATTCCAATTAAGAGAAGAACGT
TTAGGAACTGAAACAGTCGATGAAGACAATATTCCAACTCAATCTAAGATTTCAAATGAGGTCCAAGAGGAATTTAACACAACTGAGTCACACTCTGAGAAGAATGCAGA
AGAAGCTGAAGTTTCTCCTGAATTTGTTGCCGAGAACAAGAACGAGGCTCCTGTAGAGGATTGCGAGGATTCAGATGGAGAATATCTGGAAATTTCCGAACAAGGTATGG
ATATTTTAAATTTATCTATTGGAGATTGCAAGCATAAGAATGAGGAGATGGGAGAGACAACTGAACTTTCAACAAGCAATGAACACGAAGCGGAAAGACGAGAACCAGAT
GAAAGCCCATTTGAGCCTATTTTAGGGTTTCAACCACAAACTCAGCAAAAGGAAACTACAATAGTTTTTCAAACTGCAGAAAGTACAGATGAATCGATTTCAGCACCGAG
GCAAGAAACCGATACAGAAACTGAAAAATCCAAGAGCAATCCCTCGGATTCCCTAAGTTATACCCAGACAGCTTCATCAACACCCACAGAAACTAAACCATCGACAAATC
CAATCGACGAACAGAGTTTAGCAACACTTCCATTCTCCACATTTGGAGGAGAAGATCAAGATTCTCCAGGAAGAACAAGCAATGAATCAATTTCAGAAAATTCAATTGGT
CAAATCGAGATGCGTAAATCACCTAGCTTCAATATAGATATCCAAATCGAAGGAAAAACAGGAGAAACAGAGAAAACTCCATTGCTATACCAAATTAAGACAATCGAAGA
CTTATCAAATCTGCAGGAGATTAGCTTCCCAAATCCAATGGAGAAACGAGTAGTGAAGTTGGGAAGAAGCGACTCAGAGAAATCGAGGCCTTCTTTCCCAGGGTTCGTGA
AAGAAAAAGAAGAATCAGGGATGGAATTCAAAGCAATCGATCAAAATAACTTCGTCAACGAGAAGAAGGTAGCGAAAAACTTACCACCGCCATCGCCGATCCGTAAAGGG
AAGCGCAGAACGAAATCCCTCATTTTTGGGACCTGCATCTGCTGTGCTACAGCGATCAATTGAATGGATTTTGTGGAGATCATTCACATCCAGCATTGGGGTTTCGGAGT
GGTTTTCTTTCATTGATTTTTCTTTCATTTTTCTATTTTACTATTTCAGCAACGTTTCAAGAGCGAAGAACGCCATGATCGGCTTGAAGTTGAGCTCGTTGTTGGCCTTT
GAAACGAAAATATGGTTGGACTTGCTGTTCTTGTGAATTTCTTTTCTTCCCGCCCAAAATACATAATGTGATATTGTGA
Protein sequenceShow/hide protein sequence
MGNEMGNNNTSEFREEEKEKAEGPELFLLEGGKNEGEADEVAGFSQKEARPSSGEADKVNGDHHVSEKEEEKNKVFDIEEFQVVSENLHDQTIEDNVVEVKSKENKEMEC
DSEENRSDKQASSQKEEEEKRGSNLNTTVLSLNEPKLEKTEEKCKDALESSSKNTCHSADCAVPDSEENTNMNQVIDTDRDTNKENDGEKGKGSVINMITHASEDEKSET
TSDVNVDQVIDNDGDTDKENDGERGRGSSFNTTTHAPKDPKSEINSDFDSDQVIDTDVDAHKENDEERRKGSNFEMTNSSENPKSEKTSNLDSNQVVRTDWDADKENDEG
RGNSSNFDMLIDASKDPNSENSSNLRCSQHESPETNAESLTGSSDDGGTDMEKKKGDLVEPRQCHGYTAPAAKNVDTKDKGTVTDVICHNTSGSLAEECLVIESPNSSVQ
VPEVEDKVEFQLREERLGTETVDEDNIPTQSKISNEVQEEFNTTESHSEKNAEEAEVSPEFVAENKNEAPVEDCEDSDGEYLEISEQGMDILNLSIGDCKHKNEEMGETT
ELSTSNEHEAERREPDESPFEPILGFQPQTQQKETTIVFQTAESTDESISAPRQETDTETEKSKSNPSDSLSYTQTASSTPTETKPSTNPIDEQSLATLPFSTFGGEDQD
SPGRTSNESISENSIGQIEMRKSPSFNIDIQIEGKTGETEKTPLLYQIKTIEDLSNLQEISFPNPMEKRVVKLGRSDSEKSRPSFPGFVKEKEESGMEFKAIDQNNFVNE
KKVAKNLPPPSPIRKGKRRTKSLIFGTCICCATAIN