; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G004930 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G004930
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein KOKOPELLI-like isoform X1
Genome locationchr09:5158948..5163292
RNA-Seq ExpressionLsi09G004930
SyntenyLsi09G004930
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0055084.1 serine-rich adhesin for platelets [Cucumis melo var. makuwa]2.4e-15965.26Show/hide
Query:  FLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE
        +L LLSK  QLD+RAQILLK LLDDAT+ VLE +SK                   +LA NSN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +
Subjt:  FLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE

Query:  -----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR------RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTT
             +RDR SASN A NDL HGI S+LRRIELHILSLQ       +KTR + Q VLQ NE++NQQKV     HSTLRT  TKP          V P+ T
Subjt:  -----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR------RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTT

Query:  HRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR
        HR SEFVHG RIP +Q NDEA     IIE H +PKQHKVVNPMT   KSG TSV SKATFRPAMKL++TSKQ  K+NQ+ YG M+MGPTLLDHHPS+E R
Subjt:  HRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR

Query:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDND--NDNNDSSSPS-----HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRA
        +E+THN  HL  +Q+ESE +NSEF+  S+ SSSSSWTTQ+T ESET +++  ++++DSSSPS     HQDD STTDSKSSS YS KTFNIK G+ E K+A
Subjt:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDND--NDNNDSSSPS-----HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRA

Query:  IGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKG
        +G FK+LKNKLGVIFHHHHHHHHHHHN+HNFMWKQL K+F+H++NR  +VSKED+ EKVK RA+R VC KNQV KF+ALAEGLRSHV RSKAMKRKE KG
Subjt:  IGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKG

Query:  MKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR
        MK G   KKKK GVKKL+WW+MF NRRGVKLPNKG MKIGYVNR
Subjt:  MKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR

XP_038877121.1 protein KOKOPELLI-like isoform X1 [Benincasa hispida]7.5e-19876.47Show/hide
Query:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNV
        LD+RAQILLKHLLDDATA VLEFLS                    DLATNSN F NFLHKDD+Q KPL D KV EW+KHNQT RKMGN E+RDR SASNV
Subjt:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNV

Query:  AINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIP
        AIN+LSH ISS+LRRIELHILSL     QRRKTR HWQSVLQ NES+NQQ VH     STLR++FTKPIK  GH VGEQ KVKP T + CSE+VHGFRIP
Subjt:  AINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIP

Query:  LNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQ
        L+Q+NDEA+    IETH+ KQHKVVNPMTLIDKSGYTSV SKATFRPAMKLNQTSKQQAKRNQNSYGQM+MGPTLLDHHPSKE R E+ ++KTHL  TQQ
Subjt:  LNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQ

Query:  ESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHH
        ESEFT+SEFQ  S+SSSSSSWTTQ+T  SETV ND D+N  SSPSHQDDP +TDSKSSS   TKTF IKQGKTE K+ +GRFKRLKNKLGV+FHHHHHHH
Subjt:  ESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHH

Query:  HHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWK
        HHHHNS+NFMWK QLRK+FH RDN+RLLVSKED NEKVKKRAIR VCYKNQVGKFQALAEGLRSHVWRSKAMKRK +KGMKCG      KKGVKKLHWWK
Subjt:  HHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWK

Query:  MFHNRRGVKLPNKGRMKIGYVNRKAQL
        MF NRRGV+LPNKG MKIGYVN+KA+L
Subjt:  MFHNRRGVKLPNKGRMKIGYVNRKAQL

XP_038877122.1 uncharacterized protein LOC120069439 isoform X2 [Benincasa hispida]2.9e-18978.84Show/hide
Query:  DLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNVAINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNE
        DLATNSN F NFLHKDD+Q KPL D KV EW+KHNQT RKMGN E+RDR SASNVAIN+LSH ISS+LRRIELHILSL     QRRKTR HWQSVLQ NE
Subjt:  DLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNVAINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNE

Query:  SINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIPLNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATF
        S+NQQ VH     STLR++FTKPIK  GH VGEQ KVKP T + CSE+VHGFRIPL+Q+NDEA+    IETH+ KQHKVVNPMTLIDKSGYTSV SKATF
Subjt:  SINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIPLNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATF

Query:  RPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPS
        RPAMKLNQTSKQQAKRNQNSYGQM+MGPTLLDHHPSKE R E+ ++KTHL  TQQESEFT+SEFQ  S+SSSSSSWTTQ+T  SETV ND D+N  SSPS
Subjt:  RPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPS

Query:  HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRK
        HQDDP +TDSKSSS   TKTF IKQGKTE K+ +GRFKRLKNKLGV+FHHHHHHHHHHHNS+NFMWK QLRK+FH RDN+RLLVSKED NEKVKKRAIR 
Subjt:  HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRK

Query:  VCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNRKAQL
        VCYKNQVGKFQALAEGLRSHVWRSKAMKRK +KGMKCG      KKGVKKLHWWKMF NRRGV+LPNKG MKIGYVN+KA+L
Subjt:  VCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNRKAQL

XP_038877123.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida]7.5e-19876.47Show/hide
Query:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNV
        LD+RAQILLKHLLDDATA VLEFLS                    DLATNSN F NFLHKDD+Q KPL D KV EW+KHNQT RKMGN E+RDR SASNV
Subjt:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRDSASNV

Query:  AINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIP
        AIN+LSH ISS+LRRIELHILSL     QRRKTR HWQSVLQ NES+NQQ VH     STLR++FTKPIK  GH VGEQ KVKP T + CSE+VHGFRIP
Subjt:  AINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPMTTHRCSEFVHGFRIP

Query:  LNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQ
        L+Q+NDEA+    IETH+ KQHKVVNPMTLIDKSGYTSV SKATFRPAMKLNQTSKQQAKRNQNSYGQM+MGPTLLDHHPSKE R E+ ++KTHL  TQQ
Subjt:  LNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQ

Query:  ESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHH
        ESEFT+SEFQ  S+SSSSSSWTTQ+T  SETV ND D+N  SSPSHQDDP +TDSKSSS   TKTF IKQGKTE K+ +GRFKRLKNKLGV+FHHHHHHH
Subjt:  ESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHH

Query:  HHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWK
        HHHHNS+NFMWK QLRK+FH RDN+RLLVSKED NEKVKKRAIR VCYKNQVGKFQALAEGLRSHVWRSKAMKRK +KGMKCG      KKGVKKLHWWK
Subjt:  HHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWK

Query:  MFHNRRGVKLPNKGRMKIGYVNRKAQL
        MF NRRGV+LPNKG MKIGYVN+KA+L
Subjt:  MFHNRRGVKLPNKGRMKIGYVNRKAQL

XP_038877125.1 uncharacterized protein LOC120069439 isoform X4 [Benincasa hispida]2.0e-20376.75Show/hide
Query:  MQIPNFLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARK
        MQIPNFLALLSK  QLD+RAQILLKHLLDDATA VLEFLS                    DLATNSN F NFLHKDD+Q KPL D KV EW+KHNQT RK
Subjt:  MQIPNFLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARK

Query:  MGNQELRDRDSASNVAINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPM
        MGN E+RDR SASNVAIN+LSH ISS+LRRIELHILSL     QRRKTR HWQSVLQ NES+NQQ VH     STLR++FTKPIK  GH VGEQ KVKP 
Subjt:  MGNQELRDRDSASNVAINDLSHGISSSLRRIELHILSL-----QRRKTRSHWQSVLQGNESINQQKVH-----STLRTKFTKPIK--GHLVGEQ-KVKPM

Query:  TTHRCSEFVHGFRIPLNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR
        T + CSE+VHGFRIPL+Q+NDEA+    IETH+ KQHKVVNPMTLIDKSGYTSV SKATFRPAMKLNQTSKQQAKRNQNSYGQM+MGPTLLDHHPSKE R
Subjt:  TTHRCSEFVHGFRIPLNQSNDEAI----IETHLPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR

Query:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRL
         E+ ++KTHL  TQQESEFT+SEFQ  S+SSSSSSWTTQ+T  SETV ND D+N  SSPSHQDDP +TDSKSSS   TKTF IKQGKTE K+ +GRFKRL
Subjt:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRL

Query:  KNKLGVIFHHHHHHHHHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKK
        KNKLGV+FHHHHHHHHHHHNS+NFMWK QLRK+FH RDN+RLLVSKED NEKVKKRAIR VCYKNQVGKFQALAEGLRSHVWRSKAMKRK +KGMKCG  
Subjt:  KNKLGVIFHHHHHHHHHHHNSHNFMWK-QLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKK

Query:  KKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNRKAQL
            KKGVKKLHWWKMF NRRGV+LPNKG MKIGYVN+KA+L
Subjt:  KKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNRKAQL

TrEMBL top hitse value%identityAlignment
A0A0A0KN38 Uncharacterized protein3.3e-15168.8Show/hide
Query:  IQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQ
        +Q LA +SN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +     +RDR SASN A NDL HGI S+LRRIELHILSLQ      RKTR + Q
Subjt:  IQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQ

Query:  SVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSV
         VL  NE++NQQKV     HSTLRT FTKP          V P+ T   SEFVHGFRIP +Q NDE      IIETH +P QHKVVNPMT   KSG TSV
Subjt:  SVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSV

Query:  ESK-ATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDN
         SK ATFRPAMKL+QTSK Q K+NQ+ YG M+MGPTLLDHHPS+E RKE T+N THL   QQESE +NSEF+  S+ SSSSSWTTQQ  ESETV  DND+
Subjt:  ESK-ATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDN

Query:  NDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVK
         DSSSPSHQDD STTDSKSSS YS KTFN K GK E K+ +GRFK+LKNKLGVIFHHHHHHHHHHHNSHNFMWKQL K+F+H++ R  +VSKED+ EKVK
Subjt:  NDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVK

Query:  KRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR
         RA+R VC K QV KF+ALAEGLRSHV RSKAMKRKE KGM+ G     KK GVKKL+WWKMF NRRGVKLPNKGRMKIGYVNR
Subjt:  KRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR

A0A1S3ASW8 uncharacterized protein LOC103482539 isoform X17.4e-15965.67Show/hide
Query:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRD
        LD+RAQIL+K LLDDAT+ VLE +SK                   +LA NSN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +     +RDR 
Subjt:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRD

Query:  SASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFR
        SASN A NDL HGI S+LRRIELHILSLQ      +KTR + Q VLQ NE++NQQKV     HSTLRT FTKP          V P+ THR SEFVHG R
Subjt:  SASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFR

Query:  IPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLM
        IP +Q NDEA     IIE H +PKQHKVVNPMT   KSG TSV SKATFRPA+KL++TSKQ  K+NQ+ YG M+MGPTLLDHHPS+E R+E+THN  HL 
Subjt:  IPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLM

Query:  TTQQESEFTNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKL
          Q+ESE +NSEF+  S+ SSSSSWTTQ+     T ESETVDND+D++    S+ +HQDD STTDSKSSS YS KTFNIK G+ E K+A+G FK+LKNKL
Subjt:  TTQQESEFTNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKL

Query:  GVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKK
        GVIFHHHHHHHHHHHN+HNFMWKQL K+F+H++NR  +VSKED+ EKVK RA+R VC KNQV KF+ALAEGLRSHV RSKAMKRKE KGMK G KKKKKK
Subjt:  GVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKK

Query:  KGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR
         GVKKL+WW+MF NRRGVKLPNKG MKIGYVNR
Subjt:  KGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR

A0A1S3ATF3 uncharacterized protein LOC103482539 isoform X33.8e-15565.33Show/hide
Query:  LKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRDSASNVAIN
        +K LLDDAT+ VLE +SK                   +LA NSN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +     +RDR SASN A N
Subjt:  LKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRDSASNVAIN

Query:  DLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFRIPLNQSND
        DL HGI S+LRRIELHILSLQ      +KTR + Q VLQ NE++NQQKV     HSTLRT FTKP          V P+ THR SEFVHG RIP +Q ND
Subjt:  DLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFRIPLNQSND

Query:  EA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEF
        EA     IIE H +PKQHKVVNPMT   KSG TSV SKATFRPA+KL++TSKQ  K+NQ+ YG M+MGPTLLDHHPS+E R+E+THN  HL   Q+ESE 
Subjt:  EA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEF

Query:  TNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHH
        +NSEF+  S+ SSSSSWTTQ+     T ESETVDND+D++    S+ +HQDD STTDSKSSS YS KTFNIK G+ E K+A+G FK+LKNKLGVIFHHHH
Subjt:  TNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHH

Query:  HHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHW
        HHHHHHHN+HNFMWKQL K+F+H++NR  +VSKED+ EKVK RA+R VC KNQV KF+ALAEGLRSHV RSKAMKRKE KGMK G KKKKKK GVKKL+W
Subjt:  HHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHW

Query:  WKMFHNRRGVKLPNKGRMKIGYVNR
        W+MF NRRGVKLPNKG MKIGYVNR
Subjt:  WKMFHNRRGVKLPNKGRMKIGYVNR

A0A1S3ATG7 uncharacterized protein LOC103482539 isoform X27.4e-15965.67Show/hide
Query:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRD
        LD+RAQIL+K LLDDAT+ VLE +SK                   +LA NSN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +     +RDR 
Subjt:  LDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE-----LRDRD

Query:  SASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFR
        SASN A NDL HGI S+LRRIELHILSLQ      +KTR + Q VLQ NE++NQQKV     HSTLRT FTKP          V P+ THR SEFVHG R
Subjt:  SASNVAINDLSHGISSSLRRIELHILSLQR-----RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFR

Query:  IPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLM
        IP +Q NDEA     IIE H +PKQHKVVNPMT   KSG TSV SKATFRPA+KL++TSKQ  K+NQ+ YG M+MGPTLLDHHPS+E R+E+THN  HL 
Subjt:  IPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLM

Query:  TTQQESEFTNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKL
          Q+ESE +NSEF+  S+ SSSSSWTTQ+     T ESETVDND+D++    S+ +HQDD STTDSKSSS YS KTFNIK G+ E K+A+G FK+LKNKL
Subjt:  TTQQESEFTNSEFQSVSSSSSSSSWTTQQ-----TDESETVDNDNDNN--DSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKL

Query:  GVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKK
        GVIFHHHHHHHHHHHN+HNFMWKQL K+F+H++NR  +VSKED+ EKVK RA+R VC KNQV KF+ALAEGLRSHV RSKAMKRKE KGMK G KKKKKK
Subjt:  GVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKK

Query:  KGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR
         GVKKL+WW+MF NRRGVKLPNKG MKIGYVNR
Subjt:  KGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR

A0A5A7UIV0 Serine-rich adhesin for platelets1.1e-15965.26Show/hide
Query:  FLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE
        +L LLSK  QLD+RAQILLK LLDDAT+ VLE +SK                   +LA NSN  YNFLHKDD+QTKP+ D KV EW+KHNQTARKM N +
Subjt:  FLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQE

Query:  -----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR------RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTT
             +RDR SASN A NDL HGI S+LRRIELHILSLQ       +KTR + Q VLQ NE++NQQKV     HSTLRT  TKP          V P+ T
Subjt:  -----LRDRDSASNVAINDLSHGISSSLRRIELHILSLQR------RKTRSHWQSVLQGNESINQQKV-----HSTLRTKFTKPIKGHLVGEQKVKPMTT

Query:  HRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR
        HR SEFVHG RIP +Q NDEA     IIE H +PKQHKVVNPMT   KSG TSV SKATFRPAMKL++TSKQ  K+NQ+ YG M+MGPTLLDHHPS+E R
Subjt:  HRCSEFVHGFRIPLNQSNDEA-----IIETH-LPKQHKVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEAR

Query:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDND--NDNNDSSSPS-----HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRA
        +E+THN  HL  +Q+ESE +NSEF+  S+ SSSSSWTTQ+T ESET +++  ++++DSSSPS     HQDD STTDSKSSS YS KTFNIK G+ E K+A
Subjt:  KEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDND--NDNNDSSSPS-----HQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRA

Query:  IGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKG
        +G FK+LKNKLGVIFHHHHHHHHHHHN+HNFMWKQL K+F+H++NR  +VSKED+ EKVK RA+R VC KNQV KF+ALAEGLRSHV RSKAMKRKE KG
Subjt:  IGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKG

Query:  MKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR
        MK G   KKKK GVKKL+WW+MF NRRGVKLPNKG MKIGYVNR
Subjt:  MKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNR

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI1.1e-1833.58Show/hide
Query:  KQQAKRNQNSYGQMIMGPTLLDHH------PSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDD
        K+  + N+ S    IM PTL+D         S E   +QT + T    ++ E   T+ E+   + SSS S W TQ        +ND ++   SS   Q+D
Subjt:  KQQAKRNQNSYGQMIMGPTLLDHH------PSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDD

Query:  PSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHN---FMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVC
         S ++  +S  ++ +  + + GK + +  +GRFKR+KNK+G IFHHHHHHHHHHH+        W +L+  FHH+   +   SKE +    + + +    
Subjt:  PSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHN---FMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVC

Query:  YKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRR--GVKLPNKGRMKIG
         ++Q G F AL EGL  H   SK             K+K + K   KK  WWK+   R+  GVK+P +GR+K+G
Subjt:  YKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRR--GVKLPNKGRMKIG

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli7.9e-2033.58Show/hide
Query:  KQQAKRNQNSYGQMIMGPTLLDHH------PSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDD
        K+  + N+ S    IM PTL+D         S E   +QT + T    ++ E   T+ E+   + SSS S W TQ        +ND ++   SS   Q+D
Subjt:  KQQAKRNQNSYGQMIMGPTLLDHH------PSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETVDNDNDNNDSSSPSHQDD

Query:  PSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHN---FMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVC
         S ++  +S  ++ +  + + GK + +  +GRFKR+KNK+G IFHHHHHHHHHHH+        W +L+  FHH+   +   SKE +    + + +    
Subjt:  PSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHN---FMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAIRKVC

Query:  YKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRR--GVKLPNKGRMKIG
         ++Q G F AL EGL  H   SK             K+K + K   KK  WWK+   R+  GVK+P +GR+K+G
Subjt:  YKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRR--GVKLPNKGRMKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATTCCCAACTTTTTGGCTTTATTGTCAAAGCAGCTGCAGCTGGATCAAAGAGCACAAATTCTATTGAAGCATTTGCTTGATGATGCTACTGCAAGAGTTCTGGA
GTTTCTCTCAAAGTTTAAACAAAAGAACAATTTGGACACAATTTATTCAGAAAATTTGATGTCTATACAGGACTTAGCAACAAACTCCAACAGTTTCTACAATTTTCTAC
ACAAAGATGATGAACAGACAAAGCCACTGGATGATAATAAAGTTTTTGAATGGATTAAACATAATCAAACTGCAAGAAAGATGGGAAATCAAGAGCTTAGGGATAGAGAT
TCAGCTTCAAATGTTGCTATTAATGACTTATCACATGGCATCAGTTCATCACTCAGAAGAATTGAACTCCACATTTTGTCCCTGCAAAGGAGGAAAACAAGAAGCCATTG
GCAGTCTGTTCTTCAAGGGAATGAGTCAATAAACCAGCAGAAAGTTCACTCAACTTTAAGGACCAAATTTACAAAGCCAATTAAAGGTCATCTTGTTGGTGAACAGAAAG
TTAAGCCAATGACGACGCACCGTTGTTCCGAGTTTGTTCATGGATTCAGAATACCTCTGAATCAAAGCAATGATGAGGCCATTATTGAAACCCATTTACCTAAACAACAC
AAAGTTGTAAATCCAATGACTCTGATAGATAAATCTGGATATACTTCAGTAGAATCCAAGGCAACCTTCAGGCCTGCTATGAAACTGAATCAAACTAGTAAACAACAAGC
AAAGAGGAACCAAAATTCATATGGTCAAATGATAATGGGGCCAACTTTGTTAGATCATCATCCCTCCAAAGAAGCAAGGAAGGAACAAACTCATAACAAGACCCATTTGA
TGACTACACAGCAAGAATCAGAATTCACAAACTCAGAATTTCAATCAGTTTCTTCTTCTTCTTCTTCTTCAAGTTGGACAACTCAACAAACTGACGAAAGTGAAACCGTC
GACAACGACAACGACAACAACGACTCTTCTTCCCCAAGTCACCAAGACGATCCATCGACAACGGATTCAAAATCAAGTAGCAGATATTCGACGAAAACATTCAATATCAA
GCAGGGGAAAACAGAGCCGAAGAGAGCAATCGGACGGTTCAAGCGACTCAAGAACAAACTCGGCGTTATCTTTCACCACCATCACCATCATCACCATCATCACCATAACA
GCCATAACTTCATGTGGAAGCAGCTAAGGAAGATGTTTCATCACAGAGATAACAGAAGATTATTAGTGAGTAAAGAAGATAGAAATGAGAAGGTAAAGAAGAGAGCCATT
AGAAAAGTGTGTTACAAGAATCAAGTGGGGAAGTTTCAGGCACTTGCTGAAGGGCTGAGAAGCCATGTATGGAGATCAAAAGCCATGAAAAGGAAAGAGCTTAAGGGGAT
GAAATGTGGGAAGAAGAAGAAGAAGAAGAAGAAGGGTGTGAAGAAATTGCATTGGTGGAAAATGTTTCATAATAGGCGTGGAGTGAAGTTGCCCAATAAAGGGCGTATGA
AAATAGGATATGTAAATAGAAAAGCACAGCTT
mRNA sequenceShow/hide mRNA sequence
CACGGACTGACCACGTGACCAGAACAAAAATTAGGGCAAAATCATGTGCTCCCTATAAATAAATCAAGCGGGCAAATATGAACCTTCATCTTCCTCCATTTTTTTTCAAT
GGTTGCGGTTCCTTTGCCGCAACTTCCATCTTCCCCATCGTATCTAATACCATCTATATGCGGCCTGAACGTTCTTTGTTCTTACAATATGGATGTTGATGAGTTATATC
TTGATCTCCTAGCACTGAGGGAATTATACATCCTTCTCTTGAAGAGTTCTTTGCAAGATGCAAATTCCCAACTTTTTGGCTTTATTGTCAAAGCAGCTGCAGCTGGATCA
AAGAGCACAAATTCTATTGAAGCATTTGCTTGATGATGCTACTGCAAGAGTTCTGGAGTTTCTCTCAAAGTTTAAACAAAAGAACAATTTGGACACAATTTATTCAGAAA
ATTTGATGTCTATACAGGACTTAGCAACAAACTCCAACAGTTTCTACAATTTTCTACACAAAGATGATGAACAGACAAAGCCACTGGATGATAATAAAGTTTTTGAATGG
ATTAAACATAATCAAACTGCAAGAAAGATGGGAAATCAAGAGCTTAGGGATAGAGATTCAGCTTCAAATGTTGCTATTAATGACTTATCACATGGCATCAGTTCATCACT
CAGAAGAATTGAACTCCACATTTTGTCCCTGCAAAGGAGGAAAACAAGAAGCCATTGGCAGTCTGTTCTTCAAGGGAATGAGTCAATAAACCAGCAGAAAGTTCACTCAA
CTTTAAGGACCAAATTTACAAAGCCAATTAAAGGTCATCTTGTTGGTGAACAGAAAGTTAAGCCAATGACGACGCACCGTTGTTCCGAGTTTGTTCATGGATTCAGAATA
CCTCTGAATCAAAGCAATGATGAGGCCATTATTGAAACCCATTTACCTAAACAACACAAAGTTGTAAATCCAATGACTCTGATAGATAAATCTGGATATACTTCAGTAGA
ATCCAAGGCAACCTTCAGGCCTGCTATGAAACTGAATCAAACTAGTAAACAACAAGCAAAGAGGAACCAAAATTCATATGGTCAAATGATAATGGGGCCAACTTTGTTAG
ATCATCATCCCTCCAAAGAAGCAAGGAAGGAACAAACTCATAACAAGACCCATTTGATGACTACACAGCAAGAATCAGAATTCACAAACTCAGAATTTCAATCAGTTTCT
TCTTCTTCTTCTTCTTCAAGTTGGACAACTCAACAAACTGACGAAAGTGAAACCGTCGACAACGACAACGACAACAACGACTCTTCTTCCCCAAGTCACCAAGACGATCC
ATCGACAACGGATTCAAAATCAAGTAGCAGATATTCGACGAAAACATTCAATATCAAGCAGGGGAAAACAGAGCCGAAGAGAGCAATCGGACGGTTCAAGCGACTCAAGA
ACAAACTCGGCGTTATCTTTCACCACCATCACCATCATCACCATCATCACCATAACAGCCATAACTTCATGTGGAAGCAGCTAAGGAAGATGTTTCATCACAGAGATAAC
AGAAGATTATTAGTGAGTAAAGAAGATAGAAATGAGAAGGTAAAGAAGAGAGCCATTAGAAAAGTGTGTTACAAGAATCAAGTGGGGAAGTTTCAGGCACTTGCTGAAGG
GCTGAGAAGCCATGTATGGAGATCAAAAGCCATGAAAAGGAAAGAGCTTAAGGGGATGAAATGTGGGAAGAAGAAGAAGAAGAAGAAGAAGGGTGTGAAGAAATTGCATT
GGTGGAAAATGTTTCATAATAGGCGTGGAGTGAAGTTGCCCAATAAAGGGCGTATGAAAATAGGATATGTAAATAGAAAAGCACAGCTT
Protein sequenceShow/hide protein sequence
MQIPNFLALLSKQLQLDQRAQILLKHLLDDATARVLEFLSKFKQKNNLDTIYSENLMSIQDLATNSNSFYNFLHKDDEQTKPLDDNKVFEWIKHNQTARKMGNQELRDRD
SASNVAINDLSHGISSSLRRIELHILSLQRRKTRSHWQSVLQGNESINQQKVHSTLRTKFTKPIKGHLVGEQKVKPMTTHRCSEFVHGFRIPLNQSNDEAIIETHLPKQH
KVVNPMTLIDKSGYTSVESKATFRPAMKLNQTSKQQAKRNQNSYGQMIMGPTLLDHHPSKEARKEQTHNKTHLMTTQQESEFTNSEFQSVSSSSSSSSWTTQQTDESETV
DNDNDNNDSSSPSHQDDPSTTDSKSSSRYSTKTFNIKQGKTEPKRAIGRFKRLKNKLGVIFHHHHHHHHHHHNSHNFMWKQLRKMFHHRDNRRLLVSKEDRNEKVKKRAI
RKVCYKNQVGKFQALAEGLRSHVWRSKAMKRKELKGMKCGKKKKKKKKGVKKLHWWKMFHNRRGVKLPNKGRMKIGYVNRKAQL