; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G005000 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G005000
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionprotein KOKOPELLI-like isoform X1
Genome locationCicolChr01:4948120..4950259
RNA-Seq ExpressionCcUC01G005000
SyntenyCcUC01G005000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031740841.1 protein KOKOPELLI isoform X1 [Cucumis sativus]6.7e-18672.04Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        M+VDELYLDLLALRELYILLLK CL+D NSELQLDERAQILLK LLDDAT+GVLE LSK +A +S+I YNF    HKDD QTK +D K VEWMKHNQ AR
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KME  K E NP R+RASASN A NDL HGI SALRRIELHILSLQHCTS+SRKTR + Q +L  NE+LNQQKV  +TD  TLR+ FTKP           
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETH-LRKQHKVVNPMTLIDKSGYTSVESK-ATFRPTMKLNQTSKQAKRNQNSYGQMVMGPTLLD
          V PI   H SEFVHGFRIP +Q NDE  MKPP IETH +  QHKVVNPMT   KSG TSV SK ATFRP MKL+QTSKQ K+NQ+ YG MVMGPTLLD
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETH-LRKQHKVVNPMTLIDKSGYTSVESK-ATFRPTMKLNQTSKQAKRNQNSYGQMVMGPTLLD

Query:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIG
        HHPS+E RKE T+NN THL  QQESE +NSEF+S   SSSSWTTQQ TESET VD+DD DSSSPSHQD+ STTDSKSSS YS KTFN K GKKESK+ +G
Subjt:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIG

Query:  GLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM
          ++LKNKLGVIFHHHHHHHHHH H+ HNFMWKQL KIFNH++ +  +VS ED  EKVK RA R+VC K QV KFEALAEGLRSHV RSKAMKRKE KGM
Subjt:  GLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM

Query:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR
        + G  KK GV+KL+WWKMFRNRRGVKL N+GRMKIGYVNR
Subjt:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR

XP_038877121.1 protein KOKOPELLI-like isoform X1 [Benincasa hispida]1.3e-21078.04Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        MDVD+LYLDLLALRELYILLLKSCL DANSEL LDERAQILLKHLLDDATAGVLEFLS D+ATNS+IF NF    HKDD Q K L DK  EWMKHNQ  R
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KM   +I     R+RASASNVA N+LSH ISSALRRIELHILSLQHCTS+ RKTR HWQS+LQ NESLNQQ V PRT P TLRSRFTKPIKGR +FVG Q
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH
        K VKP T +HCSE+VHGFRIPL+Q+NDEAMKP TIETH+ KQHKVVNPMTLIDKSGYTSV SKATFRP MKLNQTSK QAKRNQNSYGQMVMGPTLLDHH
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH

Query:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG
        PSKE R E+  N+KTHL  TQQESEFT+SEFQS   SSSSWTTQ+T+ SETV +D D++ SSPSHQD+  +TDSKSSS    KTF IKQGK ESK+ +G 
Subjt:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG

Query:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM
         +RLKNKLGV+FHHHHHHHHHH H+ +NFMWK QLRKIF+ RDNKR LVS ED NEKVKKRA RNVC KNQVGKF+ALAEGLRSHVWRSKAMKRK +KGM
Subjt:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM

Query:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA
        KCGKK   GV+KLHWWKMFRNRRGV+L N+G MKIGYVN+KA
Subjt:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA

XP_038877122.1 uncharacterized protein LOC120069439 isoform X2 [Benincasa hispida]8.8e-19473.43Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        MDVD+LYLDLLALRELYILLLKSCL DANSEL                           D+ATNS+IF NF    HKDD Q K L DK  EWMKHNQ  R
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KM   +I     R+RASASNVA N+LSH ISSALRRIELHILSLQHCTS+ RKTR HWQS+LQ NESLNQQ V PRT P TLRSRFTKPIKGR +FVG Q
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH
        K VKP T +HCSE+VHGFRIPL+Q+NDEAMKP TIETH+ KQHKVVNPMTLIDKSGYTSV SKATFRP MKLNQTSK QAKRNQNSYGQMVMGPTLLDHH
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH

Query:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG
        PSKE R E+  N+KTHL  TQQESEFT+SEFQS   SSSSWTTQ+T+ SETV +D D++ SSPSHQD+  +TDSKSSS    KTF IKQGK ESK+ +G 
Subjt:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG

Query:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM
         +RLKNKLGV+FHHHHHHHHHH H+ +NFMWK QLRKIF+ RDNKR LVS ED NEKVKKRA RNVC KNQVGKF+ALAEGLRSHVWRSKAMKRK +KGM
Subjt:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM

Query:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA
        KCGKK   GV+KLHWWKMFRNRRGV+L N+G MKIGYVN+KA
Subjt:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA

XP_038877123.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida]1.3e-21078.04Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        MDVD+LYLDLLALRELYILLLKSCL DANSEL LDERAQILLKHLLDDATAGVLEFLS D+ATNS+IF NF    HKDD Q K L DK  EWMKHNQ  R
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KM   +I     R+RASASNVA N+LSH ISSALRRIELHILSLQHCTS+ RKTR HWQS+LQ NESLNQQ V PRT P TLRSRFTKPIKGR +FVG Q
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH
        K VKP T +HCSE+VHGFRIPL+Q+NDEAMKP TIETH+ KQHKVVNPMTLIDKSGYTSV SKATFRP MKLNQTSK QAKRNQNSYGQMVMGPTLLDHH
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHH

Query:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG
        PSKE R E+  N+KTHL  TQQESEFT+SEFQS   SSSSWTTQ+T+ SETV +D D++ SSPSHQD+  +TDSKSSS    KTF IKQGK ESK+ +G 
Subjt:  PSKEARKEQTHNNKTHL-TTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGG

Query:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM
         +RLKNKLGV+FHHHHHHHHHH H+ +NFMWK QLRKIF+ RDNKR LVS ED NEKVKKRA RNVC KNQVGKF+ALAEGLRSHVWRSKAMKRK +KGM
Subjt:  LRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK-QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGM

Query:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA
        KCGKK   GV+KLHWWKMFRNRRGV+L N+G MKIGYVN+KA
Subjt:  KCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA

XP_038877125.1 uncharacterized protein LOC120069439 isoform X4 [Benincasa hispida]1.4e-19677.25Show/hide
Query:  QLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISS
        QLDERAQILLKHLLDDATAGVLEFLS D+ATNS+IF NF    HKDD Q K L DK  EWMKHNQ  RKM   +I     R+RASASNVA N+LSH ISS
Subjt:  QLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISS

Query:  ALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEAMKP
        ALRRIELHILSLQHCTS+ RKTR HWQS+LQ NESLNQQ V PRT P TLRSRFTKPIKGR +FVG QK VKP T +HCSE+VHGFRIPL+Q+NDEAMKP
Subjt:  ALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEAMKP

Query:  PTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHL-TTQQESEFTNSEFQ
         TIETH+ KQHKVVNPMTLIDKSGYTSV SKATFRP MKLNQTSK QAKRNQNSYGQMVMGPTLLDHHPSKE R E+  N+KTHL  TQQESEFT+SEFQ
Subjt:  PTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSK-QAKRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHL-TTQQESEFTNSEFQ

Query:  SV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK
        S   SSSSWTTQ+T+ SETV +D D++ SSPSHQD+  +TDSKSSS    KTF IKQGK ESK+ +G  +RLKNKLGV+FHHHHHHHHHH H+ +NFMWK
Subjt:  SV--SSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK

Query:  -QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRRGVKLANRGR
         QLRKIF+ RDNKR LVS ED NEKVKKRA RNVC KNQVGKF+ALAEGLRSHVWRSKAMKRK +KGMKCGKK   GV+KLHWWKMFRNRRGV+L N+G 
Subjt:  -QLRKIFNHRDNKR-LVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRRGVKLANRGR

Query:  MKIGYVNRKA
        MKIGYVN+KA
Subjt:  MKIGYVNRKA

TrEMBL top hitse value%identityAlignment
A0A0A0KN38 Uncharacterized protein1.7e-15870.21Show/hide
Query:  VATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQS
        +A +S+I YNF    HKDD QTK +D K VEWMKHNQ ARKME  K E NP R+RASASN A NDL HGI SALRRIELHILSLQHCTS+SRKTR + Q 
Subjt:  VATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQS

Query:  ILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETH-LRKQHKVVNPMTLIDKSGYT
        +L  NE+LNQQKV  +TD  TLR+ FTKP             V PI   H SEFVHGFRIP +Q NDE  MKPP IETH +  QHKVVNPMT   KSG T
Subjt:  ILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETH-LRKQHKVVNPMTLIDKSGYT

Query:  SVESK-ATFRPTMKLNQTSKQAKRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDND
        SV SK ATFRP MKL+QTSKQ K+NQ+ YG MVMGPTLLDHHPS+E RKE T+NN THL  QQESE +NSEF+S   SSSSWTTQQ TESET VD+DD D
Subjt:  SVESK-ATFRPTMKLNQTSKQAKRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVDDDDND

Query:  SSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKK
        SSSPSHQD+ STTDSKSSS YS KTFN K GKKESK+ +G  ++LKNKLGVIFHHHHHHHHHH H+ HNFMWKQL KIFNH++ +  +VS ED  EKVK 
Subjt:  SSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKK

Query:  RATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR
        RA R+VC K QV KFEALAEGLRSHV RSKAMKRKE KGM+ G  KK GV+KL+WWKMFRNRRGVKL N+GRMKIGYVNR
Subjt:  RATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR

A0A1S3ASW8 uncharacterized protein LOC103482539 isoform X11.7e-18269.64Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        M+VD+LYLDLLALRELYILLLK CL+D NSEL LDERAQIL+K LLDDAT+GVLE +SK++A NS+I YNF    HKDD QTK +D K VEW+KHNQ AR
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KME SK E NP R+RASASN A NDL HGI SALRRIELHILSLQHCTS+S+KTR + Q +LQ NE+LNQQKV  +TD  TLR+ FTKP           
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLD
          V PI  H  SEFVHG RIP +Q NDEA MKPP IE H+  KQHKVVNPMT   KSG TSV SKATFRP +KL++TSKQ  K+NQ+ YG MVMGPTLLD
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLD

Query:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGK
        HHPS+E R+E+THNN  HL  Q+ESE +NSEF+S   SSSSWTTQ+TTES+T      D+D+DSSSPS     HQD+ STTDSKSSS YS KTFNIK G+
Subjt:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGK

Query:  KESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAM
        KESK+A+G  ++LKNKLGVIFHHHHHHHHHH H+ HNFMWKQL KIFNH++N+  +VS ED  EKVK RA R+VC KNQV KFEALAEGLRSHV RSKAM
Subjt:  KESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAM

Query:  KRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR
        KRKE KGMK GKKKK+  GV+KL+WW+MFRNRRGVKL N+G MKIGYVNR
Subjt:  KRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR

A0A1S3ATF3 uncharacterized protein LOC103482539 isoform X37.1e-16568.37Show/hide
Query:  LKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHI
        +K LLDDAT+GVLE +SK++A NS+I YNF    HKDD QTK +D K VEW+KHNQ ARKME SK E NP R+RASASN A NDL HGI SALRRIELHI
Subjt:  LKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHI

Query:  LSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR
        LSLQHCTS+S+KTR + Q +LQ NE+LNQQKV  +TD  TLR+ FTKP             V PI  H  SEFVHG RIP +Q NDEA MKPP IE H+ 
Subjt:  LSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR

Query:  -KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSS
         KQHKVVNPMT   KSG TSV SKATFRP +KL++TSKQ  K+NQ+ YG MVMGPTLLDHHPS+E R+E+THNN  HL  Q+ESE +NSEF+S   SSSS
Subjt:  -KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSS

Query:  WTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK
        WTTQ+TTES+T      D+D+DSSSPS     HQD+ STTDSKSSS YS KTFNIK G+KESK+A+G  ++LKNKLGVIFHHHHHHHHHH H+ HNFMWK
Subjt:  WTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWK

Query:  QLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRG
        QL KIFNH++N+  +VS ED  EKVK RA R+VC KNQV KFEALAEGLRSHV RSKAMKRKE KGMK GKKKK+  GV+KL+WW+MFRNRRGVKL N+G
Subjt:  QLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRG

Query:  RMKIGYVNR
         MKIGYVNR
Subjt:  RMKIGYVNR

A0A1S3ATG7 uncharacterized protein LOC103482539 isoform X21.7e-18269.64Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR
        M+VD+LYLDLLALRELYILLLK CL+D NSEL LDERAQIL+K LLDDAT+GVLE +SK++A NS+I YNF    HKDD QTK +D K VEW+KHNQ AR
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAAR

Query:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ
        KME SK E NP R+RASASN A NDL HGI SALRRIELHILSLQHCTS+S+KTR + Q +LQ NE+LNQQKV  +TD  TLR+ FTKP           
Subjt:  KMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQ

Query:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLD
          V PI  H  SEFVHG RIP +Q NDEA MKPP IE H+  KQHKVVNPMT   KSG TSV SKATFRP +KL++TSKQ  K+NQ+ YG MVMGPTLLD
Subjt:  KIVKPITMHHCSEFVHGFRIPLNQSNDEA-MKPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLD

Query:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGK
        HHPS+E R+E+THNN  HL  Q+ESE +NSEF+S   SSSSWTTQ+TTES+T      D+D+DSSSPS     HQD+ STTDSKSSS YS KTFNIK G+
Subjt:  HHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGK

Query:  KESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAM
        KESK+A+G  ++LKNKLGVIFHHHHHHHHHH H+ HNFMWKQL KIFNH++N+  +VS ED  EKVK RA R+VC KNQV KFEALAEGLRSHV RSKAM
Subjt:  KESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAM

Query:  KRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR
        KRKE KGMK GKKKK+  GV+KL+WW+MFRNRRGVKL N+G MKIGYVNR
Subjt:  KRKELKGMKCGKKKKE--GVRKLHWWKMFRNRRGVKLANRGRMKIGYVNR

A0A5A7UIV0 Serine-rich adhesin for platelets1.5e-17069.63Show/hide
Query:  QLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISS
        QLDERAQILLK LLDDAT+GVLE +SK++A NS+I YNF    HKDD QTK +D K VEWMKHNQ ARKME SK E NP R+RASASN A NDL HGI S
Subjt:  QLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISS

Query:  ALRRIELHILSLQH-CTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-M
        ALRRIELHILSLQH CTS+S+KTR + Q +LQ NE+LNQQKV  +TD  TLR+  TKP             V PI  H  SEFVHG RIP +Q NDEA M
Subjt:  ALRRIELHILSLQH-CTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRIPLNQSNDEA-M

Query:  KPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSE
        KPP IE H+  KQHKVVNPMT   KSG TSV SKATFRP MKL++TSKQ  K+NQ+ YG MVMGPTLLDHHPS+E R+E+THNN  HL +Q+ESE +NSE
Subjt:  KPPTIETHLR-KQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQA-KRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSE

Query:  FQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHH
        F+S   SSSSWTTQ+TTESET      D+D+DSSSPS     HQD+ STTDSKSSS YS KTFNIK G+KESK+A+G  ++LKNKLGVIFHHHHHHHHHH
Subjt:  FQSV--SSSSWTTQQTTESETVVD---DDDNDSSSPS-----HQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHH

Query:  DHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR
         H+ HNFMWKQL KIFNH++N+  +VS ED  EKVK RA R+VC KNQV KFEALAEGLRSHV RSKAMKRKE KGMK GKKKK GV+KL+WW+MFRNRR
Subjt:  DHDRHNFMWKQLRKIFNHRDNK-RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR

Query:  GVKLANRGRMKIGYVNR
        GVKL N+G MKIGYVNR
Subjt:  GVKLANRGRMKIGYVNR

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI1.7e-1123.91Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSE-LQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKH----
        M+ + +  +L +LR LY LL+ +   +   E  +LD+  Q LLK LLD A+       S  +AT   +F        K        +      +      
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSE-LQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKH----

Query:  --NQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTS---ESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPI
           + ++ +E         R     SN +K+ L+  +  A+ +I+  + +L   +S      +TRS             +  V P T+     S+ + P 
Subjt:  --NQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTS---ESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPI

Query:  KGRDYFVGGQKIVKPITMHHCSEFVHGFRI--PLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQAKRNQNSYGQ
           +Y    QK V           + G  +  P N        PP              P+                  P   ++   K  + N+ S   
Subjt:  KGRDYFVGGQKIVKPITMHHCSEFVHGFRI--PLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQAKRNQNSYGQ

Query:  MVMGPTLLDHH------PSKEARKEQTHNNKTHLTTQQESEFTNSEFQSVSSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFN
         +M PTL+D         S E   +QT  + T   ++ E   T+ E+   + SS  ++  T++E   +     S  P + D++S  +  +S  ++ +  +
Subjt:  MVMGPTLLDHH------PSKEARKEQTHNNKTHLTTQQESEFTNSEFQSVSSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFN

Query:  IKQGKKESKRAIGGLRRLKNKLGVIF-HHHHHHHHHHDHDRHN-FMWKQLRKIFNHRDNKRLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHV
         + G K+ +  +G  +R+KNK+G IF HHHHHHHHHH HD+     W +L+  F+H+  ++  S E      + +       ++Q G F AL EGL  H 
Subjt:  IKQGKKESKRAIGGLRRLKNKLGVIF-HHHHHHHHHHDHDRHN-FMWKQLRKIFNHRDNKRLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHV

Query:  WRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR--GVKLANRGRMKIG
          SK  K           + K   +K  WWK+ + R+  GVK+  RGR+K+G
Subjt:  WRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR--GVKLANRGRMKIG

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli1.2e-1223.91Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCLQDANSE-LQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKH----
        M+ + +  +L +LR LY LL+ +   +   E  +LD+  Q LLK LLD A+       S  +AT   +F        K        +      +      
Subjt:  MDVDELYLDLLALRELYILLLKSCLQDANSE-LQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKH----

Query:  --NQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTS---ESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPI
           + ++ +E         R     SN +K+ L+  +  A+ +I+  + +L   +S      +TRS             +  V P T+     S+ + P 
Subjt:  --NQAARKMETSKIEHNPKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTS---ESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPI

Query:  KGRDYFVGGQKIVKPITMHHCSEFVHGFRI--PLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQAKRNQNSYGQ
           +Y    QK V           + G  +  P N        PP              P+                  P   ++   K  + N+ S   
Subjt:  KGRDYFVGGQKIVKPITMHHCSEFVHGFRI--PLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQAKRNQNSYGQ

Query:  MVMGPTLLDHH------PSKEARKEQTHNNKTHLTTQQESEFTNSEFQSVSSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFN
         +M PTL+D         S E   +QT  + T   ++ E   T+ E+   + SS  ++  T++E   +     S  P + D++S  +  +S  ++ +  +
Subjt:  MVMGPTLLDHH------PSKEARKEQTHNNKTHLTTQQESEFTNSEFQSVSSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFN

Query:  IKQGKKESKRAIGGLRRLKNKLGVIF-HHHHHHHHHHDHDRHN-FMWKQLRKIFNHRDNKRLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHV
         + G K+ +  +G  +R+KNK+G IF HHHHHHHHHH HD+     W +L+  F+H+  ++  S E      + +       ++Q G F AL EGL  H 
Subjt:  IKQGKKESKRAIGGLRRLKNKLGVIF-HHHHHHHHHHDHDRHN-FMWKQLRKIFNHRDNKRLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHV

Query:  WRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR--GVKLANRGRMKIG
          SK  K           + K   +K  WWK+ + R+  GVK+  RGR+K+G
Subjt:  WRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRR--GVKLANRGRMKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGTCGACGAGTTATATCTTGATCTCTTAGCACTGAGGGAATTATACATCCTTCTCTTGAAGAGTTGTTTACAAGATGCGAATTCCGAACTTCAGCTGGATGAAAG
AGCACAAATTCTATTGAAGCATTTGCTTGATGATGCTACTGCAGGAGTTTTGGAGTTTCTCTCAAAGGACGTGGCAACAAACTCCAGCATTTTCTACAATTTCAACTGGA
CGTGGCACAAAGATGATATACAGACGAAGCAACTGGACGATAAAGCTGTTGAATGGATGAAACATAATCAAGCTGCAAGAAAAATGGAAACTTCAAAGATTGAACACAAC
CCCAAAAGGAATAGAGCTTCAGCTTCAAATGTTGCAAAGAATGATTTATCACATGGCATCAGTTCAGCACTAAGGAGAATTGAACTCCACATTTTGTCATTGCAACATTG
CACAAGTGAAAGTAGGAAAACAAGAAGCCATTGGCAGTCTATTCTTCAAGGGAATGAGTCATTAAACCAACAGAAAGTTCTGCCAAGGACAGATCCCCCAACTTTAAGGA
GCCGATTTACTAAGCCGATTAAAGGAAGAGACTATTTTGTTGGTGGACAGAAGATAGTTAAGCCGATAACGATGCACCATTGTTCGGAGTTCGTTCATGGATTCAGAATA
CCTCTGAATCAAAGCAATGATGAGGCCATGAAACCTCCAACCATTGAAACCCATTTACGTAAACAACACAAAGTTGTAAATCCAATGACTCTGATAGATAAATCTGGATA
CACTTCAGTAGAATCCAAGGCAACCTTCAGGCCTACTATGAAACTGAATCAAACTAGTAAACAAGCAAAGAGGAACCAAAATTCATATGGTCAAATGGTAATGGGGCCAA
CTTTGCTAGATCATCATCCCTCCAAGGAAGCAAGGAAGGAACAAACTCATAACAACAAGACCCATTTGACTACACAGCAAGAATCAGAATTCACAAACTCAGAATTTCAA
TCAGTTTCTTCTTCAAGTTGGACAACTCAACAAACTACTGAAAGTGAAACCGTTGTTGATGACGACGACAACGACTCTTCTTCCCCAAGTCACCAAGACAATCTGTCGAC
AACCGATTCAAAATCAAGTAGCAGATATTCGAAGAAAACATTCAATATCAAGCAGGGGAAAAAAGAGTCGAAGAGAGCAATAGGAGGGTTGAGGAGACTCAAAAACAAAC
TTGGTGTTATCTTTCACCACCACCACCATCATCACCACCATCATGACCATGACAGACATAACTTCATGTGGAAGCAGCTGAGGAAGATCTTCAATCACAGAGACAACAAA
AGATTAGTAAGTATAGAAGATAGTAATGAGAAGGTAAAGAAGAGAGCCACTAGAAATGTGTGTTGCAAGAACCAAGTAGGGAAGTTTGAGGCACTTGCTGAAGGGCTAAG
AAGCCATGTATGGAGATCAAAAGCCATGAAAAGGAAAGAGCTTAAAGGGATGAAATGTGGGAAGAAGAAGAAGGAGGGTGTGAGGAAATTGCATTGGTGGAAAATGTTTC
GTAATAGGCGTGGAGTGAAGTTGGCCAATAGAGGGCGTATGAAAATAGGGTATGTAAATAGAAAAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGTCGACGAGTTATATCTTGATCTCTTAGCACTGAGGGAATTATACATCCTTCTCTTGAAGAGTTGTTTACAAGATGCGAATTCCGAACTTCAGCTGGATGAAAG
AGCACAAATTCTATTGAAGCATTTGCTTGATGATGCTACTGCAGGAGTTTTGGAGTTTCTCTCAAAGGACGTGGCAACAAACTCCAGCATTTTCTACAATTTCAACTGGA
CGTGGCACAAAGATGATATACAGACGAAGCAACTGGACGATAAAGCTGTTGAATGGATGAAACATAATCAAGCTGCAAGAAAAATGGAAACTTCAAAGATTGAACACAAC
CCCAAAAGGAATAGAGCTTCAGCTTCAAATGTTGCAAAGAATGATTTATCACATGGCATCAGTTCAGCACTAAGGAGAATTGAACTCCACATTTTGTCATTGCAACATTG
CACAAGTGAAAGTAGGAAAACAAGAAGCCATTGGCAGTCTATTCTTCAAGGGAATGAGTCATTAAACCAACAGAAAGTTCTGCCAAGGACAGATCCCCCAACTTTAAGGA
GCCGATTTACTAAGCCGATTAAAGGAAGAGACTATTTTGTTGGTGGACAGAAGATAGTTAAGCCGATAACGATGCACCATTGTTCGGAGTTCGTTCATGGATTCAGAATA
CCTCTGAATCAAAGCAATGATGAGGCCATGAAACCTCCAACCATTGAAACCCATTTACGTAAACAACACAAAGTTGTAAATCCAATGACTCTGATAGATAAATCTGGATA
CACTTCAGTAGAATCCAAGGCAACCTTCAGGCCTACTATGAAACTGAATCAAACTAGTAAACAAGCAAAGAGGAACCAAAATTCATATGGTCAAATGGTAATGGGGCCAA
CTTTGCTAGATCATCATCCCTCCAAGGAAGCAAGGAAGGAACAAACTCATAACAACAAGACCCATTTGACTACACAGCAAGAATCAGAATTCACAAACTCAGAATTTCAA
TCAGTTTCTTCTTCAAGTTGGACAACTCAACAAACTACTGAAAGTGAAACCGTTGTTGATGACGACGACAACGACTCTTCTTCCCCAAGTCACCAAGACAATCTGTCGAC
AACCGATTCAAAATCAAGTAGCAGATATTCGAAGAAAACATTCAATATCAAGCAGGGGAAAAAAGAGTCGAAGAGAGCAATAGGAGGGTTGAGGAGACTCAAAAACAAAC
TTGGTGTTATCTTTCACCACCACCACCATCATCACCACCATCATGACCATGACAGACATAACTTCATGTGGAAGCAGCTGAGGAAGATCTTCAATCACAGAGACAACAAA
AGATTAGTAAGTATAGAAGATAGTAATGAGAAGGTAAAGAAGAGAGCCACTAGAAATGTGTGTTGCAAGAACCAAGTAGGGAAGTTTGAGGCACTTGCTGAAGGGCTAAG
AAGCCATGTATGGAGATCAAAAGCCATGAAAAGGAAAGAGCTTAAAGGGATGAAATGTGGGAAGAAGAAGAAGGAGGGTGTGAGGAAATTGCATTGGTGGAAAATGTTTC
GTAATAGGCGTGGAGTGAAGTTGGCCAATAGAGGGCGTATGAAAATAGGGTATGTAAATAGAAAAGCATAG
Protein sequenceShow/hide protein sequence
MDVDELYLDLLALRELYILLLKSCLQDANSELQLDERAQILLKHLLDDATAGVLEFLSKDVATNSSIFYNFNWTWHKDDIQTKQLDDKAVEWMKHNQAARKMETSKIEHN
PKRNRASASNVAKNDLSHGISSALRRIELHILSLQHCTSESRKTRSHWQSILQGNESLNQQKVLPRTDPPTLRSRFTKPIKGRDYFVGGQKIVKPITMHHCSEFVHGFRI
PLNQSNDEAMKPPTIETHLRKQHKVVNPMTLIDKSGYTSVESKATFRPTMKLNQTSKQAKRNQNSYGQMVMGPTLLDHHPSKEARKEQTHNNKTHLTTQQESEFTNSEFQ
SVSSSSWTTQQTTESETVVDDDDNDSSSPSHQDNLSTTDSKSSSRYSKKTFNIKQGKKESKRAIGGLRRLKNKLGVIFHHHHHHHHHHDHDRHNFMWKQLRKIFNHRDNK
RLVSIEDSNEKVKKRATRNVCCKNQVGKFEALAEGLRSHVWRSKAMKRKELKGMKCGKKKKEGVRKLHWWKMFRNRRGVKLANRGRMKIGYVNRKA