; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022931 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022931
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein KOKOPELLI-like isoform X1
Genome locationscaffold5:13911624..13915445
RNA-Seq ExpressionSpg022931
SyntenySpg022931
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022958321.1 uncharacterized protein LOC111459571 isoform X1 [Cucurbita moschata]1.4e-16262.32Show/hide
Query:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK
        +KM+ DELYLDLLALR+LY+ LLK C R ANSEL +  RA+ L KHLLDDAT G+LEFHSK L      FYNFL KDDKQTKPLDEKVAEWMEHNQTAR 
Subjt:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK

Query:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK
        M NPE IE+   R R SASNVA NDLS+ ISSALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+L                       
Subjt:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK

Query:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY
                       QKVK +V NHCS+FVHGFRIPL QD +EAM          KQH+L  P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S 
Subjt:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY

Query:  GRTIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTK
        GR +MRPTL              NKTHLA QQESE       +TNSESES  SSS  T+QTSESETT D+SSP  Q   PATGSEASS+  +SSS+IS +
Subjt:  GRTIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTK

Query:  AFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSH
        AFKF+HGKKES++A+GRFK L+NKLGLIF    HHHHHH+HN +N MWKQ+R++FHRT  ++LTSK E+ GML+KT IRSVSR NQVGKFQALAEGLRSH
Subjt:  AFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSH

Query:  VWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        VW+SKAMKKKE RGLNCGK  G KKLHWWKM RR RGVKLPNK RVKIGYVN+K   +++
Subjt:  VWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

XP_022996025.1 uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima]2.5e-16763.59Show/hide
Query:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK
        +KM+ DELYLDLLALR+LY  LLK C R ANSEL +  RA+ LLKHLLDDAT G+LEFHSK LA     FYNFL KDDKQTKPLDEKVAEWMEHNQTAR+
Subjt:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK

Query:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK
        M NPE IE+  +R R SASNVA NDLS+ I+SALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+  Q                     
Subjt:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK

Query:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY
                       QKVK +V NHCS+FV+GFRIPL QD DEAM          KQH+LV P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S 
Subjt:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY

Query:  GRTIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSIST
        GR +M+PTL  HPSREVRKEQT  N+ HLA QQESEFTN         SES S SS  T QTSESETT D+SSP +Q    ATGSEASS+Y +SSS+I+ 
Subjt:  GRTIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSIST

Query:  KAFKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGL
        KAFKF+HGKKES  A+GRFK L+NKLGLIFHH  HH HHHHHHH+ +N MWKQ+R +FHRTD ++LTSK E+ G L+KT IRSVSR NQVGKFQAL EGL
Subjt:  KAFKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGL

Query:  RSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        RSHVW+SKAMKKKE RGLNCG     KKLHWWKM RR RGVK PNK RVKIGYVNRK   +++
Subjt:  RSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

XP_022996027.1 uncharacterized protein LOC111491355 isoform X2 [Cucurbita maxima]1.6e-16663.64Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV
        M+ DELYLDLLALR+LY  LLK C R ANSEL +  RA+ LLKHLLDDAT G+LEFHSK LA     FYNFL KDDKQTKPLDEKVAEWMEHNQTAR+M 
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV

Query:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH
        NPE IE+  +R R SASNVA NDLS+ I+SALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+  Q                       
Subjt:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH

Query:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR
                     QKVK +V NHCS+FV+GFRIPL QD DEAM          KQH+LV P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S GR
Subjt:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR

Query:  TIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKA
         +M+PTL  HPSREVRKEQT  N+ HLA QQESEFTN         SES S SS  T QTSESETT D+SSP +Q    ATGSEASS+Y +SSS+I+ KA
Subjt:  TIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKA

Query:  FKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRS
        FKF+HGKKES  A+GRFK L+NKLGLIFHH  HH HHHHHHH+ +N MWKQ+R +FHRTD ++LTSK E+ G L+KT IRSVSR NQVGKFQAL EGLRS
Subjt:  FKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRS

Query:  HVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        HVW+SKAMKKKE RGLNCG     KKLHWWKM RR RGVK PNK RVKIGYVNRK   +++
Subjt:  HVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

XP_038877121.1 protein KOKOPELLI-like isoform X1 [Benincasa hispida]6.5e-17166.13Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN
        MDVD+LYLDLLALRELYILLLKSC   ANSELLDERAQ LLKHLLDDATAG+LEF S +LATNS IF NFLHKDDKQ KPL +KV EWM+HNQT RKM N
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN

Query:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL
        PEI     R R SASNVA N+LS+ ISSALRRIELHILSLQ CTSQ R TR H +         SVLQ NE+L QQ V  RT  STLR+ F + IKG   
Subjt:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL

Query:  SSQLRSHLVGGQ-KVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSG-CSVGSKATVRFGLKLNQT-RIQERRIQHSYG
            R H VG Q KVK    NHCSE+VHGFRIPL Q NDEAMKP T+ET I KQHK+VNPMTLIDKSG  SVGSKAT R  +KLNQT + Q +R Q+SYG
Subjt:  SSQLRSHLVGGQ-KVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSG-CSVGSKATVRFGLKLNQT-RIQERRIQHSYG

Query:  RTIMRPTLLD-HPSREVRKEQTPNKTHL-ATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETT-----YDASSPSHQDGSPATGSEASSRYRSSS
        + +M PTLLD HPS+E R E+  +KTHL ATQQESEFT+SE      +S S+SSSSWTTQ+TS SET       + SSPSHQD   +T S++SS      
Subjt:  RTIMRPTLLD-HPSREVRKEQTPNKTHL-ATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETT-----YDASSPSHQDGSPATGSEASSRYRSSS

Query:  SSISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWK-QLRKIFHRTDNRK-LTSKVERYGMLKKTAIRSVSRKNQVGKFQA
            TK F    GK ES++ +GRFKRLKNKLG++F HHHHHHHHHHHNSNNFMWK QLRKIFH  DN++ L SK +    +KK AIR+V  KNQVGKFQA
Subjt:  SSISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWK-QLRKIFHRTDNRK-LTSKVERYGMLKKTAIRSVSRKNQVGKFQA

Query:  LAEGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQ
        LAEGLRSHVWRSKAMK+K ++G+ CG KKGVKKLHWWKMFR  RGV+LPNK  +KIGYVN+K +
Subjt:  LAEGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQ

XP_038877123.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida]6.5e-17166.13Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN
        MDVD+LYLDLLALRELYILLLKSC   ANSELLDERAQ LLKHLLDDATAG+LEF S +LATNS IF NFLHKDDKQ KPL +KV EWM+HNQT RKM N
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN

Query:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL
        PEI     R R SASNVA N+LS+ ISSALRRIELHILSLQ CTSQ R TR H +         SVLQ NE+L QQ V  RT  STLR+ F + IKG   
Subjt:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL

Query:  SSQLRSHLVGGQ-KVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSG-CSVGSKATVRFGLKLNQT-RIQERRIQHSYG
            R H VG Q KVK    NHCSE+VHGFRIPL Q NDEAMKP T+ET I KQHK+VNPMTLIDKSG  SVGSKAT R  +KLNQT + Q +R Q+SYG
Subjt:  SSQLRSHLVGGQ-KVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSG-CSVGSKATVRFGLKLNQT-RIQERRIQHSYG

Query:  RTIMRPTLLD-HPSREVRKEQTPNKTHL-ATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETT-----YDASSPSHQDGSPATGSEASSRYRSSS
        + +M PTLLD HPS+E R E+  +KTHL ATQQESEFT+SE      +S S+SSSSWTTQ+TS SET       + SSPSHQD   +T S++SS      
Subjt:  RTIMRPTLLD-HPSREVRKEQTPNKTHL-ATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETT-----YDASSPSHQDGSPATGSEASSRYRSSS

Query:  SSISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWK-QLRKIFHRTDNRK-LTSKVERYGMLKKTAIRSVSRKNQVGKFQA
            TK F    GK ES++ +GRFKRLKNKLG++F HHHHHHHHHHHNSNNFMWK QLRKIFH  DN++ L SK +    +KK AIR+V  KNQVGKFQA
Subjt:  SSISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWK-QLRKIFHRTDNRK-LTSKVERYGMLKKTAIRSVSRKNQVGKFQA

Query:  LAEGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQ
        LAEGLRSHVWRSKAMK+K ++G+ CG KKGVKKLHWWKMFR  RGV+LPNK  +KIGYVN+K +
Subjt:  LAEGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQ

TrEMBL top hitse value%identityAlignment
A0A6J1ETH9 protein KOKOPELLI-like isoform X11.4e-15861.84Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN
        MDVDE YLDLLALRELYILLLKSC R A SELLDERAQ LLK+LLDDATA +LEF  KN+AT+SGIFY FLHKDDKQ+KPLDEKV EWM+          
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN

Query:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL
               KRAR SASN  T+ +   ISSA+RRIE HILSLQR TSQS+  R+HI     +YCG SVL+GNET  +QKVQSRTDHST+             
Subjt:  PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNL

Query:  SSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGRTI
        + Q++  LVGGQ  K +VT HCSEFVHGFR+PL Q + E  KP  VET + KQHKLVNPMTLIDK G SVGSKAT+R   K +Q+R+  ++ Q+SYG  +
Subjt:  SSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGRTI

Query:  MRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKAFKF
        M+PTLLDHPSREVRKE+T  KTHLATQ ESEFT           +S  SSSWTTQQTSES T  D SSPSHQD  PA  SE SS              ++
Subjt:  MRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKAFKF

Query:  NHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSHVWRS
        + GKKES+RAIGRFKRLKNKLG+IF   HHHHHHHHHNS++FMW ++RKIFH T+N+KLTS  +RY   K TAIRS  R NQVGKFQA+A+ LRSHV RS
Subjt:  NHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSHVWRS

Query:  KAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERV-KIGYVNRKTQ
        KA+ KK+   + CG KKGVKKLHWWK+FR   GV+L NK R+ +I YVN+K Q
Subjt:  KAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERV-KIGYVNRKTQ

A0A6J1H1S0 uncharacterized protein LOC111459571 isoform X17.0e-16362.32Show/hide
Query:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK
        +KM+ DELYLDLLALR+LY+ LLK C R ANSEL +  RA+ L KHLLDDAT G+LEFHSK L      FYNFL KDDKQTKPLDEKVAEWMEHNQTAR 
Subjt:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK

Query:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK
        M NPE IE+   R R SASNVA NDLS+ ISSALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+L                       
Subjt:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK

Query:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY
                       QKVK +V NHCS+FVHGFRIPL QD +EAM          KQH+L  P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S 
Subjt:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY

Query:  GRTIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTK
        GR +MRPTL              NKTHLA QQESE       +TNSESES  SSS  T+QTSESETT D+SSP  Q   PATGSEASS+  +SSS+IS +
Subjt:  GRTIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTK

Query:  AFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSH
        AFKF+HGKKES++A+GRFK L+NKLGLIF    HHHHHH+HN +N MWKQ+R++FHRT  ++LTSK E+ GML+KT IRSVSR NQVGKFQALAEGLRSH
Subjt:  AFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSH

Query:  VWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        VW+SKAMKKKE RGLNCGK  G KKLHWWKM RR RGVKLPNK RVKIGYVN+K   +++
Subjt:  VWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

A0A6J1H2T7 uncharacterized protein LOC111459571 isoform X24.5e-16262.37Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV
        M+ DELYLDLLALR+LY+ LLK C R ANSEL +  RA+ L KHLLDDAT G+LEFHSK L      FYNFL KDDKQTKPLDEKVAEWMEHNQTAR M 
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV

Query:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH
        NPE IE+   R R SASNVA NDLS+ ISSALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+L                         
Subjt:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH

Query:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR
                     QKVK +V NHCS+FVHGFRIPL QD +EAM          KQH+L  P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S GR
Subjt:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR

Query:  TIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKAF
         +MRPTL              NKTHLA QQESE       +TNSESES  SSS  T+QTSESETT D+SSP  Q   PATGSEASS+  +SSS+IS +AF
Subjt:  TIMRPTLLDHPSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKAF

Query:  KFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSHVW
        KF+HGKKES++A+GRFK L+NKLGLIF    HHHHHH+HN +N MWKQ+R++FHRT  ++LTSK E+ GML+KT IRSVSR NQVGKFQALAEGLRSHVW
Subjt:  KFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSHVW

Query:  RSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        +SKAMKKKE RGLNCGK  G KKLHWWKM RR RGVKLPNK RVKIGYVN+K   +++
Subjt:  RSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

A0A6J1K0S1 uncharacterized protein LOC111491355 isoform X28.0e-16763.64Show/hide
Query:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV
        M+ DELYLDLLALR+LY  LLK C R ANSEL +  RA+ LLKHLLDDAT G+LEFHSK LA     FYNFL KDDKQTKPLDEKVAEWMEHNQTAR+M 
Subjt:  MDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMV

Query:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH
        NPE IE+  +R R SASNVA NDLS+ I+SALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+  Q                       
Subjt:  NPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGH

Query:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR
                     QKVK +V NHCS+FV+GFRIPL QD DEAM          KQH+LV P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S GR
Subjt:  NLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGR

Query:  TIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKA
         +M+PTL  HPSREVRKEQT  N+ HLA QQESEFTN         SES S SS  T QTSESETT D+SSP +Q    ATGSEASS+Y +SSS+I+ KA
Subjt:  TIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKA

Query:  FKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRS
        FKF+HGKKES  A+GRFK L+NKLGLIFHH  HH HHHHHHH+ +N MWKQ+R +FHRTD ++LTSK E+ G L+KT IRSVSR NQVGKFQAL EGLRS
Subjt:  FKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRS

Query:  HVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        HVW+SKAMKKKE RGLNCG     KKLHWWKM RR RGVK PNK RVKIGYVNRK   +++
Subjt:  HVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

A0A6J1K5J4 uncharacterized protein LOC111491355 isoform X11.2e-16763.59Show/hide
Query:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK
        +KM+ DELYLDLLALR+LY  LLK C R ANSEL +  RA+ LLKHLLDDAT G+LEFHSK LA     FYNFL KDDKQTKPLDEKVAEWMEHNQTAR+
Subjt:  NKMDVDELYLDLLALRELYILLLKSCSRVANSEL-LDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARK

Query:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK
        M NPE IE+  +R R SASNVA NDLS+ I+SALRRIELHILSLQ      R TR+HI ET LAY G SV QGNE+  Q                     
Subjt:  MVNPE-IEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIK

Query:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY
                       QKVK +V NHCS+FV+GFRIPL QD DEAM          KQH+LV P TL+DKSGC  GSKAT R  +KLN+T IQE+R ++S 
Subjt:  GHNLSSQLRSHLVGGQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSY

Query:  GRTIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSIST
        GR +M+PTL  HPSREVRKEQT  N+ HLA QQESEFTN         SES S SS  T QTSESETT D+SSP +Q    ATGSEASS+Y +SSS+I+ 
Subjt:  GRTIMRPTLLDHPSREVRKEQT-PNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSIST

Query:  KAFKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGL
        KAFKF+HGKKES  A+GRFK L+NKLGLIFHH  HH HHHHHHH+ +N MWKQ+R +FHRTD ++LTSK E+ G L+KT IRSVSR NQVGKFQAL EGL
Subjt:  KAFKFNHGKKESERAIGRFKRLKNKLGLIFHH--HHHHHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGL

Query:  RSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV
        RSHVW+SKAMKKKE RGLNCG     KKLHWWKM RR RGVK PNK RVKIGYVNRK   +++
Subjt:  RSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKERVKIGYVNRKTQPQVV

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI3.1e-1934.24Show/hide
Query:  IMRPTLLDH-------PSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSS
        IM+PTL+D         S E   +QTP+ T   ++ E   T+ E    + E+ S+S S W TQ  +++E+  ++S P   D S +           S+S 
Subjt:  IMRPTLLDH-------PSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSS

Query:  ISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNN--FMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALA
          T         K+    +GRFKR+KNK+G IFHHHHHHHHHHHH+       W +L+  FH     K  SK  +  M +   + +  +++Q G F AL 
Subjt:  ISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNN--FMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALA

Query:  EGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKERVKIG
        EGL  H   SK  K +         K   KK  WWK+ ++ +  GVK+P + RVK+G
Subjt:  EGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKERVKIG

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli2.2e-2034.24Show/hide
Query:  IMRPTLLDH-------PSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSS
        IM+PTL+D         S E   +QTP+ T   ++ E   T+ E    + E+ S+S S W TQ  +++E+  ++S P   D S +           S+S 
Subjt:  IMRPTLLDH-------PSREVRKEQTPNKTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSS

Query:  ISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNN--FMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALA
          T         K+    +GRFKR+KNK+G IFHHHHHHHHHHHH+       W +L+  FH     K  SK  +  M +   + +  +++Q G F AL 
Subjt:  ISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHHHHHHHHHNSNN--FMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALA

Query:  EGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKERVKIG
        EGL  H   SK  K +         K   KK  WWK+ ++ +  GVK+P + RVK+G
Subjt:  EGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHR--GVKLPNKERVKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGAAACAAGGAAATTTATAAACAAGATGGATGTTGACGAGTTATATCTTGATCTCCTAGCACTCAGGGAACTATACATCCTCCTCCTAAAGAGCTGTTCGCGAGT
TGCAAATTCAGAACTTCTGGATGAAAGGGCACAGACTTTATTGAAGCATTTGCTTGATGATGCTACTGCAGGAATTCTTGAGTTTCACTCAAAGAACTTGGCAACAAACT
CTGGCATTTTTTACAACTTTCTGCACAAAGATGATAAACAGACAAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATCAAACTGCAAGAAAGATGGTAAAT
CCAGAGATTGAATACAATGCCAAGAGGGCCAGACCTTCAGCTTCAAATGTTGCCACAAATGACTTATCAAATGACATCAGTTCCGCACTTAGAAGAATTGAACTCCACAT
TTTATCTCTGCAACGTTGCACAAGTCAAAGTAGAAACACAAGAAACCATATCAGAGAAACTAATTTAGCTTACTGTGGGCTGTCTGTCCTTCAAGGGAATGAGACATTGA
AACAGCAGAAAGTTCAGTCAAGGACAGATCACTCAACTTTAAGGACCAGCTTTGCTGAGTCGATTAAAGGCCATAACTTGAGCAGTCAGTTAAGAAGTCATCTTGTTGGT
GGACAGAAAGTTAAGCTAATAGTGACAAACCATTGTTCTGAGTTCGTTCATGGATTTAGAATACCTCTGCGTCAAGACAATGATGAGGCCATGAAACCTCCAACAGTTGA
AACTCGCATATTTAAACAACACAAACTTGTAAACCCAATGACTCTGATAGATAAATCTGGATGTTCAGTAGGATCCAAGGCAACTGTCAGGTTCGGTTTGAAACTGAATC
AAACTCGGATACAAGAAAGGAGGATTCAGCATTCATATGGTCGTACGATAATGAGGCCAACTTTGCTTGATCATCCCTCCAGAGAAGTAAGAAAGGAACAAACTCCTAAC
AAGACCCATTTGGCCACTCAGCAAGAATCAGAATTCACAAACTCAGAATCTAACTTCACAAACTCAGAATCAGAATCAACTTCTTCTTCAAGTTGGACAACTCAACAAAC
CAGTGAAAGTGAAACCACTTATGACGCTTCTTCCCCAAGTCACCAAGATGGTTCACCAGCAACCGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTT
CAACAAAAGCATTTAAATTCAACCATGGGAAAAAAGAGTCTGAGCGAGCAATAGGACGGTTCAAGAGACTCAAAAACAAACTAGGCCTTATCTTCCACCACCACCATCAT
CATCACCACCATCACCACCATAACAGCAATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATCGCACAGATAACAGAAAACTAACAAGTAAAGTAGAAAGATATGG
GATGCTAAAGAAAACAGCAATCAGAAGTGTGTCTCGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAGATCGAAAGCCATGA
AGAAGAAAGAGCTTAGGGGGCTGAATTGTGGGAAGAAGAAAGGTGTGAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGTCACCGTGGAGTGAAGTTGCCCAATAAAGAG
CGTGTGAAAATAGGGTATGTAAATAGAAAAACACAGCCTCAGGTGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCAGAAACAAGGAAATTTATAAACAAGATGGATGTTGACGAGTTATATCTTGATCTCCTAGCACTCAGGGAACTATACATCCTCCTCCTAAAGAGCTGTTCGCGAGT
TGCAAATTCAGAACTTCTGGATGAAAGGGCACAGACTTTATTGAAGCATTTGCTTGATGATGCTACTGCAGGAATTCTTGAGTTTCACTCAAAGAACTTGGCAACAAACT
CTGGCATTTTTTACAACTTTCTGCACAAAGATGATAAACAGACAAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATCAAACTGCAAGAAAGATGGTAAAT
CCAGAGATTGAATACAATGCCAAGAGGGCCAGACCTTCAGCTTCAAATGTTGCCACAAATGACTTATCAAATGACATCAGTTCCGCACTTAGAAGAATTGAACTCCACAT
TTTATCTCTGCAACGTTGCACAAGTCAAAGTAGAAACACAAGAAACCATATCAGAGAAACTAATTTAGCTTACTGTGGGCTGTCTGTCCTTCAAGGGAATGAGACATTGA
AACAGCAGAAAGTTCAGTCAAGGACAGATCACTCAACTTTAAGGACCAGCTTTGCTGAGTCGATTAAAGGCCATAACTTGAGCAGTCAGTTAAGAAGTCATCTTGTTGGT
GGACAGAAAGTTAAGCTAATAGTGACAAACCATTGTTCTGAGTTCGTTCATGGATTTAGAATACCTCTGCGTCAAGACAATGATGAGGCCATGAAACCTCCAACAGTTGA
AACTCGCATATTTAAACAACACAAACTTGTAAACCCAATGACTCTGATAGATAAATCTGGATGTTCAGTAGGATCCAAGGCAACTGTCAGGTTCGGTTTGAAACTGAATC
AAACTCGGATACAAGAAAGGAGGATTCAGCATTCATATGGTCGTACGATAATGAGGCCAACTTTGCTTGATCATCCCTCCAGAGAAGTAAGAAAGGAACAAACTCCTAAC
AAGACCCATTTGGCCACTCAGCAAGAATCAGAATTCACAAACTCAGAATCTAACTTCACAAACTCAGAATCAGAATCAACTTCTTCTTCAAGTTGGACAACTCAACAAAC
CAGTGAAAGTGAAACCACTTATGACGCTTCTTCCCCAAGTCACCAAGATGGTTCACCAGCAACCGGTTCAGAGGCAAGTAGCCGGTACAGAAGCAGCAGTAGCAGCATTT
CAACAAAAGCATTTAAATTCAACCATGGGAAAAAAGAGTCTGAGCGAGCAATAGGACGGTTCAAGAGACTCAAAAACAAACTAGGCCTTATCTTCCACCACCACCATCAT
CATCACCACCATCACCACCATAACAGCAATAACTTCATGTGGAAGCAGCTAAGAAAGATCTTCCATCGCACAGATAACAGAAAACTAACAAGTAAAGTAGAAAGATATGG
GATGCTAAAGAAAACAGCAATCAGAAGTGTGTCTCGCAAGAACCAAGTTGGGAAGTTTCAGGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAGATCGAAAGCCATGA
AGAAGAAAGAGCTTAGGGGGCTGAATTGTGGGAAGAAGAAAGGTGTGAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGTCACCGTGGAGTGAAGTTGCCCAATAAAGAG
CGTGTGAAAATAGGGTATGTAAATAGAAAAACACAGCCTCAGGTGGTTTAG
Protein sequenceShow/hide protein sequence
MPETRKFINKMDVDELYLDLLALRELYILLLKSCSRVANSELLDERAQTLLKHLLDDATAGILEFHSKNLATNSGIFYNFLHKDDKQTKPLDEKVAEWMEHNQTARKMVN
PEIEYNAKRARPSASNVATNDLSNDISSALRRIELHILSLQRCTSQSRNTRNHIRETNLAYCGLSVLQGNETLKQQKVQSRTDHSTLRTSFAESIKGHNLSSQLRSHLVG
GQKVKLIVTNHCSEFVHGFRIPLRQDNDEAMKPPTVETRIFKQHKLVNPMTLIDKSGCSVGSKATVRFGLKLNQTRIQERRIQHSYGRTIMRPTLLDHPSREVRKEQTPN
KTHLATQQESEFTNSESNFTNSESESTSSSSWTTQQTSESETTYDASSPSHQDGSPATGSEASSRYRSSSSSISTKAFKFNHGKKESERAIGRFKRLKNKLGLIFHHHHH
HHHHHHHNSNNFMWKQLRKIFHRTDNRKLTSKVERYGMLKKTAIRSVSRKNQVGKFQALAEGLRSHVWRSKAMKKKELRGLNCGKKKGVKKLHWWKMFRRHRGVKLPNKE
RVKIGYVNRKTQPQVV