; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002279 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002279
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein KOKOPELLI-like isoform X1
Genome locationscaffold1:30925617..30929958
RNA-Seq ExpressionSpg002279
SyntenySpg002279
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579634.1 Protein KOKOPELLI, partial [Cucurbita argyrosperma subsp. sororia]3.6e-15360.51Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        MDVD+ YLDLLALRELYILLLKSCLRDA S+LLD RAQILLK+LLDDATAEV+EF  K +ATDS I Y F HKD KQ+KPLDEKV EWM           
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
            +H P+RAR SASN  T+    GIS+ALRRIE HILSLQR TSQS+         + +++ G+SVL+ NET+N+QKVQ+RT+HST+           
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG
          + Q++ HLVGGQ +K  VT H SEFVHGFR+PLSQ +EE  KP TVET++SKQ KL+NP+T I +SG SVGSK TIR   K +Q+R+  ++SQNS G 
Subjt:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG

Query:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH
        M+M+PTLLDHPSREVRKE+T  K HLATQ ESEFT    +SA SSSW +Q+T ESET DD SSPS+QD  P   SEAS+                 R+SH
Subjt:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH

Query:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA
         KK SKRAIGRFKRLKNKLG+IF HHHHHHHHHNSH+FMW ++RKIFH T+N+KLTS E++  K K TAIRS      NQVGKFQA+A+ L+SHV RSK 
Subjt:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA

Query:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL
        +KKK+   +  G  KGVKKLHWWK+FR RHG++  NKGR+ RIRYVN+K QL
Subjt:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL

KAG7017089.1 Protein KOKOPELLI, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-15360.51Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        MDVD+ YLDLLALRELYILLLKSCLRDA S+LLD RAQILLK+LLDDATAEV+EF  K +ATDS I Y F HKD KQ+KPLDEKV EWM           
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
            +H P+RAR SASN  T+    GIS+ALRRIE HILSLQR TSQS+         + +++ G+SVL+ NET+N+QKVQ+RT+HST+           
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG
          + Q++ HLVGGQ +K  VT H SEFVHGFR+PLSQ +EE  KP TVET++SKQ KL+NP+T I +SG SVGSK TIR   K +Q+R+  ++SQNS G 
Subjt:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG

Query:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH
        M+M+PTLLDHPSREVRKE+T  K HLATQ ESEFT    +SA SSSW +Q+T ESET DD SSPS+QD  P   SEAS+                 R+SH
Subjt:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH

Query:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA
         KK SKRAIGRFKRLKNKLG+IF HHHHHHHHHNSH+FMW ++RKIFH T+N+KLTS E++  K K TAIRS      NQVGKFQA+A+ L+SHV RSK 
Subjt:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA

Query:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL
        +KKK+   +  G  KGVKKLHWWK+FR RHG++  NKGR+ RIRYVN+K QL
Subjt:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL

XP_022996025.1 uncharacterized protein LOC111491355 isoform X1 [Cucurbita maxima]7.4e-15160.35Show/hide
Query:  LSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQS
        LSDKM+ D+LYLDLLALR+LY  LLK CLRDANS+L +  RA+ILLKHLLDDAT  ++EF+SK LA      YNF  KD KQTKPLDEKVAEWMEH NQ+
Subjt:  LSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQS

Query:  VRKMGNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFT
         R+M N E IEH PRR RASASNVA ND S+GI++ALRRIE+HILSLQR        T +HI ETKLA+ GQSV Q NE+ NQQKV              
Subjt:  VRKMGNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFT

Query:  EPIKGHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRS
                              KP V NH S+FV+GFRIPL+QD +EAM          KQ +L+ P T + +SG   GSK T R   KLN+T IQE+RS
Subjt:  EPIKGHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRS

Query:  QNSSGGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSIST
        +NS G ++M+PTL  HPSREVRKEQT +N+ HLA Q ESEFTN  SESAS SS A+ +T ESETTDD SSP  Q SP  TGSEAS++Y    +SSS+I+ 
Subjt:  QNSSGGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSIST

Query:  KAFRFSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAE
        KAF+FSH KK S  A+GRFK L+NKLGLIFHHH    HHHHHHH+ HN MWKQ+R +FH TD ++LTS EEK+GKL+KT IRS  VS  NQVGKFQAL E
Subjt:  KAFRFSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAE

Query:  GLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV
        GLRSHVW+SKAMKKKE RGL  G     KKLHWWKM RRR G+K  NKGRV+I YVNRK  +K++
Subjt:  GLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV

XP_038877121.1 protein KOKOPELLI-like isoform X1 [Benincasa hispida]4.1e-15761.87Show/hide
Query:  IFAAWAMLSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWM
        +F   A+ S  MDVDKLYLDLLALRELYILLLKSCL DANS+LLDERAQILLKHLLDDATA V+EF S  LAT+S+I  NF HKD KQ KPL +KV EWM
Subjt:  IFAAWAMLSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWM

Query:  EHNNQSVRKMGNLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTL
        +H NQ+ RKMGN EI     R RASASNVA N+ S+ IS+ALRRIE+HILSLQ CTSQ R          K     QSVLQ NE++NQQ V  RT  STL
Subjt:  EHNNQSVRKMGNLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTL

Query:  RTGFTEPIKGHNLSSQLRSHLVGGQ-KIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGY-SVGSKVTIRAGTKLNQT
        R+ FT+PIKG       R H VG Q K+KP   NH SE+VHGFRIPLSQ N+EAMKP T+ET+I+KQ K++NP+T I +SGY SVGSK T R   KLNQT
Subjt:  RTGFTEPIKGHNLSSQLRSHLVGGQ-KIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGY-SVGSKVTIRAGTKLNQT

Query:  -RIQERRSQNSSGGMIMRPTLLD-HPSREVRKEQTYNKIHL-ATQPESEFTNSE--SESASSSSWASQKTLESETT-----DDPSSPSYQDSPPTTGSEA
         + Q +R+QNS G M+M PTLLD HPS+E R E+  +K HL ATQ ESEFT+SE  S S+SSSSW +Q+T  SET       +PSSPS+QD P       
Subjt:  -RIQERRSQNSSGGMIMRPTLLD-HPSREVRKEQTYNKIHL-ATQPESEFTNSE--SESASSSSWASQKTLESETT-----DDPSSPSYQDSPPTTGSEA

Query:  STRYRSSSSSSSSISTKAFRFSHRKKGSKRAIGRFKRLKNKLGLIF-HHHHHHHHHHNSHNFMWK-QLRKIFHHTDNRK-LTSNEEKSGKLKKTAIRSRG
              S+ S SS  TK F     K  SK+ +GRFKRLKNKLG++F HHHHHHHHHHNS+NFMWK QLRKIFH  DN++ L S E+ + K+KK AIR+  
Subjt:  STRYRSSSSSSSSISTKAFRFSHRKKGSKRAIGRFKRLKNKLGLIF-HHHHHHHHHHNSHNFMWK-QLRKIFHHTDNRK-LTSNEEKSGKLKKTAIRSRG

Query:  VSHKNQVGKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQL
        V +KNQVGKFQALAEGLRSHVWRSKAMK+K ++G+  G KKGVKKLHWWKMFR R G++L NKG ++I YVN+K++L
Subjt:  VSHKNQVGKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQL

XP_038877123.1 protein KOKOPELLI-like isoform X3 [Benincasa hispida]3.4e-15662.54Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        MDVDKLYLDLLALRELYILLLKSCL DANS+LLDERAQILLKHLLDDATA V+EF S  LAT+S+I  NF HKD KQ KPL +KV EWM+H NQ+ RKMG
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
        N EI     R RASASNVA N+ S+ IS+ALRRIE+HILSLQ CTSQ R          K     QSVLQ NE++NQQ V  RT  STLR+ FT+PIKG 
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQ-KIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGY-SVGSKVTIRAGTKLNQT-RIQERRSQNS
              R H VG Q K+KP   NH SE+VHGFRIPLSQ N+EAMKP T+ET+I+KQ K++NP+T I +SGY SVGSK T R   KLNQT + Q +R+QNS
Subjt:  NLSSQLRSHLVGGQ-KIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGY-SVGSKVTIRAGTKLNQT-RIQERRSQNS

Query:  SGGMIMRPTLLD-HPSREVRKEQTYNKIHL-ATQPESEFTNSE--SESASSSSWASQKTLESETT-----DDPSSPSYQDSPPTTGSEASTRYRSSSSSS
         G M+M PTLLD HPS+E R E+  +K HL ATQ ESEFT+SE  S S+SSSSW +Q+T  SET       +PSSPS+QD P             S+ S 
Subjt:  SGGMIMRPTLLD-HPSREVRKEQTYNKIHL-ATQPESEFTNSE--SESASSSSWASQKTLESETT-----DDPSSPSYQDSPPTTGSEASTRYRSSSSSS

Query:  SSISTKAFRFSHRKKGSKRAIGRFKRLKNKLGLIF-HHHHHHHHHHNSHNFMWK-QLRKIFHHTDNRK-LTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQ
        SS  TK F     K  SK+ +GRFKRLKNKLG++F HHHHHHHHHHNS+NFMWK QLRKIFH  DN++ L S E+ + K+KK AIR+  V +KNQVGKFQ
Subjt:  SSISTKAFRFSHRKKGSKRAIGRFKRLKNKLGLIF-HHHHHHHHHHNSHNFMWK-QLRKIFHHTDNRK-LTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQ

Query:  ALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQL
        ALAEGLRSHVWRSKAMK+K ++G+  G KKGVKKLHWWKMFR R G++L NKG ++I YVN+K++L
Subjt:  ALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQL

TrEMBL top hitse value%identityAlignment
A0A6J1DNR3 protein KOKOPELLI isoform X26.3e-14860.75Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        M+V++LYLDLLALRELYILLLKSCLRDANS+LLDERAQILLKHLLDDATAE+V+F+SK                   TKP++EKVAEWME+ NQS RK G
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
                        NVA ND SNGI  ALRRIE HILSLQ  TSQSR NT +HI   KL+         N  ++QQKVQ+R +HS L+    EPI G 
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG
                              H SEFVHGFR+PLSQDN EAMKPP V T +SKQ K+INP+  I +S  SVGSK T+R+   +N+T+I ERR QN  G 
Subjt:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG

Query:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH
        MIMRPTLL+H            K  + TQ ESEFTNSESES SSSSWA+Q+T E+ETTD PSS S+Q+  P TGSE S+RYR     SS IS+KAFR SH
Subjt:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH

Query:  RKKGSKRAIGRFKRLKNKLGLIF--HHHHHHHHHHNSHN--FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVW
         KKGSK+AIGRFKRL+NKLGLIF  HHHHHHHHHHNSHN  FMWKQLRKIFH TD +++TS + +   LKKTAIRS  VS KNQVG+FQALAEGLRSHVW
Subjt:  RKKGSKRAIGRFKRLKNKLGLIF--HHHHHHHHHHNSHN--FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVW

Query:  RSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV
        +  AMKKKELR    G KKGVKKLHWW+MF RR G+KL NKGRV+I YVNRK Q K+V
Subjt:  RSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV

A0A6J1ETH9 protein KOKOPELLI-like isoform X12.2e-14859.6Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        MDVD+ YLDLLALRELYILLLKSCLRDA S+LLDERAQILLK+LLDDATAEV+EF  K +ATDS I Y F HKD KQ+KPLDEKV EWM           
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
            +  P+RAR SASN  T+    GIS+A+RRIE HILSLQR TSQS+         + +++ G+SVL+ NET N+QKVQ+RT+HST+           
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG
          + Q++  LVGGQ  K  VT H SEFVHGFR+PLSQ ++E  KP  VET++SKQ KL+NP+T I + G SVGSK TIR   K +Q+R+  ++SQNS G 
Subjt:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGG

Query:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH
        M+M+PTLLDHPSREVRKE+T  K HLATQ ESEFT    +SA SSSW +Q+T ES T DD SSPS+QD  P   SE                T + R+S 
Subjt:  MIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSH

Query:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA
         KK SKRAIGRFKRLKNKLG+IF HHHHHHHHHNSH+FMW ++RKIFH T+N+KLTS E++  K K TAIRS      NQVGKFQA+A+ LRSHV RSKA
Subjt:  RKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKA

Query:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL
        + KK+   +  G KKGVKKLHWWK+FR RHG++L NKGR+ RIRYVN+K QL
Subjt:  MKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL

A0A6J1I0S9 protein KOKOPELLI-like isoform X11.7e-14858.95Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG
        MDVD+ YLDLLALRELYILLLKSCLRDA S+LLDERAQILLK+ LDDATAEV+EF SK LATDS I Y F HKD KQTKPLDEKV E M           
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKMG

Query:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH
            +H P+RAR SAS   T+    GI +ALRRIE HILS QR  SQS+         + +++ G+SVL+ NET+N+QKVQ+RT+HST+           
Subjt:  NLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGH

Query:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSG-
          + Q++ HLVGGQ +KP ++ H SEFVHGFR+PLSQ N E  KP  VET++SKQ K +NP+T+I +SG SVGSK TI    K +Q+R+  +RS+NS G 
Subjt:  NLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSG-

Query:  GMIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFS
         M+M+PTLL+HPSREVRKE+T NK HLA+Q E+EFT    +SASSSSW +Q+T ESET D+ SSPS+QD PP   S+AS+R                R+S
Subjt:  GMIMRPTLLDHPSREVRKEQTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFS

Query:  HRKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSK
        H KK SKRAIGRFKRLKNKLG+IF HHHHHHHHHN H+FMW ++RKIFH T+N+KLTS E++  K+K TA+RS G +  NQV KFQA+A+ L+SHV RSK
Subjt:  HRKKGSKRAIGRFKRLKNKLGLIFHHHHHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSK

Query:  AMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL
        AMKKK+   +  G  KGVKKLHWWK+F  RHG++  NKG + RIRYVNRKS+L
Subjt:  AMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRV-RIRYVNRKSQL

A0A6J1K0S1 uncharacterized protein LOC111491355 isoform X27.5e-14960.07Show/hide
Query:  MDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKM
        M+ D+LYLDLLALR+LY  LLK CLRDANS+L +  RA+ILLKHLLDDAT  ++EF+SK LA      YNF  KD KQTKPLDEKVAEWMEH NQ+ R+M
Subjt:  MDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRKM

Query:  GNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIK
         N E IEH PRR RASASNVA ND S+GI++ALRRIE+HILSLQR        T +HI ETKLA+ GQSV Q NE+ NQQKV                  
Subjt:  GNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIK

Query:  GHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSS
                          KP V NH S+FV+GFRIPL+QD +EAM          KQ +L+ P T + +SG   GSK T R   KLN+T IQE+RS+NS 
Subjt:  GHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSS

Query:  GGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFR
        G ++M+PTL  HPSREVRKEQT +N+ HLA Q ESEFTN  SESAS SS A+ +T ESETTDD SSP  Q SP  TGSEAS++Y    +SSS+I+ KAF+
Subjt:  GGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFR

Query:  FSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRS
        FSH KK S  A+GRFK L+NKLGLIFHHH    HHHHHHH+ HN MWKQ+R +FH TD ++LTS EEK+GKL+KT IRS  VS  NQVGKFQAL EGLRS
Subjt:  FSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRS

Query:  HVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV
        HVW+SKAMKKKE RGL  G     KKLHWWKM RRR G+K  NKGRV+I YVNRK  +K++
Subjt:  HVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV

A0A6J1K5J4 uncharacterized protein LOC111491355 isoform X13.6e-15160.35Show/hide
Query:  LSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQS
        LSDKM+ D+LYLDLLALR+LY  LLK CLRDANS+L +  RA+ILLKHLLDDAT  ++EF+SK LA      YNF  KD KQTKPLDEKVAEWMEH NQ+
Subjt:  LSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKL-LDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQS

Query:  VRKMGNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFT
         R+M N E IEH PRR RASASNVA ND S+GI++ALRRIE+HILSLQR        T +HI ETKLA+ GQSV Q NE+ NQQKV              
Subjt:  VRKMGNLE-IEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFT

Query:  EPIKGHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRS
                              KP V NH S+FV+GFRIPL+QD +EAM          KQ +L+ P T + +SG   GSK T R   KLN+T IQE+RS
Subjt:  EPIKGHNLSSQLRSHLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRS

Query:  QNSSGGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSIST
        +NS G ++M+PTL  HPSREVRKEQT +N+ HLA Q ESEFTN  SESAS SS A+ +T ESETTDD SSP  Q SP  TGSEAS++Y    +SSS+I+ 
Subjt:  QNSSGGMIMRPTLLDHPSREVRKEQT-YNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSIST

Query:  KAFRFSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAE
        KAF+FSH KK S  A+GRFK L+NKLGLIFHHH    HHHHHHH+ HN MWKQ+R +FH TD ++LTS EEK+GKL+KT IRS  VS  NQVGKFQAL E
Subjt:  KAFRFSHRKKGSKRAIGRFKRLKNKLGLIFHHH----HHHHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAE

Query:  GLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV
        GLRSHVW+SKAMKKKE RGL  G     KKLHWWKM RRR G+K  NKGRV+I YVNRK  +K++
Subjt:  GLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKGRVRIRYVNRKSQLKVV

SwissProt top hitse value%identityAlignment
Q9FFP2 Protein KOKOPELLI4.2e-1633.46Show/hide
Query:  IMRPTLLDH-------PSREVRKEQTYNKIHLATQPE----SEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSS
        IM+PTL+D         S E   +QT +     ++ E    S+  + E+ S+S S W +Q   ++E+  + S P   D               S S  S+
Subjt:  IMRPTLLDH-------PSREVRKEQTYNKIHLATQPE----SEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSS

Query:  ISTKAFRFSHRKKGSKR--AIGRFKRLKNKLGLIFHHHHHHHHHHNSHN----FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGV-SHK--NQV
              R + R+ G +R   +GRFKR+KNK+G IFHHHHHHHHHH+ H+      W +L+  FHH        ++EKS + K+    S+G+ +HK  +Q 
Subjt:  ISTKAFRFSHRKKGSKR--AIGRFKRLKNKLGLIFHHHHHHHHHHNSHN----FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGV-SHK--NQV

Query:  GKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRH--GMKLSNKGRVRI
        G F AL EGL  H   SK  K +         K   KK  WWK+ ++R   G+K+  +GRV++
Subjt:  GKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRH--GMKLSNKGRVRI

Arabidopsis top hitse value%identityAlignment
AT5G63720.1 kokopelli3.0e-1733.46Show/hide
Query:  IMRPTLLDH-------PSREVRKEQTYNKIHLATQPE----SEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSS
        IM+PTL+D         S E   +QT +     ++ E    S+  + E+ S+S S W +Q   ++E+  + S P   D               S S  S+
Subjt:  IMRPTLLDH-------PSREVRKEQTYNKIHLATQPE----SEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSS

Query:  ISTKAFRFSHRKKGSKR--AIGRFKRLKNKLGLIFHHHHHHHHHHNSHN----FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGV-SHK--NQV
              R + R+ G +R   +GRFKR+KNK+G IFHHHHHHHHHH+ H+      W +L+  FHH        ++EKS + K+    S+G+ +HK  +Q 
Subjt:  ISTKAFRFSHRKKGSKR--AIGRFKRLKNKLGLIFHHHHHHHHHHNSHN----FMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGV-SHK--NQV

Query:  GKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRH--GMKLSNKGRVRI
        G F AL EGL  H   SK  K +         K   KK  WWK+ ++R   G+K+  +GRV++
Subjt:  GKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRH--GMKLSNKGRVRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCTTTGCTGCATGGGCTATGTTATCAGACAAGATGGATGTTGACAAATTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTT
GCGAGATGCAAATTCAAAACTTCTGGATGAAAGGGCACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCAGAAGTTGTTGAGTTTTACTCAAAGATCTTGGCAA
CAGACTCTAGCATTTCTTACAACTTTCAACATAAAGACGTAAAACAGACGAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATAATCAAAGTGTAAGAAAG
ATGGGAAATCTAGAGATTGAACACAATCCAAGGAGGGCCAGAGCTTCAGCTTCAAATGTTGCCACTAATGACTTCTCAAATGGTATCAGTACAGCACTCAGAAGAATTGA
AGTCCACATCTTATCTCTGCAACGTTGCACAAGTCAAAGTAGAAACAACACAAGCAACCATATCGGTGAAACTAAATTAGCTCACTCTGGGCAGTCTGTTCTTCAAAGGA
ATGAGACAATGAACCAGCAGAAAGTTCAGACAAGGACAAATCACTCAACTTTAAGGACCGGATTTACTGAGCCGATCAAAGGCCATAACTTGAGCAGTCAGTTAAGAAGT
CATCTTGTTGGTGGACAGAAAATTAAGCCGACAGTAACAAACCATTCCTCTGAGTTCGTTCATGGGTTCAGAATACCTCTGAGTCAAGACAATGAAGAGGCCATGAAACC
TCCAACTGTTGAAACTTACATATCTAAACAACAAAAACTTATAAATCCATTGACTCAGATAGGTCAATCTGGATATTCAGTGGGATCCAAGGTGACCATCAGAGCCGGTA
CAAAACTGAATCAAACTCGAATACAAGAAAGGAGGAGCCAGAATTCGTCTGGTGGTATGATAATGAGGCCAACTTTGTTGGATCATCCCTCTAGAGAAGTAAGAAAGGAA
CAAACTTATAATAAGATCCATTTGGCCACTCAGCCGGAATCAGAATTCACAAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAAGTCAAAAGACACTTGAGAG
TGAAACCACTGATGACCCTTCTTCCCCGAGTTACCAAGACAGTCCACCGACAACCGGTTCAGAGGCTAGTACCCGGTACCGAAGCAGCAGTAGCAGCAGTAGCAGCATTT
CAACAAAAGCATTCAGATTCAGCCACAGGAAAAAAGGGTCCAAGAGAGCAATAGGACGGTTCAAGAGACTCAAAAACAAACTAGGCCTTATCTTTCACCACCACCATCAC
CATCACCACCACCATAACAGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATTTTCCATCACACAGATAACCGAAAACTAACAAGTAACGAAGAAAAATCTGGGAAGCT
AAAGAAGACAGCAATCAGATCCAGAGGTGTGTCCCATAAGAACCAAGTTGGGAAATTTCAGGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAGATCAAAAGCCATGA
AGAAGAAAGAGCTTAGGGGGCTGACTTTTGGGAAGAAGAAGGGTGTGAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGTCGCCATGGAATGAAGTTGTCCAATAAAGGG
CGTGTGAGAATCAGGTATGTAAATAGAAAATCACAGCTTAAGGTAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCTTTGCTGCATGGGCTATGTTATCAGACAAGATGGATGTTGACAAATTATATCTTGATCTCCTTGCACTGAGGGAATTATACATCCTCCTCTTAAAGAGCTGTTT
GCGAGATGCAAATTCAAAACTTCTGGATGAAAGGGCACAAATTTTGTTGAAGCATTTGCTCGATGATGCTACTGCAGAAGTTGTTGAGTTTTACTCAAAGATCTTGGCAA
CAGACTCTAGCATTTCTTACAACTTTCAACATAAAGACGTAAAACAGACGAAGCCACTGGACGAGAAAGTTGCTGAATGGATGGAACATAATAATCAAAGTGTAAGAAAG
ATGGGAAATCTAGAGATTGAACACAATCCAAGGAGGGCCAGAGCTTCAGCTTCAAATGTTGCCACTAATGACTTCTCAAATGGTATCAGTACAGCACTCAGAAGAATTGA
AGTCCACATCTTATCTCTGCAACGTTGCACAAGTCAAAGTAGAAACAACACAAGCAACCATATCGGTGAAACTAAATTAGCTCACTCTGGGCAGTCTGTTCTTCAAAGGA
ATGAGACAATGAACCAGCAGAAAGTTCAGACAAGGACAAATCACTCAACTTTAAGGACCGGATTTACTGAGCCGATCAAAGGCCATAACTTGAGCAGTCAGTTAAGAAGT
CATCTTGTTGGTGGACAGAAAATTAAGCCGACAGTAACAAACCATTCCTCTGAGTTCGTTCATGGGTTCAGAATACCTCTGAGTCAAGACAATGAAGAGGCCATGAAACC
TCCAACTGTTGAAACTTACATATCTAAACAACAAAAACTTATAAATCCATTGACTCAGATAGGTCAATCTGGATATTCAGTGGGATCCAAGGTGACCATCAGAGCCGGTA
CAAAACTGAATCAAACTCGAATACAAGAAAGGAGGAGCCAGAATTCGTCTGGTGGTATGATAATGAGGCCAACTTTGTTGGATCATCCCTCTAGAGAAGTAAGAAAGGAA
CAAACTTATAATAAGATCCATTTGGCCACTCAGCCGGAATCAGAATTCACAAACTCAGAATCAGAATCAGCTTCTTCTTCAAGTTGGGCAAGTCAAAAGACACTTGAGAG
TGAAACCACTGATGACCCTTCTTCCCCGAGTTACCAAGACAGTCCACCGACAACCGGTTCAGAGGCTAGTACCCGGTACCGAAGCAGCAGTAGCAGCAGTAGCAGCATTT
CAACAAAAGCATTCAGATTCAGCCACAGGAAAAAAGGGTCCAAGAGAGCAATAGGACGGTTCAAGAGACTCAAAAACAAACTAGGCCTTATCTTTCACCACCACCATCAC
CATCACCACCACCATAACAGCCATAACTTCATGTGGAAGCAGCTAAGAAAGATTTTCCATCACACAGATAACCGAAAACTAACAAGTAACGAAGAAAAATCTGGGAAGCT
AAAGAAGACAGCAATCAGATCCAGAGGTGTGTCCCATAAGAACCAAGTTGGGAAATTTCAGGCACTTGCTGAAGGGCTTCGAAGCCATGTATGGAGATCAAAAGCCATGA
AGAAGAAAGAGCTTAGGGGGCTGACTTTTGGGAAGAAGAAGGGTGTGAAGAAGTTGCATTGGTGGAAAATGTTTCGTCGTCGCCATGGAATGAAGTTGTCCAATAAAGGG
CGTGTGAGAATCAGGTATGTAAATAGAAAATCACAGCTTAAGGTAGTTTAG
Protein sequenceShow/hide protein sequence
MIFAAWAMLSDKMDVDKLYLDLLALRELYILLLKSCLRDANSKLLDERAQILLKHLLDDATAEVVEFYSKILATDSSISYNFQHKDVKQTKPLDEKVAEWMEHNNQSVRK
MGNLEIEHNPRRARASASNVATNDFSNGISTALRRIEVHILSLQRCTSQSRNNTSNHIGETKLAHSGQSVLQRNETMNQQKVQTRTNHSTLRTGFTEPIKGHNLSSQLRS
HLVGGQKIKPTVTNHSSEFVHGFRIPLSQDNEEAMKPPTVETYISKQQKLINPLTQIGQSGYSVGSKVTIRAGTKLNQTRIQERRSQNSSGGMIMRPTLLDHPSREVRKE
QTYNKIHLATQPESEFTNSESESASSSSWASQKTLESETTDDPSSPSYQDSPPTTGSEASTRYRSSSSSSSSISTKAFRFSHRKKGSKRAIGRFKRLKNKLGLIFHHHHH
HHHHHNSHNFMWKQLRKIFHHTDNRKLTSNEEKSGKLKKTAIRSRGVSHKNQVGKFQALAEGLRSHVWRSKAMKKKELRGLTFGKKKGVKKLHWWKMFRRRHGMKLSNKG
RVRIRYVNRKSQLKVV