; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037853 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037853
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionChlorophyllase
Genome locationscaffold12:43048040..43051147
RNA-Seq ExpressionSpg037853
SyntenySpg037853
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002110 - Ankyrin repeat
IPR020683 - Ankyrin repeat-containing domain
IPR026961 - PGG domain
IPR036770 - Ankyrin repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606412.1 NF-kappa-B inhibitor alpha, partial [Cucurbita argyrosperma subsp. sororia]4.6e-20151.65Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH
        N+ N      LY H KR  W  VI KY ++ E                              +  KL   GDT LHLAV+ NQE  VE LV         
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH

Query:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD
        ++E LEI N+R+NNPLHLAA MGSVRMC AIAS H GL+D RN DD+TPLFLAA YG +DAF+CLY FC   P RI+ NCR+K +GDTVLH A+ NE+FD
Subjt:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLH----------RDQEYWKRSWSERKTAIFFPAIYET
        LAFQL++++ ++++W NE G TP+H+LASKP SFKSGS+I+GW+++VYHC FV  L+P+SI+ L           RD++  K+  +  K    FP  Y+T
Subjt:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLH----------RDQEYWKRSWSERKTAIFFPAIYET

Query:  CIYFFTRLWDGLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKI
        CI F   + D +L +  ++ + N          K   K   D+ N D + +K    +  + PPT T+ P NY+TC+D F I  SAI+ +LG GST I+KI
Subjt:  CIYFFTRLWDGLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKI

Query:  REMKQKHSWSVQVMEKLIDPQFVTPDSH---GGGRPDYEDSGMESA--EAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELY
        R+ K+KH+WSVQVMEKL+  +   PD +   G    DY+    E++    PY +   E +FS  I    E       + +ESAMLLAA+NGV+EIVK  +
Subjt:  REMKQKHSWSVQVMEKLIDPQFVTPDSH---GGGRPDYEDSGMESA--EAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELY

Query:  DCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYN
        +  P  I +TRK D+KNVV LAAE+RQ ++Y+FLL +   +Q LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WY +VK S+P+HFFP +N
Subjt:  DCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYN

Query:  KIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIR
          G+TAS IF +TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I+
Subjt:  KIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIR

Query:  EYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        + F   LPWKLL+ L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  EYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

KAG6606413.1 Chlorophyllase type 0, partial [Cucurbita argyrosperma subsp. sororia]2.2e-19551.18Show/hide
Query:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH
        LR+FLY + KR NW  VI KY +HPEA GLKLT  GDTALHLAV+ N+EE+V+ LV  I ++    L  LE  NDR+ NPLHLAA MGS  MC+AIASAH
Subjt:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK
          L+++RN  D+TPL+LAA  G RDAF+CLY FC D    IT NCRL +NGDTVLH A+ N++FDLAF ++HLN ++M+W  + G TPLH+LASKP +FK
Subjt:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQININLSLPFSTLVNKKKSSDEANK
        SGS IRGWRN+ Y+C  V++LKPQ I++L RD   W    S   T+   FPA Y  CI FFT +WDG LK  G  +I  +     S     KK +D+A +
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQININLSLPFSTLVNKKKSSDEANK

Query:  DKDLEKGE----------------KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--
        +  +E GE                KA  ++   +ITN P NY+TC+  FQI+ SAI+  LG GS   +KIR  K+KH+WSVQVMEKL+  ++  PD +  
Subjt:  DKDLEKGE----------------KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--

Query:  GGGRP-DYEDSGMESAEAPYTYTLEEDKFSTSIGVKS--------EDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLY
         GG P D      + A+    Y+ E+D    S+ ++S        +D  E+ MLLAAKNGV+EIVK ++   P +IY+  K D+KNVV LAAEY Q ++Y
Subjt:  GGGRP-DYEDSGMESAEAPYTYTLEEDKFSTSIGVKS--------EDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLY

Query:  KFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQS
        +FLL      + LFRAVD NGNS LHLAA+ S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YNK G+ A+ IF +THMDL QK  DWL  TS+S
Subjt:  KFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQS

Query:  CSVVATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFC
        CSVV  LI TVAF + A+IPGG N   G   L+  +AF  F + SLIALCLS+TSV +FLAILT RF+  ++ T+ LPWKL +  SSL+ SII+MLISFC
Subjt:  CSVVATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFC

Query:  SGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        +GHYFL+ R + + A +LYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  SGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_022931018.1 uncharacterized protein LOC111437341 isoform X1 [Cucurbita moschata]2.7e-20151.41Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH
        N+ N      LY H KR  W  VI KY ++ E                              +  KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH

Query:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD
          E LEI N+  NN LHLAA MGSVRMC AIAS H GL+D RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCR+K +GDTVLH A+ +E+FD
Subjt:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD
        LAFQL++++ ++++W NE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E       E      FP  Y+TCI+FF  + D
Subjt:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD

Query:  GLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWS
         +L    ++ + N          K   K   D+ N D + +K    +  + PPT T+ P NY+TC+D F I  SAI+ +LG GST I+KIR+ K+KH+WS
Subjt:  GLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWS

Query:  VQVMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRK
        VQVM KL+   +P   T D +      + +        PY +   E +FS       E       +  ESAMLLAA+NGV+EIVK   +  P  I +TRK
Subjt:  VQVMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRK

Query:  SDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFED
         D+KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N  G+TAS IF +
Subjt:  SDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFED

Query:  THMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLL
        TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I++ F   LPWKLL
Subjt:  THMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLL

Query:  VALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        + L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  VALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_022931019.1 uncharacterized protein LOC111437341 isoform X2 [Cucurbita moschata]1.5e-19951.16Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH
        N+ N      LY H KR  W  VI KY ++ E                              +  KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH

Query:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD
          E LEI N+  NN LHLAA MGSVRMC AIAS H GL+D RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCR+K +GDTVLH A+ +E+FD
Subjt:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD
        LAFQL++++ ++++W NE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E       E      FP  Y+TCI+FF  + D
Subjt:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD

Query:  GLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQ
         +L    ++ + N      T  +     + A  D D   G+ + ++  N  + T+ P NY+TC+D F I  SAI+ +LG GST I+KIR+ K+KH+WSVQ
Subjt:  GLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQ

Query:  VMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSD
        VM KL+   +P   T D +      + +        PY +   E +FS       E       +  ESAMLLAA+NGV+EIVK   +  P  I +TRK D
Subjt:  VMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSD

Query:  EKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTH
        +KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N  G+TAS IF +TH
Subjt:  EKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTH

Query:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVA
         +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I++ F   LPWKLL+ 
Subjt:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVA

Query:  LSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  LSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

XP_023534309.1 uncharacterized protein LOC111795905 [Cucurbita pepo subsp. pepo]2.4e-19751.59Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQ-------------------HPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNL--------
        N+ N      LYA+ KR  W  VI+KY +                   +P A   KL   GDT LHLAV+ NQE  VE LVR I E P + L        
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQ-------------------HPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNL--------

Query:  -----------ERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLH
                   + LE  N+R NNPLHLAA MGSVRMC AI S H GL+D RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCR+K +GDTVLH
Subjt:  -----------ERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLH

Query:  CAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETC
         A+ NE+ DLAFQL++++ ++++W NE G TP+HILASKP SFKSGS+I+GW+++VYHC FV  LK    + LH   +  ++  S       FP  Y TC
Subjt:  CAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETC

Query:  IYFFTRLWDGLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREM
        I F+  + D +L + W+    N +    T  +  K  ++   D D   G+ + ++  N  + TN P NY+TC+D F I  SAI+ +LG GST I+KIR+ 
Subjt:  IYFFTRLWDGLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREM

Query:  KQKHSWSVQVMEKLIDPQFVTPDSHGGGRPDYEDSGMESAEA-----PYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCD
        K+KH+WSVQVMEKL+  +   PD +        DS  ++ EA     PY +   E +FS  I    E       +  ESAMLLAAKNGV+EIVK   +  
Subjt:  KQKHSWSVQVMEKLIDPQFVTPDSHGGGRPDYEDSGMESAEA-----PYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCD

Query:  PRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIG
        P  I +T K D+KNVV LAAE+RQ ++Y+F L R+  +  LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WY +VK S+P+HFFP +N  G
Subjt:  PRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIG

Query:  ETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYF
        +TAS IF +TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I++ F
Subjt:  ETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYF

Query:  TSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
           LPWKLL+ L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  TSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

TrEMBL top hitse value%identityAlignment
A0A6J1ET50 uncharacterized protein LOC111437341 isoform X27.2e-20051.16Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH
        N+ N      LY H KR  W  VI KY ++ E                              +  KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH

Query:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD
          E LEI N+  NN LHLAA MGSVRMC AIAS H GL+D RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCR+K +GDTVLH A+ +E+FD
Subjt:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD
        LAFQL++++ ++++W NE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E       E      FP  Y+TCI+FF  + D
Subjt:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD

Query:  GLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQ
         +L    ++ + N      T  +     + A  D D   G+ + ++  N  + T+ P NY+TC+D F I  SAI+ +LG GST I+KIR+ K+KH+WSVQ
Subjt:  GLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGE-KAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQ

Query:  VMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSD
        VM KL+   +P   T D +      + +        PY +   E +FS       E       +  ESAMLLAA+NGV+EIVK   +  P  I +TRK D
Subjt:  VMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSD

Query:  EKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTH
        +KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N  G+TAS IF +TH
Subjt:  EKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTH

Query:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVA
         +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I++ F   LPWKLL+ 
Subjt:  MDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVA

Query:  LSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  LSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

A0A6J1EX64 uncharacterized protein LOC111437341 isoform X11.3e-20151.41Show/hide
Query:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH
        N+ N      LY H KR  W  VI KY ++ E                              +  KL   GDT LHLAV+ NQE IVE LV+++      
Subjt:  NNNNNELREFLYAHTKRANWGAVIQKYAQHPEA-----------------------------HGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGH

Query:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD
          E LEI N+  NN LHLAA MGSVRMC AIAS H GL+D RN DD+TPLFLAA YG +DAF+CLY+FC DD  RI+ NCR+K +GDTVLH A+ +E+FD
Subjt:  NLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFD

Query:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD
        LAFQL++++ ++++W NE G TP+H+LA KP SFKSGS+I+GW+++VYHC FV  L+P+S E L ++ E       E      FP  Y+TCI+FF  + D
Subjt:  LAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWD

Query:  GLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWS
         +L    ++ + N          K   K   D+ N D + +K    +  + PPT T+ P NY+TC+D F I  SAI+ +LG GST I+KIR+ K+KH+WS
Subjt:  GLLKLGWYQININLSLPFSTLVNK---KKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWS

Query:  VQVMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRK
        VQVM KL+   +P   T D +      + +        PY +   E +FS       E       +  ESAMLLAA+NGV+EIVK   +  P  I +TRK
Subjt:  VQVMEKLI---DPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSE-------DVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRK

Query:  SDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFED
         D+KNVV LAAE+RQ ++Y+ LL +   +Q LFRAVD  GNS LHLAA+   PKLW+ITGAALQ+Q EV+WYK+VK S+P+HFFP +N  G+TAS IF +
Subjt:  SDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFED

Query:  THMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLL
        TH +L  KG  WL  TS+SCS+VATLIATVAFATAATIPGGN++ G A L  EQ F IF  SSLIALCLS+TSVIMFLAI+T+RF+I++ F   LPWKLL
Subjt:  THMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLL

Query:  VALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD
        + L  LYFSIIAML+SFCSGHYFL+ +RL N+A++LYTLTF PVA +FGIVQLPLYFDLL+ L+K +P  S EV LCD
Subjt:  VALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMP-TSVEVDLCD

A0A6J1JZF8 uncharacterized protein LOC111491104 isoform X31.2e-19452.05Show/hide
Query:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH
        LR+FLY +TKR  W  VI+KY ++PEA  LKLT  GDTALHLAV+ N+EE+V+ LV  I     ++ E L+  NDR+  PLHLAA MGS  MC+AIASAH
Subjt:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK
          L+D RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCRL +NGDTVLH A+ N++FDLAFQ++HLN ++M+W  E G TPLH+LASKP +FK
Subjt:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD   W    S   T+   FPA Y TCI FFT +WDG LK  G  +I   N N      T V        
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------

Query:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP
         +   +DE+++  D E   KA  ++   +ITN P NY+TC+  FQI+ SAI+  LG GS   +KIR  K+KH+WSVQVMEKL+  ++  PD +   GG P
Subjt:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP

Query:  -DYEDSGMESAEA--PYTYTLEEDKFSTSIGVKSED--------VTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFL
         D      + AE   PY+    E + S SI  K ++          E+AMLLAAKNGV+EIVK ++   P +I + RK D+KNVV LAAEY Q ++Y+FL
Subjt:  -DYEDSGMESAEA--PYTYTLEEDKFSTSIGVKSED--------VTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFL

Query:  LDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSV
        L  K   + LFRAVD NGNS LHLAA+ S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YNK G+ A+ IF +THMDL +K  +WL  TS+SCSV
Subjt:  LDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSV

Query:  VATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGH
        V TLI TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFLAILT RF+  ++ T+ LPWKL +  SSL+ SII+ML+SFC+GH
Subjt:  VATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGH

Query:  YFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
        YFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  YFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

A0A6J1K2F1 uncharacterized protein LOC111491104 isoform X21.8e-19552.32Show/hide
Query:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH
        LR+FLY +TKR  W  VI+KY ++PEA  LKLT  GDTALHLAV+ N+EE+V+ LV  I     ++ E L+  NDR+  PLHLAA MGS  MC+AIASAH
Subjt:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK
          L+D RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCRL +NGDTVLH A+ N++FDLAFQ++HLN ++M+W  E G TPLH+LASKP +FK
Subjt:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD   W    S   T+   FPA Y TCI FFT +WDG LK  G  +I   N N      T V        
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------

Query:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP
         +   +DE+++  D E   KA  ++   +ITN P NY+TC+  FQI+ SAI+  LG GS   +KIR  K+KH+WSVQVMEKL+  ++  PD +   GG P
Subjt:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP

Query:  -DYEDSGMESAEA--PYTYTLEEDKFSTSI------GVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLD
         D      + AE   PY+    E + S SI        K ++V E+AMLLAAKNGV+EIVK ++   P +I + RK D+KNVV LAAEY Q ++Y+FLL 
Subjt:  -DYEDSGMESAEA--PYTYTLEEDKFSTSI------GVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLD

Query:  RKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVA
         K   + LFRAVD NGNS LHLAA+ S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YNK G+ A+ IF +THMDL +K  +WL  TS+SCSVV 
Subjt:  RKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVA

Query:  TLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYF
        TLI TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFLAILT RF+  ++ T+ LPWKL +  SSL+ SII+ML+SFC+GHYF
Subjt:  TLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYF

Query:  LLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
        L+ + + + A +LYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  LLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

A0A6J1K4G3 uncharacterized protein LOC111491104 isoform X11.2e-19452.05Show/hide
Query:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH
        LR+FLY +TKR  W  VI+KY ++PEA  LKLT  GDTALHLAV+ N+EE+V+ LV  I     ++ E L+  NDR+  PLHLAA MGS  MC+AIASAH
Subjt:  LREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAH

Query:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK
          L+D RN  D+TPL+LAA  G RDAF+CLY FC ++  RIT NCRL +NGDTVLH A+ N++FDLAFQ++HLN ++M+W  E G TPLH+LASKP +FK
Subjt:  GGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFK

Query:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------
        SGS IRGWRN+ Y+C  VD+LKPQ I++L RD   W    S   T+   FPA Y TCI FFT +WDG LK  G  +I   N N      T V        
Subjt:  SGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAI-FFPAIYETCIYFFTRLWDGLLK-LGWYQI---NINLSLPFSTLV--------

Query:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP
         +   +DE+++  D E   KA  ++   +ITN P NY+TC+  FQI+ SAI+  LG GS   +KIR  K+KH+WSVQVMEKL+  ++  PD +   GG P
Subjt:  NKKKSSDEANKDKDLEKGEKAERISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSH--GGGRP

Query:  -DYEDSGMESAEA--PYTYTLEEDKFSTSIGVKSED--------VTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFL
         D      + AE   PY+    E + S SI  K ++          E+AMLLAAKNGV+EIVK ++   P +I + RK D+KNVV LAAEY Q ++Y+FL
Subjt:  -DYEDSGMESAEA--PYTYTLEEDKFSTSIGVKSED--------VTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFL

Query:  LDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSV
        L  K   + LFRAVD NGNS LHLAA+ S   +W+ITGAALQ+Q E++WYKFV+ES+P++FF  YNK G+ A+ IF +THMDL +K  +WL  TS+SCSV
Subjt:  LDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSV

Query:  VATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGH
        V TLI TVAF + A+IPGG N H G   LK  +AF  F + SLIALCLS+TSV MFLAILT RF+  ++ T+ LPWKL +  SSL+ SII+ML+SFC+GH
Subjt:  VATLIATVAFATAATIPGGNNDH-GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGH

Query:  YFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD
        YFL+ + + + A +LYT+  +PVA +F I +LPLY D+++ + KI+P  S  V L D
Subjt:  YFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPT-SVEVDLCD

SwissProt top hitse value%identityAlignment
A2CIR5 Ankyrin repeat-containing protein NPR42.7e-1024.55Show/hide
Query:  SEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEK--NVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGA
        ++D  ++++  AA+ G VEIVK L + DP+     R++D+K    +H+A +    ++ + L+D   ++  L    D NGN+ LH+A  +   K  +I   
Subjt:  SEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEK--NVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGA

Query:  ALQ--------IQLEVEWYKFVKESLPI--------------------HFFPLYNKIGETASRIFEDTHMDLEQKGK-----------------DWLCVT
         L+        +  + +    + E+LP+                          +++ +T + I +D H  LEQ  K                 + +   
Subjt:  ALQ--------IQLEVEWYKFVKESLPI--------------------HFFPLYNKIGETASRIFEDTHMDLEQKGK-----------------DWLCVT

Query:  SQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSL-YFSIIAMLI
        + S +VVA L ATVAFA   T+PGGN ++G+A + Q  +F IF I + IAL  S   V++ + ++    +           K++  ++ L + + +   I
Subjt:  SQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSL-YFSIIAMLI

Query:  SFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFG
        SF +  Y +L R  Q  AL++  +  + +A + G
Subjt:  SFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFG

Q08353 NF-kappa-B inhibitor alpha3.0e-0928.92Show/hide
Query:  TALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHG-----GLLDDRNIDDQTPLFLAALYGKRDAFYCLYS
        T LHLAV+ NQ EI E L+   C+         E+++ R N PLHLA   G +     +    G      +L   N +  T L LA+++G       L S
Subjt:  TALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHG-----GLLDDRNIDDQTPLFLAALYGKRDAFYCLYS

Query:  FCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPAS
           D      +N +   NG T LH A+  +  DL   L+    D +N     G +P  +   +P++
Subjt:  FCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPAS

Q25338 Delta-latroinsectotoxin-Lt1a4.3e-0825.6Show/hide
Query:  NPTSNNNNNELREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSV
        +P    N  E   F   HT   +   V+     HP+      +  G T  HLA++   +E+ E+LV    +        L I++     P+H AA+MGS+
Subjt:  NPTSNNNNNELREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSV

Query:  RMCHAIASAHGGLLDDRNIDDQ--TPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTP
        +M   + S    +  +   ++   TPL  A  + K DA   L      D + +TI   + +   TVLH A+     ++  +L+    +      E G T 
Subjt:  RMCHAIASAHGGLLDDRNIDDQ--TPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTP

Query:  LHILASK
        LHI A +
Subjt:  LHILASK

Q91974 NF-kappa-B inhibitor alpha1.9e-0828.57Show/hide
Query:  PEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKR
        P A   +LT +GDT LHLA++  ++ +   ++R    +       L  +N+    PLHLA       +   +  A G  LD R+    TPL +A   G  
Subjt:  PEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKR

Query:  DAFYCLYSFCGDDPVRITINCRLKN-NGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHI
         +   L   C   P  +    +  N NG T LH A    Y  +   L+ L  D       +GRT LH+
Subjt:  DAFYCLYSFCGDDPVRITINCRLKN-NGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHI

Q9C7A2 Ankyrin repeat-containing protein ITN14.3e-0825.08Show/hide
Query:  SEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEK--NVVHLAAEYRQWNLYKFLLDRKYS--LQPLFRAVDLNGNSVLHLAASRSAPKLWQI-
        S    ++A+ LAA+ G VE++K L   DP+     R+ D+K    +H+A + +   + K LLD   +  +QP     D + N+ LH+A  +   ++ ++ 
Subjt:  SEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEK--NVVHLAAEYRQWNLYKFLLDRKYS--LQPLFRAVDLNGNSVLHLAASRSAPKLWQI-

Query:  -----TGA---------ALQIQLEV---EWYKFVKESLPIHFFPLYNKIGE-------TASRIFEDTHMDLEQK-----------------GKDWLCVTS
             T A         AL I   +   E   ++KE L        N++ +       T ++I  D H+ LEQ                   ++ +   +
Subjt:  -----TGA---------ALQIQLEV---EWYKFVKESLPIHFFPLYNKIGE-------TASRIFEDTHMDLEQK-----------------GKDWLCVTS

Query:  QSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISF
         S +VVA L ATVAFA   T+PGG+N+ G A +    +F IF I + +AL  S   V++ + ++    +  +    ++    L+ L+S+  S     ++F
Subjt:  QSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISF

Query:  CSGHYFLLTRR
         +  Y ++ R+
Subjt:  CSGHYFLLTRR

Arabidopsis top hitse value%identityAlignment
AT3G18670.1 Ankyrin repeat family protein5.7e-3232.76Show/hide
Query:  AMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRS-APKLWQITGAALQIQLEV
        A+  A +NG+VE ++E+    P  ++ ++ S   N+   A   RQ  ++  + +       L    D+  N++LH AA R+ A +L  I GAALQ+Q E+
Subjt:  AMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRS-APKLWQITGAALQIQLEV

Query:  EWYKFVKESL-PIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALC
        +W+K V++ + P H   +  K  +T   +F D H DL ++G+ W+  T+ SC+VVA LI T+ F++A T+PGG    G+     +  F IF+IS  I+L 
Subjt:  EWYKFVKESL-PIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALC

Query:  LSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLR
         S  S++MFL IL +R+   E F   LP KL+V L +L+ S+  M+++F      L+  ++   +     L  +P+  MF ++Q P+  ++ R
Subjt:  LSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLR

AT3G18670.1 Ankyrin repeat family protein1.3e-1230.6Show/hide
Query:  QHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLER-LEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALY
        ++PEA    LT  GDT +H AV++   +IVE ++R I     H+ E+ L+IKND     L  AA  G VR+   + +   GL+  RN  +  P+ +A+LY
Subjt:  QHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLER-LEIKNDRQNNPLHLAAAMGSVRMCHAIASAHGGLLDDRNIDDQTPLFLAALY

Query:  GKRDAFYCLYS---FCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKS
        G +     LYS       DP   +   + KN    V +C +   Y  +A  L+          +    T +  LA  P +F S
Subjt:  GKRDAFYCLYS---FCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASFKS

AT5G04680.1 Ankyrin repeat family protein9.2e-3034.56Show/hide
Query:  LEEDKFSTSIGVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQ---WNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLA
        + E+  +  +  +SE V E A+L A + G V+ + E+   +   ++ TR S    +  LA E+RQ   +NL   L DRKY    L    D +GN VLHLA
Subjt:  LEEDKFSTSIGVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQ---WNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLA

Query:  ASRSAP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA---------TVAFATAAT
           S P KL  +  A L++Q E++W+K V+   P       N   +T   IF   H  L Q+ + W+  T+ SCS+VA LI          TV FA   T
Subjt:  ASRSAP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIA---------TVAFATAAT

Query:  IPGGNNDH--GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALV
        + GG++D+  G      EQ F IF++S LI+   + T+V +FL ILTAR+   ++  + LP K++  LS L+ SI AMLI+F      L+T   + + +V
Subjt:  IPGGNNDH--GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALV

Query:  LYTL--TFLPVAFMFGIVQLPLYFDLL
          T+    LP A +F ++Q PL  +++
Subjt:  LYTL--TFLPVAFMFGIVQLPLYFDLL

AT5G04700.1 Ankyrin repeat family protein7.8e-3736.16Show/hide
Query:  LEEDKFSTSIGVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLL---DRKYSLQPLFRAVDLNGNSVLHLA
        + E+  +  +  +SE V E A+L A + G V+ + E+   +   ++ TR S    +  LA E+RQ  ++  L    DRKY    L    D +GN VLHLA
Subjt:  LEEDKFSTSIGVKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLL---DRKYSLQPLFRAVDLNGNSVLHLA

Query:  ASRSAP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH-
           S P KL  + GA LQ+Q E++W+K V+   P       N   +T   IF   H  L Q+ + W+  T+ SCS+VA LI TV FA   T+PGG +D+ 
Subjt:  ASRSAP-KLWQITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH-

Query:  -GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTF--L
         G     +++ F IF++S LI+   S TSV++FL ILTAR+   + F   LP K++  LS L+ SI AMLI+F S  + ++ +  + + +V  T+ F  L
Subjt:  -GLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTF--L

Query:  PVAFMFGIVQLPLYFDLL
        P A +F ++Q PL  +++
Subjt:  PVAFMFGIVQLPLYFDLL

AT5G04730.1 Ankyrin-repeat containing protein1.7e-3131.17Show/hide
Query:  VKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQ---WNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAP-KLWQ
        ++ ++    A+L AAK+G  +   E+  C+ + ++    +  +N+  LA E+++   +NL   L DRK +   L R+ D   N++LH+A   S P +L +
Subjt:  VKSEDVTESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQ---WNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAP-KLWQ

Query:  ITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGG-NNDHGLATLKQEQAF
        I+GAAL++Q E +W+K V+  +        NK  +T  +IFE  H  L ++G++W+  T+ +CS VA LIATV F    T+PGG +   G   +  +  F
Subjt:  ITGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGG-NNDHGLATLKQEQAF

Query:  SIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLY
          F+ +  +A   S  SV++FL+ILT+R+   ++  S LP K+++  S L+ SI +ML++F +     L+  ++++  ++Y L   P+A    ++ L L 
Subjt:  SIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLY

Query:  FDLLRVLL
        + LL+ ++
Subjt:  FDLLRVLL

AT5G35810.1 Ankyrin repeat family protein4.0e-3334.95Show/hide
Query:  SEDVTESAMLL--AAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYK--FLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAP-KLWQI
        S+ V  S MLL  AA++G +E++  L    P  I+ T     +++ H+AA  R   ++   + L     L  +++  + N N +LHL A    P +L  +
Subjt:  SEDVTESAMLL--AAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYK--FLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAP-KLWQI

Query:  TGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH------GLATLKQ
        +GAALQ+Q E+ WYK VKE +P  +    NK  E A  +F   H +L ++G+ W+  T+ +C +V+TLIATV FA A T+PGGN+        G  T ++
Subjt:  TGAALQIQLEVEWYKFVKESLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDH------GLATLKQ

Query:  EQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQ
        E  F +F+IS  +AL  S TS+++FL+ILT+R+     F + LP KL++ L +L+ SII+M+++F +    L+  +    +L+L        A  F ++ 
Subjt:  EQAFSIFVISSLIALCLSTTSVIMFLAILTARFEIREYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQ

Query:  LPLYFDLLR
          L+FD LR
Subjt:  LPLYFDLLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTAAAGACGAGGATAAGAGAGAAAATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGCACACACCAAGAGAGCAAACTGGGGTGC
AGTGATTCAAAAGTACGCCCAACACCCAGAAGCTCATGGGTTGAAGCTGACTCCAGAGGGCGACACGGCGTTGCATTTGGCTGTGGTTGCCAACCAAGAAGAAATAGTGG
AAACGCTTGTGAGACTCATTTGCGAAAACCCAGGACACAACCTGGAACGTCTTGAGATTAAGAACGACAGACAAAACAACCCTCTCCACCTTGCCGCAGCGATGGGAAGC
GTGAGAATGTGCCACGCCATTGCTTCAGCCCATGGGGGCTTACTGGACGACAGAAACATAGACGACCAAACCCCACTCTTCCTGGCGGCGTTGTACGGCAAAAGGGACGC
CTTTTATTGCCTTTACTCCTTCTGCGGAGACGACCCAGTTCGAATCACCATCAACTGCAGACTCAAGAACAATGGCGACACCGTGCTACATTGTGCCATCGGAAACGAGT
ACTTTGATTTGGCGTTTCAATTAATGCACTTGAACGTTGACAGTATGAATTGGGCGAATGAGCACGGCAGAACCCCTCTCCATATTCTAGCTAGCAAGCCAGCTTCCTTC
AAAAGTGGATCCAACATCAGAGGATGGCGGAACCTCGTCTATCACTGCATATTTGTGGATAAATTGAAGCCTCAATCCATTGAAACCCTTCACCGTGACCAAGAGTACTG
GAAGAGAAGTTGGTCCGAAAGAAAGACAGCTATATTTTTTCCAGCCATCTACGAGACATGCATTTACTTCTTTACGAGGTTGTGGGATGGATTGTTAAAATTAGGTTGGT
ACCAAATAAATATAAATTTATCCTTACCTTTCAGCACTTTGGTGAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGGATAAAGATTTGGAGAAAGGGGAGAAAGCTGAG
AGAATATCAAATCCACCGACGATTACAAATTCCCCGGAAAACTATTCTACCTGCGTCGACCTTTTTCAAATAATTTTATCGGCCATCATAACCGTTCTTGGGCACGGATC
TACGACAATCAGAAAGATTCGAGAGATGAAACAGAAACACAGTTGGTCAGTTCAAGTGATGGAGAAACTTATTGATCCACAATTTGTTACACCGGACAGTCATGGCGGCG
GTCGGCCCGATTATGAAGATTCTGGGATGGAATCAGCAGAAGCTCCATACACGTACACGTTGGAAGAAGACAAGTTCAGTACAAGTATTGGTGTGAAGTCGGAGGATGTC
ACAGAGTCGGCGATGTTATTGGCAGCGAAGAACGGCGTGGTTGAGATAGTGAAGGAACTTTACGACTGTGATCCAAGGACAATCTACGAGACTAGGAAATCAGATGAGAA
GAATGTAGTGCATTTGGCTGCAGAGTACAGGCAGTGGAACCTCTACAAGTTTTTACTTGACAGAAAATATTCCTTACAGCCCCTGTTTCGAGCCGTGGATCTTAACGGCA
ACAGCGTCTTACATCTGGCGGCTTCCAGATCAGCTCCTAAGCTTTGGCAAATCACTGGAGCTGCACTCCAGATCCAATTGGAAGTTGAGTGGTATAAGTTCGTGAAGGAA
TCTTTGCCAATCCATTTTTTCCCTCTTTATAATAAGATAGGAGAAACTGCAAGTAGAATCTTCGAAGATACCCACATGGATCTCGAGCAAAAGGGCAAGGATTGGCTTTG
CGTCACCTCACAGTCATGCTCTGTGGTGGCCACTCTGATTGCAACCGTGGCTTTTGCAACTGCAGCCACCATCCCAGGCGGCAACAACGACCACGGTCTCGCAACACTTA
AACAAGAGCAGGCCTTTTCTATCTTCGTCATCTCTTCCCTCATTGCTCTATGCCTCTCCACAACCTCAGTCATCATGTTTCTCGCCATCTTGACCGCCAGATTCGAAATC
AGAGAGTACTTCACATCAATGTTGCCCTGGAAACTGCTCGTGGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCT
CCTCACTCGCCGCCTCCAAAATCAAGCCCTTGTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATAGTGCAGCTTCCTCTCTACTTCGATCTCCTGC
GGGTTCTTCTCAAAATCATGCCTACCAGCGTCGAAGTCGACCTTTGCGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATTAAAGACGAGGATAAGAGAGAAAATGAATCCAACGTCCAACAATAACAACAACGAATTACGAGAGTTTCTTTATGCACACACCAAGAGAGCAAACTGGGGTGC
AGTGATTCAAAAGTACGCCCAACACCCAGAAGCTCATGGGTTGAAGCTGACTCCAGAGGGCGACACGGCGTTGCATTTGGCTGTGGTTGCCAACCAAGAAGAAATAGTGG
AAACGCTTGTGAGACTCATTTGCGAAAACCCAGGACACAACCTGGAACGTCTTGAGATTAAGAACGACAGACAAAACAACCCTCTCCACCTTGCCGCAGCGATGGGAAGC
GTGAGAATGTGCCACGCCATTGCTTCAGCCCATGGGGGCTTACTGGACGACAGAAACATAGACGACCAAACCCCACTCTTCCTGGCGGCGTTGTACGGCAAAAGGGACGC
CTTTTATTGCCTTTACTCCTTCTGCGGAGACGACCCAGTTCGAATCACCATCAACTGCAGACTCAAGAACAATGGCGACACCGTGCTACATTGTGCCATCGGAAACGAGT
ACTTTGATTTGGCGTTTCAATTAATGCACTTGAACGTTGACAGTATGAATTGGGCGAATGAGCACGGCAGAACCCCTCTCCATATTCTAGCTAGCAAGCCAGCTTCCTTC
AAAAGTGGATCCAACATCAGAGGATGGCGGAACCTCGTCTATCACTGCATATTTGTGGATAAATTGAAGCCTCAATCCATTGAAACCCTTCACCGTGACCAAGAGTACTG
GAAGAGAAGTTGGTCCGAAAGAAAGACAGCTATATTTTTTCCAGCCATCTACGAGACATGCATTTACTTCTTTACGAGGTTGTGGGATGGATTGTTAAAATTAGGTTGGT
ACCAAATAAATATAAATTTATCCTTACCTTTCAGCACTTTGGTGAACAAGAAGAAAAGTAGTGATGAAGCCAACAAGGATAAAGATTTGGAGAAAGGGGAGAAAGCTGAG
AGAATATCAAATCCACCGACGATTACAAATTCCCCGGAAAACTATTCTACCTGCGTCGACCTTTTTCAAATAATTTTATCGGCCATCATAACCGTTCTTGGGCACGGATC
TACGACAATCAGAAAGATTCGAGAGATGAAACAGAAACACAGTTGGTCAGTTCAAGTGATGGAGAAACTTATTGATCCACAATTTGTTACACCGGACAGTCATGGCGGCG
GTCGGCCCGATTATGAAGATTCTGGGATGGAATCAGCAGAAGCTCCATACACGTACACGTTGGAAGAAGACAAGTTCAGTACAAGTATTGGTGTGAAGTCGGAGGATGTC
ACAGAGTCGGCGATGTTATTGGCAGCGAAGAACGGCGTGGTTGAGATAGTGAAGGAACTTTACGACTGTGATCCAAGGACAATCTACGAGACTAGGAAATCAGATGAGAA
GAATGTAGTGCATTTGGCTGCAGAGTACAGGCAGTGGAACCTCTACAAGTTTTTACTTGACAGAAAATATTCCTTACAGCCCCTGTTTCGAGCCGTGGATCTTAACGGCA
ACAGCGTCTTACATCTGGCGGCTTCCAGATCAGCTCCTAAGCTTTGGCAAATCACTGGAGCTGCACTCCAGATCCAATTGGAAGTTGAGTGGTATAAGTTCGTGAAGGAA
TCTTTGCCAATCCATTTTTTCCCTCTTTATAATAAGATAGGAGAAACTGCAAGTAGAATCTTCGAAGATACCCACATGGATCTCGAGCAAAAGGGCAAGGATTGGCTTTG
CGTCACCTCACAGTCATGCTCTGTGGTGGCCACTCTGATTGCAACCGTGGCTTTTGCAACTGCAGCCACCATCCCAGGCGGCAACAACGACCACGGTCTCGCAACACTTA
AACAAGAGCAGGCCTTTTCTATCTTCGTCATCTCTTCCCTCATTGCTCTATGCCTCTCCACAACCTCAGTCATCATGTTTCTCGCCATCTTGACCGCCAGATTCGAAATC
AGAGAGTACTTCACATCAATGTTGCCCTGGAAACTGCTCGTGGCCTTATCCTCTCTTTACTTTTCCATCATCGCCATGTTGATTTCCTTTTGCAGCGGCCACTACTTTCT
CCTCACTCGCCGCCTCCAAAATCAAGCCCTTGTGCTCTACACACTTACTTTTCTCCCTGTCGCATTCATGTTCGGTATAGTGCAGCTTCCTCTCTACTTCGATCTCCTGC
GGGTTCTTCTCAAAATCATGCCTACCAGCGTCGAAGTCGACCTTTGCGATTAG
Protein sequenceShow/hide protein sequence
MELKTRIREKMNPTSNNNNNELREFLYAHTKRANWGAVIQKYAQHPEAHGLKLTPEGDTALHLAVVANQEEIVETLVRLICENPGHNLERLEIKNDRQNNPLHLAAAMGS
VRMCHAIASAHGGLLDDRNIDDQTPLFLAALYGKRDAFYCLYSFCGDDPVRITINCRLKNNGDTVLHCAIGNEYFDLAFQLMHLNVDSMNWANEHGRTPLHILASKPASF
KSGSNIRGWRNLVYHCIFVDKLKPQSIETLHRDQEYWKRSWSERKTAIFFPAIYETCIYFFTRLWDGLLKLGWYQININLSLPFSTLVNKKKSSDEANKDKDLEKGEKAE
RISNPPTITNSPENYSTCVDLFQIILSAIITVLGHGSTTIRKIREMKQKHSWSVQVMEKLIDPQFVTPDSHGGGRPDYEDSGMESAEAPYTYTLEEDKFSTSIGVKSEDV
TESAMLLAAKNGVVEIVKELYDCDPRTIYETRKSDEKNVVHLAAEYRQWNLYKFLLDRKYSLQPLFRAVDLNGNSVLHLAASRSAPKLWQITGAALQIQLEVEWYKFVKE
SLPIHFFPLYNKIGETASRIFEDTHMDLEQKGKDWLCVTSQSCSVVATLIATVAFATAATIPGGNNDHGLATLKQEQAFSIFVISSLIALCLSTTSVIMFLAILTARFEI
REYFTSMLPWKLLVALSSLYFSIIAMLISFCSGHYFLLTRRLQNQALVLYTLTFLPVAFMFGIVQLPLYFDLLRVLLKIMPTSVEVDLCD