; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002654 (gene) of Chayote v1 genome

Gene IDSed0002654
OrganismSechium edule (Chayote v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG05:41491287..41494665
RNA-Seq ExpressionSed0002654
SyntenySed0002654
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010370.1 hypothetical protein SDJN02_27163 [Cucurbita argyrosperma subsp. argyrosperma]5.2e-25583.02Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+   +DNQERLLEYHNNDNNLAIVSYV +SKGNG GHGEFNG  +    CSFE L          +DGGD+ PLVIPGVLIKDEISDIRVREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGI+ G+SRIWCEWLGK N+G ENK+KVP  DYAIVTFTYNVDLGRKGL DDVKLLLSS+ GAE E +E+SRVKRKK FSD   VS+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCV SSLLLDRYDDRILN+ VMLNK+V+REL+RQQRL +ERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE +S K+LGG+KV+RRYRRK KTKGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFP QSEETLQE++KHLKL+ FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

XP_022140423.1 uncharacterized protein LOC111011105 isoform X1 [Momordica charantia]4.9e-25383.21Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P    NQIG+S+DN ERLLEYHNNDNNLAIV Y  NSKGNG  H + +G  +     SFE L          NDGGDS PLVIPGVLI+DEISDI+V EL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGIL+GV RIWCEWLGK N   E K+KVP  DYAIVTFTYNVDLGRKGL DDVKLLLSS+PGAE+EN+E++RVKRKKSFSDPG VSES
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCVTSSLLLDRYDD+IL++ + LNKAVRRELRRQQRLVAERMCDICQQ+ILTHKDVATLVNMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE IS K LGG K +RRYRRK +TKGNKCSKDSETRQIKTQIDS+FCPACQGTGI V+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEET+QE+VK LKLL FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

XP_022943372.1 uncharacterized protein LOC111448154 isoform X1 [Cucurbita moschata]3.1e-25583.02Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+   +DNQERLLEYHNNDNNLAIVSYV +SKGNG GHGEFNG  +    CSFE L          +D GD+ PLVIPGVLIKDEISDIRVREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGIL G+SRIWCEWLGK N+G ENK+KVP  D+AIVTFTYNVDLGRKGL DDVKLLLSS+ GAE E +E+SRVKRKK FSD   VS+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCV SSLLLDRYDDRILN+ VMLNK+V+REL+RQQRL +ERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE +S K+LGG+KV+RRYRRK KTKGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEETLQE++KHLKL+ FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

XP_023512516.1 uncharacterized protein LOC111777240 [Cucurbita pepo subsp. pepo]7.5e-25482.65Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+   +DNQERLLEYHNNDNNLAIVSYV NSKGNG GHGEFNG  +    CSFE L          +DGGD+ PLVIPGVLIKDEISDI+VREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYG+IAARFTEKDGI+ GVSRIWCEWLGK N G ENK+KVP  D AIVTFTYNVDLGRKGL DDVKLLLSS+ GAE EN+++SRVKRKK FSD   VS+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCV SSLLLDRYDDRILN+ VMLNK+V+REL+RQQRL +ERMCDICQQ+ILTHKDVATL+N+KTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE +S K+LGG+KV+RRYRRK KTKGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+GD+LEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEETLQE++KHLKL+ FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

XP_038901466.1 uncharacterized protein LOC120088321 [Benincasa hispida]1.4e-25282.46Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+G+S+DN ERLLEYHNNDNNLAIV YVGNSKGNG GH EFNG  +    CSFE           +NDGGD   LVIPGVLIK+EISDI+VREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAAR TEK+GI SGVSRIWCEWLGK N G +NK+KVP  +YAIVTFTYNVDLGRKGL DDVKLLLSS+PGAE +N E+ RVKRK SFSDP   S+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASN VTSSLLLD YDD+IL+  +MLNKAVRRELRRQQRL AERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE IS K LGG KV+RRYRRKKK KG+K SKD ETRQIKTQIDSVFCPACQGTGIIV+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQ +ET+QE+VKHLKLL FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

TrEMBL top hitse value%identityAlignment
A0A0A0KE98 C2H2-type domain-containing protein1.8e-24880.97Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH YVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+G+S+DN ERLLEY+NNDNNLAIV YVGNSKGNG    EFNG  +    CSFE L          NDGG+S PLVIPGVLIK+EISDI+VREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGI SGVSRIWCEWLGK N G EN +KVP  +YAI+TFTYNVDLGRKGL DDVKLLLSS+PGAE +N+E+ +VKRKKSFSDP   S S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         S QYDSSGEDS ASNCV SSL LD YDD+IL++ VMLNKAVRRELRRQQRL AERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE IS K LGG+KV+RRYRRKKKTKGNK  KD ETRQIKTQIDSVFCPACQGTGI ++GDDLEKP++PLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGF FPYQ +ET+QE+VK LKLL FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

A0A1S3BJC6 uncharacterized protein LOC1034905233.5e-24980.97Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH YVELRE+GK+FIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G +Q+G+S+DN ERLLEY+NNDNNLAIV YVGNSKGNG G  EFNG  +    CSFE L          NDGG+S PLVIPGVLIK+EISDI+VR L
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGI SGVSRIWCEWLGK N G ENK+KVP  +YAIVTFTYNVDLGRKGL DDVKLLLSS+PGAE +N+E+ +VKRKKSFSDP   S S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         S QYDSSGEDS ASNCV SSL LD YDD+IL++ VMLNKAVRRELRRQ RL AERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE IS K LGG+KV+RRYRRKKKTKGNK SKD ETRQ+K+QID VFCPACQGTG+I++GDDLEKP++PLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQ +ET+QE+VK LKLL FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

A0A6J1CFP1 uncharacterized protein LOC111011105 isoform X12.4e-25383.21Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P    NQIG+S+DN ERLLEYHNNDNNLAIV Y  NSKGNG  H + +G  +     SFE L          NDGGDS PLVIPGVLI+DEISDI+V EL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGIL+GV RIWCEWLGK N   E K+KVP  DYAIVTFTYNVDLGRKGL DDVKLLLSS+PGAE+EN+E++RVKRKKSFSDPG VSES
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCVTSSLLLDRYDD+IL++ + LNKAVRRELRRQQRLVAERMCDICQQ+ILTHKDVATLVNMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE IS K LGG K +RRYRRK +TKGNKCSKDSETRQIKTQIDS+FCPACQGTGI V+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEET+QE+VK LKLL FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

A0A6J1FSV1 uncharacterized protein LOC111448154 isoform X11.5e-25583.02Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+   +DNQERLLEYHNNDNNLAIVSYV +SKGNG GHGEFNG  +    CSFE L          +D GD+ PLVIPGVLIKDEISDIRVREL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGIL G+SRIWCEWLGK N+G ENK+KVP  D+AIVTFTYNVDLGRKGL DDVKLLLSS+ GAE E +E+SRVKRKK FSD   VS+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCV SSLLLDRYDDRILN+ VMLNK+V+REL+RQQRL +ERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRNVNGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE +S K+LGG+KV+RRYRRK KTKGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEETLQE++KHLKL+ FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

A0A6J1JEQ8 uncharacterized protein LOC111483838 isoform X17.6e-25281.9Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M + MELGF KSASY+LREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLS+AKLTL+GPNPWPFDDGVLFFHK
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL
        P  G NQ+   +DN ERLLEYHNNDNNLAIVSYV NSKGNG GH EFNG  +    CSFE L          +DGGD+ PLVIPGVLIKDEISDI+V EL
Subjt:  PDGGANQIGVSSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVREL

Query:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES
        GYGQIAARFTEKDGI+ GVSRIWCEWLGK N G ENK+KVP  D AIVTFTYNVDLGRKGL DDVKLLLSS+ GAE E +++SRVKRKK FSD   VS+S
Subjt:  GYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSES

Query:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC
         SHQYDSSGEDS ASNCV SSLLLDRYDDRILN+ VMLNK+V+REL++QQRL +ERMCDICQQ+ILTHKDVATL+NMKTGRL CSSRN NGVFHVFHTSC
Subjt:  SSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSC

Query:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE
        LIHWILLCEYE +S K+LGG+KV+RRYRRK KTKGNK SK+ ETRQIKTQIDSVFCPACQGTG+IV+GDDLEKP+IPLSEIFKYKIKVSDARRAWMKSPE
Subjt:  LIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPE

Query:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV
        VLQNCSTGFHFPYQSEETLQE++KHLKL+ FYGAFV
Subjt:  VLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28260.1 unknown protein1.7e-13147.13Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M +  ELG  K  S NL+EQ ART L+N+R QGH Y+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TL+G NPWPF DGVLFF  
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQ--ERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVR
          G   +    S  +     LE+ ++D   AIV Y  N+K NG                  + +   V      +   D   L+I GVLIK+   D+  +
Subjt:  PDGGANQIGVSSDNQ--ERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVR

Query:  ELGYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVS
         +G+G+IAAR  E  G  + + ++WCEWLG      E K  +P  D+AIVTF+Y  +LGR GL DD   LL+S+  +E  N E S  KRKKSFSDP   S
Subjt:  ELGYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVS

Query:  ESSSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHT
        ES  +QYDSS E S   N  +S  L+  YDD +++  V+ N+ VRRELRRQQR+ +ER+C++C+Q++L  KD A ++NMKTG L C SRN+ G FH+FH 
Subjt:  ESSSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHT

Query:  SCLIHWILLCEYEIISAKSLGGTKVKRRYRRKKKT--KGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWM
        SC++HW L CE EI+  K + G   KR  +   +T  K N+ + D     +  QI SVFCP CQGTGI +EG  +E+ + PLS+ +++++KVS+ R+AW+
Subjt:  SCLIHWILLCEYEIISAKSLGGTKVKRRYRRKKKT--KGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWM

Query:  KSPEVLQNCSTGFHFPYQSEETLQ-----ESVKHLKLLRFY
        K+PE L+NCSTGFHFP Q+EET Q     E V+ +KL+RFY
Subjt:  KSPEVLQNCSTGFHFPYQSEETLQ-----ESVKHLKLLRFY

AT4G28260.2 unknown protein1.7e-13147.13Show/hide
Query:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK
        M +  ELG  K  S NL+EQ ART L+N+R QGH Y+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TL+G NPWPF DGVLFF  
Subjt:  MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHK

Query:  PDGGANQIGVSSDNQ--ERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVR
          G   +    S  +     LE+ ++D   AIV Y  N+K NG                  + +   V      +   D   L+I GVLIK+   D+  +
Subjt:  PDGGANQIGVSSDNQ--ERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVR

Query:  ELGYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVS
         +G+G+IAAR  E  G  + + ++WCEWLG      E K  +P  D+AIVTF+Y  +LGR GL DD   LL+S+  +E  N E S  KRKKSFSDP   S
Subjt:  ELGYGQIAARFTEKDGILSGVSRIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVS

Query:  ESSSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHT
        ES  +QYDSS E S   N  +S  L+  YDD +++  V+ N+ VRRELRRQQR+ +ER+C++C+Q++L  KD A ++NMKTG L C SRN+ G FH+FH 
Subjt:  ESSSHQYDSSGEDSPASNCVTSSLLLDRYDDRILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHT

Query:  SCLIHWILLCEYEIISAKSLGGTKVKRRYRRKKKT--KGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWM
        SC++HW L CE EI+  K + G   KR  +   +T  K N+ + D     +  QI SVFCP CQGTGI +EG  +E+ + PLS+ +++++KVS+ R+AW+
Subjt:  SCLIHWILLCEYEIISAKSLGGTKVKRRYRRKKKT--KGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWM

Query:  KSPEVLQNCSTGFHFPYQSEETLQ-----ESVKHLKLLRFY
        K+PE L+NCSTGFHFP Q+EET Q     E V+ +KL+RFY
Subjt:  KSPEVLQNCSTGFHFPYQSEETLQ-----ESVKHLKLLRFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAAATGATGGAATTGGGGTTCCGGAAGTCTGCTTCGTATAACCTTCGAGAACAAGCTGCTAGAACTATTCTACGCAATGTAAGGTCTCAAGGGCATGCATATGT
TGAGTTGCGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTCTGTGTCTTGCACCATGTTACAGTGATTCTGTGCTGTTTAACCATCTGAAGGGTACTCTTCACACGG
AAAGGTTATCTTCTGCGAAGCTGACTCTCATAGGACCGAATCCGTGGCCTTTTGATGATGGTGTTCTTTTCTTCCACAAGCCGGATGGAGGAGCTAACCAGATTGGGGTT
TCAAGTGACAATCAAGAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCCATTGTCAGCTATGTTGGAAATTCGAAAGGCAATGGCAAAGGACATGGTGAGTT
TAATGGAAAGAGAAAGAAAACGGGTGTTTGTTCGTTTGAGAAGTTGAATGGCGGTGTAGGCAGTTGTCCTAAGATGAATGACGGTGGAGACAGTTTCCCTTTGGTGATTC
CTGGTGTATTGATTAAGGATGAAATTTCTGATATAAGGGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGGAATCTTATCTGGTGTTAGC
AGAATATGGTGTGAGTGGTTGGGTAAAAGAAATAGTGGGCCTGAGAATAAGCTCAAAGTTCCTCGACTTGATTACGCTATTGTTACTTTCACTTATAATGTTGATTTAGG
TAGAAAGGGCCTTTTTGATGATGTCAAATTATTGCTCTCATCTAACCCCGGAGCAGAAATAGAGAACGAGGAGCACTCTAGAGTGAAAAGAAAGAAATCTTTCTCTGACC
CTGGGGGTGTTAGTGAGTCTTCGAGTCATCAATATGATTCATCAGGTGAAGATTCTCCAGCTTCAAATTGTGTCACTTCATCACTATTGTTGGATAGATATGATGATCGA
ATTTTGAATTCAGCAGTCATGTTGAATAAAGCAGTAAGGCGCGAGCTGAGAAGGCAACAGCGTTTAGTTGCCGAGCGGATGTGTGATATCTGTCAACAGAGGATACTGAC
TCATAAAGATGTAGCAACACTCGTAAACATGAAAACTGGAAGACTTGTCTGCAGTAGTCGCAATGTCAATGGGGTGTTTCATGTATTTCATACCTCCTGCCTTATACACT
GGATACTTCTTTGTGAGTATGAGATAATAAGTGCGAAAAGTCTAGGTGGTACAAAAGTTAAACGAAGGTACAGGAGAAAGAAGAAGACTAAGGGCAACAAATGCAGCAAG
GACAGTGAAACGAGACAAATAAAAACTCAAATTGATTCTGTTTTCTGCCCTGCATGTCAGGGAACTGGTATAATTGTTGAGGGGGATGACCTAGAGAAACCGAGTATTCC
TCTTTCTGAGATCTTCAAGTACAAAATAAAGGTGAGCGACGCCCGAAGAGCATGGATGAAAAGTCCCGAGGTTCTGCAGAATTGTTCGACAGGTTTCCATTTCCCTTACC
AATCTGAAGAAACTCTACAGGAAAGTGTAAAGCATCTTAAATTGCTGCGTTTTTATGGAGCTTTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAAATGATGGAATTGGGGTTCCGGAAGTCTGCTTCGTATAACCTTCGAGAACAAGCTGCTAGAACTATTCTACGCAATGTAAGGTCTCAAGGGCATGCATATGT
TGAGTTGCGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTCTGTGTCTTGCACCATGTTACAGTGATTCTGTGCTGTTTAACCATCTGAAGGGTACTCTTCACACGG
AAAGGTTATCTTCTGCGAAGCTGACTCTCATAGGACCGAATCCGTGGCCTTTTGATGATGGTGTTCTTTTCTTCCACAAGCCGGATGGAGGAGCTAACCAGATTGGGGTT
TCAAGTGACAATCAAGAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCCATTGTCAGCTATGTTGGAAATTCGAAAGGCAATGGCAAAGGACATGGTGAGTT
TAATGGAAAGAGAAAGAAAACGGGTGTTTGTTCGTTTGAGAAGTTGAATGGCGGTGTAGGCAGTTGTCCTAAGATGAATGACGGTGGAGACAGTTTCCCTTTGGTGATTC
CTGGTGTATTGATTAAGGATGAAATTTCTGATATAAGGGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGGAATCTTATCTGGTGTTAGC
AGAATATGGTGTGAGTGGTTGGGTAAAAGAAATAGTGGGCCTGAGAATAAGCTCAAAGTTCCTCGACTTGATTACGCTATTGTTACTTTCACTTATAATGTTGATTTAGG
TAGAAAGGGCCTTTTTGATGATGTCAAATTATTGCTCTCATCTAACCCCGGAGCAGAAATAGAGAACGAGGAGCACTCTAGAGTGAAAAGAAAGAAATCTTTCTCTGACC
CTGGGGGTGTTAGTGAGTCTTCGAGTCATCAATATGATTCATCAGGTGAAGATTCTCCAGCTTCAAATTGTGTCACTTCATCACTATTGTTGGATAGATATGATGATCGA
ATTTTGAATTCAGCAGTCATGTTGAATAAAGCAGTAAGGCGCGAGCTGAGAAGGCAACAGCGTTTAGTTGCCGAGCGGATGTGTGATATCTGTCAACAGAGGATACTGAC
TCATAAAGATGTAGCAACACTCGTAAACATGAAAACTGGAAGACTTGTCTGCAGTAGTCGCAATGTCAATGGGGTGTTTCATGTATTTCATACCTCCTGCCTTATACACT
GGATACTTCTTTGTGAGTATGAGATAATAAGTGCGAAAAGTCTAGGTGGTACAAAAGTTAAACGAAGGTACAGGAGAAAGAAGAAGACTAAGGGCAACAAATGCAGCAAG
GACAGTGAAACGAGACAAATAAAAACTCAAATTGATTCTGTTTTCTGCCCTGCATGTCAGGGAACTGGTATAATTGTTGAGGGGGATGACCTAGAGAAACCGAGTATTCC
TCTTTCTGAGATCTTCAAGTACAAAATAAAGGTGAGCGACGCCCGAAGAGCATGGATGAAAAGTCCCGAGGTTCTGCAGAATTGTTCGACAGGTTTCCATTTCCCTTACC
AATCTGAAGAAACTCTACAGGAAAGTGTAAAGCATCTTAAATTGCTGCGTTTTTATGGAGCTTTTGTATAG
Protein sequenceShow/hide protein sequence
MEKMMELGFRKSASYNLREQAARTILRNVRSQGHAYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSSAKLTLIGPNPWPFDDGVLFFHKPDGGANQIGV
SSDNQERLLEYHNNDNNLAIVSYVGNSKGNGKGHGEFNGKRKKTGVCSFEKLNGGVGSCPKMNDGGDSFPLVIPGVLIKDEISDIRVRELGYGQIAARFTEKDGILSGVS
RIWCEWLGKRNSGPENKLKVPRLDYAIVTFTYNVDLGRKGLFDDVKLLLSSNPGAEIENEEHSRVKRKKSFSDPGGVSESSSHQYDSSGEDSPASNCVTSSLLLDRYDDR
ILNSAVMLNKAVRRELRRQQRLVAERMCDICQQRILTHKDVATLVNMKTGRLVCSSRNVNGVFHVFHTSCLIHWILLCEYEIISAKSLGGTKVKRRYRRKKKTKGNKCSK
DSETRQIKTQIDSVFCPACQGTGIIVEGDDLEKPSIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQSEETLQESVKHLKLLRFYGAFV