; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009646 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009646
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionC2H2-type domain-containing protein
Genome locationchr9:41061229..41064359
RNA-Seq ExpressionLag0009646
SyntenyLag0009646
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010370.1 hypothetical protein SDJN02_27163 [Cucurbita argyrosperma subsp. argyrosperma]2.4e-27689.56Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDN ERLLEYHNNDN+LAIV YV++S+GNG  NGH EFNGN+R++EDCSFENL+DG DN PLVIPGVLIKDEISDI+VRELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI+ G+SRIWCEWLGKVN GLENKVKVP HDYAIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DENSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFP QSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

XP_022943372.1 uncharacterized protein LOC111448154 isoform X1 [Cucurbita moschata]1.4e-27689.56Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDN ERLLEYHNNDN+LAIV YV++S+GNG  NGH EFNGN+R++EDCSFENL+D  DN PLVIPGVLIKDEISDI+VRELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGIL G+SRIWCEWLGKVN GLENKVKVP HD+AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DENSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

XP_022985949.1 uncharacterized protein LOC111483838 isoform X1 [Cucurbita maxima]1.8e-27689.37Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDNHERLLEYHNNDN+LAIV YV+NS+GNG  NGH EFNGN+R++EDCSFENL+DG DN PLVIPGVLIKDEISDIKV ELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI+ GVSRIWCEWLGKVN+GLENKVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+NSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL++QQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTG+IVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

XP_023512516.1 uncharacterized protein LOC111777240 [Cucurbita pepo subsp. pepo]1.1e-27689.56Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDN ERLLEYHNNDN+LAIV YV+NS+GNG  NGH EFNGN+R++EDCSFENL+DG DN PLVIPGVLIKDEISDIKVRELGYG+IAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI+ GVSRIWCEWLGKVN+GLENKVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE END+NSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+N+KTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTGIIVDGD+LEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

XP_038901466.1 uncharacterized protein LOC120088321 [Benincasa hispida]1.4e-27689.37Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+G+SNDNHERLLEYHNNDN+LAIVRYV NS+GNG  NGH EFNGN+R++EDCSFEN+NDG D   LVIPGVLIK+EISDIKVRELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
         TEK+GI SGVSRIWCEWLGKVN+G++NKVKVP H+YAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE +N+EN RVKRK SFSDPED S+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASN VTSSLLLD YDD+ILS T+MLNKAVRRELRRQQRL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYE+SVKDLGG KVRRRYRRK K KG+KHSKD ETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQ +E +QENVKHLKLLHFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

TrEMBL top hitse value%identityAlignment
A0A0A0KE98 C2H2-type domain-containing protein1.0e-27288.24Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         +EGDNQ+GISNDNHERLLEY+NNDN+LAIV+YV NS+GNGN     EFNGN+R++EDCSFENLNDG ++CPLVIPGVLIK+EISDIKVRELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI SGVSRIWCEWLGKVN G+EN VKVP H+YAI+TFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDPED S S+   +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSL LD YDD+ILSTTVMLNKAVRRELRRQQRL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYE+SVKDLGGSKVRRRYRRK KTKGNKH KD ETRQIKTQIDSVFCPACQGTGI +DGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
         FPYQ +E +QENVK LKLLHFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

A0A1S3BJC6 uncharacterized protein LOC1034905235.0e-27288.05Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGD+Q+G+SNDNHERLLEY+NNDN+LAIV+YV NS+GNG  NG  EFNGN+R++EDCSFENLNDG ++ PLVIPGVLIK+EISDIKVR LGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI SGVSRIWCEWLGKVN G+ENKVKVP H+YAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDPED S S+   +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSL LD YDD+ILSTTVMLNKAVRRELRRQ RL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYE+SVKDLGGSKVRRRYRRK KTKGNKHSKD ETRQ+K+QID VFCPACQGTG+I+DGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQ +E +QENVK LKLLHFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

A0A6J1CFP1 uncharacterized protein LOC111011105 isoform X12.0e-27690.51Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR++ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VE DNQIGISNDNHERLLEYHNNDN+LAIVRY ENS+GNG+   H + +GNIRD+   SFENLNDG D+CPLVIPGVLI+DEISDIKV ELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGIL+GV RIWCEWLGKVN+ LE KVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE ENDEN+RVKRKKSFSDP  VSESL H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCVTSSLLLDRYDD+ILSTT+ LNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYE+SVKDLGG K RRRYRRKN+TKGNK SKDSETRQIKTQIDS+FCPACQGTGI VDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE +QENVK LKLLHFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

A0A6J1FSV1 uncharacterized protein LOC111448154 isoform X16.7e-27789.56Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDN ERLLEYHNNDN+LAIV YV++S+GNG  NGH EFNGN+R++EDCSFENL+D  DN PLVIPGVLIKDEISDI+VRELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGIL G+SRIWCEWLGKVN GLENKVKVP HD+AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DENSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

A0A6J1JEQ8 uncharacterized protein LOC111483838 isoform X18.8e-27789.37Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARR+ELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR
         VEGDNQ+   NDNHERLLEYHNNDN+LAIV YV+NS+GNG  NGH EFNGN+R++EDCSFENL+DG DN PLVIPGVLIKDEISDIKV ELGYGQIAAR
Subjt:  SVEGDNQIGISNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS
        FTEKDGI+ GVSRIWCEWLGKVN+GLENKVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+NSRVKRKK FSD EDVS+S+ H +DSS
Subjt:  FTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSS

Query:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC
        GEDSSASNCV SSLLLDRYDDRIL+TTVMLNK+V+REL++QQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCL+HWILLC
Subjt:  GEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLC

Query:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EYEMSVK+LGGSKVRRRYRRKNKTKGNK+SK+ ETRQIKTQIDSVFCPACQGTG+IVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEEALQENVKHLKLLHFYGAFV
        HFPYQSEE LQEN+KHLKL+HFYGAFV
Subjt:  HFPYQSEEALQENVKHLKLLHFYGAFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28260.1 unknown protein2.3e-13649.62Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MA + ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF  
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SV-EGDNQIGIS-NDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIA
        S  E + +  +S  +     LE+ ++D   AIV+Y +N++ NG++   A     + D E       +  AD+  L+I GVLIK+   D++ + +G+G+IA
Subjt:  SV-EGDNQIGIS-NDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIA

Query:  ARFTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFD
        AR  E  G  + + ++WCEWLG      E K  +P HD+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E+S  KRKKSFSDPED SESL + +D
Subjt:  ARFTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFD

Query:  SSGEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWIL
        SS E SS  N  +S  L+  YDD ++S  V+ N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC+VHW L
Subjt:  SSGEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWIL

Query:  LCEYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCST
         CE E+    +   K ++R  + +   G K ++      +  QI SVFCP CQGTGI ++G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCST
Subjt:  LCEYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCST

Query:  GFHFPYQSEEALQ-----ENVKHLKLLHFY
        GFHFP Q+EE  Q     E V+ +KL+ FY
Subjt:  GFHFPYQSEEALQ-----ENVKHLKLLHFY

AT4G28260.2 unknown protein2.3e-13649.62Show/hide
Query:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MA + ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF  
Subjt:  MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  SV-EGDNQIGIS-NDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIA
        S  E + +  +S  +     LE+ ++D   AIV+Y +N++ NG++   A     + D E       +  AD+  L+I GVLIK+   D++ + +G+G+IA
Subjt:  SV-EGDNQIGIS-NDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIA

Query:  ARFTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFD
        AR  E  G  + + ++WCEWLG      E K  +P HD+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E+S  KRKKSFSDPED SESL + +D
Subjt:  ARFTEKDGILSGVSRIWCEWLGKVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFD

Query:  SSGEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWIL
        SS E SS  N  +S  L+  YDD ++S  V+ N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC+VHW L
Subjt:  SSGEDSSASNCVTSSLLLDRYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWIL

Query:  LCEYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCST
         CE E+    +   K ++R  + +   G K ++      +  QI SVFCP CQGTGI ++G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCST
Subjt:  LCEYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCST

Query:  GFHFPYQSEEALQ-----ENVKHLKLLHFY
        GFHFP Q+EE  Q     E V+ +KL+ FY
Subjt:  GFHFPYQSEEALQ-----ENVKHLKLLHFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAAGGTTGGAATTGGGGTTTCCGAAGTCTGCTTCATATAGTCTTCGAGAACAAGCTGCTAGAACGATTCTACGCAATGTAAGGTCACAAGGGCACACATATGT
TGAGCTACGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGTCTGGCTCCATGTTATAGTGACTCGGTGCTTTTTAACCATCTGAAGGGTACTCTTCACACTG
AAAGATTATCTGCTGCCAAGCTGACTCTCTTAGGACCAAATCCGTGGCCTTTTGATGATGGTGTTCTTTTCTTCCACAAGTCAGTTGAAGGAGACAACCAGATTGGGATT
TCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGATAACAGTCTTGCTATTGTCAGGTATGTTGAAAATTCACAAGGCAACGGCAACAGTAATGGACATGC
TGAGTTTAATGGAAATATTAGGGACTTGGAAGATTGTTCCTTCGAGAATTTGAATGACGGTGCAGACAATTGCCCTTTGGTGATTCCTGGTGTTCTGATTAAGGATGAAA
TTTCTGATATAAAAGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAAAAGGATGGGATCTTATCTGGAGTTAGCAGAATATGGTGTGAGTGGCTAGGT
AAAGTAAATATTGGGCTTGAGAATAAGGTCAAAGTTCCTGGACATGATTATGCTATTGTTACTTTTACTTATAATGTTGATTTAGGTAGAAAGGGCTTGCTTGATGATGT
CAAGTTATTGCTCTCATCTAGTCCTGGAGCTGAACAAGAGAATGATGAGAACTCTAGAGTGAAAAGAAAGAAATCTTTCTCTGATCCTGAGGATGTTAGTGAGTCTTTGG
GCCATCATTTTGATTCGTCAGGTGAAGATTCTTCGGCTTCAAATTGTGTCACTTCATCACTGTTGTTGGATAGATATGATGATCGAATTTTGAGTACAACAGTCATGTTG
AATAAGGCAGTAAGGCGTGAGCTGAGAAGGCAACAGCGTTTAGTTGCAGAGCGGATGTGTGATATCTGTCAACAGAAGATACTCACTCATAAAGATGTAGCAACACTCAT
GAACATGAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTTCATACCTCTTGCCTTGTACATTGGATACTTCTCTGTGAGTATGAGA
TGAGTGTGAAGGATCTAGGTGGTTCAAAAGTTAGACGACGGTACAGGAGAAAGAACAAGACTAAGGGCAACAAACACAGTAAGGACAGTGAAACAAGACAAATAAAAACT
CAAATTGATTCTGTATTCTGCCCAGCATGTCAGGGTACCGGTATAATTGTTGATGGGGATGACCTAGAGAAACCAACTATTCCTCTTTCTGAGATCTTCAAATACAAAAT
AAAGGTGAGTGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAACTGTTCGACAGGTTTCCATTTCCCTTACCAATCTGAAGAAGCTCTCCAGGAAAATG
TGAAGCATCTCAAATTGCTGCACTTTTATGGAGCTTTTGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAAGGTTGGAATTGGGGTTTCCGAAGTCTGCTTCATATAGTCTTCGAGAACAAGCTGCTAGAACGATTCTACGCAATGTAAGGTCACAAGGGCACACATATGT
TGAGCTACGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGTCTGGCTCCATGTTATAGTGACTCGGTGCTTTTTAACCATCTGAAGGGTACTCTTCACACTG
AAAGATTATCTGCTGCCAAGCTGACTCTCTTAGGACCAAATCCGTGGCCTTTTGATGATGGTGTTCTTTTCTTCCACAAGTCAGTTGAAGGAGACAACCAGATTGGGATT
TCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGATAACAGTCTTGCTATTGTCAGGTATGTTGAAAATTCACAAGGCAACGGCAACAGTAATGGACATGC
TGAGTTTAATGGAAATATTAGGGACTTGGAAGATTGTTCCTTCGAGAATTTGAATGACGGTGCAGACAATTGCCCTTTGGTGATTCCTGGTGTTCTGATTAAGGATGAAA
TTTCTGATATAAAAGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAAAAGGATGGGATCTTATCTGGAGTTAGCAGAATATGGTGTGAGTGGCTAGGT
AAAGTAAATATTGGGCTTGAGAATAAGGTCAAAGTTCCTGGACATGATTATGCTATTGTTACTTTTACTTATAATGTTGATTTAGGTAGAAAGGGCTTGCTTGATGATGT
CAAGTTATTGCTCTCATCTAGTCCTGGAGCTGAACAAGAGAATGATGAGAACTCTAGAGTGAAAAGAAAGAAATCTTTCTCTGATCCTGAGGATGTTAGTGAGTCTTTGG
GCCATCATTTTGATTCGTCAGGTGAAGATTCTTCGGCTTCAAATTGTGTCACTTCATCACTGTTGTTGGATAGATATGATGATCGAATTTTGAGTACAACAGTCATGTTG
AATAAGGCAGTAAGGCGTGAGCTGAGAAGGCAACAGCGTTTAGTTGCAGAGCGGATGTGTGATATCTGTCAACAGAAGATACTCACTCATAAAGATGTAGCAACACTCAT
GAACATGAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTTCATACCTCTTGCCTTGTACATTGGATACTTCTCTGTGAGTATGAGA
TGAGTGTGAAGGATCTAGGTGGTTCAAAAGTTAGACGACGGTACAGGAGAAAGAACAAGACTAAGGGCAACAAACACAGTAAGGACAGTGAAACAAGACAAATAAAAACT
CAAATTGATTCTGTATTCTGCCCAGCATGTCAGGGTACCGGTATAATTGTTGATGGGGATGACCTAGAGAAACCAACTATTCCTCTTTCTGAGATCTTCAAATACAAAAT
AAAGGTGAGTGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAACTGTTCGACAGGTTTCCATTTCCCTTACCAATCTGAAGAAGCTCTCCAGGAAAATG
TGAAGCATCTCAAATTGCTGCACTTTTATGGAGCTTTTGTATAG
Protein sequenceShow/hide protein sequence
MARRLELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKSVEGDNQIGI
SNDNHERLLEYHNNDNSLAIVRYVENSQGNGNSNGHAEFNGNIRDLEDCSFENLNDGADNCPLVIPGVLIKDEISDIKVRELGYGQIAARFTEKDGILSGVSRIWCEWLG
KVNIGLENKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDENSRVKRKKSFSDPEDVSESLGHHFDSSGEDSSASNCVTSSLLLDRYDDRILSTTVML
NKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLMNMKTGRLACSSRNVNGVFHVFHTSCLVHWILLCEYEMSVKDLGGSKVRRRYRRKNKTKGNKHSKDSETRQIKT
QIDSVFCPACQGTGIIVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQSEEALQENVKHLKLLHFYGAFV