; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004810 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004810
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionC2H2-type domain-containing protein
Genome locationscaffold176:768909..771936
RNA-Seq ExpressionMS004810
SyntenyMS004810
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139943.1 uncharacterized protein LOC101204451 [Cucumis sativus]3.4e-26787.43Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        P+E DNQ+GISNDNHERLLEY+NNDNNLAIV+Y  NSKGNG+R  + +GN+R   D SFENLNDGG+SCPLVIPGVLI++EISDIKV ELGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI +GVSRIWCEWLGKVN  +E  VKVP H+YAI+TFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDP   S S+S QYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSL LD YDDQILSTT+ LNKAVRRELRRQQRL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        EISVKDLGG K RRRYRRK +TKGNK  KD ETRQIKTQIDS+FCPACQGTGIT+DGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF F
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQ +ETIQENVKPLKLLHFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

XP_022140423.1 uncharacterized protein LOC111011105 isoform X1 [Momordica charantia]2.4e-30599.81Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD
        PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD

Query:  GILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS
        GILTGV RIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS
Subjt:  GILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS

Query:  ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS
        ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS
Subjt:  ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS

Query:  VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ
        VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ
Subjt:  VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ

Query:  SEETIQENVKPLKLLHFYGAFV
        SEETIQENVKPLKLLHFYGAFV
Subjt:  SEETIQENVKPLKLLHFYGAFV

XP_022985949.1 uncharacterized protein LOC111483838 isoform X1 [Cucurbita maxima]2.4e-26586.67Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE DNQ+   NDNHERLLEYHNNDNNLAIV Y +NSKGNG+ H + +GN+R   D SFENL+DGGD+ PLVIPGVLI+DEISDIKVMELGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI+ GVSRIWCEWLGKVNV LE KVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+N+RVKRKK FSD   VS+S+SHQYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSLLLDRYDD+IL+TT+ LNK+V+REL++QQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        E+SVK+LGG K RRRYRRKN+TKGNK SK+ ETRQIKTQIDS+FCPACQGTG+ VDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQSEET+QEN+K LKL+HFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

XP_023512516.1 uncharacterized protein LOC111777240 [Cucurbita pepo subsp. pepo]6.0e-26486.48Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE DNQ+   NDN ERLLEYHNNDNNLAIV Y +NSKGNG+ H + +GN+R   D SFENL+DGGD+ PLVIPGVLI+DEISDIKV ELGYG+IAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI+ GVSRIWCEWLGKVNV LE KVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE END+N+RVKRKK FSD   VS+S+SHQYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSLLLDRYDD+IL+TT+ LNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+N+KTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        E+SVK+LGG K RRRYRRKN+TKGNK SK+ ETRQIKTQIDS+FCPACQGTGI VDGD+LEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQSEET+QEN+K LKL+HFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

XP_038901466.1 uncharacterized protein LOC120088321 [Benincasa hispida]9.9e-26788Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDM---SFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE DNQ+G+SNDNHERLLEYHNNDNNLAIVRY  NSKGNG+ H + +GN+R+M   SFEN+NDGGD   LVIPGVLI++EISDIKV ELGYGQIAAR T
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDM---SFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EK+GI +GVSRIWCEWLGKVNV ++ KVKVP H+YAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE +N+EN RVKRK SFSDP   S+S+SHQYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN VTSSLLLD YDDQILS TI LNKAVRRELRRQQRL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        EISVKDLGG K RRRYRRK + KG+K SKD ETRQIKTQIDS+FCPACQGTGI VDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQ +ETIQENVK LKLLHFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

TrEMBL top hitse value%identityAlignment
A0A0A0KE98 C2H2-type domain-containing protein1.6e-26787.43Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        P+E DNQ+GISNDNHERLLEY+NNDNNLAIV+Y  NSKGNG+R  + +GN+R   D SFENLNDGG+SCPLVIPGVLI++EISDIKV ELGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI +GVSRIWCEWLGKVN  +E  VKVP H+YAI+TFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDP   S S+S QYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSL LD YDDQILSTT+ LNKAVRRELRRQQRL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        EISVKDLGG K RRRYRRK +TKGNK  KD ETRQIKTQIDS+FCPACQGTGIT+DGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF F
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQ +ETIQENVKPLKLLHFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

A0A1S3BJC6 uncharacterized protein LOC1034905236.5e-26486.48Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE D+Q+G+SNDNHERLLEY+NNDNNLAIV+Y  NSKGNG+   + +GN+R   D SFENLNDGG+S PLVIPGVLI++EISDIKV  LGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI +GVSRIWCEWLGKVN  +E KVKVP H+YAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDP   S S+S QYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSL LD YDDQILSTT+ LNKAVRRELRRQ RL AERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        EISVKDLGG K RRRYRRK +TKGNK SKD ETRQ+K+QID +FCPACQGTG+ +DGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQ +ETIQENVKPLKLLHFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

A0A6J1CFP1 uncharacterized protein LOC111011105 isoform X11.2e-30599.81Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD
        PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKD

Query:  GILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS
        GILTGV RIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS
Subjt:  GILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSS

Query:  ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS
        ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS
Subjt:  ASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEIS

Query:  VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ
        VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ
Subjt:  VKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQ

Query:  SEETIQENVKPLKLLHFYGAFV
        SEETIQENVKPLKLLHFYGAFV
Subjt:  SEETIQENVKPLKLLHFYGAFV

A0A6J1FSV1 uncharacterized protein LOC111448154 isoform X13.8e-26486.29Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE DNQ+   NDN ERLLEYHNNDNNLAIV Y ++SKGNG+ H + +GN+R   D SFENL+D GD+ PLVIPGVLI+DEISDI+V ELGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGIL G+SRIWCEWLGKVN  LE KVKVP HD+AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DEN+RVKRKK FSD   VS+S+SHQYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSLLLDRYDD+IL+TT+ LNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        E+SVK+LGG K RRRYRRKN+TKGNK SK+ ETRQIKTQIDS+FCPACQGTGI VDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQSEET+QEN+K LKL+HFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

A0A6J1JEQ8 uncharacterized protein LOC111483838 isoform X11.2e-26586.67Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MAR+MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT
        PVE DNQ+   NDNHERLLEYHNNDNNLAIV Y +NSKGNG+ H + +GN+R   D SFENL+DGGD+ PLVIPGVLI+DEISDIKVMELGYGQIAARFT
Subjt:  PVERDNQIGISNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIR---DMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFT

Query:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE
        EKDGI+ GVSRIWCEWLGKVNV LE KVKVP HD AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+N+RVKRKK FSD   VS+S+SHQYDSSGE
Subjt:  EKDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGE

Query:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASNCV SSLLLDRYDD+IL+TT+ LNK+V+REL++QQRL +ERMCDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
        E+SVK+LGG K RRRYRRKN+TKGNK SK+ ETRQIKTQIDS+FCPACQGTG+ VDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF
Subjt:  EISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHF

Query:  PYQSEETIQENVKPLKLLHFYGAFV
        PYQSEET+QEN+K LKL+HFYGAFV
Subjt:  PYQSEETIQENVKPLKLLHFYGAFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28260.1 unknown protein6.4e-13950.19Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MA K ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF  
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PV-ERDNQIGIS-NDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTE
           E + +  +S  +     LE+ ++D   AIV+YD N+K NGD   +    + D    +  D      L+I GVLI++   D++   +G+G+IAAR  E
Subjt:  PV-ERDNQIGIS-NDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTE

Query:  KDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGED
          G  T + ++WCEWLG      E K  +P HD+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E++  KRKKSFSDP   SESL +QYDSS E 
Subjt:  KDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGED

Query:  SSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE
        SS  N  +S  L+  YDD ++S  +  N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC++HW L CE E
Subjt:  SSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE

Query:  I-SVKDLGGKKARRRYRRKNRT--KGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        I   K + GK  +R  +   +T  K N+ + D     +  QI S+FCP CQGTGI ++G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCSTGF
Subjt:  I-SVKDLGGKKARRRYRRKNRT--KGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETIQ-----ENVKPLKLLHFY
        HFP Q+EET Q     E V+ +KL+ FY
Subjt:  HFPYQSEETIQ-----ENVKPLKLLHFY

AT4G28260.2 unknown protein6.4e-13950.19Show/hide
Query:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK
        MA K ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF  
Subjt:  MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHK

Query:  PV-ERDNQIGIS-NDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTE
           E + +  +S  +     LE+ ++D   AIV+YD N+K NGD   +    + D    +  D      L+I GVLI++   D++   +G+G+IAAR  E
Subjt:  PV-ERDNQIGIS-NDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTE

Query:  KDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGED
          G  T + ++WCEWLG      E K  +P HD+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E++  KRKKSFSDP   SESL +QYDSS E 
Subjt:  KDGILTGVSRIWCEWLGKVNVELEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGED

Query:  SSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE
        SS  N  +S  L+  YDD ++S  +  N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC++HW L CE E
Subjt:  SSASNCVTSSLLLDRYDDQILSTTITLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE

Query:  I-SVKDLGGKKARRRYRRKNRT--KGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        I   K + GK  +R  +   +T  K N+ + D     +  QI S+FCP CQGTGI ++G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCSTGF
Subjt:  I-SVKDLGGKKARRRYRRKNRT--KGNKCSKDSETRQIKTQIDSIFCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETIQ-----ENVKPLKLLHFY
        HFP Q+EET Q     E V+ +KL+ FY
Subjt:  HFPYQSEETIQ-----ENVKPLKLLHFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAAAGATGGAATTGGGATTCCCAAAATCTGCTTCGTACAGTCTTCGGGAACAAGCTGCTAGAACAATTCTACGCAATGTGAGGTCACAAGGGCATACATACGT
CGAGCTACGAGAAGATGGAAAAAGGTTCATTTTCTTTTGCACTTTGTGTCTTGCACCATGTTATAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCATACTG
AAAGATTATCTGCTGCTAAGCTGACTCTTTTAGGACCAAATCCTTGGCCTTTTGATGATGGTGTTCTTTTCTTCCATAAGCCAGTTGAACGAGATAATCAGATTGGGATT
TCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGACAACAATCTTGCTATTGTCAGGTATGATGAAAATTCTAAAGGCAATGGCGACAGACATGTTGACTG
TGATGGAAATATAAGGGATATGTCCTTCGAGAATTTGAATGATGGTGGAGACAGTTGTCCTTTGGTGATTCCTGGTGTACTGATTAGGGATGAAATTTCTGATATTAAAG
TGATGGAGTTGGGCTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGACGGGATCTTAACTGGAGTTAGCAGAATATGGTGCGAGTGGCTGGGTAAAGTAAATGTTGAG
CTTGAGGGTAAGGTCAAAGTTCCTGGACATGATTATGCTATTGTTACTTTCACTTATAATGTAGATTTAGGTAGAAAGGGCTTGCTTGATGATGTGAAGTTATTGCTCTC
ATCTAGTCCTGGAGCTGAACTAGAAAATGATGAGAACACTAGAGTGAAAAGAAAGAAATCGTTCTCAGATCCTGGGGTTGTTAGTGAGTCTTTGAGCCATCAATATGATT
CGTCGGGTGAAGATTCTTCAGCTTCAAATTGTGTCACTTCATCACTGTTGTTGGATAGATATGATGATCAAATTCTGAGTACAACAATCACGTTGAATAAGGCAGTAAGG
CGTGAACTGAGAAGGCAACAGCGTTTAGTTGCAGAGCGGATGTGTGATATCTGTCAACAAAAGATACTCACCCATAAAGATGTAGCAACACTCGTGAACATGAAAACTGG
AAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTTCATACATCGTGCCTCATACATTGGATACTTCTATGTGAGTATGAGATAAGTGTGAAGGATC
TAGGTGGTAAAAAAGCTAGACGAAGGTACAGGAGAAAGAACAGGACTAAGGGCAATAAATGCAGCAAGGACAGTGAAACGAGACAAATAAAAACTCAAATTGATTCTATA
TTCTGCCCAGCATGTCAGGGTACCGGTATAACTGTTGATGGAGATGACTTAGAGAAACCAACAATTCCTCTTTCGGAGATATTCAAATATAAAATAAAGGTGAGTGATGC
CCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCGACAGGTTTCCATTTCCCTTATCAATCTGAAGAAACTATACAGGAAAATGTGAAGCCTCTCAAGC
TGCTGCATTTTTATGGAGCTTTTGTA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAAAGATGGAATTGGGATTCCCAAAATCTGCTTCGTACAGTCTTCGGGAACAAGCTGCTAGAACAATTCTACGCAATGTGAGGTCACAAGGGCATACATACGT
CGAGCTACGAGAAGATGGAAAAAGGTTCATTTTCTTTTGCACTTTGTGTCTTGCACCATGTTATAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCATACTG
AAAGATTATCTGCTGCTAAGCTGACTCTTTTAGGACCAAATCCTTGGCCTTTTGATGATGGTGTTCTTTTCTTCCATAAGCCAGTTGAACGAGATAATCAGATTGGGATT
TCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGACAACAATCTTGCTATTGTCAGGTATGATGAAAATTCTAAAGGCAATGGCGACAGACATGTTGACTG
TGATGGAAATATAAGGGATATGTCCTTCGAGAATTTGAATGATGGTGGAGACAGTTGTCCTTTGGTGATTCCTGGTGTACTGATTAGGGATGAAATTTCTGATATTAAAG
TGATGGAGTTGGGCTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGACGGGATCTTAACTGGAGTTAGCAGAATATGGTGCGAGTGGCTGGGTAAAGTAAATGTTGAG
CTTGAGGGTAAGGTCAAAGTTCCTGGACATGATTATGCTATTGTTACTTTCACTTATAATGTAGATTTAGGTAGAAAGGGCTTGCTTGATGATGTGAAGTTATTGCTCTC
ATCTAGTCCTGGAGCTGAACTAGAAAATGATGAGAACACTAGAGTGAAAAGAAAGAAATCGTTCTCAGATCCTGGGGTTGTTAGTGAGTCTTTGAGCCATCAATATGATT
CGTCGGGTGAAGATTCTTCAGCTTCAAATTGTGTCACTTCATCACTGTTGTTGGATAGATATGATGATCAAATTCTGAGTACAACAATCACGTTGAATAAGGCAGTAAGG
CGTGAACTGAGAAGGCAACAGCGTTTAGTTGCAGAGCGGATGTGTGATATCTGTCAACAAAAGATACTCACCCATAAAGATGTAGCAACACTCGTGAACATGAAAACTGG
AAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTTCATACATCGTGCCTCATACATTGGATACTTCTATGTGAGTATGAGATAAGTGTGAAGGATC
TAGGTGGTAAAAAAGCTAGACGAAGGTACAGGAGAAAGAACAGGACTAAGGGCAATAAATGCAGCAAGGACAGTGAAACGAGACAAATAAAAACTCAAATTGATTCTATA
TTCTGCCCAGCATGTCAGGGTACCGGTATAACTGTTGATGGAGATGACTTAGAGAAACCAACAATTCCTCTTTCGGAGATATTCAAATATAAAATAAAGGTGAGTGATGC
CCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCGACAGGTTTCCATTTCCCTTATCAATCTGAAGAAACTATACAGGAAAATGTGAAGCCTCTCAAGC
TGCTGCATTTTTATGGAGCTTTTGTA
Protein sequenceShow/hide protein sequence
MARKMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVERDNQIGI
SNDNHERLLEYHNNDNNLAIVRYDENSKGNGDRHVDCDGNIRDMSFENLNDGGDSCPLVIPGVLIRDEISDIKVMELGYGQIAARFTEKDGILTGVSRIWCEWLGKVNVE
LEGKVKVPGHDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAELENDENTRVKRKKSFSDPGVVSESLSHQYDSSGEDSSASNCVTSSLLLDRYDDQILSTTITLNKAVR
RELRRQQRLVAERMCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGKKARRRYRRKNRTKGNKCSKDSETRQIKTQIDSI
FCPACQGTGITVDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQSEETIQENVKPLKLLHFYGAFV