; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017239 (gene) of Snake gourd v1 genome

Gene IDTan0017239
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionC2H2-type domain-containing protein
Genome locationLG05:5671723..5675801
RNA-Seq ExpressionTan0017239
SyntenyTan0017239
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010370.1 hypothetical protein SDJN02_27163 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-27088.8Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDN ERLLEYHNNDNNLAIVSYVD+SK N NGHGEFNGN+R+V+ CSFENL+DGGD+ PLVIPGVLIKDEISDI+VRELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI+ G++RIWCEWLGKVNAG ENKVKVP  DYAIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DE SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFP QSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

XP_022140423.1 uncharacterized protein LOC111011105 isoform X1 [Momordica charantia]1.8e-26889.18Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M R+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVE DN IGISNDNHERLLEYHNNDNNLAIV Y +NSK N + H + +GNIR +   SFENLNDGGDSCPLVIPGVLI+DEISDIKV ELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGIL+GV RIWCEWLGKVN   E KVKVPG DYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE ENDE +RVKRKK FSDP  VSESLSHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN  TSSLLLD+YDD+ILSTT+ LNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        E+SVKDL G K RRRYRRKN  + KGNKCSKDSETRQIKTQIDS+FCPACQGTGI V+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEET+QENVK LKLLHFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

XP_022943372.1 uncharacterized protein LOC111448154 isoform X1 [Cucurbita moschata]1.9e-27088.8Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDN ERLLEYHNNDNNLAIVSYVD+SK N NGHGEFNGN+R+V+ CSFENL+D GD+ PLVIPGVLIKDEISDI+VRELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGIL G++RIWCEWLGKVNAG ENKVKVP  D+AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DE SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

XP_022985949.1 uncharacterized protein LOC111483838 isoform X1 [Cucurbita maxima]8.2e-26988.24Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDNHERLLEYHNNDNNLAIVSYVDNSK N NGH EFNGN+R+V+ CSFENL+DGGD+ PLVIPGVLIKDEISDIKV ELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI+ GV+RIWCEWLGKVN G ENKVKVP  D AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+ SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL++QQRL +ERMCDICQQKILTHKDVATLLNMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTG+IV+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

XP_023512516.1 uncharacterized protein LOC111777240 [Cucurbita pepo subsp. pepo]1.3e-26988.61Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDN ERLLEYHNNDNNLAIVSYVDNSK N NGHGEFNGN+R+V+ CSFENL+DGGD+ PLVIPGVLIKDEISDIKVRELGYG+IAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI+ GV+RIWCEWLGKVN G ENKVKVP  D AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE END+ SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATLLN+KTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+ D+LEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

TrEMBL top hitse value%identityAlignment
A0A0A0KE98 C2H2-type domain-containing protein3.8e-26487.1Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        P+EGDN +GISNDNHERLLEY+NNDNNLAIV YV NSK N N   EFNGN+R+V+ CSFENLNDGG+SCPLVIPGVLIK+EISDIKVRELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI SGV+RIWCEWLGKVN G EN VKVP  +YAI+TFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDE  +VKRKK FSDPE+ S S+S QYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSL LD YDD+ILSTTVMLNKAVRRELRRQQRL AERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        E+SVKDL GSKVRRRYRRK   K KGNK  KD ETRQIKTQIDSVFCPACQGTGI ++ DDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
         FPYQ +ET+QENVK LKLLHFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

A0A1S3BJC6 uncharacterized protein LOC1034905231.5e-26386.91Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGD+ +G+SNDNHERLLEY+NNDNNLAIV YV NSK N NG  EFNGN+R+V+ CSFENLNDGG+S PLVIPGVLIK+EISDIKVR LGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI SGV+RIWCEWLGKVN G ENKVKVP  +YAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE +NDE  +VKRKK FSDPE+ S S+S QYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSL LD YDD+ILSTTVMLNKAVRRELRRQ RL AERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        E+SVKDL GSKVRRRYRRK   K KGNK SKD ETRQ+K+QID VFCPACQGTG+I++ DDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQ +ET+QENVK LKLLHFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

A0A6J1CFP1 uncharacterized protein LOC111011105 isoform X18.8e-26989.18Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M R+MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVE DN IGISNDNHERLLEYHNNDNNLAIV Y +NSK N + H + +GNIR +   SFENLNDGGDSCPLVIPGVLI+DEISDIKV ELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGIL+GV RIWCEWLGKVN   E KVKVPG DYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAE ENDE +RVKRKK FSDP  VSESLSHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN  TSSLLLD+YDD+ILSTT+ LNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        E+SVKDL G K RRRYRRKN  + KGNKCSKDSETRQIKTQIDS+FCPACQGTGI V+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEET+QENVK LKLLHFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

A0A6J1FSV1 uncharacterized protein LOC111448154 isoform X19.4e-27188.8Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDN ERLLEYHNNDNNLAIVSYVD+SK N NGHGEFNGN+R+V+ CSFENL+D GD+ PLVIPGVLIKDEISDI+VRELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGIL G++RIWCEWLGKVNAG ENKVKVP  D+AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E DE SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL+RQQRL +ERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTGIIV+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

A0A6J1JEQ8 uncharacterized protein LOC111483838 isoform X13.9e-26988.24Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M RRMELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFF+K
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT
        PVEGDN +   NDNHERLLEYHNNDNNLAIVSYVDNSK N NGH EFNGN+R+V+ CSFENL+DGGD+ PLVIPGVLIKDEISDIKV ELGYGQIAARFT
Subjt:  PVEGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFT

Query:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE
        EKDGI+ GV+RIWCEWLGKVN G ENKVKVP  D AIVTFTYNVDLGRKGLLDDVKLLLSSS GAE E D+ SRVKRKKCFSD E+VS+S+SHQYDSSGE
Subjt:  EKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGE

Query:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY
        DSSASN   SSLLLD+YDDRIL+TTVMLNK+V+REL++QQRL +ERMCDICQQKILTHKDVATLLNMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEY
Subjt:  DSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEY

Query:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
        EMSVK+L GSKVRRRYRRKN  K KGNK SK+ ETRQIKTQIDSVFCPACQGTG+IV+ DDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF
Subjt:  EMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGF

Query:  HFPYQSEETLQENVKLLKLLHFYGAFV
        HFPYQSEETLQEN+K LKL+HFYGAFV
Subjt:  HFPYQSEETLQENVKLLKLLHFYGAFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28260.1 unknown protein3.5e-13749.53Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M  + ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF+ 
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PV--EGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAAR
            E + S     +     LE+ ++D   AIV Y DN+K         NG+     V   E  +   D   L+I GVLIK+   D++ + +G+G+IAAR
Subjt:  PV--EGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSS
          E  G  + + ++WCEWLG      E K  +P  D+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E S  KRKK FSDPE+ SESL +QYDSS
Subjt:  FTEKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSS

Query:  GEDSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLC
         E SS  N  +S  L+  YDD ++S  V+ N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A +LNMKTG LAC SRN+ G FH+FH SC++HW L C
Subjt:  GEDSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLC

Query:  EYEMSVKDLVGSKVRRR-YRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCS
        E E+    +V  K ++R  +   +   K N+ + D     +  QI SVFCP CQGTGI +E   +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCS
Subjt:  EYEMSVKDLVGSKVRRR-YRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCS

Query:  TGFHFPYQSEETLQ-----ENVKLLKLLHFY
        TGFHFP Q+EET Q     E V+++KL+ FY
Subjt:  TGFHFPYQSEETLQ-----ENVKLLKLLHFY

AT4G28260.2 unknown protein3.5e-13749.53Show/hide
Query:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK
        M  + ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF+ 
Subjt:  MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNK

Query:  PV--EGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAAR
            E + S     +     LE+ ++D   AIV Y DN+K         NG+     V   E  +   D   L+I GVLIK+   D++ + +G+G+IAAR
Subjt:  PV--EGDNSIGISNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAAR

Query:  FTEKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSS
          E  G  + + ++WCEWLG      E K  +P  D+AIVTF+Y  +LGR GLLDD   LL+SS  +E  N E S  KRKK FSDPE+ SESL +QYDSS
Subjt:  FTEKDGILSGVTRIWCEWLGKVNAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSS

Query:  GEDSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLC
         E SS  N  +S  L+  YDD ++S  V+ N+ VRRELRRQQR+ +ER+C++C+QK+L  KD A +LNMKTG LAC SRN+ G FH+FH SC++HW L C
Subjt:  GEDSSASNFATSSLLLDKYDDRILSTTVMLNKAVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLC

Query:  EYEMSVKDLVGSKVRRR-YRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCS
        E E+    +V  K ++R  +   +   K N+ + D     +  QI SVFCP CQGTGI +E   +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCS
Subjt:  EYEMSVKDLVGSKVRRR-YRRKNKAKNKGNKCSKDSETRQIKTQIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCS

Query:  TGFHFPYQSEETLQ-----ENVKLLKLLHFY
        TGFHFP Q+EET Q     E V+++KL+ FY
Subjt:  TGFHFPYQSEETLQ-----ENVKLLKLLHFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGAAGGATGGAATTGGGCTTCCCGAAGTCTGCTTCATATAGTCTTCGAGAACAAGCTGCTAGGACGATTCTACGCAATGTAAGGTCACAAGGGCACACATACGT
TGAGCTACGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGCCTTGCACCATGTTATAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCACACTG
AAAGATTATCTGCTGCTAAGCTGACTCTCTTAGGACCAAATCCATGGCCTTTTGATGATGGTGTTCTTTTCTTCAACAAGCCAGTTGAGGGAGATAACTCGATAGGGATC
TCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCTATTGTCAGCTATGTTGATAATTCGAAAGCCAATGTCAATGGACATGGTGAGTT
TAATGGAAATATCAGGCACGTAGATGTTTGTTCCTTCGAGAATTTGAATGATGGTGGAGACAGTTGTCCTTTGGTGATACCTGGTGTACTGATTAAGGATGAAATTTCTG
ATATAAAAGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGGAATCTTATCTGGAGTTACCAGAATATGGTGTGAGTGGCTGGGTAAAGTA
AATGCCGGGCATGAGAATAAGGTCAAAGTTCCTGGACAGGATTATGCTATTGTTACTTTCACTTATAATGTTGATTTAGGTAGAAAGGGCCTGCTTGATGATGTTAAGTT
ATTACTCTCATCTAGTCCTGGAGCTGAACAAGAGAATGATGAGACCTCTAGAGTGAAAAGAAAGAAATGTTTCTCTGATCCTGAGAATGTGAGTGAATCTTTGAGCCATC
AATATGATTCGTCAGGTGAAGATTCTTCAGCTTCAAATTTTGCCACTTCATCGCTGTTGTTGGATAAATATGATGATCGGATTTTGAGTACAACAGTCATGTTGAATAAG
GCAGTAAGGCGTGAGTTGAGAAGGCAACAGCGCTTAGTCGCAGAGCGGATGTGTGATATCTGTCAACAGAAGATACTTACTCATAAAGATGTAGCAACACTCCTGAACAT
GAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAACGGGGTGTTTCATGTATTTCATACCTCTTGCCTTATACACTGGATTCTTCTCTGTGAGTATGAGATGAGTG
TGAAGGATCTAGTTGGTTCAAAAGTTAGACGAAGGTACAGGAGAAAGAACAAGGCTAAGAATAAGGGCAACAAATGCAGCAAGGACAGTGAAACGAGACAAATAAAAACT
CAAATTGATTCTGTTTTCTGCCCTGCATGTCAGGGTACCGGTATAATTGTTGAGAGGGATGACCTAGAGAAACCAACTATCCCTCTTTCTGAGATCTTCAAATACAAAAT
AAAGGTGAGCGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCAACAGGTTTCCATTTCCCTTACCAATCTGAAGAAACTCTACAGGAAAATG
TGAAGCTTCTCAAATTGCTGCATTTTTATGGAGCTTTTGTATAG
mRNA sequenceShow/hide mRNA sequence
TCTATCTTAATCCAGTCAACAATCTAATCTACCACCGTCCATCACACATCTTAAAAACAACAAACCTTCAATCCAACGGCTCAGAGCGGCCGACTTGCATCGCTCCGATG
CTCGATCACTCATTGCAAATAATTAATTATTTATTTGCACATGTTCTTCGTTGCTCACCTTCTTCCTTTCGATTCAACTAATTTTCCGTTGTTTGTATTTTGCTTTTCGG
CCAAATCTTCGAACAAATCGGTTCAGTTCTCATTTCTCACCTGCTGAATTCTGGAATTTGTAAGCTTTCCCTCCATTTATTTTCAATTTTCCCTCTCTTTTTCAGAACCA
GCGAGTTCGATTTTCTCTTTCGCCCTCCGATTTGTAACCAATGGAGATGGCAATGGAGTTGGACGATTCATATTCGCACGTTTTTCTTTCCATTGCCCGCGTCTCGCTCA
CCGAAGATGGAAGGACTCCGCGGTTTCCGATTAATCCATTACGAAGTATACTGCGAATATGTATACCTGCGCCCTTTTAAGTTTTCTCTAGGGTTTCTTCAATTTCTTAG
GTTTTGGACGTGTTCTGCTCTGCAATTTCGAGGGTTCTATTTGGATTCTGGGCGTATTTGTTGAATTTCGTTCGATTTGAACTCTGATTGTTCGTGTTATTGGACTCCTT
GAAGTTTTCAAGGCTGGTGGCCATTATTTCCAAACCTTACCTTGGCTCCAGCTTTCGTTATCGAAGAAGTTTGAAGAATTATCGTGTTTGTTGAAATATCTCTTTTCTAT
TATAAGTGCTTTAGGGTTTCATATGGTTCTTATAAACTGCTGTTTCTATTCTCTCTGTCGTTGAAGAATATTACGGATTCTACTTAAAACTCGGCTCTTTTGTGAGGAAC
TTTTGAATTTAAAATTTTAATACAAGCTCTTCAGCGAGAGTTATTGAATGACAAGAAGGATGGAATTGGGCTTCCCGAAGTCTGCTTCATATAGTCTTCGAGAACAAGCT
GCTAGGACGATTCTACGCAATGTAAGGTCACAAGGGCACACATACGTTGAGCTACGAGAAGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGCCTTGCACCATGTTA
TAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCACACTGAAAGATTATCTGCTGCTAAGCTGACTCTCTTAGGACCAAATCCATGGCCTTTTGATGATGGTG
TTCTTTTCTTCAACAAGCCAGTTGAGGGAGATAACTCGATAGGGATCTCAAATGACAATCATGAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCTATTGTC
AGCTATGTTGATAATTCGAAAGCCAATGTCAATGGACATGGTGAGTTTAATGGAAATATCAGGCACGTAGATGTTTGTTCCTTCGAGAATTTGAATGATGGTGGAGACAG
TTGTCCTTTGGTGATACCTGGTGTACTGATTAAGGATGAAATTTCTGATATAAAAGTGAGGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGGAA
TCTTATCTGGAGTTACCAGAATATGGTGTGAGTGGCTGGGTAAAGTAAATGCCGGGCATGAGAATAAGGTCAAAGTTCCTGGACAGGATTATGCTATTGTTACTTTCACT
TATAATGTTGATTTAGGTAGAAAGGGCCTGCTTGATGATGTTAAGTTATTACTCTCATCTAGTCCTGGAGCTGAACAAGAGAATGATGAGACCTCTAGAGTGAAAAGAAA
GAAATGTTTCTCTGATCCTGAGAATGTGAGTGAATCTTTGAGCCATCAATATGATTCGTCAGGTGAAGATTCTTCAGCTTCAAATTTTGCCACTTCATCGCTGTTGTTGG
ATAAATATGATGATCGGATTTTGAGTACAACAGTCATGTTGAATAAGGCAGTAAGGCGTGAGTTGAGAAGGCAACAGCGCTTAGTCGCAGAGCGGATGTGTGATATCTGT
CAACAGAAGATACTTACTCATAAAGATGTAGCAACACTCCTGAACATGAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAACGGGGTGTTTCATGTATTTCATAC
CTCTTGCCTTATACACTGGATTCTTCTCTGTGAGTATGAGATGAGTGTGAAGGATCTAGTTGGTTCAAAAGTTAGACGAAGGTACAGGAGAAAGAACAAGGCTAAGAATA
AGGGCAACAAATGCAGCAAGGACAGTGAAACGAGACAAATAAAAACTCAAATTGATTCTGTTTTCTGCCCTGCATGTCAGGGTACCGGTATAATTGTTGAGAGGGATGAC
CTAGAGAAACCAACTATCCCTCTTTCTGAGATCTTCAAATACAAAATAAAGGTGAGCGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCAAC
AGGTTTCCATTTCCCTTACCAATCTGAAGAAACTCTACAGGAAAATGTGAAGCTTCTCAAATTGCTGCATTTTTATGGAGCTTTTGTATAGAGGTACCCTTCAAATTCGT
AGGTTGTAACAGGCTTCAAGCATGCGCTGAAGCGTACAGTAGATCTTACTTTCTTATTAGCCACTTGACAAACTTTTGAACATTCATGATCTTTGTAAATGAAATGTCCA
CAAGTCTCGTAAATAATCAGAAGGGCATGGAGTATCGGATTCGGAGTTGC
Protein sequenceShow/hide protein sequence
MTRRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFNKPVEGDNSIGI
SNDNHERLLEYHNNDNNLAIVSYVDNSKANVNGHGEFNGNIRHVDVCSFENLNDGGDSCPLVIPGVLIKDEISDIKVRELGYGQIAARFTEKDGILSGVTRIWCEWLGKV
NAGHENKVKVPGQDYAIVTFTYNVDLGRKGLLDDVKLLLSSSPGAEQENDETSRVKRKKCFSDPENVSESLSHQYDSSGEDSSASNFATSSLLLDKYDDRILSTTVMLNK
AVRRELRRQQRLVAERMCDICQQKILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSVKDLVGSKVRRRYRRKNKAKNKGNKCSKDSETRQIKT
QIDSVFCPACQGTGIIVERDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGFHFPYQSEETLQENVKLLKLLHFYGAFV