; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023608 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023608
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionC2H2-type domain-containing protein
Genome locationtig00000892:4973840..4981540
RNA-Seq ExpressionSgr023608
SyntenySgr023608
Gene Ontology termsNA
InterPro domainsIPR013087 - Zinc finger C2H2-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140423.1 uncharacterized protein LOC111011105 isoform X1 [Momordica charantia]9.1e-24686.73Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQIG+SNDNH+RLLEYHNNDNNLAIV Y EN KG+ + H + +GNIR+M   SFENLNDG  SC LVIPGVLIRDEISDIKV ELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        ILTGV RIWCEW                             LGRKGLLDDVKLLLSSSPGAELENDEN RVKRKKSFSDP  VSESLSHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV+SSLLLDRYDDQILSTTI LNKAVRRELRRQQRL AER CDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE+SV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        KDLGG KARRRYRRKNRTKGNK SKDSETRQIKTQIDS+FCPACQGTGI +DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EET+Q
Subjt:  EETLQ

XP_022943372.1 uncharacterized protein LOC111448154 isoform X1 [Cucurbita moschata]1.1e-23882.77Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+   NDN +RLLEYHNNDNNLAIV+Y ++ KG+ NGH EFNGN+RN+E CSFENL+D   +  LVIPGVLI+DEISDI+VRELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        IL G+SRIWCEW                             LGRKGLLDDVKLLLSSS GAE E DEN+RVKRKK FSD EDVS+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSLLLDRYDD+IL+TT+MLNK+V+REL+RQQRLA+ER CDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        K+LGG K RRRYRRKN+TKGNK+SK+ ETRQIKTQIDSVFCPACQGTGII+DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EETLQ
Subjt:  EETLQ

XP_022985949.1 uncharacterized protein LOC111483838 isoform X1 [Cucurbita maxima]1.1e-23882.57Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+   NDNH+RLLEYHNNDNNLAIV+Y +N KG+ NGH EFNGN+RN+E CSFENL+DG  +  LVIPGVLI+DEISDIKV ELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I+ GVSRIWCEW                             LGRKGLLDDVKLLLSSS GAE E D+N+RVKRKK FSD EDVS+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSLLLDRYDD+IL+TT+MLNK+V+REL++QQRLA+ER CDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEYEMSV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        K+LGG K RRRYRRKN+TKGNK+SK+ ETRQIKTQIDSVFCPACQGTG+I+DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EETLQ
Subjt:  EETLQ

XP_023512516.1 uncharacterized protein LOC111777240 [Cucurbita pepo subsp. pepo]6.3e-23982.77Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+   NDN +RLLEYHNNDNNLAIV+Y +N KG+ NGH EFNGN+RN+E CSFENL+DG  +  LVIPGVLI+DEISDIKVRELGYG+IAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I+ GVSRIWCEW                             LGRKGLLDDVKLLLSSS GAE END+N+RVKRKK FSD EDVS+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSLLLDRYDD+IL+TT+MLNK+V+REL+RQQRLA+ER CDICQQKILTHKDVATL+N+KTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        K+LGG K RRRYRRKN+TKGNK+SK+ ETRQIKTQIDSVFCPACQGTGII+DGD+LEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EETLQ
Subjt:  EETLQ

XP_038901466.1 uncharacterized protein LOC120088321 [Benincasa hispida]8.0e-24284.16Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+GMSNDNH+RLLEYHNNDNNLAIV Y  N KG+ NGH EFNGN+RNME CSFEN+NDG     LVIPGVLI++EISDIKVRELGYGQIAAR TEK+ 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I +GVSRIWCEW                             LGRKGLLDDVKLLLSSSPGAE +N+EN RVKRK SFSDPED S+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SN V+SSLLLD YDDQILS TIMLNKAVRRELRRQQRLAAER CDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE+SV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        KDLGGPK RRRYRRK + KG+KHSKD ETRQIKTQIDSVFCPACQGTGII+DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQ 
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        +ET+Q
Subjt:  EETLQ

TrEMBL top hitse value%identityAlignment
A0A0A0KE98 C2H2-type domain-containing protein3.4e-23882.97Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLF+HLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKP+E 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+G+SNDNH+RLLEY+NNDNNLAIV Y  N KG+ N   EFNGN+RN+E CSFENLNDG  SC LVIPGVLI++EISDIKVRELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I +GVSRIWCEW                             LGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDPED S S+S QYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSL LD YDDQILSTT+MLNKAVRRELRRQQRLAAER CDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE+SV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        KDLGG K RRRYRRK +TKGNKH KD ETRQIKTQIDSVFCPACQGTGI IDGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG  FPYQ 
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        +ET+Q
Subjt:  EETLQ

A0A1S3BJC6 uncharacterized protein LOC1034905231.3e-23782.77Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGHTYVELRE+GK+FIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        D+Q+GMSNDNH+RLLEY+NNDNNLAIV Y  N KG+ NG  EFNGN+RN+E CSFENLNDG  S  LVIPGVLI++EISDIKVR LGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I +GVSRIWCEW                             LGRKGLLDDVKLLLSSSPGAE +NDEN +VKRKKSFSDPED S S+S QYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSL LD YDDQILSTT+MLNKAVRRELRRQ RLAAER CDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE+SV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        KDLGG K RRRYRRK +TKGNKHSKD ETRQ+K+QID VFCPACQGTG+IIDGDDLEKPT+PLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQ 
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        +ET+Q
Subjt:  EETLQ

A0A6J1CFP1 uncharacterized protein LOC111011105 isoform X14.4e-24686.73Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQIG+SNDNH+RLLEYHNNDNNLAIV Y EN KG+ + H + +GNIR+M   SFENLNDG  SC LVIPGVLIRDEISDIKV ELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        ILTGV RIWCEW                             LGRKGLLDDVKLLLSSSPGAELENDEN RVKRKKSFSDP  VSESLSHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV+SSLLLDRYDDQILSTTI LNKAVRRELRRQQRL AER CDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYE+SV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        KDLGG KARRRYRRKNRTKGNK SKDSETRQIKTQIDS+FCPACQGTGI +DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EET+Q
Subjt:  EETLQ

A0A6J1FSV1 uncharacterized protein LOC111448154 isoform X15.2e-23982.77Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+   NDN +RLLEYHNNDNNLAIV+Y ++ KG+ NGH EFNGN+RN+E CSFENL+D   +  LVIPGVLI+DEISDI+VRELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        IL G+SRIWCEW                             LGRKGLLDDVKLLLSSS GAE E DEN+RVKRKK FSD EDVS+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSLLLDRYDD+IL+TT+MLNK+V+REL+RQQRLA+ER CDICQQKILTHKDVATL+NMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        K+LGG K RRRYRRKN+TKGNK+SK+ ETRQIKTQIDSVFCPACQGTGII+DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EETLQ
Subjt:  EETLQ

A0A6J1JEQ8 uncharacterized protein LOC111483838 isoform X15.2e-23982.57Show/hide
Query:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER
        MELGFPKSASYSLREQAARTILRNVRSQGH+YVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVE 
Subjt:  MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVER

Query:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE
        DNQ+   NDNH+RLLEYHNNDNNLAIV+Y +N KG+ NGH EFNGN+RN+E CSFENL+DG  +  LVIPGVLI+DEISDIKV ELGYGQIAARFTEKD 
Subjt:  DNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDE

Query:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA
        I+ GVSRIWCEW                             LGRKGLLDDVKLLLSSS GAE E D+N+RVKRKK FSD EDVS+S+SHQYDSSGEDSSA
Subjt:  ILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSA

Query:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV
        SNCV SSLLLDRYDD+IL+TT+MLNK+V+REL++QQRLA+ER CDICQQKILTHKDVATL+NMKTGRLACSSRN NGVFHVFHTSCLIHWILLCEYEMSV
Subjt:  SNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSV

Query:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS
        K+LGG K RRRYRRKN+TKGNK+SK+ ETRQIKTQIDSVFCPACQGTG+I+DGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTG HFPYQS
Subjt:  KDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPYQS

Query:  EETLQ
        EETLQ
Subjt:  EETLQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G28260.1 unknown protein9.7e-12146.46Show/hide
Query:  ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPV--E
        ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF      E
Subjt:  ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPV--E

Query:  RDNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSC-TLVIPGVLIRDEISDIKVRELGYGQIAARFTEK
         +       +     LE+ ++D   AIV Y  N     N  A                 ++  H+   L+I GVLI++   D++ + +G+G+IAAR  E 
Subjt:  RDNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSC-TLVIPGVLIRDEISDIKVRELGYGQIAARFTEK

Query:  DEILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDS
            T + ++WCEW                             LGR GLLDD   LL+SS  +E  N E++  KRKKSFSDPED SESL +QYDSS E S
Subjt:  DEILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDS

Query:  SASNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEM
        S  N  SS  L+  YDD ++S  ++ N+ VRRELRRQQR+ +ER C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC++HW L CE E+
Subjt:  SASNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEM

Query:  SVKDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPY
            +   K ++R  + +   G K ++      +  QI SVFCP CQGTGI I+G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCSTG HFP 
Subjt:  SVKDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPY

Query:  QSEETLQV
        Q+EET Q+
Subjt:  QSEETLQV

AT4G28260.2 unknown protein9.7e-12146.46Show/hide
Query:  ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPV--E
        ELG PK  S +L+EQ ART L+N+R QGHTY+ELREDGKRF+FFCTLCLAPCYSD++L  HL G LH ERL+ A++TLLG NPWPF DGVLFF      E
Subjt:  ELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPV--E

Query:  RDNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSC-TLVIPGVLIRDEISDIKVRELGYGQIAARFTEK
         +       +     LE+ ++D   AIV Y  N     N  A                 ++  H+   L+I GVLI++   D++ + +G+G+IAAR  E 
Subjt:  RDNQIGMSNDNHKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSC-TLVIPGVLIRDEISDIKVRELGYGQIAARFTEK

Query:  DEILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDS
            T + ++WCEW                             LGR GLLDD   LL+SS  +E  N E++  KRKKSFSDPED SESL +QYDSS E S
Subjt:  DEILTGVSRIWCEW-----------------------------LGRKGLLDDVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDS

Query:  SASNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEM
        S  N  SS  L+  YDD ++S  ++ N+ VRRELRRQQR+ +ER C++C+QK+L  KD A ++NMKTG LAC SRN+ G FH+FH SC++HW L CE E+
Subjt:  SASNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVATLVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEM

Query:  SVKDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPY
            +   K ++R  + +   G K ++      +  QI SVFCP CQGTGI I+G  +E+ T PLS+ +++++KVS+ R+AW+K+PE L+NCSTG HFP 
Subjt:  SVKDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKYKIKVSDARRAWMKSPEVLQNCSTGCHFPY

Query:  QSEETLQV
        Q+EET Q+
Subjt:  QSEETLQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTGGGGTTCCCAAAGTCTGCTTCGTATAGTCTTCGAGAACAAGCTGCTAGAACAATTCTACGCAATGTAAGGTCACAAGGGCATACATACGTTGAGCTAAGAGA
AGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGTCTTGCACCATGTTATAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCACACTGAAAGATTATCTG
CTGCTAAGCTGACTCTCTTAGGACCAAATCCATGGCCTTTTGATGATGGTGTTCTTTTCTTCCATAAGCCAGTTGAGCGAGATAACCAGATTGGGATGTCAAATGACAAT
CATAAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCTATTGTCAACTATGGTGAAAATTTGAAAGGCAGTGTCAACGGACATGCTGAGTTTAATGGAAATAT
AAGGAATATGGAAGTTTGTTCCTTCGAGAATTTGAATGATGGTGAACACAGTTGTACTTTGGTGATTCCTGGTGTACTGATTAGGGATGAAATTTCTGATATAAAAGTGA
GGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGAGATCTTAACTGGAGTTAGCAGAATATGGTGCGAGTGGCTGGGTAGAAAGGGCTTGCTTGAT
GATGTCAAGTTATTGCTCTCATCTAGTCCCGGAGCTGAACTAGAGAATGATGAGAATGCTAGAGTGAAAAGAAAGAAATCGTTCTCTGATCCTGAGGATGTTAGTGAGTC
TTTGAGCCATCAATATGATTCTTCAGGTGAAGATTCTTCAGCTTCAAATTGTGTCTCTTCATCGCTATTGTTGGATAGATATGATGATCAGATTTTGAGTACAACAATCA
TGTTGAATAAGGCAGTAAGGCGTGAGCTGAGAAGGCAACAGCGTTTAGCCGCAGAGCGGACGTGTGATATCTGTCAACAGAAGATACTTACCCATAAAGATGTAGCAACA
CTTGTGAACATGAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTCCATACATCGTGCCTTATACATTGGATACTTCTTTGTGAGTA
TGAGATGAGTGTGAAGGATCTAGGCGGTCCAAAAGCTAGACGAAGGTACAGGAGAAAGAACAGGACTAAGGGCAATAAACACAGCAAGGACAGTGAAACGAGACAAATAA
AAACTCAAATCGATTCTGTATTCTGCCCAGCATGTCAGGGTACCGGTATAATTATTGATGGAGATGACCTAGAGAAACCAACTATCCCTCTTTCTGAGATCTTTAAATAT
AAAATAAAGGTGAGTGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCGACAGGTTGCCATTTCCCTTACCAATCTGAAGAAACTCTACAGGT
TTGTCGTCGTGTTTCCGAATGTTCTTGCATAAATCGCTTCCCCAAACATTCCGATCACTGGATGGATAATGTTCGTTGTAATACTGCTCGAGCTGAGCATCGACGAAGCG
ATTCTGAAATCTGCCGTAAAGAGGTACGAGAGTTGCGTGCGAGACTTCAGCCTTGCAAACGGGACAAAGACGTTCCCGTGGTTGGCATTGTTGCTGTGCAGATACGTTCT
GGTAATGAAGCCATTTATAAATGCAGGGCCAACAGAAGAGGTGACCACACAGTGTTACAACTGGATCTTCAACTGGGTCTAGGCATATGTTACAGTCAAAGCCATGGGAA
GCTCCACCATCAGCATCTGAAACACATGTTAGAAGAGTGGGTTGAATGGAAAGGGGTAGAGAAGAGAAGAGAGGAATGGGGTGTGGATTTGAAGAGGGGTGATGATGAGG
CTTCAGGCGTTTATATAAGAGCGAAAGAGACCCAAAAGGAAAAGAAAGCGAGGCATCTCCCTCTCGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATTGGGGTTCCCAAAGTCTGCTTCGTATAGTCTTCGAGAACAAGCTGCTAGAACAATTCTACGCAATGTAAGGTCACAAGGGCATACATACGTTGAGCTAAGAGA
AGATGGGAAAAGGTTTATTTTCTTCTGCACTTTGTGTCTTGCACCATGTTATAGTGATTCAGTGCTCTTTAACCACCTGAAGGGTACTCTTCACACTGAAAGATTATCTG
CTGCTAAGCTGACTCTCTTAGGACCAAATCCATGGCCTTTTGATGATGGTGTTCTTTTCTTCCATAAGCCAGTTGAGCGAGATAACCAGATTGGGATGTCAAATGACAAT
CATAAAAGGTTGTTGGAGTATCACAACAATGATAACAATCTTGCTATTGTCAACTATGGTGAAAATTTGAAAGGCAGTGTCAACGGACATGCTGAGTTTAATGGAAATAT
AAGGAATATGGAAGTTTGTTCCTTCGAGAATTTGAATGATGGTGAACACAGTTGTACTTTGGTGATTCCTGGTGTACTGATTAGGGATGAAATTTCTGATATAAAAGTGA
GGGAGTTGGGTTATGGACAAATTGCAGCTAGGTTTACTGAGAAGGATGAGATCTTAACTGGAGTTAGCAGAATATGGTGCGAGTGGCTGGGTAGAAAGGGCTTGCTTGAT
GATGTCAAGTTATTGCTCTCATCTAGTCCCGGAGCTGAACTAGAGAATGATGAGAATGCTAGAGTGAAAAGAAAGAAATCGTTCTCTGATCCTGAGGATGTTAGTGAGTC
TTTGAGCCATCAATATGATTCTTCAGGTGAAGATTCTTCAGCTTCAAATTGTGTCTCTTCATCGCTATTGTTGGATAGATATGATGATCAGATTTTGAGTACAACAATCA
TGTTGAATAAGGCAGTAAGGCGTGAGCTGAGAAGGCAACAGCGTTTAGCCGCAGAGCGGACGTGTGATATCTGTCAACAGAAGATACTTACCCATAAAGATGTAGCAACA
CTTGTGAACATGAAAACTGGAAGACTTGCCTGCAGTAGTCGAAATGTTAATGGGGTGTTTCATGTATTCCATACATCGTGCCTTATACATTGGATACTTCTTTGTGAGTA
TGAGATGAGTGTGAAGGATCTAGGCGGTCCAAAAGCTAGACGAAGGTACAGGAGAAAGAACAGGACTAAGGGCAATAAACACAGCAAGGACAGTGAAACGAGACAAATAA
AAACTCAAATCGATTCTGTATTCTGCCCAGCATGTCAGGGTACCGGTATAATTATTGATGGAGATGACCTAGAGAAACCAACTATCCCTCTTTCTGAGATCTTTAAATAT
AAAATAAAGGTGAGTGATGCCCGAAGAGCGTGGATGAAAAGTCCTGAGGTTCTGCAGAATTGTTCGACAGGTTGCCATTTCCCTTACCAATCTGAAGAAACTCTACAGGT
TTGTCGTCGTGTTTCCGAATGTTCTTGCATAAATCGCTTCCCCAAACATTCCGATCACTGGATGGATAATGTTCGTTGTAATACTGCTCGAGCTGAGCATCGACGAAGCG
ATTCTGAAATCTGCCGTAAAGAGGTACGAGAGTTGCGTGCGAGACTTCAGCCTTGCAAACGGGACAAAGACGTTCCCGTGGTTGGCATTGTTGCTGTGCAGATACGTTCT
GGTAATGAAGCCATTTATAAATGCAGGGCCAACAGAAGAGGTGACCACACAGTGTTACAACTGGATCTTCAACTGGGTCTAGGCATATGTTACAGTCAAAGCCATGGGAA
GCTCCACCATCAGCATCTGAAACACATGTTAGAAGAGTGGGTTGAATGGAAAGGGGTAGAGAAGAGAAGAGAGGAATGGGGTGTGGATTTGAAGAGGGGTGATGATGAGG
CTTCAGGCGTTTATATAAGAGCGAAAGAGACCCAAAAGGAAAAGAAAGCGAGGCATCTCCCTCTCGTTTAG
Protein sequenceShow/hide protein sequence
MELGFPKSASYSLREQAARTILRNVRSQGHTYVELREDGKRFIFFCTLCLAPCYSDSVLFNHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPVERDNQIGMSNDN
HKRLLEYHNNDNNLAIVNYGENLKGSVNGHAEFNGNIRNMEVCSFENLNDGEHSCTLVIPGVLIRDEISDIKVRELGYGQIAARFTEKDEILTGVSRIWCEWLGRKGLLD
DVKLLLSSSPGAELENDENARVKRKKSFSDPEDVSESLSHQYDSSGEDSSASNCVSSSLLLDRYDDQILSTTIMLNKAVRRELRRQQRLAAERTCDICQQKILTHKDVAT
LVNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEMSVKDLGGPKARRRYRRKNRTKGNKHSKDSETRQIKTQIDSVFCPACQGTGIIIDGDDLEKPTIPLSEIFKY
KIKVSDARRAWMKSPEVLQNCSTGCHFPYQSEETLQVCRRVSECSCINRFPKHSDHWMDNVRCNTARAEHRRSDSEICRKEVRELRARLQPCKRDKDVPVVGIVAVQIRS
GNEAIYKCRANRRGDHTVLQLDLQLGLGICYSQSHGKLHHQHLKHMLEEWVEWKGVEKRREEWGVDLKRGDDEASGVYIRAKETQKEKKARHLPLV