; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017272 (gene) of Chayote v1 genome

Gene IDSed0017272
OrganismSechium edule (Chayote v1)
Descriptionzinc finger CCCH domain-containing protein 13-like isoform X4
Genome locationLG01:64108612..64119581
RNA-Seq ExpressionSed0017272
SyntenySed0017272
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605779.1 hypothetical protein SDJN03_03096, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0074.34Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY
        MD+HHKRNKPQD+D DREVTK KREGDLDAM  QDHD H  +ERDHDQESR RRDRDRD   D++G+Q+RSRSR RDR+SDYECDVDRDGSHLEDQY KY
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY

Query:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE
        VDSRG+KRSP DHDDSV+ARSKSLKNS H ANEEKKSLS+DKVDSDVER  S SRSRHADVSLS+ RRKSSP SLSRGGTDEYRHQ+QE LRDRY KKEE
Subjt:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE

Query:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT
        RSK  STR+K VLSGVQ+K SKYT ++K  ETDGGNAI++SRDRSLN KN DI+E+GRRHSTS+DAKDLSS+KDRHSWELQGEK    MDDSS AE YF+
Subjt:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT

Query:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP
        K SQSNPSPFHP P FRG  D PFDGSLEDDGRLNSNSRFRRGNDP  GRIHGNTWRGIPNWT PLPN FIPF HGPPHGSFQSIMPQFPAPPLFGIRPP
Subjt:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGIPYR+PDAERF SHMHPLGWQN+LDG+SPSHLH WDGNNG+FRDE+HIYSGAEWDENRQM+NGRGWESK EMWKRQ+GS K EL S FQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QD V+DVS+REVCDESADTILTKTAE+RP IPS KESPNT E L E P PL QSMDD S+L CSYL+KLKISTELA PDLYHQCQRLM IEHCAT DE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET +YI+LEG M AVSIS NS HQS +H NK+SVFQ AM+LY+KQRMEMK+M+V+ GG A+ SER LE KGMQV S G +S E  LEE   +FNNEEVKA
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLAS-TATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMD
        PVST+D E AQAPI T  DKE EAT A G+ +DLAS TA++ +KC +N EE+LPVTNSTEV  +A E  QQ NL+AEKDTI V  DN+ VNDT+KLSN++
Subjt:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLAS-TATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMD

Query:  IKGIANGKDSTRCGVGNSCFDNAV---SGPLSETC--------------------------EGLILSQIHHSPESTH
        +KGI  GKDS RC VG SC +NA       + E C                          E LILSQIHHSPESTH
Subjt:  IKGIANGKDSTRCGVGNSCFDNAV---SGPLSETC--------------------------EGLILSQIHHSPESTH

XP_008437591.1 PREDICTED: uncharacterized protein DDB_G0283697 [Cucumis melo]0.0e+0076.5Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+D DRE+TK KR+GDLD M  QDHD H  +ERDHDQESR RRD  RDRD  HD++G++NRSRSR RDR+SDYECDVDRDGSHLEDQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DV+LS+ RRKSSP SLSR GTDEYRHQ+QE LRDRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK
        ERSK  STR+K VLSGVQEKGSKY+ +EKPSET+GGNA ++ RDRSLNSKN DI+E+GRRH+TS+DAKDLSSNKDRHSW++QGEK  MDDSSQAESY++K
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK

Query:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP
        GSQSNPSPFH  PAFRG  D PFDGSL+DDGRLNSNSRFRRGNDPNLGR+HGN+WRG+PNW+ PLPN FIPF HG PPHGSFQSIMPQFPAPPLFGIRPP
Subjt:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGI YRMPDAERFSSHMH LGWQN+LDG+SPSHLHGWDGNNG+FRDE+HIYSGAEWDENRQMVNGRGWESK EMWKRQ+GS K EL SQFQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QDLVDDVSSRE CDES +T+LTKTAE+RPNIPSAKESPNT EL SE PAPLR+SMDD S+L CSYLSKLKISTELA PDLYHQC RLM IEHCATADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET+ YI+LEG MRAVSIS +S  QSL HP+KNSVFQ AM+LY+KQRMEMKEMQVV  G+ + SER+LE KGMQV+S  MA+ E  LE  +FDFNN EVK 
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQ-TNLEAEKDTIVVSNDNVAVNDTEKLSNMD
        P ST DVE  Q PI T G D+E E T A GK + +AST + EE+KCL+N EE+LP +N  EVDM+ SEQQ  NL+AEKDT+ ++ DN AVND++K SN D
Subjt:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQ-TNLEAEKDTIVVSNDNVAVNDTEKLSNMD

Query:  IKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH
        IKGIA G DS+RCGVGNSCFDNAVSGPLS      ETCEG          LILSQIHHSPESTH
Subjt:  IKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH

XP_023532838.1 uncharacterized protein LOC111794890 [Cucurbita pepo subsp. pepo]0.0e+0075.46Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY
        MD+HHKRNKPQD+D DREVTK KREGDLDAM  QDHD H  +ERDHDQESR RRDRDRD   D++G+Q+RSRSR RDR+SDYECDVDRDGSHLEDQY KY
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY

Query:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE
        VDSRG+KRSP DHDDSV+ARSKSLKNS H ANEEKKSLS+DKVDSDVER  S SRSRHADVSLS+ RRKSSP SLSRGGTDEYRHQ+QE LRDRY KKEE
Subjt:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE

Query:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT
        RSK  STR+K VLSGVQ+K SKYT ++K  ETDGGNAI++SRDRSLN KN DI+E+GRRHSTS+DAKDLSSNKDRHSWELQGEK    MDDSS AE YF+
Subjt:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT

Query:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP
        KGSQSNPSPFHP P FRG  D PFDGSLEDDGRLNSNSRFRRGNDP  GRIHGNTWRGIPNWT PLPN FIPF HGPPHGSFQSIMPQFPAPPLFGIRPP
Subjt:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGIPYR+PDAERF SHMHPLGWQN+LDG+SPSHLH WDGNNG+FRDE+HIYSGAEWDENRQM+NGRGWESK EMWKRQ+GS K EL S FQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QD V+DVS+REVCDESADTILTKTAE+RP IPS KESPNT E L E P PL QSMDD S+L CSYL+KLKISTELA PDLYHQCQRLM IEHCATADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET +YI+LEG M AVSIS NS HQS +H NK+SVFQ AM+LY+KQRMEMK+M+V+ GG A+ SER LE KGMQV S G +S E  LEE  F+FNNEEVKA
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMDI
        PVST+D E AQ PI T  DKE EAT A G+  DLASTA++ +KC +N EE+LPVTNSTEV  +A E  QQ NL+AEKDTI V  DN+ VNDT+KLS++++
Subjt:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMDI

Query:  KGIANGKDSTRCGVGNSCFDNAV---SGPLSETC------------------EGLILSQIHHSPESTH
        KGI   KDSTRCGVG SC +NA       + E C                  E LILSQIHHSPESTH
Subjt:  KGIANGKDSTRCGVGNSCFDNAV---SGPLSETC------------------EGLILSQIHHSPESTH

XP_031740997.1 uncharacterized protein DDB_G0283697 [Cucumis sativus]0.0e+0076.42Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+D DRE+TK KR+GDLDAM  QDHD H  +ERDHDQESR RRD  RDRD  HD++G++NRSRSR RDR+SDYECD+DRDGSHLEDQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DV+LS+ RRKSSP SLSR GTDEYRHQ+QE LRDRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYF-T
        ERSK  STR+K +LSGVQEKGSKY+ +EKPSET+G NA ++ RDRSLNSKN DI+E+GRRH+TS+DAKDLSSNKDRHSW++QGEK  MDD SQAESY+ +
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYF-T

Query:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRP
        KGSQSNPSPFH  PAFRG  D PFDGSL+DDGRLNSNSRFRRGNDPNLGR+HGN+WRG+PNW+ PLPN FIPF HG PPHGSFQSIMPQFPAPPLFGIRP
Subjt:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRP

Query:  PPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDD
        P EINHSGI YRMPDAERFSSHMH LGWQN+LDG+SPSHLHGWDGNNG+FRDE+HIY+GAEWDENRQMVNGRGWESK EMWKRQ+GS K EL SQFQKD+
Subjt:  PPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDD

Query:  RSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATAD
        RS  DLVDDVSSRE CDES DT+LTKTAE+RPNIPSAKESPNT EL SE PAPLRQSMDD S+L CSYLSKLKISTELA PDLYHQC RLM IEHCATAD
Subjt:  RSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATAD

Query:  EETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVK
        EET+AYI+LEG MRAVSIS +S HQSL HP+KNS+FQ AM+LY+KQRMEMKEMQVV  G+ + SER+LE K M+V+   MA+ E  LEEK+FDFNN EVK
Subjt:  EETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVK

Query:  APVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQ-QTNLEAEKDTIVVSNDNVAVNDTEKLSNM
         P ST+DVE  QAPI T G D+E E T A GK +D+AST + EE+KCL+N EE+LP +NS EVDM+ SEQ   NLEAEKDTI ++ DN  VND++K +N+
Subjt:  APVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQ-QTNLEAEKDTIVVSNDNVAVNDTEKLSNM

Query:  DIKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH
        DIKGIA G DSTRCGVGNSCFDNAVSGPLS      ETCEG          LILSQIHHSPESTH
Subjt:  DIKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH

XP_038876328.1 LOW QUALITY PROTEIN: filaggrin [Benincasa hispida]0.0e+0077.06Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+D DREVTK KREGDLDAM            RDHDQESR RRD  RDRD +HD++G++NRSRSR RDR+SDYECDVDRDGSHLEDQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DV+LS+ RRKSSP SLSR GTDEYRHQ+QE L+DRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK
        +RSK  STR+K VLSGVQEKGSKY+ +EKPSET+GGNA ++ RDRSLNSKN DI+E+GRRH+TS+DAKDLSSNKDRHSW++QGEK  MDDSSQAESY++K
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK

Query:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP
        GSQ+NPSPFHP PAFRG  D PFDGSL+DDGRLNSN+RFRRG+DPNLGR+HGNTWRG+PNW+ PLPN FIPF HG PPHGSFQ  MPQFPAPPLFGIRPP
Subjt:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGI YRMPDAERFSSHMH LGWQN+LDG+SPSHLHGWDGNNG+FRDE+HIYSGAEWDENRQMVNGRGW+SK EMWKRQ+GS K EL SQFQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QD VDDVSSREVCDESADTILTKTAE+RPNIPSAKESPNT EL SE P PLR+SMDD S+L CSYLSKLKISTELA PDLYHQCQRLM IEH  TADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET+AYI+LEG +RAVSIS NSVHQSL HP+KNSVFQ AM+LY+KQRMEMKEMQVV GGM + SER+LE KGMQV+S G+AS E  LEEK+FDFN+EEVKA
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIVVSNDNVAVNDTEKLSNMDI
        P+ST+D E  Q PI TTG DKE E   A GK +D+ASTA+ EE+KCL+N EE+LP+TN TEV M+ASE Q NL+AEKDT+VV+NDN+ V+DT+K SN D+
Subjt:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIVVSNDNVAVNDTEKLSNMDI

Query:  KGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH
        KGIAN KDSTR GVGNSCF+N VSGPLS      ETCEG          LILSQIHHSPESTH
Subjt:  KGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH

TrEMBL top hitse value%identityAlignment
A0A0A0KJV1 Uncharacterized protein0.0e+0076.42Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+D DRE+TK KR+GDLDAM  QDHD H  +ERDHDQESR RRD  RDRD  HD++G++NRSRSR RDR+SDYECD+DRDGSHLEDQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DV+LS+ RRKSSP SLSR GTDEYRHQ+QE LRDRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYF-T
        ERSK  STR+K +LSGVQEKGSKY+ +EKPSET+G NA ++ RDRSLNSKN DI+E+GRRH+TS+DAKDLSSNKDRHSW++QGEK  MDD SQAESY+ +
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYF-T

Query:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRP
        KGSQSNPSPFH  PAFRG  D PFDGSL+DDGRLNSNSRFRRGNDPNLGR+HGN+WRG+PNW+ PLPN FIPF HG PPHGSFQSIMPQFPAPPLFGIRP
Subjt:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRP

Query:  PPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDD
        P EINHSGI YRMPDAERFSSHMH LGWQN+LDG+SPSHLHGWDGNNG+FRDE+HIY+GAEWDENRQMVNGRGWESK EMWKRQ+GS K EL SQFQKD+
Subjt:  PPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDD

Query:  RSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATAD
        RS  DLVDDVSSRE CDES DT+LTKTAE+RPNIPSAKESPNT EL SE PAPLRQSMDD S+L CSYLSKLKISTELA PDLYHQC RLM IEHCATAD
Subjt:  RSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATAD

Query:  EETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVK
        EET+AYI+LEG MRAVSIS +S HQSL HP+KNS+FQ AM+LY+KQRMEMKEMQVV  G+ + SER+LE K M+V+   MA+ E  LEEK+FDFNN EVK
Subjt:  EETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVK

Query:  APVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQ-QTNLEAEKDTIVVSNDNVAVNDTEKLSNM
         P ST+DVE  QAPI T G D+E E T A GK +D+AST + EE+KCL+N EE+LP +NS EVDM+ SEQ   NLEAEKDTI ++ DN  VND++K +N+
Subjt:  APVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQ-QTNLEAEKDTIVVSNDNVAVNDTEKLSNM

Query:  DIKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH
        DIKGIA G DSTRCGVGNSCFDNAVSGPLS      ETCEG          LILSQIHHSPESTH
Subjt:  DIKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH

A0A1S3AUZ1 uncharacterized protein DDB_G02836970.0e+0076.5Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+D DRE+TK KR+GDLD M  QDHD H  +ERDHDQESR RRD  RDRD  HD++G++NRSRSR RDR+SDYECDVDRDGSHLEDQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DV+LS+ RRKSSP SLSR GTDEYRHQ+QE LRDRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK
        ERSK  STR+K VLSGVQEKGSKY+ +EKPSET+GGNA ++ RDRSLNSKN DI+E+GRRH+TS+DAKDLSSNKDRHSW++QGEK  MDDSSQAESY++K
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK

Query:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP
        GSQSNPSPFH  PAFRG  D PFDGSL+DDGRLNSNSRFRRGNDPNLGR+HGN+WRG+PNW+ PLPN FIPF HG PPHGSFQSIMPQFPAPPLFGIRPP
Subjt:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGI YRMPDAERFSSHMH LGWQN+LDG+SPSHLHGWDGNNG+FRDE+HIYSGAEWDENRQMVNGRGWESK EMWKRQ+GS K EL SQFQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QDLVDDVSSRE CDES +T+LTKTAE+RPNIPSAKESPNT EL SE PAPLR+SMDD S+L CSYLSKLKISTELA PDLYHQC RLM IEHCATADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAEL-SEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET+ YI+LEG MRAVSIS +S  QSL HP+KNSVFQ AM+LY+KQRMEMKEMQVV  G+ + SER+LE KGMQV+S  MA+ E  LE  +FDFNN EVK 
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQ-TNLEAEKDTIVVSNDNVAVNDTEKLSNMD
        P ST DVE  Q PI T G D+E E T A GK + +AST + EE+KCL+N EE+LP +N  EVDM+ SEQQ  NL+AEKDT+ ++ DN AVND++K SN D
Subjt:  PVSTIDVEAAQAPIDTTG-DKETEATAASGKSDDLASTAT-EEIKCLDNWEEALPVTNSTEVDMLASEQQ-TNLEAEKDTIVVSNDNVAVNDTEKLSNMD

Query:  IKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH
        IKGIA G DS+RCGVGNSCFDNAVSGPLS      ETCEG          LILSQIHHSPESTH
Subjt:  IKGIANGKDSTRCGVGNSCFDNAVSGPLS------ETCEG----------LILSQIHHSPESTH

A0A6J1E442 uncharacterized protein LOC111430427 isoform X20.0e+0071.48Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD------RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLE
        MD+HHKRNKPQD+DPDREVTK KREGD+DAM  QDHD H A+ERDH+QESR RRD      RDRD +HD++ +++RSRSR RDR+SDYECDVDRDGSH +
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD------RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLE

Query:  DQYMKYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRY
        DQY KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DVSLS+ RRKSSP S SR  TDEYRHQ+QE LRDRY
Subjt:  DQYMKYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRY

Query:  SKKEERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAES
         KKEERSK  STR+K VLS VQEKGSKYT +EKPSE +GGNA ++ RDR+LNSKN DI+E+GRRH+ S+DAKDLSSNKDRHSW++QGEK  MDDSSQ ES
Subjt:  SKKEERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAES

Query:  YFTKGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFG
        Y++KGSQSNPSPFHP PAFRG  D PFDGSL+DDGRLNSNSRFRRGNDPN+GR+HGNTWRG+PNWT PLPN FIPF HG PPHGSFQS+MPQFPAPP+FG
Subjt:  YFTKGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFG

Query:  IRPPPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQ
        IRPP +INHSGI YRMPDA+RFSSHMHPLGWQN+LDG+SPSHLHGWD NNG+FRDE+HIY+GAEWDENRQMVNGRGW+SK EMWKRQ+GS K E+ SQFQ
Subjt:  IRPPPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQ

Query:  KDDRSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCA
        KD+RS QD VDDVSS+E+ DE+ADT+LTKT+E+RPNIPSAKESPNT E LSE PAPL +SMDD S+L CSYLSKL ISTELA PDLY QCQRLM IEHCA
Subjt:  KDDRSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCA

Query:  TADEETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKL--EGKGMQVLSVGMASFEGGLEEKSFDFN
        TADEET+AYI+LEG MRAVS+S NS   SL  PNKNSVFQ AM+LY+KQR EMKEMQ +   M + SER L  E +GMQV+S GMA  E   EE   +F 
Subjt:  TADEETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKL--EGKGMQVLSVGMASFEGGLEEKSFDFN

Query:  NEEVKAPVSTIDVEAAQAPIDTTG---------------DKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIV
        NEEVKAPVST+D E  QAPI TTG               D   EA AA G+ +DLAS AT E+KCL+N EE++P+TNSTEVDM+ SEQ  NL+AEKDTIV
Subjt:  NEEVKAPVSTIDVEAAQAPIDTTG---------------DKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIV

Query:  VSNDNVAVNDTEKLSNMDIKGIANGKDSTRCGVGNSCFDNAVSGPLS---------ETC------------------EGLILS-QIHHSPESTH
        +++DN  VN+  + SN D+KGI NGK+S  CGVGNSCFD AVSGPLS         E+C                  E LILS QIHHSPESTH
Subjt:  VSNDNVAVNDTEKLSNMDIKGIANGKDSTRCGVGNSCFDNAVSGPLS---------ETC------------------EGLILS-QIHHSPESTH

A0A6J1H3M6 filaggrin-like0.0e+0075.09Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY
        MD+HHKRNKPQD+D DREVTK KREGDLDAM  QDHD H  +ERDHDQESR RRDRDRD   D++G+Q+R RSR RDR+SDYECDVDRDGSHLEDQY KY
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKY

Query:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE
        VDSRG+KRSP DHDDSV+ARSKSLKNS H ANEEKKSLS+DKVDSDVER  S SRSRHADVSLS+ RRKSSP SLSRGGTDEYRHQ+QE LRDRY KKEE
Subjt:  VDSRGRKRSPTDHDDSVEARSKSLKNS-HLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEE

Query:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT
        RSK  STR+K VLSGVQ+K SKYT ++K  ETDGGNAI++SRDRSLN KN DI+E+GRRHSTS+DAKDLSS+KDRHSWELQGEK    MDDSS AE YF+
Subjt:  RSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEK---QMDDSSQAESYFT

Query:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP
        KGSQSNPSPFHP P FRG  D PFDGSLEDDGRLNSNSRFR GNDP  GRIHGNTWRGIPNWT PLPN FIPF HGPPHGSFQSIMPQFPAPPLFGIRPP
Subjt:  KGSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         EINHSGIPYR+PDAERF SHMHPLGWQN+LDG+SPSHLH WDGNNG+FRDE+HIYSGAEWDENRQM+NGRGWESK EMWKRQ+GS K EL S FQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
        S QD V+DVS+REVCDESADTILTKTAE+RP IPS KESPNT E L E P PL QSMDD S+L CSYL+KLKISTELA PDLYHQCQRLM IEHCATADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET +YI+LEG M AVSIS NS HQS +H NK+SVFQ AM+LY+KQRMEMK+M+V+  G A+ SER LE KGMQV S G +S E  LEE   +FNNEEVKA
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLAS-TATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMD
        PVST+D E AQ  I T  DKE EAT ASG+ +DLAS TA++ +KC +N EE+LPVTNST+V  +A E  QQ NL+AEKDTI V  DN+ VNDT+KLSN++
Subjt:  PVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLAS-TATEEIKCLDNWEEALPVTNSTEVDMLASE--QQTNLEAEKDTIVVSNDNVAVNDTEKLSNMD

Query:  IKGIANGKDSTRCGVGNSCFDNAV---SGPLSETC----------------EGLILSQIHHSPESTH
        +KGI  GKDSTRCGVG SC +NA       + E C                E LILSQIHHSPESTH
Subjt:  IKGIANGKDSTRCGVGNSCFDNAV---SGPLSETC----------------EGLILSQIHHSPESTH

A0A6J1I6E2 uncharacterized protein LOC111471538 isoform X20.0e+0070Show/hide
Query:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM
        MD+HHKRNKPQD+DPDREVTK KREGD+DAM  QDHD H A+ERDH+QESR RRD  RDRD + D++ +++RSRSR RDR+SDYECDVDRDG H +DQY 
Subjt:  MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRD--RDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYM

Query:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE
        KYVDSRGRKRSP DHDDSV+ARSKSLKNSH AN+EKKSLSNDKVDSD ER  S SRSRH DVSLS+ RRKSSP S SR  TDEYRHQ+QE LRDRY KKE
Subjt:  KYVDSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKE

Query:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK
        +RSK  STR+K VLS VQEKGSKYT +EKPSE +GGNA ++ RDR+LNSKN DI+E+GRRH+ S+DAKDLSSNKDRHSW++QGEK  MDDSSQ ESY++K
Subjt:  ERSKPNSTREKSVLSGVQEKGSKYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQ-MDDSSQAESYFTK

Query:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP
        GSQSNPSPFHP PAFRG  D PFDGSL+DDGRLNSNS FRRGNDPN+GR+HGNTWRG+PNWT PLPN FIPF HG PPHGSFQS+MPQFPAPP+FGIRPP
Subjt:  GSQSNPSPFHPHPAFRGN-DFPFDGSLEDDGRLNSNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHG-PPHGSFQSIMPQFPAPPLFGIRPP

Query:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR
         +INHSGI YRMPDA+RFSSHMHPLGWQN+LDG+SPSHLHGWD NNG+FRDE+HIY+GAEWDENRQMVNGRGW+SK EMWKRQ+GS K E+ SQFQKD+R
Subjt:  PEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGNNGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDR

Query:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE
          QD VDDVSS+E+CDE+ADT+LTKTAE+RPNIPSAKESPNT E LSE PAPL +SMDD S+L CSYLSKLKISTELA PDLY QCQRLM IEHCATADE
Subjt:  SAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAE-LSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADE

Query:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA
        ET+AYI+LEG MRAVS+S NS   SL  PNKNSVFQ AM+LY+KQR EMKEMQ +   M       +E +GMQV+S GMA  E   EEK F+FNNEEVKA
Subjt:  ETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERKLEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKA

Query:  PVSTIDVEAAQAPIDTTG---------------DKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIVVSNDNV
        PVST+D E  QAPI TTG               D   EA AA G+ +DLAS AT E+KCL+N EE++P TNSTEV M+ SEQQ NL+AEKDTIV++NDN 
Subjt:  PVSTIDVEAAQAPIDTTG---------------DKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKDTIVVSNDNV

Query:  AVNDTEKLS----------------------NMDIKGIANGKDSTRCGVGNSCFDNAVSGPLS---------ETC-------------------EGLILS
         VN+  + S                      N DIKGI NGK+S  CGVGNSCFD AVSGPLS         E+C                   E LILS
Subjt:  AVNDTEKLS----------------------NMDIKGIANGKDSTRCGVGNSCFDNAVSGPLS---------ETC-------------------EGLILS

Query:  QIHHSPESTH
        QI HSPESTH
Subjt:  QIHHSPESTH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53440.1 unknown protein2.9e-7231.01Show/hide
Query:  DLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKYV
        D  ++R++ +D D DRE  + +R+ + D  H +D D  ++ +RD D E     DR++D + D      RS+       SD E D DRD S L+DQ  +Y 
Subjt:  DLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKYV

Query:  DSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEERS
        D R  +RSP D+ D  +  + S ++S +  +   +    ++ S V +  + + S       S+       G   RG             R + S+K    
Subjt:  DSRGRKRSPTDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEERS

Query:  KPNSTREKSVLSGVQEKGSKYTNAEK-PSETDGGNAIDVSRDRSLNSKNG-----DIDENGRRHSTSVDAKDLS-SNKDRHSWELQGEKQMDDSSQAESY
        +     E         K S     E+ PS T        S +R  N++ G     +++E G R+    +A+D S + ++RH         +D++SQAE  
Subjt:  KPNSTREKSVLSGVQEKGSKYTNAEK-PSETDGGNAIDVSRDRSLNSKNG-----DIDENGRRHSTSVDAKDLS-SNKDRHSWELQGEKQMDDSSQAESY

Query:  FTKGSQSNPSPFHPHPAFR-GNDFPFDGSLEDDGRLNSNSRFRRGN-DPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGI
        F   +  N S F P P  R G   P  G  E+D R+N+  R++RG  D  +GR   N WRG+P+W +PL N + PF H PPHG+FQ++MPQFP+P LFG+
Subjt:  FTKGSQSNPSPFHPHPAFR-GNDFPFDGSLEDDGRLNSNSRFRRGN-DPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGI

Query:  RPPPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGN-NGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQ
        RP  E+NH GI Y +PDAERFS HM PLGWQN++D +  SH+HG+ G+ +   RDE+++Y G+EWD+NR+M NGRGWES  + WK +NG    E++S   
Subjt:  RPPPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGN-NGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQ

Query:  KDDRSAQDLVDDVSSREVCDESADTILTKTAELRPNIPS-AKE----SPNTAELSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYI
        KDD SAQ + DD S       S D    K+ E   N+ S AKE    SP T E      P+ +++D+       YLSKL +S  LA  +L  +C  L+  
Subjt:  KDDRSAQDLVDDVSSREVCDESADTILTKTAELRPNIPS-AKE----SPNTAELSEMPAPLRQSMDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYI

Query:  EHCATADEETSAYIIL-EGDMRAVSISLNSVHQSLIHPNKN-SVFQDAMNLYRKQRMEMKEM---------QVVCGGMAALSER-----------KLEGK
        E     D+ T+ ++ L EG  R    + NS+    + P++N SVFQ AM+ Y++QR E+K +         QV    +  +               +E  
Subjt:  EHCATADEETSAYIIL-EGDMRAVSISLNSVHQSLIHPNKN-SVFQDAMNLYRKQRMEMKEM---------QVVCGGMAALSER-----------KLEGK

Query:  GMQVLSVGMASFEGGLEEKSFDFNNEEVKAPVSTIDVEAAQAPIDTTGDKETEATAA---SGKSDDLAS---TATEEIKCLDNW---EEALPVTNSTEVD
         M++  V  +  +   +E     +N   K    T D E + +P      +   A ++    G  + +AS     +EE   LD+    E+   + +   VD
Subjt:  GMQVLSVGMASFEGGLEEKSFDFNNEEVKAPVSTIDVEAAQAPIDTTGDKETEATAA---SGKSDDLAS---TATEEIKCLDNW---EEALPVTNSTEVD

Query:  M---LASEQQTNLEAEKDTIVVSNDNVAVNDTEKLSNM----------DIKGIANGKDSTR---CGVGNSCFDNA---VSGPLSETCEGLILSQIHHSPE
             A E     E +  T+ V+   +   D ++  ++          + KG  +G+++      G G+    +    V+ PL    +  ++S+IHHSP+
Subjt:  M---LASEQQTNLEAEKDTIVVSNDNVAVNDTEKLSNM----------DIKGIANGKDSTR---CGVGNSCFDNA---VSGPLSETCEGLILSQIHHSPE

Query:  STH
        STH
Subjt:  STH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTTGCATCATAAGAGAAACAAGCCTCAAGATAATGATCCTGATCGAGAGGTAACCAAAGGCAAACGTGAGGGTGATCTAGATGCCATGCACAAGCAGGATCATGA
TTGCCATCAAGCGCATGAACGTGATCATGATCAAGAGAGTAGGCATAGACGTGACCGTGACCGTGACCACAACCATGATCAGAATGGGAAACAGAATCGTAGTCGAAGTC
GTGGTCGTGACCGTTTCTCTGATTATGAATGTGATGTTGATCGTGATGGATCACATCTTGAGGATCAATACATGAAATATGTGGATAGTAGGGGAAGGAAAAGATCACCA
ACTGATCACGATGATTCCGTTGAAGCTAGATCTAAAAGTTTGAAAAATAGTCACCTTGCAAATGAAGAAAAGAAGTCTTTGAGCAATGATAAAGTGGACTCAGATGTTGA
AAGAAGAACATCTCATTCACGATCTCGGCATGCAGATGTTAGTTTAAGCAACCTTAGACGGAAGAGTTCACCCGGTTCTTTGTCACGTGGTGGCACAGATGAATACAGGC
ATCAAGAACAGGAACATTTGAGAGACCGATACTCAAAAAAGGAAGAAAGGTCCAAACCCAATTCTACAAGAGAAAAAAGTGTTCTGTCAGGAGTACAAGAAAAGGGTTCC
AAGTACACTAATGCGGAGAAACCTAGTGAAACAGACGGTGGCAATGCTATTGATGTGTCACGCGACAGGTCTTTAAATTCTAAGAATGGTGATATCGATGAAAATGGACG
AAGGCACAGTACTTCTGTTGATGCCAAAGACCTCTCTTCTAATAAGGATAGGCATAGTTGGGAATTACAAGGAGAGAAGCAGATGGATGATTCATCTCAGGCAGAGTCCT
ACTTTACCAAAGGTAGTCAGAGCAATCCATCACCGTTCCATCCGCACCCTGCTTTTAGGGGGAATGACTTTCCTTTTGATGGTTCACTAGAAGATGATGGCAGACTCAAT
TCTAATAGCCGTTTCCGAAGGGGGAATGATCCTAATTTGGGTAGAATACATGGAAACACTTGGAGAGGCATTCCGAACTGGACAACTCCACTACCTAATAGCTTTATCCC
TTTCCCGCATGGGCCTCCCCATGGAAGTTTCCAATCAATTATGCCACAGTTTCCTGCACCACCTTTGTTTGGTATCAGACCTCCACCTGAAATCAATCACTCTGGAATTC
CTTATCGGATGCCTGATGCCGAACGATTTTCCAGTCACATGCATCCTCTAGGGTGGCAGAATATTTTGGATGGTACGAGCCCTTCTCACTTACACGGATGGGATGGAAAT
AACGGAGTCTTTAGGGATGAAACTCACATTTATAGCGGAGCTGAATGGGATGAAAACAGGCAGATGGTGAATGGTCGAGGATGGGAGTCCAAAGTTGAAATGTGGAAGAG
ACAGAATGGTTCCCAGAAAAGTGAGTTAACTTCCCAATTTCAGAAGGATGACCGTTCCGCGCAAGATCTTGTTGATGATGTATCAAGTAGAGAGGTGTGTGATGAGAGTG
CTGATACTATTTTGACAAAAACTGCTGAATTAAGGCCTAATATCCCTTCTGCAAAAGAAAGCCCCAACACTGCTGAACTCTCCGAAATGCCAGCTCCCCTTAGACAGTCG
ATGGACGATAAATCTGAACTTCGTTGTTCTTACCTTTCCAAGCTTAAGATTTCCACAGAACTTGCACGTCCGGATTTGTACCACCAGTGCCAGAGATTAATGTATATTGA
GCACTGTGCTACTGCAGATGAGGAAACTTCTGCTTACATAATTCTCGAGGGTGACATGAGAGCAGTGTCCATCTCTTTAAATAGTGTGCATCAATCTCTTATCCATCCAA
ACAAGAACTCGGTATTTCAGGATGCAATGAACTTGTACAGAAAGCAGAGAATGGAAATGAAGGAGATGCAAGTTGTTTGTGGGGGAATGGCGGCACTCTCTGAGAGGAAA
CTTGAAGGGAAGGGAATGCAAGTTCTTTCTGTGGGGATGGCTTCCTTTGAGGGTGGACTTGAAGAGAAGAGCTTCGATTTCAATAATGAAGAAGTTAAGGCTCCTGTTTC
GACTATTGATGTTGAAGCGGCGCAGGCACCCATCGATACCACTGGCGATAAGGAAACGGAGGCGACTGCTGCGTCGGGGAAATCGGATGATTTGGCTTCAACTGCCACTG
AGGAGATCAAGTGTCTTGATAACTGGGAGGAGGCATTGCCAGTAACCAATTCTACAGAAGTGGATATGTTGGCTTCGGAGCAGCAGACAAACTTAGAGGCTGAAAAGGAT
ACCATTGTCGTATCGAATGACAATGTAGCAGTCAACGACACCGAGAAATTGAGTAATATGGACATCAAGGGGATTGCCAATGGGAAAGATTCAACCAGATGTGGAGTTGG
TAATTCTTGCTTTGACAATGCAGTGAGTGGTCCTTTATCTGAGACTTGTGAGGGATTAATTTTGAGTCAGATACATCATTCTCCTGAAAGTACACATTGA
mRNA sequenceShow/hide mRNA sequence
GGGACAGGGAAAGGGTTAGGGATAGGGACAGAGAGAAGGAAAGAAAAGGCAGAGAAGGGAGAAGTGAGAGGGTGGTTGCAAGCGAAGAACACCGCGTTGAAAAGCAAGTT
GAAAGGAATACAGAAAATGTGTTGCATAGCCCTGGATTAGAGAATCACTTGGATGTACGAGTAAGGAAAAGAGCTGGTTCTTTTGACGAGGATAAACATAAAGGTGATAT
TGGAGATGTGGAAAATAGACAACCATCTTCAAAGAATGATGTGAGAGATGAAAGACGAAAGAGTGAGAAGCACAAGGATGAGAGAAATAGAGGGAAGTATCGGGGAGATG
CTGATAGAGATGGCAAGGAAAGAGATGAGCAACTTGTAAAAGATCACATTAGCAGATCGAATGACAAAGATTTGAGAGATGAGAAGGATGCTATGGATTTGCATCATAAG
AGAAACAAGCCTCAAGATAATGATCCTGATCGAGAGGTAACCAAAGGCAAACGTGAGGGTGATCTAGATGCCATGCACAAGCAGGATCATGATTGCCATCAAGCGCATGA
ACGTGATCATGATCAAGAGAGTAGGCATAGACGTGACCGTGACCGTGACCACAACCATGATCAGAATGGGAAACAGAATCGTAGTCGAAGTCGTGGTCGTGACCGTTTCT
CTGATTATGAATGTGATGTTGATCGTGATGGATCACATCTTGAGGATCAATACATGAAATATGTGGATAGTAGGGGAAGGAAAAGATCACCAACTGATCACGATGATTCC
GTTGAAGCTAGATCTAAAAGTTTGAAAAATAGTCACCTTGCAAATGAAGAAAAGAAGTCTTTGAGCAATGATAAAGTGGACTCAGATGTTGAAAGAAGAACATCTCATTC
ACGATCTCGGCATGCAGATGTTAGTTTAAGCAACCTTAGACGGAAGAGTTCACCCGGTTCTTTGTCACGTGGTGGCACAGATGAATACAGGCATCAAGAACAGGAACATT
TGAGAGACCGATACTCAAAAAAGGAAGAAAGGTCCAAACCCAATTCTACAAGAGAAAAAAGTGTTCTGTCAGGAGTACAAGAAAAGGGTTCCAAGTACACTAATGCGGAG
AAACCTAGTGAAACAGACGGTGGCAATGCTATTGATGTGTCACGCGACAGGTCTTTAAATTCTAAGAATGGTGATATCGATGAAAATGGACGAAGGCACAGTACTTCTGT
TGATGCCAAAGACCTCTCTTCTAATAAGGATAGGCATAGTTGGGAATTACAAGGAGAGAAGCAGATGGATGATTCATCTCAGGCAGAGTCCTACTTTACCAAAGGTAGTC
AGAGCAATCCATCACCGTTCCATCCGCACCCTGCTTTTAGGGGGAATGACTTTCCTTTTGATGGTTCACTAGAAGATGATGGCAGACTCAATTCTAATAGCCGTTTCCGA
AGGGGGAATGATCCTAATTTGGGTAGAATACATGGAAACACTTGGAGAGGCATTCCGAACTGGACAACTCCACTACCTAATAGCTTTATCCCTTTCCCGCATGGGCCTCC
CCATGGAAGTTTCCAATCAATTATGCCACAGTTTCCTGCACCACCTTTGTTTGGTATCAGACCTCCACCTGAAATCAATCACTCTGGAATTCCTTATCGGATGCCTGATG
CCGAACGATTTTCCAGTCACATGCATCCTCTAGGGTGGCAGAATATTTTGGATGGTACGAGCCCTTCTCACTTACACGGATGGGATGGAAATAACGGAGTCTTTAGGGAT
GAAACTCACATTTATAGCGGAGCTGAATGGGATGAAAACAGGCAGATGGTGAATGGTCGAGGATGGGAGTCCAAAGTTGAAATGTGGAAGAGACAGAATGGTTCCCAGAA
AAGTGAGTTAACTTCCCAATTTCAGAAGGATGACCGTTCCGCGCAAGATCTTGTTGATGATGTATCAAGTAGAGAGGTGTGTGATGAGAGTGCTGATACTATTTTGACAA
AAACTGCTGAATTAAGGCCTAATATCCCTTCTGCAAAAGAAAGCCCCAACACTGCTGAACTCTCCGAAATGCCAGCTCCCCTTAGACAGTCGATGGACGATAAATCTGAA
CTTCGTTGTTCTTACCTTTCCAAGCTTAAGATTTCCACAGAACTTGCACGTCCGGATTTGTACCACCAGTGCCAGAGATTAATGTATATTGAGCACTGTGCTACTGCAGA
TGAGGAAACTTCTGCTTACATAATTCTCGAGGGTGACATGAGAGCAGTGTCCATCTCTTTAAATAGTGTGCATCAATCTCTTATCCATCCAAACAAGAACTCGGTATTTC
AGGATGCAATGAACTTGTACAGAAAGCAGAGAATGGAAATGAAGGAGATGCAAGTTGTTTGTGGGGGAATGGCGGCACTCTCTGAGAGGAAACTTGAAGGGAAGGGAATG
CAAGTTCTTTCTGTGGGGATGGCTTCCTTTGAGGGTGGACTTGAAGAGAAGAGCTTCGATTTCAATAATGAAGAAGTTAAGGCTCCTGTTTCGACTATTGATGTTGAAGC
GGCGCAGGCACCCATCGATACCACTGGCGATAAGGAAACGGAGGCGACTGCTGCGTCGGGGAAATCGGATGATTTGGCTTCAACTGCCACTGAGGAGATCAAGTGTCTTG
ATAACTGGGAGGAGGCATTGCCAGTAACCAATTCTACAGAAGTGGATATGTTGGCTTCGGAGCAGCAGACAAACTTAGAGGCTGAAAAGGATACCATTGTCGTATCGAAT
GACAATGTAGCAGTCAACGACACCGAGAAATTGAGTAATATGGACATCAAGGGGATTGCCAATGGGAAAGATTCAACCAGATGTGGAGTTGGTAATTCTTGCTTTGACAA
TGCAGTGAGTGGTCCTTTATCTGAGACTTGTGAGGGATTAATTTTGAGTCAGATACATCATTCTCCTGAAAGTACACATTGAAACAATTTTAATATCGTTGCTTCTTTAT
TTCATAGTCAAGTTATTAATTTTTCTATTCATGTTGCTTCATCTTCATGCAAGGAATAAAATTTCCTATTCTTCTGCACCTTGGTGCATGGTCTTGAGGCTGCTCTTATT
TTTGGTCTTTAAAGGGGTGGTGCAGTAGTTGAAGACTTGGGTTTTGAGGGTATATTTCTCTCAAGGTTTCAGGTGGTTGTCCACTTCGAGGAGCCGAACAAATTTGCGGA
GTTGGGGAAACTTCTACCTGCAGCAAGGTATTATCTATTTGCATTTATATCATTCTATATTATTTTATCTTTTATATTAATTTGGTGTCCACAGTTTGTTGTCATTTATT
ACCATTAAAAAG
Protein sequenceShow/hide protein sequence
MDLHHKRNKPQDNDPDREVTKGKREGDLDAMHKQDHDCHQAHERDHDQESRHRRDRDRDHNHDQNGKQNRSRSRGRDRFSDYECDVDRDGSHLEDQYMKYVDSRGRKRSP
TDHDDSVEARSKSLKNSHLANEEKKSLSNDKVDSDVERRTSHSRSRHADVSLSNLRRKSSPGSLSRGGTDEYRHQEQEHLRDRYSKKEERSKPNSTREKSVLSGVQEKGS
KYTNAEKPSETDGGNAIDVSRDRSLNSKNGDIDENGRRHSTSVDAKDLSSNKDRHSWELQGEKQMDDSSQAESYFTKGSQSNPSPFHPHPAFRGNDFPFDGSLEDDGRLN
SNSRFRRGNDPNLGRIHGNTWRGIPNWTTPLPNSFIPFPHGPPHGSFQSIMPQFPAPPLFGIRPPPEINHSGIPYRMPDAERFSSHMHPLGWQNILDGTSPSHLHGWDGN
NGVFRDETHIYSGAEWDENRQMVNGRGWESKVEMWKRQNGSQKSELTSQFQKDDRSAQDLVDDVSSREVCDESADTILTKTAELRPNIPSAKESPNTAELSEMPAPLRQS
MDDKSELRCSYLSKLKISTELARPDLYHQCQRLMYIEHCATADEETSAYIILEGDMRAVSISLNSVHQSLIHPNKNSVFQDAMNLYRKQRMEMKEMQVVCGGMAALSERK
LEGKGMQVLSVGMASFEGGLEEKSFDFNNEEVKAPVSTIDVEAAQAPIDTTGDKETEATAASGKSDDLASTATEEIKCLDNWEEALPVTNSTEVDMLASEQQTNLEAEKD
TIVVSNDNVAVNDTEKLSNMDIKGIANGKDSTRCGVGNSCFDNAVSGPLSETCEGLILSQIHHSPESTH