; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G011750 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G011750
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein HGV2
Genome locationCmo_Chr20:11557983..11563011
RNA-Seq ExpressionCmoCh20G011750
SyntenyCmoCh20G011750
Gene Ontology termsGO:0006335 - DNA replication-dependent nucleosome assembly (biological process)
GO:0034080 - CENP-A containing nucleosome assembly (biological process)
GO:0005654 - nucleoplasm (cellular component)
GO:0042393 - histone binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571417.1 hypothetical protein SDJN03_30332, partial [Cucurbita argyrosperma subsp. sororia]3.3e-24188.93Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKP AESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEP+QQSYK ESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDK
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDK
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDK

KAG7011181.1 Nuclear autoantigenic sperm protein, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-24589.24Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKP AESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

XP_022928044.1 protein HGV2 [Cucurbita moschata]4.5e-24689.8Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

XP_022971716.1 protein HGV2 [Cucurbita maxima]2.3e-24288.31Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGL+SSCNCSNETKP AESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVK+ ENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPE+PLSSNDSQ+D+ANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

XP_023513004.1 LOW QUALITY PROTEIN: protein HGV2 [Cucurbita pepo subsp. pepo]2.4e-24489.05Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVKN ENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDIL ALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPE+PLSSNDSQTDNANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

TrEMBL top hitse value%identityAlignment
A0A0A0LMR2 TPR_REGION domain-containing protein1.1e-19776.06Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES V GGL SSCN  NE KP  + TAQTSD SG+KSL+LAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVS---SKKDQDEEEGDDSGTE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS KD+SVK+  NGESSKASVSSNAE VDGV DDVS   SKKD+DEEE D S  E
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVS---SKKDQDEEEGDDSGTE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALER                                 EDI TSLSDYQKALSIL
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL

Query:  ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGL
        ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS+CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPE+PLSSN SQTDN NA TEKQSEI+ LSGL
Subjt:  ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGL

Query:  LVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTH
        LVELEKK                      LEDLQQ ASNPKSILSEILGIG+AK  +EKI PP+P+V NSSQMGSA+SNGGFDSPTVSTAHTN   GVTH
Subjt:  LVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTH

Query:  LGVVGRGVKRVSTNSESADSSHPTKKRATD-SSTQDKGDGSSA
        LGVVGRGVKRVSTNSES D S+PTKK A D SS+QDKGD SSA
Subjt:  LGVVGRGVKRVSTNSESADSSHPTKKRATD-SSTQDKGDGSSA

A0A1S3C6L8 NASP-related protein sim31.5e-20277.68Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSE+SVT+ KPKLDE LNVS+ TTES   GGLDSSCN  NE KP  E TAQTSDGSGEKSLELAEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVS---SKKDQDEEEGDDSGTE
        AA YGELA ECVKLYYKYGCALLYKAQEEADPLGAVPKKEG    QS K ES K+  NGESSKASVSSNAE+VDGV DDVS   SKKDQDEEE DDS  E
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVS---SKKDQDEEEGDDSGTE

Query:  DLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL
        DLADADEDESDLDLAWKMLDVARAIVEKDS DTMEKVDILSALAEVALER                                 EDI TSLSDYQKALSIL
Subjt:  DLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSIL

Query:  ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGL
        ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS+CQKAISICKSRV+RLTDEVK +IVPTTASSTSGSEPEIPLSSN SQTDN NAATEKQSEIE LSGL
Subjt:  ERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGL

Query:  LVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTH
        LVELEKK                      LEDLQQLASNP SILSEILGIG+AK  +EKI PP+P+V NSSQMGSANSNGGFDSPTVSTAHTN   GVTH
Subjt:  LVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTH

Query:  LGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        LGVVGRGVKRVSTNSES D S+PTKK A D S+QDKGD SSA
Subjt:  LGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

A0A6J1E053 NASP-related protein sim38.9e-20076.11Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADE PPSE+SVT+ KPK+DE LN S+ T ES+  GG++SS NCSN+   A E+TAQTSDGSGEKSLE+AEELLEKGSKA+KDNDF EAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AA YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE +Q+S KD SVK+  NGESSKASVSSNAELVDGV DDVSSKKDQDEE+ D+S  EDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        +ADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFR+CLCLEFGS+PQEAI FCQKAISICKSRV+RLTDEVK I+VPTTASSTSGSEP   LSSN SQ D  NAA+EKQSEIE LSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKV-EKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLG
        LEKK                      LEDLQQLASNPKSILSEILGIG+A++KV EK AP  PA LNSSQ+ SANSNGGFDSPTVSTAHTNGA GVTHLG
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKV-EKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLG

Query:  VVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VVGRGVKRVSTNSESA+ S+P KK A DSS+QDKGDGSSA
Subjt:  VVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

A0A6J1EJQ1 protein HGV22.2e-24689.8Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

A0A6J1I412 protein HGV21.1e-24288.31Show/hide
Query:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
        MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGL+SSCNCSNETKP AESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR
Subjt:  MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIR

Query:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
        AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEP+QQSYKDESVK+ ENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA
Subjt:  AARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA

Query:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL
        DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALER                                 EDIETSLSDYQKALSILERL
Subjt:  DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERL

Query:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE
        VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPE+PLSSNDSQ+D+ANAATEKQSEIEILSGLLVE
Subjt:  VEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVE

Query:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
        LEKK                      LEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV
Subjt:  LEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGV

Query:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
Subjt:  VGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

SwissProt top hitse value%identityAlignment
Q17886 Protein NASP homolog 17.6e-0722.98Show/hide
Query:  NCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE
        + S ++      T    +   +K   LA ELL  G +A+K ND  +A D  S A E+ +  YGE         Y YG A L  A+EE+  L    +KE  
Subjt:  NCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGE

Query:  PYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAE
          +Q+   +   + ENGE+ K                         E+G++SG E+    D+D+  + L+W++L+ AR I          +   +SA+ E
Subjt:  PYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAE

Query:  VALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSR
          L+    A  L+    H I                 +     +  D  +AL+I   ++ P +R++A+    +       +   E + +  K   +  +R
Subjt:  VALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSR

Query:  VMRLTDEVK
           L  E++
Subjt:  VMRLTDEVK

Arabidopsis top hitse value%identityAlignment
AT4G37210.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-10548.54Show/hide
Query:  DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAAR
        +P +E++ TL+ P L  I    +AT ES V GG +S+CN       AA+S A +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR A 
Subjt:  DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAAR

Query:  YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA--
        YGEL  EC+  YY+YG ALL KAQ EADPLG +PKKEGE  Q+S   ES+  +V +G+  +   SS  E         S  KDQ  E+G+D   +DL+  
Subjt:  YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA--

Query:  --DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE
          DADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LER                                 EDIE+SLSDY+ ALSILE
Subjt:  --DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE

Query:  RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLL
        RLVEPD+R+ AELNFR+C+CLE G QP+EAI +CQKA+ ICK+R+ RL++E+KG     T+S+ S  +  I  SSN    D   +A++K+ EI  L+GL 
Subjt:  RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLL

Query:  VELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD--SPTVSTAHT-----NG
         +LEKK                      LEDL+Q A NPK +L+E++G+ +AK        P  A ++SS+MG+ N+N G D  SPTVSTAHT       
Subjt:  VELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSILSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFD--SPTVSTAHT-----NG

Query:  AAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA
        A+GVTHLGVVGRGVKRV  N+ S +SS  +KK A + S  DK DG+S+
Subjt:  AAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA

AT4G37210.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-8450.25Show/hide
Query:  DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAAR
        +P +E++ TL+ P L  I    +AT ES V GG +S+CN       AA+S A +  D   EK+LE AEEL EKGS  +K+NDF EAVDCFSRALEIR A 
Subjt:  DPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTA-QTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAAR

Query:  YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA--
        YGEL  EC+  YY+YG ALL KAQ EADPLG +PKKEGE  Q+S   ES+  +V +G+  +   SS  E         S  KDQ  E+G+D   +DL+  
Subjt:  YGELAPECVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESV-KNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLA--

Query:  --DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE
          DADEDESDLD+AWKMLD+AR I +K S +TMEKVDIL +LAEV+LER                                 EDIE+SLSDY+ ALSILE
Subjt:  --DADEDESDLDLAWKMLDVARAIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILE

Query:  RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLL
        RLVEPD+R+ AELNFR+C+CLE G QP+EAI +CQKA+ ICK+R+ RL++E+KG     T+S+ S  +  I  SSN    D   +A++K+ EI  L+GL 
Subjt:  RLVEPDNRQLAELNFRVCLCLEFGSQPQEAISFCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLL

Query:  VELEKKAS
         +LEKKA+
Subjt:  VELEKKAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGACGAAGATCCACCATCGGAGCTCTCAGTGACATTGCAGAAACCGAAACTCGACGAAATCCTGAACGTCAGTAAGGCCACCACTGAGTCCGATGTTCTGGGAGG
CCTCGACTCGTCTTGCAATTGTTCCAACGAAACGAAGCCCGCTGCTGAATCCACTGCTCAAACCTCTGATGGAAGTGGTGAAAAATCGTTGGAGCTGGCCGAAGAGTTGC
TGGAGAAGGGTTCCAAGGCTATTAAGGACAATGATTTCGTTGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTAGAGCTGCGCGATATGGTGAACTGGCTCCGGAA
TGTGTTAAATTGTACTACAAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTCCCAAAGAAAGAAGGTGAACCCTATCAACAATC
TTACAAAGATGAATCTGTTAAGAATGTTGAAAATGGCGAGTCATCAAAAGCTTCTGTTTCCAGCAATGCTGAACTGGTGGATGGAGTTGCAGATGATGTTTCCAGTAAGA
AAGATCAGGATGAAGAAGAGGGTGATGATAGTGGTACTGAGGACTTAGCAGATGCAGATGAAGACGAATCTGACCTTGATTTAGCTTGGAAAATGCTAGACGTTGCCAGA
GCAATAGTTGAAAAAGACTCGGGTGATACGATGGAGAAAGTGGACATATTGTCAGCCTTGGCAGAAGTTGCTCTGGAAAGAGGTACACATGCATTTTATTTGCTCTGTTG
TGATACACATTCAATTGGTTTGGTTTGTGTTTCTTTGATATTATTCCCCTTTCTTTCTCCCCAGTTAGAGGAGGACATTGAAACTTCCCTGAGCGACTACCAGAAAGCAT
TATCAATTTTAGAAAGACTTGTCGAACCTGATAATCGACAGCTTGCTGAATTAAACTTCCGTGTATGCTTGTGCCTGGAGTTTGGTTCCCAGCCGCAGGAAGCCATTTCA
TTTTGCCAGAAGGCAATATCAATTTGCAAGTCACGTGTGATGCGGCTTACTGATGAAGTAAAGGGTATCATTGTACCGACCACAGCTTCTTCTACTTCTGGGTCAGAACC
AGAGATCCCACTATCGTCCAATGACTCCCAGACTGACAATGCCAATGCTGCAACAGAGAAACAATCTGAGATTGAAATTCTATCTGGGCTTTTGGTCGAGCTAGAAAAGA
AGGCGAGTTTAGATTATAACTTCTGCAAAGAAACGGAAAAATCTAACCAATGTTTTGTGTTTGTGCAGCTTGAAGATCTGCAACAGCTGGCCTCAAACCCAAAGTCAATC
CTCTCGGAGATCCTCGGGATAGGAGCGGCGAAGGCGAAAGTCGAAAAGATCGCTCCTCCACTTCCAGCGGTGTTGAACTCCTCACAGATGGGTTCAGCTAACAGCAATGG
AGGATTTGACTCTCCAACAGTCTCAACTGCGCACACGAACGGCGCGGCTGGAGTGACGCACCTTGGCGTCGTTGGAAGAGGAGTCAAACGAGTATCAACAAATTCAGAGT
CTGCTGACTCCTCCCACCCAACTAAGAAACGGGCAACAGATTCATCAACACAAGATAAAGGTGATGGCAGTTCCGCCTGA
mRNA sequenceShow/hide mRNA sequence
TGTCGCGTTTCCGCCCAAAAAGGATTTTAAAGCTTGAAGTGCGGACTTCTCTGCTTAAACCTCCATTTTCTCCAAGCTTCGAAACTATAGCCATGGCCGACGAAGATCCA
CCATCGGAGCTCTCAGTGACATTGCAGAAACCGAAACTCGACGAAATCCTGAACGTCAGTAAGGCCACCACTGAGTCCGATGTTCTGGGAGGCCTCGACTCGTCTTGCAA
TTGTTCCAACGAAACGAAGCCCGCTGCTGAATCCACTGCTCAAACCTCTGATGGAAGTGGTGAAAAATCGTTGGAGCTGGCCGAAGAGTTGCTGGAGAAGGGTTCCAAGG
CTATTAAGGACAATGATTTCGTTGAGGCTGTCGATTGCTTCAGCCGTGCCCTTGAGATTAGAGCTGCGCGATATGGTGAACTGGCTCCGGAATGTGTTAAATTGTACTAC
AAATATGGATGTGCTTTATTGTACAAAGCCCAGGAGGAGGCAGATCCACTGGGAGCTGTCCCAAAGAAAGAAGGTGAACCCTATCAACAATCTTACAAAGATGAATCTGT
TAAGAATGTTGAAAATGGCGAGTCATCAAAAGCTTCTGTTTCCAGCAATGCTGAACTGGTGGATGGAGTTGCAGATGATGTTTCCAGTAAGAAAGATCAGGATGAAGAAG
AGGGTGATGATAGTGGTACTGAGGACTTAGCAGATGCAGATGAAGACGAATCTGACCTTGATTTAGCTTGGAAAATGCTAGACGTTGCCAGAGCAATAGTTGAAAAAGAC
TCGGGTGATACGATGGAGAAAGTGGACATATTGTCAGCCTTGGCAGAAGTTGCTCTGGAAAGAGGTACACATGCATTTTATTTGCTCTGTTGTGATACACATTCAATTGG
TTTGGTTTGTGTTTCTTTGATATTATTCCCCTTTCTTTCTCCCCAGTTAGAGGAGGACATTGAAACTTCCCTGAGCGACTACCAGAAAGCATTATCAATTTTAGAAAGAC
TTGTCGAACCTGATAATCGACAGCTTGCTGAATTAAACTTCCGTGTATGCTTGTGCCTGGAGTTTGGTTCCCAGCCGCAGGAAGCCATTTCATTTTGCCAGAAGGCAATA
TCAATTTGCAAGTCACGTGTGATGCGGCTTACTGATGAAGTAAAGGGTATCATTGTACCGACCACAGCTTCTTCTACTTCTGGGTCAGAACCAGAGATCCCACTATCGTC
CAATGACTCCCAGACTGACAATGCCAATGCTGCAACAGAGAAACAATCTGAGATTGAAATTCTATCTGGGCTTTTGGTCGAGCTAGAAAAGAAGGCGAGTTTAGATTATA
ACTTCTGCAAAGAAACGGAAAAATCTAACCAATGTTTTGTGTTTGTGCAGCTTGAAGATCTGCAACAGCTGGCCTCAAACCCAAAGTCAATCCTCTCGGAGATCCTCGGG
ATAGGAGCGGCGAAGGCGAAAGTCGAAAAGATCGCTCCTCCACTTCCAGCGGTGTTGAACTCCTCACAGATGGGTTCAGCTAACAGCAATGGAGGATTTGACTCTCCAAC
AGTCTCAACTGCGCACACGAACGGCGCGGCTGGAGTGACGCACCTTGGCGTCGTTGGAAGAGGAGTCAAACGAGTATCAACAAATTCAGAGTCTGCTGACTCCTCCCACC
CAACTAAGAAACGGGCAACAGATTCATCAACACAAGATAAAGGTGATGGCAGTTCCGCCTGA
Protein sequenceShow/hide protein sequence
MADEDPPSELSVTLQKPKLDEILNVSKATTESDVLGGLDSSCNCSNETKPAAESTAQTSDGSGEKSLELAEELLEKGSKAIKDNDFVEAVDCFSRALEIRAARYGELAPE
CVKLYYKYGCALLYKAQEEADPLGAVPKKEGEPYQQSYKDESVKNVENGESSKASVSSNAELVDGVADDVSSKKDQDEEEGDDSGTEDLADADEDESDLDLAWKMLDVAR
AIVEKDSGDTMEKVDILSALAEVALERGTHAFYLLCCDTHSIGLVCVSLILFPFLSPQLEEDIETSLSDYQKALSILERLVEPDNRQLAELNFRVCLCLEFGSQPQEAIS
FCQKAISICKSRVMRLTDEVKGIIVPTTASSTSGSEPEIPLSSNDSQTDNANAATEKQSEIEILSGLLVELEKKASLDYNFCKETEKSNQCFVFVQLEDLQQLASNPKSI
LSEILGIGAAKAKVEKIAPPLPAVLNSSQMGSANSNGGFDSPTVSTAHTNGAAGVTHLGVVGRGVKRVSTNSESADSSHPTKKRATDSSTQDKGDGSSA