; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10002801 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10002801
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionZf-4CXXC_R1 domain-containing protein
Genome locationChr11:12881265..12886361
RNA-Seq ExpressionHG10002801
SyntenyHG10002801
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR018501 - DDT domain
IPR040221 - CDCA7/CDA7L


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053380.1 zf-4CXXC_R1 domain-containing protein [Cucumis melo var. makuwa]2.2e-24586.3Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA
        M+TPRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD  K SKKSSPTKSSL EI KEQTEANGRNDSLPSKKKGS+KGTSKDA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA

Query:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV
        ASDV  PKDAREKNSLCHE+AK  D +EEEDKRSSKDV YH IKAED KE K L+ HKYAKTSKD KNNK  VHDKP AKS+ENK+CSV IQN+EF A V
Subjt:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV

Query:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW
         FPPGSRLTTVADIELTTDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRE   RRRCRV+DSLTVRFHIQLLSLILKDMDEESAI SPTNDR+SW
Subjt:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW

Query:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI
        LLALKKCIS SPFKLNDLKPDYFDGGD+CYDDL FSKKLRLLTYLCDEALNTTKLR+WIE+QN+NF+EEQKEVKEKLAALKDKEKQAK K+RDELAKALI
Subjt:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI

Query:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQE
         KNGVPLSIAEHDAI+S+IK+DVAEAQAERLVALELASKRR+ S ATRT P+MLDVNGRVFWKLRGFA +GNILLQDMESW SVNPSEKW MYK EQKQE
Subjt:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQE

Query:  IEKYITSLRLKRPKLVEIAQTLPGGDSETASVC
        IEKYI+SLR KR KL EI QTLPGG SETAS C
Subjt:  IEKYITSLRLKRPKLVEIAQTLPGGDSETASVC

XP_011653347.1 uncharacterized protein LOC101206502 isoform X1 [Cucumis sativus]9.4e-24185.69Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA
        MITPRKQGKENSLNGNNESNLNLQ QTP  D+KKLKEMKREELKEICN NKVD K SKKSS TKSSL EISKEQTEANGRNDSLPSKKKG +KGTSKDAA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA

Query:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS
        SDVS PKDAREKNS  HE+AKA D  EEEDKRSSKDV YH IKA D KE K L IHKYA TSKD KNNK  VHDKP AKSQENK+CSV IQN+EF A V 
Subjt:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS

Query:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL
        F PG RLTTVADIELTTDDVGHALQFLEFCA FGKALN+KKG+AESVLKDLMRER  RRCRV+DSLTVRFHIQLLSLILKDMDEESAI SPTNDR+SWLL
Subjt:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL

Query:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK
        ALKKCIS SPFK NDLKPDYFDGGD+CYDDLDFSKKLRLLTYLCDEALNTTKLR+WIE+QN+NFLEEQKEVKEKLAALKDKEKQAK K++DELAKALIAK
Subjt:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK

Query:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE
        NGVPLSIAE+DAI+S+IK+DVAEAQAERL ALELASKRRQ S ATRTVP+MLDVNGRVFWKLRGFA EGNILLQDMESW S NPSEKW +YK EQKQEIE
Subjt:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE

Query:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC
        KYI+SL  KRPKLVE  QTLPGG SETAS C
Subjt:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC

XP_038874924.1 uncharacterized protein LOC120067431 isoform X1 [Benincasa hispida]3.9e-26392.47Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA
        M TPRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVDVKVSKKSSPTKSSL EISKEQTEANGRNDSLPSKKKGS KG SKDAA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA

Query:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS
        SDV KPKDAREKNS CHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNK  VHDKPL KS+ENK+CSV IQN E  ACVS
Subjt:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS

Query:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL
        FPPGSRLTTVADIEL TDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRER  RRCRV+DSLTVRFHIQLLSLILKDMDEES ISSPTND+NSWLL
Subjt:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL

Query:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK
        ALKKCIS SPFKLNDLK D+FDGGD CYDDLD SKKLRLLTYLCDEALNTTKLR WIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK
Subjt:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK

Query:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE
        NGVPLSIAEHDAIVSEIKSDV+EAQAERLVALELASKRR+SSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQD+ESWESVNPSEKWFMYK EQKQEIE
Subjt:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE

Query:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC
        KYITS RLKRPKLVEIAQTLPGG +ETASVC
Subjt:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC

XP_038874928.1 uncharacterized protein LOC120067431 isoform X2 [Benincasa hispida]1.0e-25089.41Show/hide
Query:  TPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASD
        +P KQ  EN            ++QTP SDKKKLK+MK EELKEIC GNKVDVKVSKKSSPTKSSL EISKEQTEANGRNDSLPSKKKGS KG SKDAASD
Subjt:  TPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASD

Query:  VSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFP
        V KPKDAREKNS CHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNK  VHDKPL KS+ENK+CSV IQN E  ACVSFP
Subjt:  VSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFP

Query:  PGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLAL
        PGSRLTTVADIEL TDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRER  RRCRV+DSLTVRFHIQLLSLILKDMDEES ISSPTND+NSWLLAL
Subjt:  PGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLAL

Query:  KKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNG
        KKCIS SPFKLNDLK D+FDGGD CYDDLD SKKLRLLTYLCDEALNTTKLR WIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNG
Subjt:  KKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNG

Query:  VPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKY
        VPLSIAEHDAIVSEIKSDV+EAQAERLVALELASKRR+SSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQD+ESWESVNPSEKWFMYK EQKQEIEKY
Subjt:  VPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKY

Query:  ITSLRLKRPKLVEIAQTLPGGDSETASVC
        ITS RLKRPKLVEIAQTLPGG +ETASVC
Subjt:  ITSLRLKRPKLVEIAQTLPGGDSETASVC

XP_038874931.1 uncharacterized protein LOC120067431 isoform X3 [Benincasa hispida]3.9e-26392.47Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA
        M TPRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVDVKVSKKSSPTKSSL EISKEQTEANGRNDSLPSKKKGS KG SKDAA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA

Query:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS
        SDV KPKDAREKNS CHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNK  VHDKPL KS+ENK+CSV IQN E  ACVS
Subjt:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS

Query:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL
        FPPGSRLTTVADIEL TDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRER  RRCRV+DSLTVRFHIQLLSLILKDMDEES ISSPTND+NSWLL
Subjt:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLL

Query:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK
        ALKKCIS SPFKLNDLK D+FDGGD CYDDLD SKKLRLLTYLCDEALNTTKLR WIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK
Subjt:  ALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAK

Query:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE
        NGVPLSIAEHDAIVSEIKSDV+EAQAERLVALELASKRR+SSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQD+ESWESVNPSEKWFMYK EQKQEIE
Subjt:  NGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIE

Query:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC
        KYITS RLKRPKLVEIAQTLPGG +ETASVC
Subjt:  KYITSLRLKRPKLVEIAQTLPGGDSETASVC

TrEMBL top hitse value%identityAlignment
A0A0A0KZ36 DDT domain-containing protein4.6e-20185.59Show/hide
Query:  QNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASDVSKPKDAREKNSLCHEDAKAP
        + QTP  D+KKLKEMKREELKEICN NKVD K SKKSS TKSSL EISKEQTEANGRNDSLPSKKKG +KGTSKDAASDVS PKDAREKNS  HE+AKA 
Subjt:  QNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASDVSKPKDAREKNSLCHEDAKAP

Query:  DAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHA
        D  EEEDKRSSKDV YH IKA D KE K L IHKYA TSKD KNNK  VHDKP AKSQENK+CSV IQN+EF A V F PG RLTTVADIELTTDDVGHA
Subjt:  DAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHA

Query:  LQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDG
        LQFLEFCA FGKALN+KKG+AESVLKDLMRER  RRCRV+DSLTVRFHIQLLSLILKDMDEESAI SPTNDR+SWLLALKKCIS SPFK NDLKPDYFDG
Subjt:  LQFLEFCAGFGKALNLKKGHAESVLKDLMRER--RRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDG

Query:  GDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAE
        GD+CYDDLDFSKKLRLLTYLCDEALNTTKLR+WIE+QN+NFLEEQKEVKEKLAALKDKEKQAK K++DELAKALIAKNGVPLSIAE+DAI+S+IK+DVAE
Subjt:  GDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAE

Query:  AQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQ
        AQAERL ALELASKRRQ S ATRTVP+MLDVNGRVFWKLRGFA EGNILLQ
Subjt:  AQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQ

A0A5A7UDV8 Zf-4CXXC_R1 domain-containing protein1.0e-24586.3Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA
        M+TPRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD  K SKKSSPTKSSL EI KEQTEANGRNDSLPSKKKGS+KGTSKDA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA

Query:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV
        ASDV  PKDAREKNSLCHE+AK  D +EEEDKRSSKDV YH IKAED KE K L+ HKYAKTSKD KNNK  VHDKP AKS+ENK+CSV IQN+EF A V
Subjt:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV

Query:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW
         FPPGSRLTTVADIELTTDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRE   RRRCRV+DSLTVRFHIQLLSLILKDMDEESAI SPTNDR+SW
Subjt:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW

Query:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI
        LLALKKCIS SPFKLNDLKPDYFDGGD+CYDDL FSKKLRLLTYLCDEALNTTKLR+WIE+QN+NF+EEQKEVKEKLAALKDKEKQAK K+RDELAKALI
Subjt:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI

Query:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQE
         KNGVPLSIAEHDAI+S+IK+DVAEAQAERLVALELASKRR+ S ATRT P+MLDVNGRVFWKLRGFA +GNILLQDMESW SVNPSEKW MYK EQKQE
Subjt:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQE

Query:  IEKYITSLRLKRPKLVEIAQTLPGGDSETASVC
        IEKYI+SLR KR KL EI QTLPGG SETAS C
Subjt:  IEKYITSLRLKRPKLVEIAQTLPGGDSETASVC

A0A5D3CWK9 Zinc-finger domain of monoamine-oxidase A repressor R1 protein, putative isoform 53.3e-19987.24Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA
        M+TPRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD  K SKKSSPTKSSL EI KEQTEANGRNDSLPSKKKGS+KGTSKDA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVD-VKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDA

Query:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV
        ASDV  PKDAREKNSLCHE+AK  D +EEEDKRSSKDV YH IKAED KE K L+ HKYAKTSKD KNNK  VHDKP AKS+ENK+CSV IQN+EF A V
Subjt:  ASDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACV

Query:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW
         FPPGSRLTTVADIELTTDDVGHALQFLEFCA FGKALNLKKGHAESVLKDLMRE   RRRCRV+DSLTVRFHIQLLSLILKDMDEESAI SPTNDR+SW
Subjt:  SFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMRE---RRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSW

Query:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI
        LLALKKCIS SPFKLNDLKPDYFDGGD+CYDDL FSKKLRLLTYLCDEALNTTKLR+WIE+QN NF+EEQKEVKEKLAALKDKEKQAK K+RDELAKALI
Subjt:  LLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALI

Query:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASK
         KNGVPLSIAEHDAI+S+IK+DVAEAQAERLVALELASK
Subjt:  AKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASK

A0A6J1DLC3 uncharacterized protein LOC111022193 isoform X21.6e-21778.87Show/hide
Query:  TPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASD
        +P+KQ  EN            +NQTP SDKKKLKEMK EEL+EICNGNKVDVK S KSSPTKSSL E SKEQTEANGRN SLPSKKKGSKK TSKDAASD
Subjt:  TPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAASD

Query:  VSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFP
        VS+PKDAREKNS CHEDAKAPDA++EED+R SKDVS H I AE TKE   LDIH      KDA NNK  V DKPL  SQEN +C+V   N+EF ACVS P
Subjt:  VSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFP

Query:  PGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMR---ERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLA
         GSRLTTVA++ELTT+DVGHALQFLEFCA FGKALNL+KGHAESVLKDLMR   +RRR RVYDSLTVRFHIQLLSLILKDMDEESAISSPTND +SWLLA
Subjt:  PGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMR---ERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLA

Query:  LKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKN
        LKKCIS S FKL+DLKPDYFD GDSCYDDLDFS+KLRLLTYLCDEALNTTKLRNWI+EQN NF+EEQKE KEKLAALKDKEKQAK+K+RDELAKALIAKN
Subjt:  LKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKN

Query:  GVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEK
        G+PL+IAEHDAIVS++K DVA AQ+ERLV LE+ASKR+Q SDATRTVPI+LD NGRVFW LRGFAGEGNILLQDMESWE+VNPSEKW MYKGEQKQEIE+
Subjt:  GVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEK

Query:  YITSLRLKRPKLVEIAQTLPGGDSETASVC
        YI+SLR+KR +LV+ AQTLP G S  AS+C
Subjt:  YITSLRLKRPKLVEIAQTLPGGDSETASVC

A0A6J1DP35 uncharacterized protein LOC111022193 isoform X14.0e-22981.77Show/hide
Query:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA
        MI+PRKQGKEN LNGNNESNLNLQNQTP SDKKKLKEMK EEL+EICNGNKVDVK S KSSPTKSSL E SKEQTEANGRN SLPSKKKGSKK TSKDAA
Subjt:  MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSL-EISKEQTEANGRNDSLPSKKKGSKKGTSKDAA

Query:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS
        SDVS+PKDAREKNS CHEDAKAPDA++EED+R SKDVS H I AE TKE   LDIH      KDA NNK  V DKPL  SQEN +C+V   N+EF ACVS
Subjt:  SDVSKPKDAREKNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVS

Query:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMR---ERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWL
         P GSRLTTVA++ELTT+DVGHALQFLEFCA FGKALNL+KGHAESVLKDLMR   +RRR RVYDSLTVRFHIQLLSLILKDMDEESAISSPTND +SWL
Subjt:  FPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLMR---ERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWL

Query:  LALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIA
        LALKKCIS S FKL+DLKPDYFD GDSCYDDLDFS+KLRLLTYLCDEALNTTKLRNWI+EQN NF+EEQKE KEKLAALKDKEKQAK+K+RDELAKALIA
Subjt:  LALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIA

Query:  KNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEI
        KNG+PL+IAEHDAIVS++K DVA AQ+ERLV LE+ASKR+Q SDATRTVPI+LD NGRVFW LRGFAGEGNILLQDMESWE+VNPSEKW MYKGEQKQEI
Subjt:  KNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEI

Query:  EKYITSLRLKRPKLVEIAQTLPGGDSETASVC
        E+YI+SLR+KR +LV+ AQTLP G S  AS+C
Subjt:  EKYITSLRLKRPKLVEIAQTLPGGDSETASVC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67270.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.0e-5941.81Show/hide
Query:  DAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLM---RERRRCRV
        D    KK V DK       +K        +E +     P G  LT V+ I++ T++ G+  Q  EFC+ FGKAL LK+GHAE+++++L    R  RR + 
Subjt:  DAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLM---RERRRCRV

Query:  YDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNT
        Y S+T +  IQLL LI K  D E ++S    D +SW  A+ + +S S    ++L  + F GG + Y+ ++ S+KL+LL +LCDE+L+T  +RN+I  Q  
Subjt:  YDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNT

Query:  NFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKL
           E +KE K+K AA K KEKQ K KM+ E+AK+++ KNG PLSI EH++IVS+I+ +  EA  E + A  + SK     DA RT PIMLD NG V WKL
Subjt:  NFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKL

Query:  RGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLKRPKLVEI
        + F  E   LLQD+ +++ + P E+W  +K EQK EIE  I+ +R K  KL++I
Subjt:  RGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLKRPKLVEI

AT1G67780.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.7e-5437.69Show/hide
Query:  DKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLM---RERRRCRVYDSLTVRFHI
        DK     +  K  +     +E +     P G  L +V+ + + T++ G+  Q  EFC+ FGKAL LK+G AE+V+++L    R  RR + Y S+ ++  I
Subjt:  DKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLKKGHAESVLKDLM---RERRRCRVYDSLTVRFHI

Query:  QLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVK
        QLL LI KD +   ++S   N    W  AL + +  S    ++  P+ F+ G + Y+ +D S++L+LL ++CDE+L+T  +RN I+ Q+T       E K
Subjt:  QLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVK

Query:  EKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNIL
         K AA K+KEKQ K K++ +LAKA++ KNG PLSI EH+ I+S+I+++  EA    + A  + S   +  DA RT PIM++ NG V WKL  +  E   L
Subjt:  EKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSDATRTVPIMLDVNGRVFWKLRGFAGEGNIL

Query:  LQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLK
        LQD+ +++ +   EKW  +K EQK +IE YI+  R K
Subjt:  LQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLK

AT5G38690.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein2.7e-7642.74Show/hide
Query:  RIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLK
        +IK  D+       I K     K  K   K + +  +A+  +  + ++  + +E +  +  P G+   TV+ I+L  +D G+  QFLEFC+ FGKAL+L+
Subjt:  RIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVGHALQFLEFCAGFGKALNLK

Query:  KGHAESVLKDLMRERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTY
        KG AE V+++++  R + R   S   +  IQLL++IL+D  E S   S T+   SW   + +C+S S  KL+D  P+ F+ G S Y+ L+ SK+L+LL +
Subjt:  KGHAESVLKDLMRERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDFSKKLRLLTY

Query:  LCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQ-S
        LCDE L T  +RN I+ QN   +E +KE KEK+ A KDKEKQ K K++DELA+A+ AKNG+PL I EHDAIVS I ++  E  +E   A+++ SK+ Q S
Subjt:  LCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQ-S

Query:  SDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLKRPKLVEIAQTL
         DA RT P+ LD NG +FW+L+ +  E NILLQD+ SW  V P EKWF +  EQK EIEKYI+ +R+KR +  + A T+
Subjt:  SDATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLKRPKLVEIAQTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTACACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGAATCAAACTCCAACTTCTGATAAAAAGAAGTTGAAGGA
AATGAAGCGGGAAGAATTGAAAGAAATATGCAATGGAAATAAGGTTGATGTTAAAGTCTCAAAGAAGAGCAGTCCAACAAAATCCAGCTTAGAAATTTCAAAGGAACAAA
CTGAAGCAAATGGGAGGAATGACTCTCTTCCCTCAAAGAAAAAGGGATCCAAAAAGGGGACCTCTAAGGATGCTGCCTCCGATGTTAGCAAACCCAAGGATGCGAGAGAA
AAAAACAGTTTGTGTCATGAGGATGCCAAAGCACCAGATGCTGTGGAAGAGGAAGACAAGAGGTCCTCAAAGGATGTTTCTTACCATCGAATTAAAGCTGAAGACACAAA
AGAAGTAAAAAATTTGGATATTCACAAATATGCCAAGACATCAAAAGATGCGAAAAATAATAAAAAAAATGTGCATGACAAGCCACTGGCAAAGTCCCAGGAAAATAAAG
AATGCTCTGTGACTATTCAGAACCAGGAATTTCGTGCCTGTGTTTCTTTCCCTCCAGGTTCAAGGTTAACAACTGTAGCAGATATTGAACTGACCACAGATGATGTTGGT
CATGCACTACAGTTTCTAGAATTCTGTGCAGGTTTTGGAAAGGCTCTAAATTTAAAGAAAGGGCATGCCGAGTCTGTACTCAAAGACCTAATGCGGGAGAGAAGAAGGTG
TCGAGTGTATGATTCACTCACTGTTCGGTTTCATATTCAACTCCTGTCTCTGATATTGAAGGATATGGATGAAGAGTCTGCAATCTCGAGTCCCACAAATGACAGAAACT
CATGGTTGCTGGCTCTGAAGAAATGCATTTCTGTATCCCCATTTAAGTTGAATGATCTGAAACCAGATTACTTTGATGGAGGTGACAGTTGTTATGATGACTTAGACTTC
TCAAAAAAGCTCAGACTATTGACTTACCTATGCGATGAGGCTCTCAATACAACAAAATTGAGAAACTGGATTGAAGAACAAAATACTAACTTTTTGGAGGAACAAAAGGA
AGTTAAAGAAAAACTTGCTGCCCTGAAGGATAAGGAAAAACAAGCTAAGAACAAAATGCGAGATGAATTGGCTAAAGCTCTTATTGCAAAGAATGGTGTTCCCCTTTCAA
TTGCAGAGCACGATGCTATTGTTTCAGAAATAAAAAGTGACGTAGCTGAAGCTCAGGCTGAGAGGCTTGTTGCATTGGAATTGGCATCTAAGAGGAGACAAAGTTCAGAT
GCTACTAGGACAGTTCCCATAATGTTAGACGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTGCTGGTGAAGGGAATATTCTGCTGCAAGATATGGAAAGCTGGGA
ATCAGTCAATCCAAGTGAAAAGTGGTTTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTACTTCTTTAAGGTTGAAAAGGCCTAAGTTAGTGGAAATAG
CTCAAACTCTTCCAGGTGGAGATAGTGAGACAGCTTCAGTATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTACACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGAATCAAACTCCAACTTCTGATAAAAAGAAGTTGAAGGA
AATGAAGCGGGAAGAATTGAAAGAAATATGCAATGGAAATAAGGTTGATGTTAAAGTCTCAAAGAAGAGCAGTCCAACAAAATCCAGCTTAGAAATTTCAAAGGAACAAA
CTGAAGCAAATGGGAGGAATGACTCTCTTCCCTCAAAGAAAAAGGGATCCAAAAAGGGGACCTCTAAGGATGCTGCCTCCGATGTTAGCAAACCCAAGGATGCGAGAGAA
AAAAACAGTTTGTGTCATGAGGATGCCAAAGCACCAGATGCTGTGGAAGAGGAAGACAAGAGGTCCTCAAAGGATGTTTCTTACCATCGAATTAAAGCTGAAGACACAAA
AGAAGTAAAAAATTTGGATATTCACAAATATGCCAAGACATCAAAAGATGCGAAAAATAATAAAAAAAATGTGCATGACAAGCCACTGGCAAAGTCCCAGGAAAATAAAG
AATGCTCTGTGACTATTCAGAACCAGGAATTTCGTGCCTGTGTTTCTTTCCCTCCAGGTTCAAGGTTAACAACTGTAGCAGATATTGAACTGACCACAGATGATGTTGGT
CATGCACTACAGTTTCTAGAATTCTGTGCAGGTTTTGGAAAGGCTCTAAATTTAAAGAAAGGGCATGCCGAGTCTGTACTCAAAGACCTAATGCGGGAGAGAAGAAGGTG
TCGAGTGTATGATTCACTCACTGTTCGGTTTCATATTCAACTCCTGTCTCTGATATTGAAGGATATGGATGAAGAGTCTGCAATCTCGAGTCCCACAAATGACAGAAACT
CATGGTTGCTGGCTCTGAAGAAATGCATTTCTGTATCCCCATTTAAGTTGAATGATCTGAAACCAGATTACTTTGATGGAGGTGACAGTTGTTATGATGACTTAGACTTC
TCAAAAAAGCTCAGACTATTGACTTACCTATGCGATGAGGCTCTCAATACAACAAAATTGAGAAACTGGATTGAAGAACAAAATACTAACTTTTTGGAGGAACAAAAGGA
AGTTAAAGAAAAACTTGCTGCCCTGAAGGATAAGGAAAAACAAGCTAAGAACAAAATGCGAGATGAATTGGCTAAAGCTCTTATTGCAAAGAATGGTGTTCCCCTTTCAA
TTGCAGAGCACGATGCTATTGTTTCAGAAATAAAAAGTGACGTAGCTGAAGCTCAGGCTGAGAGGCTTGTTGCATTGGAATTGGCATCTAAGAGGAGACAAAGTTCAGAT
GCTACTAGGACAGTTCCCATAATGTTAGACGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTGCTGGTGAAGGGAATATTCTGCTGCAAGATATGGAAAGCTGGGA
ATCAGTCAATCCAAGTGAAAAGTGGTTTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTACTTCTTTAAGGTTGAAAAGGCCTAAGTTAGTGGAAATAG
CTCAAACTCTTCCAGGTGGAGATAGTGAGACAGCTTCAGTATGTTGA
Protein sequenceShow/hide protein sequence
MITPRKQGKENSLNGNNESNLNLQNQTPTSDKKKLKEMKREELKEICNGNKVDVKVSKKSSPTKSSLEISKEQTEANGRNDSLPSKKKGSKKGTSKDAASDVSKPKDARE
KNSLCHEDAKAPDAVEEEDKRSSKDVSYHRIKAEDTKEVKNLDIHKYAKTSKDAKNNKKNVHDKPLAKSQENKECSVTIQNQEFRACVSFPPGSRLTTVADIELTTDDVG
HALQFLEFCAGFGKALNLKKGHAESVLKDLMRERRRCRVYDSLTVRFHIQLLSLILKDMDEESAISSPTNDRNSWLLALKKCISVSPFKLNDLKPDYFDGGDSCYDDLDF
SKKLRLLTYLCDEALNTTKLRNWIEEQNTNFLEEQKEVKEKLAALKDKEKQAKNKMRDELAKALIAKNGVPLSIAEHDAIVSEIKSDVAEAQAERLVALELASKRRQSSD
ATRTVPIMLDVNGRVFWKLRGFAGEGNILLQDMESWESVNPSEKWFMYKGEQKQEIEKYITSLRLKRPKLVEIAQTLPGGDSETASVC