; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020065 (gene) of Snake gourd v1 genome

Gene IDTan0020065
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionZf-4CXXC_R1 domain-containing protein
Genome locationLG04:57383385..57407719
RNA-Seq ExpressionTan0020065
SyntenyTan0020065
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR018501 - DDT domain
IPR018866 - Zinc-finger domain of monoamine-oxidase A repressor R1
IPR028942 - WHIM1 domain
IPR040221 - CDCA7/CDA7L


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053380.1 zf-4CXXC_R1 domain-containing protein [Cucumis melo var. makuwa]0.0e+0084.05Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRV+GGRIYDSENGKTCHQCRQKTMDFAASCMN KG+KLCTIKFCHKCLLNRYGEKAEEVML+EDW+CPKCR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV  AKATGFSSVSEMLLV+GS+CLD  KN I KVASP KQAS++KESVM++PRKQGKENSLNGNNESNLNLQ QTPN DKKKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVD-VKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKD
        MKREELKEICN NKVD  K SKKSS  KSS  E  K+QTE NG N+ LPSKKKGS+KGTSKDAASD   PKDAR+KNSLCHE+AK  D ++EEDKRSSKD
Subjt:  MKREELKEICNGNKVD-VKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKD

Query:  TSYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKA
          YHLI AE  KEEK+L+ HKYAKTS D KNNKTKV DKP AKS+EN+K + N+QNKEFGA V  PPGSRLTTVADIELTT+DVGHALQFLEFCAAFGKA
Subjt:  TSYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKA

Query:  LNLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSK
        LNLKKGHAESVLKDLMRER  RR  RVHDSL+VRFHIQLLSLILKDMDEES I SPTNDRSSWL+ALKKCISASPFKLNDLKPDYFDGGDNCY DL FSK
Subjt:  LNLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSK

Query:  KLRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELA
        KLRLLTYLCDEALNT KLR+WIE+QN+NFVEEQKEVKEKLAALKDKEKQAK KL+DELAKALI KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELA
Subjt:  KLRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELA

Query:  SKRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        SKRR+RS ATRT P++LDVNGRVFWKLRGFA +GNILLQDMESW SV P EKW MYK EQKQEIEKYISSLR KR +L E+ QTLPGG  E AS C
Subjt:  SKRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

XP_011653347.1 uncharacterized protein LOC101206502 isoform X1 [Cucumis sativus]0.0e+0083.31Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRV+GGRIYDSENGKTCHQCRQKTMDFAASCMN K +KLCTIKFCHKCLLNRYGEKAEE ML +DW+CPKCR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLVR AKATGFSSVSEMLLV+GS+CLD  KN I K ASP KQAS++KESVMI+PRKQGKENSLNGNNESNLNLQ QTPN D+KKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MKREELKEICN NKVD K SKKSS  KSS  E SK+QTE NG N+ LPSKKKG +KGTSKDAASD S PKDAR+KNS  HE+AKA D  +EEDKRSSKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
         YHLI A   KEEK+L IHKYA TS D KNNKTKV DKP AKSQEN+K + N+QNKEFGA V   PG RLTTVADIELTT+DVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        N+KKG+AESVLKDLMRER  RR  RVHDSL+VRFHIQLLSLILKDMDEES I SPTNDRSSWL+ALKKCISASPFK NDLKPDYFDGGDNCY DLDFSKK
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLR+WIE+QN+NF+EEQKEVKEKLAALKDKEKQAK KLQDELAKALIAKNG+PLSIAE+DAI+SQIK DVAEAQAERL ALELAS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KRRQRS ATRTVP++LDVNGRVFWKLRGFA EGNILLQDMESW S  P EKW +YK EQKQEIEKYISSL  KRP+LVE  QTLPGG  E AS C
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

XP_022155054.1 uncharacterized protein LOC111022193 isoform X1 [Momordica charantia]0.0e+0085.47Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVML EDW+CPKCRDLCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATGFSSVSEML VNGS+CLD +KN I KVASP KQASENKE VMISPRKQGKEN LNGNNESNLNLQNQTP SDKKKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MK EEL+EICNGNKVDVKCS KSS  KSS EE+SK+QTE NG N  LPSKKKGSKK TSKDAASD SRPKDAR+KNS CHEDAKAPDA+KEED+R SKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
        S HLI+AE TKEE  LDIHK      DA NNKT+VKDKPL  SQEN K T N  NKEF ACVSLP GSRLTTVA++ELTTEDVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        NL+KGHAESVLKDLMR+RT RR GRV+DSL+VRFHIQLLSLILKDMDEES ISSPTND SSWL+ALKKCISAS FKL+DLKPDYFD GD+CY DLDFS+K
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAK+KL+DELAKALIAKNG+PL+IAEHDAIVSQ+KRDVA AQ+ERLV LE+AS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KR+QRSDATRTVPIILD NGRVFW LRGFAGEGNILLQDMESWE+V P EKWSMYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

XP_038874924.1 uncharacterized protein LOC120067431 isoform X1 [Benincasa hispida]0.0e+0086.19Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRV+GGRIYDSE+GKTCHQCRQKTMDFAASCMN KG+K CTIKFCHKCLLNRYGEKAEEVMLREDW+CPKCR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATGFSSVSEMLLVNGS+CLD  K  ICKVASPNKQ SENKES+M +PRKQGKENSLNGNNESNLNLQ+QTPNSDKKKLK+
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MK EELKEIC GNKVDVK SKKSS  KSS EE SK+QTE NG N+ LPSKKKGS KG SKDAASD  +PKDAR+KNS CHEDAKAPDAV+EEDKRSSKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
        SYH I AE TKE K+LDIHKYAKTS DAKNNKTKV DKPL KS+EN+K + N+QN E GACVS PPGSRLTTVADIEL T+DVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        NLKKGHAESVLKDLMRER  RR  RVHDSL+VRFHIQLLSLILKDMDEESEISSPTND++SWL+ALKKCISASPFKLNDLK D+FDGGD CY DLD SKK
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLR WIEEQNTNF+EEQKEVKEKLAALKDKEKQAKNK++DELAKALIAKNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELAS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KRR+ SDATRTVPI+LDVNGRVFWKLRGFAGEGNILLQD+ESWESV P EKW MYK EQKQEIEKYI+S RLKRP+LVE+AQTLPGG  E ASVC
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

XP_038874928.1 uncharacterized protein LOC120067431 isoform X2 [Benincasa hispida]0.0e+0082.73Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRV+GGRIYDSE+GKTCHQCRQKTMDFAASCMN KG+K CTIKFCHKCLLNRYGEKAEEVMLREDW+CPKCR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATGFSSVSEMLLVNGS+CLD  K  ICKVASPNKQ SENK                           +QTPNSDKKKLK+
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MK EELKEIC GNKVDVK SKKSS  KSS EE SK+QTE NG N+ LPSKKKGS KG SKDAASD  +PKDAR+KNS CHEDAKAPDAV+EEDKRSSKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
        SYH I AE TKE K+LDIHKYAKTS DAKNNKTKV DKPL KS+EN+K + N+QN E GACVS PPGSRLTTVADIEL T+DVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        NLKKGHAESVLKDLMRER  RR  RVHDSL+VRFHIQLLSLILKDMDEESEISSPTND++SWL+ALKKCISASPFKLNDLK D+FDGGD CY DLD SKK
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLR WIEEQNTNF+EEQKEVKEKLAALKDKEKQAKNK++DELAKALIAKNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELAS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KRR+ SDATRTVPI+LDVNGRVFWKLRGFAGEGNILLQD+ESWESV P EKW MYK EQKQEIEKYI+S RLKRP+LVE+AQTLPGG  E ASVC
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

TrEMBL top hitse value%identityAlignment
A0A5A7UDV8 Zf-4CXXC_R1 domain-containing protein0.0e+0084.05Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRV+GGRIYDSENGKTCHQCRQKTMDFAASCMN KG+KLCTIKFCHKCLLNRYGEKAEEVML+EDW+CPKCR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV  AKATGFSSVSEMLLV+GS+CLD  KN I KVASP KQAS++KESVM++PRKQGKENSLNGNNESNLNLQ QTPN DKKKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVD-VKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKD
        MKREELKEICN NKVD  K SKKSS  KSS  E  K+QTE NG N+ LPSKKKGS+KGTSKDAASD   PKDAR+KNSLCHE+AK  D ++EEDKRSSKD
Subjt:  MKREELKEICNGNKVD-VKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKD

Query:  TSYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKA
          YHLI AE  KEEK+L+ HKYAKTS D KNNKTKV DKP AKS+EN+K + N+QNKEFGA V  PPGSRLTTVADIELTT+DVGHALQFLEFCAAFGKA
Subjt:  TSYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKA

Query:  LNLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSK
        LNLKKGHAESVLKDLMRER  RR  RVHDSL+VRFHIQLLSLILKDMDEES I SPTNDRSSWL+ALKKCISASPFKLNDLKPDYFDGGDNCY DL FSK
Subjt:  LNLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSK

Query:  KLRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELA
        KLRLLTYLCDEALNT KLR+WIE+QN+NFVEEQKEVKEKLAALKDKEKQAK KL+DELAKALI KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELA
Subjt:  KLRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELA

Query:  SKRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        SKRR+RS ATRT P++LDVNGRVFWKLRGFA +GNILLQDMESW SV P EKW MYK EQKQEIEKYISSLR KR +L E+ QTLPGG  E AS C
Subjt:  SKRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

A0A6J1DLC3 uncharacterized protein LOC111022193 isoform X21.3e-30781.87Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVML EDW+CPKCRDLCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATGFSSVSEML VNGS+CLD +KN I KVASP KQASENK                           NQTP SDKKKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MK EEL+EICNGNKVDVKCS KSS  KSS EE+SK+QTE NG N  LPSKKKGSKK TSKDAASD SRPKDAR+KNS CHEDAKAPDA+KEED+R SKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
        S HLI+AE TKEE  LDIHK      DA NNKT+VKDKPL  SQEN K T N  NKEF ACVSLP GSRLTTVA++ELTTEDVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        NL+KGHAESVLKDLMR+RT RR GRV+DSL+VRFHIQLLSLILKDMDEES ISSPTND SSWL+ALKKCISAS FKL+DLKPDYFD GD+CY DLDFS+K
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAK+KL+DELAKALIAKNG+PL+IAEHDAIVSQ+KRDVA AQ+ERLV LE+AS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KR+QRSDATRTVPIILD NGRVFW LRGFAGEGNILLQDMESWE+V P EKWSMYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

A0A6J1DP35 uncharacterized protein LOC111022193 isoform X10.0e+0085.47Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVML EDW+CPKCRDLCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATGFSSVSEML VNGS+CLD +KN I KVASP KQASENKE VMISPRKQGKEN LNGNNESNLNLQNQTP SDKKKLKE
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT
        MK EEL+EICNGNKVDVKCS KSS  KSS EE+SK+QTE NG N  LPSKKKGSKK TSKDAASD SRPKDAR+KNS CHEDAKAPDA+KEED+R SKD 
Subjt:  MKREELKEICNGNKVDVKCSKKSS--KSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDT

Query:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL
        S HLI+AE TKEE  LDIHK      DA NNKT+VKDKPL  SQEN K T N  NKEF ACVSLP GSRLTTVA++ELTTEDVGHALQFLEFCAAFGKAL
Subjt:  SYHLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKAL

Query:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK
        NL+KGHAESVLKDLMR+RT RR GRV+DSL+VRFHIQLLSLILKDMDEES ISSPTND SSWL+ALKKCISAS FKL+DLKPDYFD GD+CY DLDFS+K
Subjt:  NLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKK

Query:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS
        LRLLTYLCDEALNT KLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAK+KL+DELAKALIAKNG+PL+IAEHDAIVSQ+KRDVA AQ+ERLV LE+AS
Subjt:  LRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELAS

Query:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC
        KR+QRSDATRTVPIILD NGRVFW LRGFAGEGNILLQDMESWE+V P EKWSMYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  KRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC

A0A6J1G959 uncharacterized protein LOC111452074 isoform X11.4e-29879.25Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKR AAD+NEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMD AASCMN KG+KLCTIKFCHKCL+NRYGEKAEEVML+ DW CP+CR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATG+SSVSEMLLVNGSECLDH K+   +VASPNK ASE++ SVMISPR +GKENSLNGNNESNLNLQNQTPNS+KK+LK+
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSY
        MKR+ELKEI NGNKVDVKC         EET KKQTE NGTNE L +KKK SKK  S+DA           ++NSLCHEDA  P                
Subjt:  MKREELKEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSY

Query:  HLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNL
         +I+AE + E KDLDIHKYAKTS+DAKN +TK+K+KP            N+QNKEFGACV LPP S+LTTVADIELTTEDVGHALQFLEFCAAFGKALNL
Subjt:  HLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNL

Query:  KKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLR
        KKGH   VLKDL RERT RR GRVHDSL+VRFHIQLLSLIL+DMDEES ISSPTND SSWL+ LKKCISAS FK+NDLKPDYFDGGDNCY DLDFSKKLR
Subjt:  KKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLR

Query:  LLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKR
        LLTYLCDEALNT KLRNWIEEQN NFVE+QKEV+EKL+ALKDKEKQAKNKL+DE AKALIAKNGLPLSIAEHDAIV+QIKRDVAE QAE+LVALELAS +
Subjt:  LLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKR

Query:  RQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGVIEAASVC
        +QRS+ATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESV P EKWSM+K EQKQEIEKYISSLRLKR PRLVEV QTLP G IEAASVC
Subjt:  RQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGVIEAASVC

A0A6J1KD89 uncharacterized protein LOC111492807 isoform X19.4e-29578.53Show/hide
Query:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC
        MGSKKR  AD+NEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMD AASC N KG+KLCTIKFCHKCL+NRYGEKAEEVML+  W CP+CR LCNC
Subjt:  MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNC

Query:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE
        SVCMKKKGLKPTGLLV TAKATG+SSVSEMLLVNGSECLDH K+   +VASPNK ASE++ SVMIS R +GKENSLNGNN+SNLNLQNQTPNS+KK+LK+
Subjt:  SVCMKKKGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKE

Query:  MKREELKEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSY
        MKR+ LKEI NGNKVDVKC         EET KKQTE NGTNE L +KKK SKK  S+DA           ++NSLCHEDA                 S 
Subjt:  MKREELKEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSY

Query:  HLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNL
        HLI+AE +KE  DLDIHKYAKTS+DAKN +TK+K+KP            N+QNKEFGACVSLPP S+LTTVADIELTTEDVGHALQFLEFCAAFGKALNL
Subjt:  HLISAESTKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNL

Query:  KKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLR
        KKGH   VLKDL RERT RR GRVHDSL+VRFHIQLLSLIL+DMDEES ISSPTND SSWL+ LKKCISAS FK+NDLKPDYF GGDNCY DLDFSKKLR
Subjt:  KKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLR

Query:  LLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKR
        LLTYLCDEALNT KLRNWIEEQN NFVE+QKEV+EKL+ALKDKEK+AKNKL+DE AKALIAKNGLPLSIAEHDAIV+QIKRDVAE QAE+LVALELAS +
Subjt:  LLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKR

Query:  RQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGVIEAASVC
        +QRS+ATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESV P EKWSM+K EQKQEIEKYISSLRLKR PRLVEV QTLP G IEAASVC
Subjt:  RQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGVIEAASVC

SwissProt top hitse value%identityAlignment
Q32PH1 Cell division cycle-associated protein 71.3e-1945.1Show/hide
Query:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE
        +IY+   G TCHQCRQKT+D   +C N +   +   +FC  CL NRYGE+ ++ +L  +W+CP CR +CNCS C ++ G   TG+LV  AK  GF +V  
Subjt:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE

Query:  ML
         L
Subjt:  ML

Q4G059 Cell division cycle-associated 7-like protein1.0e-1945.1Show/hide
Query:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE
        ++YD   G TCHQCRQKT+D    C N +G      +FC  CL NRYGE     +L   W CP CR +CNCS C ++ G   TG+L+  AK  G+ +V E
Subjt:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE

Query:  ML
         L
Subjt:  ML

Q922M5 Cell division cycle-associated 7-like protein1.3e-1944.12Show/hide
Query:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE
        ++YD   G TCHQCRQKT+D    C N+    +   +FC  CL NRYGE     +L   W CP CR +CNCS C ++ G   TG+L+  AK  G+ +V E
Subjt:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSVSE

Query:  ML
         L
Subjt:  ML

Q96GN5 Cell division cycle-associated 7-like protein3.1e-2147.12Show/hide
Query:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIK--FCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV
        +IYD   G TCHQCRQKT+D    C N   +  C ++  FC  CL NRYGE     +L  DW CP CR +CNCS C K+ G   TG+L+  AK  G+ +V
Subjt:  RIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIK--FCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV

Query:  SEML
         E L
Subjt:  SEML

Q9BWT1 Cell division cycle-associated protein 72.3e-1943.93Show/hide
Query:  RIYDSENGKTCHQCRQKTMDFAASCMNK-----KGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGF
        +IY+   G TCHQCRQKT+D   +C N      +G+      FC  CL NRYGE+  + +L  +W+CP CR +CNCS C ++ G   TG+LV  AK  GF
Subjt:  RIYDSENGKTCHQCRQKTMDFAASCMNK-----KGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGF

Query:  SSVSEML
         +V   L
Subjt:  SSVSEML

Arabidopsis top hitse value%identityAlignment
AT1G67270.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.1e-8836.83Show/hide
Query:  KRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLV
        KR  +PGVRV+G RIYDS+NGK+CHQCRQKT+DFAA C   + +KLC IKFC+KCL  RYGE AEEV   +DW CP CR +C CSVC K +GL+PTG+L 
Subjt:  KRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLV

Query:  RTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREELKEICNGNKVD
          AKA G+SSV  +L   G     H+K A  K                                          P    KK+ ++    L   CN     
Subjt:  RTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREELKEICNGNKVD

Query:  VKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAESTKEEKDLDI
                    E +S     V GT   +    K  KK              D RK+       AK  D +KEE            I  E+         
Subjt:  VKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAESTKEEKDLDI

Query:  HKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER
                                                     LP G  LT V+ I++ TE+ G+  Q  EFC+AFGKAL LK+GHAE+++++L    
Subjt:  HKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER

Query:  TLRRSGRVHDSLSV-RFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLCDEALNTAKL
           R+ R     S+ +  IQLL LI K  D E  +S    D SSW  A+ + +S S    ++L  + F GG   Y  ++ S+KL+LL +LCDE+L+T  +
Subjt:  TLRRSGRVHDSLSV-RFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLCDEALNTAKL

Query:  RNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRSDATRTVPIILD
        RN+I  Q     E +KE K+K AA K KEKQ K K+Q E+AK+++ KNG PLSI EH++IVSQI+ +  EA  E + A  + SK     DA RT PI+LD
Subjt:  RNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRSDATRTVPIILD

Query:  VNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR
         NG V WKL+ F  E   LLQD+ +++ + P E+W  +K EQK EIE  IS +R K+
Subjt:  VNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKR

AT1G67780.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.9e-9034.59Show/hide
Query:  AAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKK
        A   + E  +KRT +PGVRV+GGRIYDS NGKTCHQCRQKTMDF ASC   K +K CTI FCHKCL+NRYGE AEEV   +DW CP+CR +CNCS C KK
Subjt:  AAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKK

Query:  KGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREEL
        +GL PTG+L   AKA+G +SVS +L V G +   ++K                                                   K KL +      
Subjt:  KGLKPTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREEL

Query:  KEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAE
                            S E +S     V+GT+  +    K  KK   K             KK +  H+                           
Subjt:  KEICNGNKVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAE

Query:  STKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAE
          KEE   +                                              LP G  L +V+ + + TE+ G+  Q  EFC+AFGKAL LK+G AE
Subjt:  STKEEKDLDIHKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAE

Query:  SVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLC
        +V+++L      R + R      ++  IQLL LI KD +    +S   N    W  AL + +  S    ++  P+ F+ G   Y  +D S++L+LL ++C
Subjt:  SVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLC

Query:  DEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRSDA
        DE+L+T  +RN I+ Q+T       E K K AA K+KEKQ K KLQ +LAKA++ KNG PLSI EH+ I+SQI+ +  EA    + A  + S   +  DA
Subjt:  DEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRSDA

Query:  TRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLK
         RT PI+++ NG V WKL  +  E   LLQD+ +++ +   EKW  +K EQK +IE YIS  R K
Subjt:  TRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLK

AT2G23530.1 Zinc-finger domain of monoamine-oxidase A repressor R11.0e-1935.14Show/hide
Query:  GGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV
        G RIYD  NGKTCHQCRQKTM     C       L   +FC  CL  RYGE   E +   DW CP CR +CNCS+C   KG  PTG + R   A G+ SV
Subjt:  GGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV

Query:  SEMLLVNGSECLD----HEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQ---TPNSDKKKLKEMKREE
        +  L+       D     + +A   ++   K A +    ++ +     KE   N N + N +L  +     N + +    +K+EE
Subjt:  SEMLLVNGSECLD----HEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQ---TPNSDKKKLKEMKREE

AT4G37110.1 Zinc-finger domain of monoamine-oxidase A repressor R11.7e-1731.27Show/hide
Query:  GGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV
        G RIYD   GK CHQCRQKT+ +   C   +       +FC  CL  RYGE   E +   DW CP CRD+CNCS C  KKG  PTG   R     G+ SV
Subjt:  GGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLVRTAKATGFSSV

Query:  SEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKL--------------KEMKREELKEICNGN
        +  L+    +    E +     A  + QASE    ++I+   Q  +N ++ N + + +  N+ P+S +K L               ++K  ++KE+    
Subjt:  SEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKL--------------KEMKREELKEICNGN

Query:  KVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARK
         V +    + S     ET +     N  N    SK+K S      +  S G R +  RK
Subjt:  KVDVKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARK

AT5G38690.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein7.7e-11641.08Show/hide
Query:  KRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLV
        ++T +PG  V   RI DS NGKTCHQCRQK  D   SC+ KK +K C IK C KC+LNRYGE A+EV L++DW CPKCR  CNCS CMKK+G KPTG+LV
Subjt:  KRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLKPTGLLV

Query:  RTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREELKEICNGNKVD
         TAK TGFSSVSE+L  +GS+   + K              + +  V++SP K  +ENS+   + S             KK ++ KREELK+I NG   +
Subjt:  RTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREELKEICNGNKVD

Query:  VKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAESTKEEKDLDI
            KKS+        K    V+ T E +   KK   +G  K   + G       K+N++  +  +   A+K+                   KEE  ++I
Subjt:  VKCSKKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAESTKEEKDLDI

Query:  HKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER
                                                     +P G+   TV+ I+L  ED G+  QFLEFC+AFGKAL+L+KG AE V+++++  R
Subjt:  HKYAKTSDDAKNNKTKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER

Query:  TLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLCDEALNTAKLR
        + RR      S   +  IQLL++IL+D  E S   S T+   SW   + +C+S S  KL+D  P+ F+ G + Y  L+ SK+L+LL +LCDE L T  +R
Subjt:  TLRRSGRVHDSLSVRFHIQLLSLILKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLCDEALNTAKLR

Query:  NWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRS-DATRTVPIILD
        N I+ QN   VE +KE KEK+ A KDKEKQ K KLQDELA+A+ AKNG+PL I EHDAIVS+I  +  E  +E   A+++ SK+ Q S DA RT P+ LD
Subjt:  NWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNKLQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRS-DATRTVPIILD

Query:  VNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTL
         NG +FW+L+ +  E NILLQD+ SW  V P EKW  +  EQK EIEKYIS +R+KR +  + A T+
Subjt:  VNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGCAAGAAGAGGGCTGCTGCCGATGATAACGAGCACAGAGCGAAACGGACAAAGTCTCCTGGAGTTCGAGTCATTGGGGGCCGGATCTATGATTCTGAGAACGG
CAAAACTTGCCATCAATGCCGACAAAAAACCATGGATTTTGCCGCATCCTGCATGAACAAGAAAGGGGAGAAGCTGTGTACCATAAAATTCTGTCACAAGTGCCTTCTAA
ACAGATATGGAGAGAAAGCAGAAGAGGTGATGCTCAGGGAAGACTGGAATTGTCCCAAGTGCAGAGACCTTTGCAATTGCAGCGTTTGCATGAAGAAAAAGGGTCTTAAA
CCCACTGGTTTGCTAGTACGCACAGCCAAAGCAACTGGGTTTTCTTCTGTCTCAGAAATGCTTCTAGTGAATGGTTCTGAGTGTCTAGATCATGAGAAGAATGCGATATG
CAAAGTTGCTTCACCAAACAAGCAGGCTTCTGAAAACAAGGAGTCTGTCATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACT
TGAATTTGCAGAATCAAACTCCAAATTCTGATAAAAAGAAGTTGAAGGAAATGAAGCGTGAAGAATTGAAAGAAATATGCAATGGAAATAAGGTTGATGTTAAATGCTCA
AAGAAGAGCAGCAAATCCAGCTTTGAGGAAACTTCTAAGAAGCAAACTGAAGTAAATGGGACGAATGAATATCTTCCCTCAAAGAAAAAGGGCTCCAAAAAAGGGACCTC
TAAGGATGCTGCCTCTGATGGTAGCAGACCTAAGGATGCTAGAAAAAAAAACAGTTTGTGTCATGAGGATGCAAAAGCACCAGATGCTGTGAAAGAGGAAGACAAGAGAT
CCTCAAAGGACACTTCTTACCATCTAATTTCAGCTGAAAGCACAAAAGAAGAAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGACGATGCAAAAAATAATAAA
ACAAAGGTGAAAGACAAGCCACTAGCGAAGTCCCAGGAAAATAGGAAAGGCACTACGAATGTTCAGAACAAAGAATTTGGTGCCTGTGTTTCTTTGCCTCCGGGTTCAAG
GTTAACAACTGTAGCAGATATTGAACTGACCACAGAAGATGTTGGTCATGCATTGCAGTTTTTAGAATTCTGTGCAGCTTTCGGAAAGGCTCTTAATTTAAAGAAAGGGC
ATGCTGAATCCGTACTCAAAGACCTAATGCGTGAGAGAACTCTGAGAAGAAGTGGTCGAGTGCACGATTCACTGAGTGTTCGATTTCATATTCAACTCCTGTCTCTGATA
TTGAAGGATATGGATGAAGAGTCTGAAATCTCTAGTCCTACAAATGACAGAAGCTCATGGTTGATGGCTTTGAAGAAATGTATTTCTGCATCCCCATTTAAGTTGAATGA
TCTGAAACCAGATTACTTTGATGGAGGAGACAATTGTTATGTTGACTTAGACTTCTCAAAAAAGCTCAGACTATTGACTTACCTATGTGATGAGGCTCTCAATACTGCAA
AATTGAGAAACTGGATTGAAGAACAAAATACCAACTTTGTGGAGGAACAAAAGGAAGTAAAAGAAAAACTTGCTGCGCTAAAGGATAAGGAGAAACAAGCTAAGAACAAA
TTGCAAGATGAGTTGGCCAAAGCTCTTATTGCCAAGAATGGTCTTCCACTTTCAATTGCCGAGCATGATGCTATTGTTTCGCAAATAAAGAGAGACGTAGCTGAAGCTCA
AGCTGAGAGGCTTGTTGCATTGGAATTGGCATCTAAGAGGAGACAAAGATCAGATGCTACTAGGACAGTTCCCATCATTTTGGATGTTAATGGTCGTGTATTTTGGAAAT
TAAGAGGCTTTGCTGGTGAAGGGAATATTCTGCTACAAGATATGGAAAGCTGGGAATCAGTCATTCCTAGGGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGAG
ATAGAAAAATACATTTCTTCTCTAAGGTTGAAGAGGCCTAGGTTAGTGGAAGTAGCTCAAACTCTTCCAGGTGGAGTTATTGAGGCAGCCTCAGTATGTTGA
mRNA sequenceShow/hide mRNA sequence
GTCAAGTTTAAGATAAAACTGGACTCTCCCAATCAGTTGAAAAGTTGGGCGGGGAGAAATTGAATTCTCCCCAAGCAAAATAGTATTCAGGAAAACCCAAAAAAAATTCG
CGGTTCTCTGAAAGTTTTGGGTTGCAGTACGGAAATGCACTCCAGTTCCAGATCTTCAAAAGTGGCGGGAAATCCTTTATCGCCCCAAGATTTCTTTTCAAAATTAGTTT
TCACTATCGATTTTTTTCCCGCCGGAACAGAAGAATCTCGAGAAGATATAACACCCGCCAATCCGGAAATGTAAGCTTTTCTTCCTCTCTAGCGTCTTGGAACTCCCCAA
AGGCCAACCCCATTTGTTCAGCAGATTATTTCTACGAAATCCCACCATTTGTTGCCTGTTCTTGAAGCTGGGTTTCGAAGATTTTTTTCTGAACTGAACTACAACGAGAT
AATGGGGAGCAAGAAGAGGGCTGCTGCCGATGATAACGAGCACAGAGCGAAACGGACAAAGTCTCCTGGAGTTCGAGTCATTGGGGGCCGGATCTATGATTCTGAGAACG
GCAAAACTTGCCATCAATGCCGACAAAAAACCATGGATTTTGCCGCATCCTGCATGAACAAGAAAGGGGAGAAGCTGTGTACCATAAAATTCTGTCACAAGTGCCTTCTA
AACAGATATGGAGAGAAAGCAGAAGAGGTGATGCTCAGGGAAGACTGGAATTGTCCCAAGTGCAGAGACCTTTGCAATTGCAGCGTTTGCATGAAGAAAAAGGGTCTTAA
ACCCACTGGTTTGCTAGTACGCACAGCCAAAGCAACTGGGTTTTCTTCTGTCTCAGAAATGCTTCTAGTGAATGGTTCTGAGTGTCTAGATCATGAGAAGAATGCGATAT
GCAAAGTTGCTTCACCAAACAAGCAGGCTTCTGAAAACAAGGAGTCTGTCATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAAC
TTGAATTTGCAGAATCAAACTCCAAATTCTGATAAAAAGAAGTTGAAGGAAATGAAGCGTGAAGAATTGAAAGAAATATGCAATGGAAATAAGGTTGATGTTAAATGCTC
AAAGAAGAGCAGCAAATCCAGCTTTGAGGAAACTTCTAAGAAGCAAACTGAAGTAAATGGGACGAATGAATATCTTCCCTCAAAGAAAAAGGGCTCCAAAAAAGGGACCT
CTAAGGATGCTGCCTCTGATGGTAGCAGACCTAAGGATGCTAGAAAAAAAAACAGTTTGTGTCATGAGGATGCAAAAGCACCAGATGCTGTGAAAGAGGAAGACAAGAGA
TCCTCAAAGGACACTTCTTACCATCTAATTTCAGCTGAAAGCACAAAAGAAGAAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGACGATGCAAAAAATAATAA
AACAAAGGTGAAAGACAAGCCACTAGCGAAGTCCCAGGAAAATAGGAAAGGCACTACGAATGTTCAGAACAAAGAATTTGGTGCCTGTGTTTCTTTGCCTCCGGGTTCAA
GGTTAACAACTGTAGCAGATATTGAACTGACCACAGAAGATGTTGGTCATGCATTGCAGTTTTTAGAATTCTGTGCAGCTTTCGGAAAGGCTCTTAATTTAAAGAAAGGG
CATGCTGAATCCGTACTCAAAGACCTAATGCGTGAGAGAACTCTGAGAAGAAGTGGTCGAGTGCACGATTCACTGAGTGTTCGATTTCATATTCAACTCCTGTCTCTGAT
ATTGAAGGATATGGATGAAGAGTCTGAAATCTCTAGTCCTACAAATGACAGAAGCTCATGGTTGATGGCTTTGAAGAAATGTATTTCTGCATCCCCATTTAAGTTGAATG
ATCTGAAACCAGATTACTTTGATGGAGGAGACAATTGTTATGTTGACTTAGACTTCTCAAAAAAGCTCAGACTATTGACTTACCTATGTGATGAGGCTCTCAATACTGCA
AAATTGAGAAACTGGATTGAAGAACAAAATACCAACTTTGTGGAGGAACAAAAGGAAGTAAAAGAAAAACTTGCTGCGCTAAAGGATAAGGAGAAACAAGCTAAGAACAA
ATTGCAAGATGAGTTGGCCAAAGCTCTTATTGCCAAGAATGGTCTTCCACTTTCAATTGCCGAGCATGATGCTATTGTTTCGCAAATAAAGAGAGACGTAGCTGAAGCTC
AAGCTGAGAGGCTTGTTGCATTGGAATTGGCATCTAAGAGGAGACAAAGATCAGATGCTACTAGGACAGTTCCCATCATTTTGGATGTTAATGGTCGTGTATTTTGGAAA
TTAAGAGGCTTTGCTGGTGAAGGGAATATTCTGCTACAAGATATGGAAAGCTGGGAATCAGTCATTCCTAGGGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGA
GATAGAAAAATACATTTCTTCTCTAAGGTTGAAGAGGCCTAGGTTAGTGGAAGTAGCTCAAACTCTTCCAGGTGGAGTTATTGAGGCAGCCTCAGTATGTTGACATTTGC
CATCGCTAAGCCAAAAGAGTAGTATGGTCATATCATCAGCCTTTGTTTTTTTGAAGCAGATTGCCAAAACAAATGACTTGAAGGAGATTTAGTTTCCATTGCCCTTCTCT
TCTAGATGGGACATCCGTATCCCTCTATTTTGTAACTTCTTTAAGAGAAGCTTACTTCACATCATACTTTAGCAGAAGGAAAAGAAAAAGGAAAAAACAGGGATAACCTC
AGCTAATTTTAGCTGTGTTTTTGTACACTTAAAGTTGCATCATGGCAATGTACTTGTTTCCTCCCAAACCATGATTTGGTATTGGTTTTGGGGATATTAGAAGAGGGAAT
CCAATATAGGATATATAACTGATACCTAATGTATTGTATGTTGTGATGTGTAACTTCACAAATGTTTTGGGAATTTATTAGAATTGGTTTCTCATCACTTCTGCTTTTCC
CTCAAATTTGAG
Protein sequenceShow/hide protein sequence
MGSKKRAAADDNEHRAKRTKSPGVRVIGGRIYDSENGKTCHQCRQKTMDFAASCMNKKGEKLCTIKFCHKCLLNRYGEKAEEVMLREDWNCPKCRDLCNCSVCMKKKGLK
PTGLLVRTAKATGFSSVSEMLLVNGSECLDHEKNAICKVASPNKQASENKESVMISPRKQGKENSLNGNNESNLNLQNQTPNSDKKKLKEMKREELKEICNGNKVDVKCS
KKSSKSSFEETSKKQTEVNGTNEYLPSKKKGSKKGTSKDAASDGSRPKDARKKNSLCHEDAKAPDAVKEEDKRSSKDTSYHLISAESTKEEKDLDIHKYAKTSDDAKNNK
TKVKDKPLAKSQENRKGTTNVQNKEFGACVSLPPGSRLTTVADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTLRRSGRVHDSLSVRFHIQLLSLI
LKDMDEESEISSPTNDRSSWLMALKKCISASPFKLNDLKPDYFDGGDNCYVDLDFSKKLRLLTYLCDEALNTAKLRNWIEEQNTNFVEEQKEVKEKLAALKDKEKQAKNK
LQDELAKALIAKNGLPLSIAEHDAIVSQIKRDVAEAQAERLVALELASKRRQRSDATRTVPIILDVNGRVFWKLRGFAGEGNILLQDMESWESVIPREKWSMYKGEQKQE
IEKYISSLRLKRPRLVEVAQTLPGGVIEAASVC