; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018727 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018727
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionZf-4CXXC_R1 domain-containing protein
Genome locationscaffold3:13702535..13711407
RNA-Seq ExpressionSpg018727
SyntenySpg018727
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR018501 - DDT domain
IPR040221 - CDCA7/CDA7L


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053380.1 zf-4CXXC_R1 domain-containing protein [Cucumis melo var. makuwa]1.5e-23280.37Show/hide
Query:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRT
        +ESVM++PRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD +K SKKSSPTKSS  EI K+QTEANG N+SLPSKKKGS+K T
Subjt:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRT

Query:  SKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEY
        SKDAASDV   KDARE+NS CHE+AK  D ++EE+++ SK V YH+  A+D  E K+L+ HKYAKTS+D K NK KV DKP  KS+EN KC+VNIQNKE+
Subjt:  SKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEY

Query:  GACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLS
        GA VP PPGSRLTT+ADIE+TT+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER QRRRCRVHDS+TVRFHIQLLSLILKDMDEE        S
Subjt:  GACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLS

Query:  SISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAK
        +I SPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDL FSKKLRLLTY+CDEALNTTKLR+WIE+QN+NFVEEQKE+KEKLAALKDKEKQAK
Subjt:  SISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAK

Query:  HKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSE
         KL+DELAKALI KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELASKRR  S ATRT P++LDVNGRVFWKLRGF+ +GNILLQDMESW +VNPSE
Subjt:  HKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSE

Query:  KWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        KW MYK EQKQEIEKYISSLR KR +L E+ QTLPGGG E AS C
Subjt:  KWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

XP_011653347.1 uncharacterized protein LOC101206502 isoform X1 [Cucumis sativus]3.3e-22779.78Show/hide
Query:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS
        +ESVMI+PRKQGKENSLNGNNESNLNLQ QTP  D+KKLKEMKREELKEICN NKVD+K SKKSS TKSS  EISK+QTEANG N+SLPSKKKG +K TS
Subjt:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS

Query:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG
        KDAASDVS  KDARE+NS  HE+AKA D  +EE+++ SK V YH+  A D  E K+L IHKYA TS+D K NK KV DKP  KSQEN KC+VNIQNKE+G
Subjt:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG

Query:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS
        A VP  PG RLTT+ADIE+TT+DVGHALQFLEFCAAFGKALN+KKG+AESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDMDEE        S+
Subjt:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS

Query:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH
        I SPTNDRSSWLLALKKCISASPFK NDLKPDYFDGGDNCYDDLDFSKKLRLLTY+CDEALNTTKLR+WIE+QN+NF+EEQKE+KEKLAALKDKEKQAK 
Subjt:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH

Query:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK
        KLQDELAKALIAKNG+PLSIAE+DAI+SQIK DVAEAQAERL ALELASKRR+ S ATRTVP++LDVNGRVFWKLRGF+ EGNILLQDMESW + NPSEK
Subjt:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK

Query:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        W +YK EQKQEIEKYISSL  KRP+LVE  QTLPGGG E AS C
Subjt:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

XP_022155054.1 uncharacterized protein LOC111022193 isoform X1 [Momordica charantia]1.0e-22880.15Show/hide
Query:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS
        +E VMISPRKQGKEN LNGNNESNLNLQ QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLPSKKKGSKK+TS
Subjt:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS

Query:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG
        KDAASDVSR KDARE+NS CHEDAKAPDA+KEE++++SK VS H+  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ 
Subjt:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG

Query:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS
        ACV LP GSRLTT+A++E+TTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEE        S+
Subjt:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS

Query:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH
        ISSPTND SSWLLALKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAKH
Subjt:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH

Query:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK
        KL+DELAKALIAKNG+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ S+ATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEK
Subjt:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK

Query:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        WSMYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

XP_038874924.1 uncharacterized protein LOC120067431 isoform X1 [Benincasa hispida]8.3e-23980.97Show/hide
Query:  CMLFHLQRSHAFQQESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNES
        C +    +  +  +ES+M +PRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVD K SKKSSPTKSS EEISK+QTEANG N+S
Subjt:  CMLFHLQRSHAFQQESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNES

Query:  LPSKKKGSKKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--MADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQEN
        LPSKKKGS K  SKDAASDV + KDARE+NS CHEDAKAPDAV+EE+++ SK VSYH   A+DT E K+LDIHKYAKTS+DAK NK KV DKPL KS+EN
Subjt:  LPSKKKGSKKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--MADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQEN

Query:  MKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDM
         KC+VNIQN E GACV  PPGSRLTT+ADIE+ T+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDM
Subjt:  MKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDM

Query:  DEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEK
        DEE        S ISSPTND++SWLLALKKCISASPFKLNDLK D+FDGGD CYDDLD SKKLRLLTY+CDEALNTTKLR WIEEQNTNF+EEQKE+KEK
Subjt:  DEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEK

Query:  LAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQ
        LAALKDKEKQAK+K++DELAKALIAKNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELASKRRRSS+ATRTVPI+LDVNGRVFWKLRGF+GEGNILLQ
Subjt:  LAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQ

Query:  DMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        D+ESWE+VNPSEKW MYK EQKQEIEKYI+S RLKRP+LVE+AQTLPGGG E ASVC
Subjt:  DMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

XP_038874931.1 uncharacterized protein LOC120067431 isoform X3 [Benincasa hispida]8.3e-23980.97Show/hide
Query:  CMLFHLQRSHAFQQESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNES
        C +    +  +  +ES+M +PRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVD K SKKSSPTKSS EEISK+QTEANG N+S
Subjt:  CMLFHLQRSHAFQQESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNES

Query:  LPSKKKGSKKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--MADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQEN
        LPSKKKGS K  SKDAASDV + KDARE+NS CHEDAKAPDAV+EE+++ SK VSYH   A+DT E K+LDIHKYAKTS+DAK NK KV DKPL KS+EN
Subjt:  LPSKKKGSKKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--MADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQEN

Query:  MKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDM
         KC+VNIQN E GACV  PPGSRLTT+ADIE+ T+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDM
Subjt:  MKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDM

Query:  DEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEK
        DEE        S ISSPTND++SWLLALKKCISASPFKLNDLK D+FDGGD CYDDLD SKKLRLLTY+CDEALNTTKLR WIEEQNTNF+EEQKE+KEK
Subjt:  DEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEK

Query:  LAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQ
        LAALKDKEKQAK+K++DELAKALIAKNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELASKRRRSS+ATRTVPI+LDVNGRVFWKLRGF+GEGNILLQ
Subjt:  LAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQ

Query:  DMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        D+ESWE+VNPSEKW MYK EQKQEIEKYI+S RLKRP+LVE+AQTLPGGG E ASVC
Subjt:  DMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

TrEMBL top hitse value%identityAlignment
A0A5A7UDV8 Zf-4CXXC_R1 domain-containing protein7.4e-23380.37Show/hide
Query:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRT
        +ESVM++PRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD +K SKKSSPTKSS  EI K+QTEANG N+SLPSKKKGS+K T
Subjt:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRT

Query:  SKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEY
        SKDAASDV   KDARE+NS CHE+AK  D ++EE+++ SK V YH+  A+D  E K+L+ HKYAKTS+D K NK KV DKP  KS+EN KC+VNIQNKE+
Subjt:  SKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEY

Query:  GACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLS
        GA VP PPGSRLTT+ADIE+TT+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER QRRRCRVHDS+TVRFHIQLLSLILKDMDEE        S
Subjt:  GACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLS

Query:  SISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAK
        +I SPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDL FSKKLRLLTY+CDEALNTTKLR+WIE+QN+NFVEEQKE+KEKLAALKDKEKQAK
Subjt:  SISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAK

Query:  HKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSE
         KL+DELAKALI KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELASKRR  S ATRT P++LDVNGRVFWKLRGF+ +GNILLQDMESW +VNPSE
Subjt:  HKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSE

Query:  KWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        KW MYK EQKQEIEKYISSLR KR +L E+ QTLPGGG E AS C
Subjt:  KWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

A0A6J1DLC3 uncharacterized protein LOC111022193 isoform X29.6e-21777.12Show/hide
Query:  SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTSKD
        S + SP+KQ  EN            + QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLPSKKKGSKK+TSKD
Subjt:  SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTSKD

Query:  AASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYGAC
        AASDVSR KDARE+NS CHEDAKAPDA+KEE++++SK VS H+  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ AC
Subjt:  AASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYGAC

Query:  VPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSIS
        V LP GSRLTT+A++E+TTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEE        S+IS
Subjt:  VPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSIS

Query:  SPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKL
        SPTND SSWLLALKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAKHKL
Subjt:  SPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKL

Query:  QDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWS
        +DELAKALIAKNG+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ S+ATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEKWS
Subjt:  QDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWS

Query:  MYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        MYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  MYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

A0A6J1DP35 uncharacterized protein LOC111022193 isoform X14.9e-22980.15Show/hide
Query:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS
        +E VMISPRKQGKEN LNGNNESNLNLQ QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLPSKKKGSKK+TS
Subjt:  QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTS

Query:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG
        KDAASDVSR KDARE+NS CHEDAKAPDA+KEE++++SK VS H+  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ 
Subjt:  KDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNKEYG

Query:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS
        ACV LP GSRLTT+A++E+TTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEE        S+
Subjt:  ACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSS

Query:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH
        ISSPTND SSWLLALKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKLAALKDKEKQAKH
Subjt:  ISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKH

Query:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK
        KL+DELAKALIAKNG+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ S+ATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEK
Subjt:  KLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEK

Query:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC
        WSMYKGEQKQEIE+YISSLR+KR RLV+ AQTLP G   AAS+C
Subjt:  WSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC

A0A6J1G959 uncharacterized protein LOC111452074 isoform X14.8e-20874.64Show/hide
Query:  HAFQQE-SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGS
        HA + E SVMISPR +GKENSLNGNNESNLNLQ QTP S+KK+LK+MKR+ELKEI NGNKVD KC           EE  KKQTEANGTNE L +KKK S
Subjt:  HAFQQE-SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGS

Query:  KKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHMADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNK
        KKR S+DA           ERNS CHEDA  P            V++   A+++TE KDLDIHKYAKTSEDAK  + K+K+KP           +NIQNK
Subjt:  KKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHMADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQNK

Query:  EYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRL
        E+GACVPLPP S+LTT+ADIE+TTEDVGHALQFLEFCAAFGKALNLKKGH   VLKDL RERT RR  RVHDS+TVRFHIQLLSLIL+DMDEE       
Subjt:  EYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRL

Query:  LSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQ
         S+ISSPTND SSWLL LKKCISAS FK+NDLKPDYFDGGDNCYDDLDFSKKLRLLTY+CDEALNTTKLRNWIEEQN NFVE+QKE++EKL+ALKDKEKQ
Subjt:  LSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQ

Query:  AKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNP
        AK+KL+DE AKALIAKNGLPLSIAEHDAIV+QIK+DVAE QAE+LVALELAS +++ SEATRTVPI+LDVNGRVFWKLRGF+GEGNILLQDMESWE+VNP
Subjt:  AKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNP

Query:  SEKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGGIEAASVC
        SEKWSM+K EQKQEIEKYISSLRLKR PRLVEV QTLP GGIEAASVC
Subjt:  SEKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGGIEAASVC

A0A6J1KD89 uncharacterized protein LOC111492807 isoform X18.8e-20272.91Show/hide
Query:  HAFQQE-SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGS
        HA + E SVMIS R +GKENSLNGNN+SNLNLQ QTP S+KK+LK+MKR+ LKEI NGNKVD KC           EE  KKQTEANGTNE L +KKK S
Subjt:  HAFQQE-SVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGS

Query:  KKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQ
        KKR S+DA           ERNS CHEDA                 S H+  A+++ E  DLDIHKYAKTSEDAK  + K+K+KP           +NIQ
Subjt:  KKRTSKDAASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHM--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLVKSQENMKCTVNIQ

Query:  NKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKL
        NKE+GACV LPP S+LTT+ADIE+TTEDVGHALQFLEFCAAFGKALNLKKGH   VLKDL RERT RR  RVHDS+TVRFHIQLLSLIL+DMDEE     
Subjt:  NKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKL

Query:  RLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKE
           S+ISSPTND SSWLL LKKCISAS FK+NDLKPDYF GGDNCYDDLDFSKKLRLLTY+CDEALNTTKLRNWIEEQN NFVE+QKE++EKL+ALKDKE
Subjt:  RLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKE

Query:  KQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAV
        K+AK+KL+DE AKALIAKNGLPLSIAEHDAIV+QIK+DVAE QAE+LVALELAS +++ SEATRTVPI+LDVNGRVFWKLRGF+GEGNILLQDMESWE+V
Subjt:  KQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAV

Query:  NPSEKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGGIEAASVC
        NPSEKWSM+K EQKQEIEKYISSLRLKR PRLVEV QTLP GGIEAASVC
Subjt:  NPSEKWSMYKGEQKQEIEKYISSLRLKR-PRLVEVAQTLPGGGIEAASVC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67270.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein2.0e-5740.34Show/hide
Query:  KCNKKVKDK-PLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLM---RERTQRRRCR
        K  KKV DK    K  + +K  + ++         LP G  LT ++ I+I TE+ G+  Q  EFC+AFGKAL LK+GHAE+++++L    R   +++ C 
Subjt:  KCNKKVKDK-PLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLM---RERTQRRRCR

Query:  VHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKL
        +      +  IQLL LI KD +           S+S    D SSW  A+ + +S S    ++L  + F GG   Y+ ++ S+KL+LL ++CDE+L+T  +
Subjt:  VHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKL

Query:  RNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLD
        RN+I  Q     E +KE K+K AA K KEKQ K K+Q E+AK+++ KNG PLSI EH++IVSQI+ +  EA  E + A  + SK     +A RT PI+LD
Subjt:  RNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSEATRTVPIVLD

Query:  VNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKR
         NG V WKL+ F  E   LLQD+ +++ + P E+W  +K EQK EIE  IS +R K+
Subjt:  VNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKR

AT1G67780.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.2e-5436.29Show/hide
Query:  DDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAE
        DD+ EG   +    + T  D +     KDK  V  +       +   +E      LP G  L +++ + I TE+ G+  Q  EFC+AFGKAL LK+G AE
Subjt:  DDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGHALQFLEFCAAFGKALNLKKGHAE

Query:  SVLKDLM---RERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDF
        +V+++L    R   +++ C +     ++  IQLL LI KD +           S+S   +D  +W  AL + +  S    ++  P+ F+ G   Y+ +D 
Subjt:  SVLKDLM---RERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDF

Query:  SKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALE
        S++L+LL ++CDE+L+T  +RN I+ Q+T       E K K AA K+KEKQ K KLQ +LAKA++ KNG PLSI EH+ I+SQI+ +  EA    + A  
Subjt:  SKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALE

Query:  LASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEA
        + S   R  +A RT PI+++ NG V WKL  +  E   LLQD+ +++ +   EKW  +K EQK +IE YIS  R K      V QTL    + A
Subjt:  LASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEA

AT5G38690.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.3e-6736.22Show/hide
Query:  MKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLP---SKKKGSKKRTSKDAASDVSRLK-DARERNSFCHEDAKAPDAVKEEE---
        +K++ +   C GN   S C KK     +     + K+T  +  +E L    S K    K+   +    VS LK D        H   K     K EE   
Subjt:  MKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLP---SKKKGSKKRTSKDAASDVSRLK-DARERNSFCHEDAKAPDAVKEEE---

Query:  -----------QKFSKVVSYHMADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGH
                    K S      ++D      ++     A+  +     K +K+  + +  + +K  +  + +E    + +P G+   T++ I++  ED G+
Subjt:  -----------QKFSKVVSYHMADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTEDVGH

Query:  ALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKL
          QFLEFC+AFGKAL+L+KG AE V+++++  R++RR+     S   +  IQLL++IL+D  E         +S+     D  SW   + +C+S S  KL
Subjt:  ALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKL

Query:  NDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAI
        +D  P+ F+ G + Y+ L+ SK+L+LL ++CDE L T  +RN I+ QN   VE +KE KEK+ A KDKEKQ K KLQDELA+A+ AKNG+PL I EHDAI
Subjt:  NDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAI

Query:  VSQIKKDVAEAQAERLVALELASKRRR-SSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPR
        VS+I  +  E  +E   A+++ SK+ + S +A RT P+ LD NG +FW+L+ ++ E NILLQD+ SW  V P EKW  +  EQK EIEKYIS +R+KR +
Subjt:  VSQIKKDVAEAQAERLVALELASKRRR-SSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPR

Query:  LVEVAQTL
          + A T+
Subjt:  LVEVAQTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGCAATTCGGGCGACAAATGATCCAGGTAGAACAAAACCCTCCAAGAAGTGGCAAACTGACAGAGTGAGCTCATCGGATGGGGTGAATCAACAAAAGACACCCAA
CAGAGATGGTGCAAATGATGGGATCGCGTCGAAGGGTCATGAATGGTTGAAAGTACAACCTCATGGTTGTAGGTTGTGTGGTTGGAATCTAGGGCATCTAGGTCTGAGTC
AATCACCATTGGAAGCGGCGGTGTACAATGACAGCGGAGGAAACAATGGTGGCTGGTTTGGTTGCCATGCCTGTATGCTCTTTCATTTACAAAGATCACATGCTTTTCAA
CAGGAGTCTGTCATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGACTCAAACTCCAACTTCTGATAAAAA
GAAGTTGAAGGAAATGAAGCGTGAAGAATTGAAAGAAATATGCAACGGAAATAAGGTTGATAGTAAATGCTCAAAGAAGAGCAGTCCAACGAAATCCAGCTCAGAGGAAA
TTTCTAAGAAACAAACTGAAGCAAATGGGACGAACGAATCTCTTCCCTCAAAGAAAAAGGGCTCCAAAAAAAGGACTTCTAAGGATGCTGCCTCTGATGTTAGCAGACTC
AAGGATGCTAGAGAAAGAAACAGTTTCTGTCATGAGGATGCCAAAGCACCAGATGCTGTGAAAGAGGAAGAACAGAAGTTCTCAAAGGTTGTTTCTTACCATATGGCTGA
TGATACAACAGAAGGAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGAAGATGCAAAATGTAACAAAAAGGTGAAAGACAAGCCTCTAGTGAAGTCCCAGGAAA
ATATGAAATGCACTGTGAATATTCAGAACAAGGAATATGGTGCCTGTGTTCCTTTGCCTCCAGGCTCAAGGCTAACAACTATAGCAGATATTGAAATCACCACAGAAGAT
GTTGGTCATGCATTGCAGTTTTTAGAATTCTGTGCAGCTTTTGGAAAGGCTCTTAATTTAAAGAAAGGGCATGCTGAGTCTGTACTCAAAGATCTAATGCGGGAGAGAAC
TCAAAGAAGAAGGTGTCGAGTTCACGATTCAGTGACTGTTCGATTTCATATTCAACTTCTGTCTCTGATATTGAAGGATATGGATGAAGAGTATGGCTATAAGCTGAGGC
TTTTGTCTTCCATCTCTAGTCCCACAAATGACAGAAGTTCATGGTTGCTGGCTTTGAAGAAATGCATTTCTGCATCCCCATTTAAGTTGAATGATCTGAAACCAGATTAC
TTTGATGGAGGTGACAATTGTTATGATGACTTAGACTTCTCGAAAAAGCTCAGACTATTGACTTACATATGTGATGAGGCTCTCAATACTACAAAATTGAGAAACTGGAT
TGAAGAACAAAATACTAACTTTGTGGAGGAACAAAAGGAACTTAAAGAAAAACTTGCTGCACTGAAGGATAAGGAGAAACAAGCTAAACACAAATTGCAAGATGAGTTGG
CCAAGGCTCTTATTGCAAAGAATGGTCTTCCCCTTTCAATTGCAGAGCATGATGCTATTGTCTCGCAAATAAAAAAAGACGTAGCTGAAGCTCAAGCTGAGAGGCTTGTT
GCATTGGAATTGGCATCGAAGAGAAGACGAAGTTCAGAAGCTACTAGGACAGTTCCCATCGTTTTGGATGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTTCTGG
TGAAGGGAATATCCTGCTACAAGATATGGAAAGCTGGGAAGCAGTCAATCCAAGCGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTT
CTTCTTTAAGGTTGAAGAGGCCTAGGTTAGTGGAAGTAGCTCAAACTCTTCCAGGTGGAGGTATTGAGGCAGCTTCAGTATGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGCAATTCGGGCGACAAATGATCCAGGTAGAACAAAACCCTCCAAGAAGTGGCAAACTGACAGAGTGAGCTCATCGGATGGGGTGAATCAACAAAAGACACCCAA
CAGAGATGGTGCAAATGATGGGATCGCGTCGAAGGGTCATGAATGGTTGAAAGTACAACCTCATGGTTGTAGGTTGTGTGGTTGGAATCTAGGGCATCTAGGTCTGAGTC
AATCACCATTGGAAGCGGCGGTGTACAATGACAGCGGAGGAAACAATGGTGGCTGGTTTGGTTGCCATGCCTGTATGCTCTTTCATTTACAAAGATCACATGCTTTTCAA
CAGGAGTCTGTCATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGACTCAAACTCCAACTTCTGATAAAAA
GAAGTTGAAGGAAATGAAGCGTGAAGAATTGAAAGAAATATGCAACGGAAATAAGGTTGATAGTAAATGCTCAAAGAAGAGCAGTCCAACGAAATCCAGCTCAGAGGAAA
TTTCTAAGAAACAAACTGAAGCAAATGGGACGAACGAATCTCTTCCCTCAAAGAAAAAGGGCTCCAAAAAAAGGACTTCTAAGGATGCTGCCTCTGATGTTAGCAGACTC
AAGGATGCTAGAGAAAGAAACAGTTTCTGTCATGAGGATGCCAAAGCACCAGATGCTGTGAAAGAGGAAGAACAGAAGTTCTCAAAGGTTGTTTCTTACCATATGGCTGA
TGATACAACAGAAGGAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGAAGATGCAAAATGTAACAAAAAGGTGAAAGACAAGCCTCTAGTGAAGTCCCAGGAAA
ATATGAAATGCACTGTGAATATTCAGAACAAGGAATATGGTGCCTGTGTTCCTTTGCCTCCAGGCTCAAGGCTAACAACTATAGCAGATATTGAAATCACCACAGAAGAT
GTTGGTCATGCATTGCAGTTTTTAGAATTCTGTGCAGCTTTTGGAAAGGCTCTTAATTTAAAGAAAGGGCATGCTGAGTCTGTACTCAAAGATCTAATGCGGGAGAGAAC
TCAAAGAAGAAGGTGTCGAGTTCACGATTCAGTGACTGTTCGATTTCATATTCAACTTCTGTCTCTGATATTGAAGGATATGGATGAAGAGTATGGCTATAAGCTGAGGC
TTTTGTCTTCCATCTCTAGTCCCACAAATGACAGAAGTTCATGGTTGCTGGCTTTGAAGAAATGCATTTCTGCATCCCCATTTAAGTTGAATGATCTGAAACCAGATTAC
TTTGATGGAGGTGACAATTGTTATGATGACTTAGACTTCTCGAAAAAGCTCAGACTATTGACTTACATATGTGATGAGGCTCTCAATACTACAAAATTGAGAAACTGGAT
TGAAGAACAAAATACTAACTTTGTGGAGGAACAAAAGGAACTTAAAGAAAAACTTGCTGCACTGAAGGATAAGGAGAAACAAGCTAAACACAAATTGCAAGATGAGTTGG
CCAAGGCTCTTATTGCAAAGAATGGTCTTCCCCTTTCAATTGCAGAGCATGATGCTATTGTCTCGCAAATAAAAAAAGACGTAGCTGAAGCTCAAGCTGAGAGGCTTGTT
GCATTGGAATTGGCATCGAAGAGAAGACGAAGTTCAGAAGCTACTAGGACAGTTCCCATCGTTTTGGATGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTTCTGG
TGAAGGGAATATCCTGCTACAAGATATGGAAAGCTGGGAAGCAGTCAATCCAAGCGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTT
CTTCTTTAAGGTTGAAGAGGCCTAGGTTAGTGGAAGTAGCTCAAACTCTTCCAGGTGGAGGTATTGAGGCAGCTTCAGTATGTTGA
Protein sequenceShow/hide protein sequence
MVAIRATNDPGRTKPSKKWQTDRVSSSDGVNQQKTPNRDGANDGIASKGHEWLKVQPHGCRLCGWNLGHLGLSQSPLEAAVYNDSGGNNGGWFGCHACMLFHLQRSHAFQ
QESVMISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPSKKKGSKKRTSKDAASDVSRL
KDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHMADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLVKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIEITTED
VGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEEYGYKLRLLSSISSPTNDRSSWLLALKKCISASPFKLNDLKPDY
FDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLAALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLV
ALELASKRRRSSEATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGIEAASVC