; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000622 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000622
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionZf-4CXXC_R1 domain-containing protein
Genome locationchr4:11602426..11607304
RNA-Seq ExpressionLag0000622
SyntenyLag0000622
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0005737 - cytoplasm (cellular component)
InterPro domainsIPR018501 - DDT domain
IPR040221 - CDCA7/CDA7L


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053380.1 zf-4CXXC_R1 domain-containing protein [Cucumis melo var. makuwa]7.1e-23181.02Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDA
        M++PRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD +K SKKSSPTKSS  EI K+QTEANG N+SLP KKKGS+K TSKDA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDA

Query:  ASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACV
        ASDV   KDARE+NS CHE+AK  D ++EE+++ SK V YHL  A+D  E K+L+ HKYAKTS+D K NK KV DKP AKS+EN KC+VNIQNKE+GA V
Subjt:  ASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACV

Query:  PLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAW
        P PPGSRLTT+ADIELTT+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER QRRRCRVHDS+TVRFHIQLLSLILKDMDEES+I SPT+DRS+W
Subjt:  PLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAW

Query:  LLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALI
        LLALKKCISASPFKLNDLKPDYFDGGDNCYDDL FSKKLRLLTY+CDEALNTTKLR+WIE+QN+NFVEEQKE+KEKL+ALKDKEKQAK KL+DELAKALI
Subjt:  LLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALI

Query:  AKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQE
         KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELASKRR  S ATRT P++LDVNGRVFWKLRGF+ +GNILLQDMESW +VNPSEKW MYK EQKQE
Subjt:  AKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQE

Query:  IEKYISSLRLKRPRLVEVAQTLPGGGQAASIA
        IEKYISSLR KR +L E+ QTLPGGG   + A
Subjt:  IEKYISSLRLKRPRLVEVAQTLPGGGQAASIA

XP_011653347.1 uncharacterized protein LOC101206502 isoform X1 [Cucumis sativus]1.5e-22580.41Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        MI+PRKQGKENSLNGNNESNLNLQ QTP  D+KKLKEMKREELKEICN NKVD+K SKKSS TKSS  EISK+QTEANG N+SLP KKKG +K TSKDAA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
        SDVS  KDARE+NS  HE+AKA D  +EE+++ SK V YHL  A D  E K+L IHKYA TS+D K NK KV DKP AKSQEN KC+VNIQNKE+GA VP
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
          PG RLTT+ADIELTT+DVGHALQFLEFCAAFGKALN+KKG+AESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDMDEES+I SPT+DRS+WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        LALKKCISASPFK NDLKPDYFDGGDNCYDDLDFSKKLRLLTY+CDEALNTTKLR+WIE+QN+NF+EEQKE+KEKL+ALKDKEKQAK KLQDELAKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNG+PLSIAE+DAI+SQIK DVAEAQAERL ALELASKRR+ S ATRTVP++LDVNGRVFWKLRGF+ EGNILLQDMESW + NPSEKW +YK EQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKRPRLVEVAQTLPGGGQAASIA
        EKYISSL  KRP+LVE  QTLPGGG   + A
Subjt:  EKYISSLRLKRPRLVEVAQTLPGGGQAASIA

XP_022155054.1 uncharacterized protein LOC111022193 isoform X1 [Momordica charantia]5.6e-22881.29Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        MISPRKQGKEN LNGNNESNLNLQ QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLP KKKGSKK+TSKDAA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
        SDVSR KDARE+NS CHEDAKAPDA+KEE++++SK VS HL  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ ACV 
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
        LP GSRLTT+A++ELTTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEES+ISSPT+D S+WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        LALKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKL+ALKDKEKQAKHKL+DELAKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNG+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ SDATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEKWSMYKGEQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKRPRLVEVAQTLPGGGQAAS
        E+YISSLR+KR RLV+ AQTLP G   A+
Subjt:  EKYISSLRLKRPRLVEVAQTLPGGGQAAS

XP_038874924.1 uncharacterized protein LOC120067431 isoform X1 [Benincasa hispida]1.1e-23683.43Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        M +PRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVD K SKKSSPTKSS EEISK+QTEANG N+SLP KKKGS K  SKDAA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--LADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
        SDV + KDARE+NS CHEDAKAPDAV+EE+++ SK VSYH   A+DT E K+LDIHKYAKTS+DAK NK KV DKPL KS+EN KC+VNIQN E GACV 
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--LADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
         PPGSRLTT+ADIEL T+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDMDEES ISSPT+D+++WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        LALKKCISASPFKLNDLK D+FDGGD CYDDLD SKKLRLLTY+CDEALNTTKLR WIEEQNTNF+EEQKE+KEKL+ALKDKEKQAK+K++DELAKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELASKRRRSSDATRTVPI+LDVNGRVFWKLRGF+GEGNILLQD+ESWE+VNPSEKW MYK EQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKRPRLVEVAQTLPGGG-QAASI
        EKYI+S RLKRP+LVE+AQTLPGGG + AS+
Subjt:  EKYISSLRLKRPRLVEVAQTLPGGG-QAASI

XP_038874931.1 uncharacterized protein LOC120067431 isoform X3 [Benincasa hispida]1.1e-23683.43Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        M +PRKQGKENSLNGNNESNLNLQ+QTP SDKKKLK+MK EELKEIC GNKVD K SKKSSPTKSS EEISK+QTEANG N+SLP KKKGS K  SKDAA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--LADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
        SDV + KDARE+NS CHEDAKAPDAV+EE+++ SK VSYH   A+DT E K+LDIHKYAKTS+DAK NK KV DKPL KS+EN KC+VNIQN E GACV 
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYH--LADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
         PPGSRLTT+ADIEL T+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER Q RRCRVHDS+TVRFHIQLLSLILKDMDEES ISSPT+D+++WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        LALKKCISASPFKLNDLK D+FDGGD CYDDLD SKKLRLLTY+CDEALNTTKLR WIEEQNTNF+EEQKE+KEKL+ALKDKEKQAK+K++DELAKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNG+PLSIAEHDAIVS+IK DV+EAQAERLVALELASKRRRSSDATRTVPI+LDVNGRVFWKLRGF+GEGNILLQD+ESWE+VNPSEKW MYK EQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKRPRLVEVAQTLPGGG-QAASI
        EKYI+S RLKRP+LVE+AQTLPGGG + AS+
Subjt:  EKYISSLRLKRPRLVEVAQTLPGGG-QAASI

TrEMBL top hitse value%identityAlignment
A0A5A7UDV8 Zf-4CXXC_R1 domain-containing protein3.4e-23181.02Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDA
        M++PRKQGKENSLNGNNESNLNLQ QTP  DKKKLKEMKREELKEICN NKVD +K SKKSSPTKSS  EI K+QTEANG N+SLP KKKGS+K TSKDA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVD-SKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDA

Query:  ASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACV
        ASDV   KDARE+NS CHE+AK  D ++EE+++ SK V YHL  A+D  E K+L+ HKYAKTS+D K NK KV DKP AKS+EN KC+VNIQNKE+GA V
Subjt:  ASDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACV

Query:  PLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAW
        P PPGSRLTT+ADIELTT+DVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRER QRRRCRVHDS+TVRFHIQLLSLILKDMDEES+I SPT+DRS+W
Subjt:  PLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAW

Query:  LLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALI
        LLALKKCISASPFKLNDLKPDYFDGGDNCYDDL FSKKLRLLTY+CDEALNTTKLR+WIE+QN+NFVEEQKE+KEKL+ALKDKEKQAK KL+DELAKALI
Subjt:  LLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALI

Query:  AKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQE
         KNG+PLSIAEHDAI+SQIK DVAEAQAERLVALELASKRR  S ATRT P++LDVNGRVFWKLRGF+ +GNILLQDMESW +VNPSEKW MYK EQKQE
Subjt:  AKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQE

Query:  IEKYISSLRLKRPRLVEVAQTLPGGGQAASIA
        IEKYISSLR KR +L E+ QTLPGGG   + A
Subjt:  IEKYISSLRLKRPRLVEVAQTLPGGGQAASIA

A0A6J1DLC3 uncharacterized protein LOC111022193 isoform X21.1e-21678.37Show/hide
Query:  SPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAASD
        SP+KQ  EN            + QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLP KKKGSKK+TSKDAASD
Subjt:  SPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAASD

Query:  VSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVPLP
        VSR KDARE+NS CHEDAKAPDA+KEE++++SK VS HL  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ ACV LP
Subjt:  VSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVPLP

Query:  PGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLA
         GSRLTT+A++ELTTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEES+ISSPT+D S+WLLA
Subjt:  PGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLA

Query:  LKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKN
        LKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKL+ALKDKEKQAKHKL+DELAKALIAKN
Subjt:  LKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKN

Query:  GLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEK
        G+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ SDATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEKWSMYKGEQKQEIE+
Subjt:  GLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEK

Query:  YISSLRLKRPRLVEVAQTLPGGGQAAS
        YISSLR+KR RLV+ AQTLP G   A+
Subjt:  YISSLRLKRPRLVEVAQTLPGGGQAAS

A0A6J1DP35 uncharacterized protein LOC111022193 isoform X12.7e-22881.29Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        MISPRKQGKEN LNGNNESNLNLQ QTP SDKKKLKEMK EEL+EICNGNKVD KCS KSSPTKSS EE SK+QTEANG N SLP KKKGSKK+TSKDAA
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
        SDVSR KDARE+NS CHEDAKAPDA+KEE++++SK VS HL  A+ T E   LDIHK      DA  NK +VKDKPL  SQENMKCTVN  NKE+ ACV 
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
        LP GSRLTT+A++ELTTEDVGHALQFLEFCAAFGKALNL+KGHAESVLKDLMR+RTQRRR RV+DS+TVRFHIQLLSLILKDMDEES+ISSPT+D S+WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        LALKKCISAS FKL+DLKPDYFD GD+CYDDLDFS+KLRLLTY+CDEALNTTKLRNWI+EQN NFVEEQKE KEKL+ALKDKEKQAKHKL+DELAKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNG+PL+IAEHDAIVSQ+K+DVA AQ+ERLV LE+ASKR++ SDATRTVPI+LD NGRVFW LRGF+GEGNILLQDMESWE VNPSEKWSMYKGEQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKRPRLVEVAQTLPGGGQAAS
        E+YISSLR+KR RLV+ AQTLP G   A+
Subjt:  EKYISSLRLKRPRLVEVAQTLPGGGQAAS

A0A6J1G959 uncharacterized protein LOC111452074 isoform X12.5e-20575.28Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        MISPR +GKENSLNGNNESNLNLQ QTP S+KK+LK+MKR+ELKEI NGNKVD KC           EE  KKQTEANGTNE L  KKK SKKR S+DA 
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHLADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVPLP
                  ERNS CHEDA  P            V++   A+++TE KDLDIHKYAKTSEDAK  + K+K+KP           +NIQNKE+GACVPLP
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHLADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVPLP

Query:  PGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLA
        P S+LTT+ADIELTTEDVGHALQFLEFCAAFGKALNLKKGH   VLKDL RERT RR  RVHDS+TVRFHIQLLSLIL+DMDEES+ISSPT+D S+WLL 
Subjt:  PGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLA

Query:  LKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKN
        LKKCISAS FK+NDLKPDYFDGGDNCYDDLDFSKKLRLLTY+CDEALNTTKLRNWIEEQN NFVE+QKE++EKLSALKDKEKQAK+KL+DE AKALIAKN
Subjt:  LKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKN

Query:  GLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEK
        GLPLSIAEHDAIV+QIK+DVAE QAE+LVALELAS +++ S+ATRTVPI+LDVNGRVFWKLRGF+GEGNILLQDMESWE+VNPSEKWSM+K EQKQEIEK
Subjt:  GLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEK

Query:  YISSLRLKR-PRLVEVAQTLPGGG-QAASI
        YISSLRLKR PRLVEV QTLP GG +AAS+
Subjt:  YISSLRLKR-PRLVEVAQTLPGGG-QAASI

A0A6J1KD89 uncharacterized protein LOC111492807 isoform X12.7e-19973.68Show/hide
Query:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA
        MIS R +GKENSLNGNN+SNLNLQ QTP S+KK+LK+MKR+ LKEI NGNKVD KC           EE  KKQTEANGTNE L  KKK SKKR S+DA 
Subjt:  MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAA

Query:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP
                  ERNS CHEDA                 S HL  A+++ E  DLDIHKYAKTSEDAK  + K+K+KP           +NIQNKE+GACV 
Subjt:  SDVSRLKDARERNSFCHEDAKAPDAVKEEEQKFSKVVSYHL--ADDTTEGKDLDIHKYAKTSEDAKCNK-KVKDKPLAKSQENMKCTVNIQNKEYGACVP

Query:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL
        LPP S+LTT+ADIELTTEDVGHALQFLEFCAAFGKALNLKKGH   VLKDL RERT RR  RVHDS+TVRFHIQLLSLIL+DMDEES+ISSPT+D S+WL
Subjt:  LPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWL

Query:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA
        L LKKCISAS FK+NDLKPDYF GGDNCYDDLDFSKKLRLLTY+CDEALNTTKLRNWIEEQN NFVE+QKE++EKLSALKDKEK+AK+KL+DE AKALIA
Subjt:  LALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIA

Query:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI
        KNGLPLSIAEHDAIV+QIK+DVAE QAE+LVALELAS +++ S+ATRTVPI+LDVNGRVFWKLRGF+GEGNILLQDMESWE+VNPSEKWSM+K EQKQEI
Subjt:  KNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEI

Query:  EKYISSLRLKR-PRLVEVAQTLPGGG-QAASI
        EKYISSLRLKR PRLVEV QTLP GG +AAS+
Subjt:  EKYISSLRLKR-PRLVEVAQTLPGGG-QAASI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67270.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein1.8e-5941.26Show/hide
Query:  KCNKKVKDK-PLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLM---RERTQRRRCR
        K  KKV DK   AK  + +K  + ++         LP G  LT ++ I++ TE+ G+  Q  EFC+AFGKAL LK+GHAE+++++L    R   +++ C 
Subjt:  KCNKKVKDK-PLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGHAESVLKDLM---RERTQRRRCR

Query:  VHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQN
        +      +  IQLL LI KD +   S+S+ TD  S+W  A+ + +S S    ++L  + F GG   Y+ ++ S+KL+LL ++CDE+L+T  +RN+I  Q 
Subjt:  VHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQN

Query:  TNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWK
            E +KE K+K +A K KEKQ K K+Q E+AK+++ KNG PLSI EH++IVSQI+ +  EA  E + A  + SK     DA RT PI+LD NG V WK
Subjt:  TNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSSDATRTVPIVLDVNGRVFWK

Query:  LRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKR
        L+ F  E   LLQD+ +++ + P E+W  +K EQK EIE  IS +R K+
Subjt:  LRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKR

AT1G67780.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein7.2e-5637.84Show/hide
Query:  LADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGH
        L DD+ EG   +    + T  +     K K K + KS++    T  ++ +E      LP G  L +++ + + TE+ G+  Q  EFC+AFGKAL LK+G 
Subjt:  LADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDVGHALQFLEFCAAFGKALNLKKGH

Query:  AESVLKDLM---RERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRL
        AE+V+++L    R   +++ C +     ++  IQLL LI K  D E S+S    D   W  AL + +  S    ++  P+ F+ G   Y+ +D S++L+L
Subjt:  AESVLKDLM---RERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLDFSKKLRL

Query:  LTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRR
        L ++CDE+L+T  +RN I+ Q+T       E K K +A K+KEKQ K KLQ +LAKA++ KNG PLSI EH+ I+SQI+ +  EA    + A  + S   
Subjt:  LTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRR

Query:  RSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLK
        R  DA RT PI+++ NG V WKL  +  E   LLQD+ +++ +   EKW  +K EQK +IE YIS  R K
Subjt:  RSSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLK

AT5G38690.1 Zinc-finger domain of monoamine-oxidase A repressor R1 protein3.2e-7237.85Show/hide
Query:  MKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGS-----KKRTSKDAASDVSRLK-DARERNSFCHEDAKAPDAVKEEE-
        +K++ +   C GN   S C KK     +     + K+T  +  +E   LK  GS      K+   +    VS LK D        H   K     K EE 
Subjt:  MKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGS-----KKRTSKDAASDVSRLK-DARERNSFCHEDAKAPDAVKEEE-

Query:  -------------QKFSKVVSYHLADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDV
                      K S      L+D      ++     A+  +     K +K+  +A+  + +K  +  + +E    + +P G+   T++ I+L  ED 
Subjt:  -------------QKFSKVVSYHLADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDV

Query:  GHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPD
        G+  QFLEFC+AFGKAL+L+KG AE V+++++  R++RR+     S   +  IQLL++IL+D  E S   S TD   +W   + +C+S S  KL+D  P+
Subjt:  GHALQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPD

Query:  YFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKK
         F+ G + Y+ L+ SK+L+LL ++CDE L T  +RN I+ QN   VE +KE KEK++A KDKEKQ K KLQDELA+A+ AKNG+PL I EHDAIVS+I  
Subjt:  YFDGGDNCYDDLDFSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKK

Query:  DVAEAQAERLVALELASKRRR-SSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQ
        +  E  +E   A+++ SK+ + S DA RT P+ LD NG +FW+L+ ++ E NILLQD+ SW  V P EKW  +  EQK EIEKYIS +R+KR +  + A 
Subjt:  DVAEAQAERLVALELASKRRR-SSDATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQ

Query:  TL
        T+
Subjt:  TL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGACTCAAACTCCAACTTCTGATAAAAAGAAGTTGAAGGA
AATGAAGCGTGAAGAACTGAAAGAAATATGCAACGGAAATAAGGTTGATAGTAAATGCTCAAAGAAGAGCAGTCCAACGAAATCCAGCTCAGAGGAAATTTCTAAGAAAC
AAACTGAAGCAAATGGGACGAACGAATCTCTTCCCTTAAAGAAAAAGGGCTCCAAAAAAAGGACTTCTAAGGATGCTGCCTCCGATGTTAGCAGACTCAAGGATGCTAGA
GAAAGAAACAGTTTCTGTCATGAGGATGCCAAAGCACCAGATGCTGTGAAAGAGGAAGAACAGAAGTTCTCAAAGGTTGTTTCTTACCATCTGGCTGATGATACAACAGA
AGGAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGAAGATGCAAAATGTAACAAAAAGGTGAAAGACAAGCCTCTAGCGAAGTCCCAGGAAAATATGAAATGCA
CTGTGAATATTCAGAACAAGGAATATGGTGCCTGTGTTCCTTTGCCTCCAGGCTCAAGGCTAACAACTATAGCAGATATTGAACTCACCACAGAAGATGTTGGTCATGCA
TTGCAGTTTTTAGAATTCTGTGCAGCTTTTGGAAAGGCTCTTAATTTAAAGAAAGGGCATGCTGAGTCTGTACTCAAAGATCTAATGCGGGAGAGAACTCAAAGAAGAAG
GTGTCGAGTTCATGATTCAGTGACTGTTCGATTTCATATTCAACTTCTGTCTCTGATATTGAAGGATATGGATGAAGAGTCTTCCATCTCTAGTCCCACAGATGACAGAA
GTGCATGGTTGCTGGCTTTGAAGAAATGCATTTCTGCATCCCCATTTAAGTTGAATGATCTGAAACCAGATTACTTTGATGGAGGTGACAATTGTTATGATGACTTAGAC
TTCTCAAAAAAGCTCAGACTATTGACTTACATATGTGATGAGGCTCTCAATACTACAAAATTGAGAAACTGGATTGAAGAACAAAATACTAACTTTGTGGAGGAACAAAA
GGAACTTAAAGAAAAACTTTCTGCACTGAAGGATAAGGAGAAACAAGCTAAACACAAATTGCAAGATGAGTTGGCCAAGGCTCTTATTGCAAAGAATGGTCTTCCCCTTT
CAATTGCAGAGCATGATGCTATTGTCTCGCAAATAAAAAAAGACGTAGCTGAAGCTCAAGCTGAGAGGCTTGTTGCATTGGAATTGGCATCGAAGAGAAGACGAAGTTCA
GATGCTACTAGGACAGTTCCCATTGTTTTGGATGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTTCTGGTGAAGGGAATATTCTGCTACAAGATATGGAAAGCTG
GGAAGCAGTCAATCCAAGCGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTTCTTCTCTAAGGTTGAAGAGGCCTAGGTTAGTGGAAG
TAGCTCAAACTCTTCCAGGTGGAGGTCAGGCAGCTTCAATTGCCAAAAATGACTTGAAAAGAGATCTATCTAGTTTCCATTGTCCTCGTCTTCTAGATGGGATATCCATA
TCCCTCTATTTTGTAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTCACCAAGGAAACAAGGGAAGGAAAATTCTTTAAATGGAAATAATGAATCGAACTTGAATTTGCAGACTCAAACTCCAACTTCTGATAAAAAGAAGTTGAAGGA
AATGAAGCGTGAAGAACTGAAAGAAATATGCAACGGAAATAAGGTTGATAGTAAATGCTCAAAGAAGAGCAGTCCAACGAAATCCAGCTCAGAGGAAATTTCTAAGAAAC
AAACTGAAGCAAATGGGACGAACGAATCTCTTCCCTTAAAGAAAAAGGGCTCCAAAAAAAGGACTTCTAAGGATGCTGCCTCCGATGTTAGCAGACTCAAGGATGCTAGA
GAAAGAAACAGTTTCTGTCATGAGGATGCCAAAGCACCAGATGCTGTGAAAGAGGAAGAACAGAAGTTCTCAAAGGTTGTTTCTTACCATCTGGCTGATGATACAACAGA
AGGAAAAGATTTGGATATTCACAAATATGCCAAGACATCAGAAGATGCAAAATGTAACAAAAAGGTGAAAGACAAGCCTCTAGCGAAGTCCCAGGAAAATATGAAATGCA
CTGTGAATATTCAGAACAAGGAATATGGTGCCTGTGTTCCTTTGCCTCCAGGCTCAAGGCTAACAACTATAGCAGATATTGAACTCACCACAGAAGATGTTGGTCATGCA
TTGCAGTTTTTAGAATTCTGTGCAGCTTTTGGAAAGGCTCTTAATTTAAAGAAAGGGCATGCTGAGTCTGTACTCAAAGATCTAATGCGGGAGAGAACTCAAAGAAGAAG
GTGTCGAGTTCATGATTCAGTGACTGTTCGATTTCATATTCAACTTCTGTCTCTGATATTGAAGGATATGGATGAAGAGTCTTCCATCTCTAGTCCCACAGATGACAGAA
GTGCATGGTTGCTGGCTTTGAAGAAATGCATTTCTGCATCCCCATTTAAGTTGAATGATCTGAAACCAGATTACTTTGATGGAGGTGACAATTGTTATGATGACTTAGAC
TTCTCAAAAAAGCTCAGACTATTGACTTACATATGTGATGAGGCTCTCAATACTACAAAATTGAGAAACTGGATTGAAGAACAAAATACTAACTTTGTGGAGGAACAAAA
GGAACTTAAAGAAAAACTTTCTGCACTGAAGGATAAGGAGAAACAAGCTAAACACAAATTGCAAGATGAGTTGGCCAAGGCTCTTATTGCAAAGAATGGTCTTCCCCTTT
CAATTGCAGAGCATGATGCTATTGTCTCGCAAATAAAAAAAGACGTAGCTGAAGCTCAAGCTGAGAGGCTTGTTGCATTGGAATTGGCATCGAAGAGAAGACGAAGTTCA
GATGCTACTAGGACAGTTCCCATTGTTTTGGATGTTAATGGTCGTGTATTTTGGAAATTAAGAGGCTTTTCTGGTGAAGGGAATATTCTGCTACAAGATATGGAAAGCTG
GGAAGCAGTCAATCCAAGCGAAAAGTGGTCTATGTATAAAGGCGAGCAGAAACAAGAGATAGAAAAATACATTTCTTCTCTAAGGTTGAAGAGGCCTAGGTTAGTGGAAG
TAGCTCAAACTCTTCCAGGTGGAGGTCAGGCAGCTTCAATTGCCAAAAATGACTTGAAAAGAGATCTATCTAGTTTCCATTGTCCTCGTCTTCTAGATGGGATATCCATA
TCCCTCTATTTTGTAACTTAG
Protein sequenceShow/hide protein sequence
MISPRKQGKENSLNGNNESNLNLQTQTPTSDKKKLKEMKREELKEICNGNKVDSKCSKKSSPTKSSSEEISKKQTEANGTNESLPLKKKGSKKRTSKDAASDVSRLKDAR
ERNSFCHEDAKAPDAVKEEEQKFSKVVSYHLADDTTEGKDLDIHKYAKTSEDAKCNKKVKDKPLAKSQENMKCTVNIQNKEYGACVPLPPGSRLTTIADIELTTEDVGHA
LQFLEFCAAFGKALNLKKGHAESVLKDLMRERTQRRRCRVHDSVTVRFHIQLLSLILKDMDEESSISSPTDDRSAWLLALKKCISASPFKLNDLKPDYFDGGDNCYDDLD
FSKKLRLLTYICDEALNTTKLRNWIEEQNTNFVEEQKELKEKLSALKDKEKQAKHKLQDELAKALIAKNGLPLSIAEHDAIVSQIKKDVAEAQAERLVALELASKRRRSS
DATRTVPIVLDVNGRVFWKLRGFSGEGNILLQDMESWEAVNPSEKWSMYKGEQKQEIEKYISSLRLKRPRLVEVAQTLPGGGQAASIAKNDLKRDLSSFHCPRLLDGISI
SLYFVT