; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024822 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024822
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein ALWAYS EARLY 3
Genome locationtig00002486:3157309..3159171
RNA-Seq ExpressionSgr024822
SyntenySgr024822
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0017053 - transcriptional repressor complex (cellular component)
InterPro domainsIPR010561 - Protein LIN-9/Protein ALWAYS EARLY


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582226.1 Protein ALWAYS EARLY 3, partial [Cucurbita argyrosperma subsp. sororia]2.3e-13889.76Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQEASSQ SALAQIQAKEADVHALSELSRAL KKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGT PLMFLKPVHD+GDSC HAQEP SHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRLSVDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR TAADTSN+ P+SQNHFN CTSNPS A+HVVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+ LQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

XP_004134200.2 protein ALWAYS EARLY 3 isoform X2 [Cucumis sativus]6.0e-13990.1Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQE SSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGTSPLMFLKPVHD GD C H+QEPGSHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRL+VDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR  AADTSNA P+SQNHFNACTSN STA+ VVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+SLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

XP_008438872.2 PREDICTED: protein ALWAYS EARLY 3 isoform X1 [Cucumis melo]1.0e-13889.76Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQE SSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGTSPLMFLKPVHD GD C H+QEPGSHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRL+VDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR  AADTSNA P+SQNHFN CTSN STA+ VVG+KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+SLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

XP_022138292.1 protein ALWAYS EARLY 3 [Momordica charantia]3.9e-14693.17Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC +LQ KFGL+ETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ +EVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQ ILTL+KGESNLESIEEAIDFVSNRLSVDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
         RLTAADTSNATP+SQNHF+AC SNPSTANHVVG KSNGPSDKNE ++PSELIAHCVATLLMIQKCTERQFPP DVAQVLDSAVNSLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

XP_031739375.1 protein ALWAYS EARLY 3 isoform X1 [Cucumis sativus]6.0e-13990.1Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQE SSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGTSPLMFLKPVHD GD C H+QEPGSHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRL+VDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR  AADTSNA P+SQNHFNACTSN STA+ VVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+SLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

TrEMBL top hitse value%identityAlignment
A0A0A0L571 SANT domain-containing protein2.9e-13990.1Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQE SSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGTSPLMFLKPVHD GD C H+QEPGSHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRL+VDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR  AADTSNA P+SQNHFNACTSN STA+ VVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+SLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

A0A1S3AXH9 protein ALWAYS EARLY 3 isoform X15.0e-13989.76Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQE SSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGTSPLMFLKPVHD GD C H+QEPGSHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRL+VDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR  AADTSNA P+SQNHFN CTSN STA+ VVG+KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+SLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

A0A6J1CAP3 protein ALWAYS EARLY 31.9e-14693.17Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC +LQ KFGL+ETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQ NGDNLLKDSENFKKQYAAVLLQ +EVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQ ILTL+KGESNLESIEEAIDFVSNRLSVDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
         RLTAADTSNATP+SQNHF+AC SNPSTANHVVG KSNGPSDKNE ++PSELIAHCVATLLMIQKCTERQFPP DVAQVLDSAVNSLQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

A0A6J1GVN8 protein ALWAYS EARLY 3-like isoform X34.2e-13889.42Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQEASSQ SALAQIQAKEADVHALSELSRAL KKEVVVSELKRLNDEVLENQ +GDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGT PLMFLKPVHD+GDSC HAQEP SHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRLSVDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR TAADTSN+ P+SQNHFN CTSNPS A+HVVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+ LQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

A0A6J1GVV5 protein ALWAYS EARLY 3-like isoform X14.2e-13889.42Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN
        VD GC NLQAKFGLSETV IQQEASSQ SALAQIQAKEADVHALSELSRAL KKEVVVSELKRLNDEVLENQ +GDNLLKDSENFKKQYAAVLLQ NEVN
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVN

Query:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI
        EQVSSAL+CLRQRNTYQGT PLMFLKPVHD+GDSC HAQEP SHVAEIVGSSRAKAQTMIDEAMQAIL L+KGESNLE+IEEAIDFVSNRLSVDDLALP 
Subjt:  EQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPI

Query:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        VR TAADTSN+ P+SQNHFN CTSNPS A+HVVG KSNG SDK E +IPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAV+ LQPCCPQ
Subjt:  VRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

SwissProt top hitse value%identityAlignment
Q6A331 Protein ALWAYS EARLY 18.8e-1633.97Show/hide
Query:  AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKS
        AQE    + EIV  S++ AQ M+D A++A  + +  E +   + +A+  +     +D+  +P ++         T  S +H +  T+ P +   +    S
Subjt:  AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKS

Query:  NGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
           S KN+  +PSELI  CVA+ LM+Q  +++Q+PP+DVAQ++D+ VN LQP CPQ
Subjt:  NGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

Q6A332 Protein ALWAYS EARLY 31.9e-6651.66Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNG-DNLLKDSENFKKQYAAVLLQFNEV
        VD    N QA+ G+ E +++Q   +SQPS++ QIQA+EADV ALSEL+RALDKKE+V+ ELK +NDEV+E+Q +G +N LKDSE+FKKQYAAVL Q +E+
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNG-DNLLKDSENFKKQYAAVLLQFNEV

Query:  NEQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCH--------AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRL
        NEQVS AL  LRQRNTYQ   P   ++ +   G+            +   G HV+EIV SSR KA+ M+  A+QA+  LRK E+N  ++EEAIDFV+N+L
Subjt:  NEQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCH--------AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRL

Query:  SVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCC
        S+D         T   +   T   Q+     T NP ++     S  N P D+N+  +PS+L++ C+ATLLMIQKCTERQFPPS+VAQVLDSAV SLQPCC
Subjt:  SVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCC

Query:  PQ
         Q
Subjt:  PQ

Q6A333 Protein ALWAYS EARLY 21.1e-1840.51Show/hide
Query:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS
        +E    + EIV  S+ +AQ M+D A++A  ++++GE     I+EA++ V  N+L    +             +     ++H N   SN S   AN+ + S
Subjt:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS

Query:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        +    S+KN + +PSELI  CVAT LMIQ CTERQ+PP+DVAQ++D+AV SLQP CPQ
Subjt:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

Arabidopsis top hitse value%identityAlignment
AT3G05380.1 DIRP ;Myb-like DNA-binding domain7.9e-2040.51Show/hide
Query:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS
        +E    + EIV  S+ +AQ M+D A++A  ++++GE     I+EA++ V  N+L    +             +     ++H N   SN S   AN+ + S
Subjt:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS

Query:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        +    S+KN + +PSELI  CVAT LMIQ CTERQ+PP+DVAQ++D+AV SLQP CPQ
Subjt:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

AT3G05380.2 DIRP ;Myb-like DNA-binding domain7.9e-2040.51Show/hide
Query:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS
        +E    + EIV  S+ +AQ M+D A++A  ++++GE     I+EA++ V  N+L    +             +     ++H N   SN S   AN+ + S
Subjt:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS

Query:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        +    S+KN + +PSELI  CVAT LMIQ CTERQ+PP+DVAQ++D+AV SLQP CPQ
Subjt:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

AT3G05380.3 DIRP ;Myb-like DNA-binding domain7.9e-2040.51Show/hide
Query:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS
        +E    + EIV  S+ +AQ M+D A++A  ++++GE     I+EA++ V  N+L    +             +     ++H N   SN S   AN+ + S
Subjt:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS

Query:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        +    S+KN + +PSELI  CVAT LMIQ CTERQ+PP+DVAQ++D+AV SLQP CPQ
Subjt:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

AT3G05380.5 DIRP ;Myb-like DNA-binding domain7.9e-2040.51Show/hide
Query:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS
        +E    + EIV  S+ +AQ M+D A++A  ++++GE     I+EA++ V  N+L    +             +     ++H N   SN S   AN+ + S
Subjt:  QEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFV-SNRLSVDDLALPIVRLTAADTSNATPISQNHFNACTSNPS--TANHVVGS

Query:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ
        +    S+KN + +PSELI  CVAT LMIQ CTERQ+PP+DVAQ++D+AV SLQP CPQ
Subjt:  KSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQ

AT3G21430.2 DNA binding1.3e-6751.66Show/hide
Query:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNG-DNLLKDSENFKKQYAAVLLQFNEV
        VD    N QA+ G+ E +++Q   +SQPS++ QIQA+EADV ALSEL+RALDKKE+V+ ELK +NDEV+E+Q +G +N LKDSE+FKKQYAAVL Q +E+
Subjt:  VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNG-DNLLKDSENFKKQYAAVLLQFNEV

Query:  NEQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCH--------AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRL
        NEQVS AL  LRQRNTYQ   P   ++ +   G+            +   G HV+EIV SSR KA+ M+  A+QA+  LRK E+N  ++EEAIDFV+N+L
Subjt:  NEQVSSALFCLRQRNTYQGTSPLMFLKPVHDLGDSCCH--------AQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRL

Query:  SVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCC
        S+D         T   +   T   Q+     T NP ++     S  N P D+N+  +PS+L++ C+ATLLMIQKCTERQFPPS+VAQVLDSAV SLQPCC
Subjt:  SVDDLALPIVRLTAADTSNATPISQNHFNACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCC

Query:  PQ
         Q
Subjt:  PQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GTTGACCCGGGGTGCCCTAATCTACAAGCTAAATTTGGTCTTAGTGAGACTGTCAGTATCCAACAGGAGGCAAGTTCCCAACCTTCTGCTCTTGCTCAAATTCAAGCAAA
AGAAGCTGATGTTCATGCTCTTTCTGAATTGTCACGTGCACTTGACAAGAAGGAGGTGGTGGTGTCTGAATTGAAGCGCTTGAATGATGAGGTGTTAGAAAACCAAACAA
ATGGAGACAACTTGCTCAAGGATTCAGAGAACTTTAAGAAGCAATATGCTGCTGTGCTATTACAGTTTAATGAAGTGAATGAACAGGTTTCATCTGCTCTGTTTTGCTTG
AGGCAGCGTAATACATATCAAGGGACTTCACCGTTAATGTTCCTCAAGCCAGTGCATGATTTAGGTGACTCTTGCTGTCATGCTCAAGAACCTGGTTCCCATGTGGCTGA
AATTGTGGGAAGTTCCAGAGCAAAGGCTCAGACAATGATCGATGAAGCTATGCAGGCAATTCTCACACTGAGAAAAGGAGAAAGTAATTTAGAGAGTATTGAAGAAGCCA
TTGATTTTGTTAGTAATAGACTTTCAGTGGATGATTTGGCCTTGCCAATTGTGAGATTAACAGCTGCAGATACAAGTAATGCCACTCCAATATCTCAGAATCATTTCAAT
GCGTGCACATCAAACCCATCGACTGCTAATCATGTAGTTGGTTCCAAGTCCAATGGTCCATCCGACAAGAATGAAGAGGATATCCCTTCGGAACTTATTGCACACTGTGT
AGCTACTTTACTCATGATTCAGAAATGCACAGAACGACAGTTTCCGCCATCTGATGTTGCTCAGGTACTGGATTCTGCTGTCAATAGTTTGCAGCCTTGTTGTCCTCAAA
CCTTCCACTATATGCAGAGATACAGAAATGCATGGGAATCATAA
mRNA sequenceShow/hide mRNA sequence
GTTGACCCGGGGTGCCCTAATCTACAAGCTAAATTTGGTCTTAGTGAGACTGTCAGTATCCAACAGGAGGCAAGTTCCCAACCTTCTGCTCTTGCTCAAATTCAAGCAAA
AGAAGCTGATGTTCATGCTCTTTCTGAATTGTCACGTGCACTTGACAAGAAGGAGGTGGTGGTGTCTGAATTGAAGCGCTTGAATGATGAGGTGTTAGAAAACCAAACAA
ATGGAGACAACTTGCTCAAGGATTCAGAGAACTTTAAGAAGCAATATGCTGCTGTGCTATTACAGTTTAATGAAGTGAATGAACAGGTTTCATCTGCTCTGTTTTGCTTG
AGGCAGCGTAATACATATCAAGGGACTTCACCGTTAATGTTCCTCAAGCCAGTGCATGATTTAGGTGACTCTTGCTGTCATGCTCAAGAACCTGGTTCCCATGTGGCTGA
AATTGTGGGAAGTTCCAGAGCAAAGGCTCAGACAATGATCGATGAAGCTATGCAGGCAATTCTCACACTGAGAAAAGGAGAAAGTAATTTAGAGAGTATTGAAGAAGCCA
TTGATTTTGTTAGTAATAGACTTTCAGTGGATGATTTGGCCTTGCCAATTGTGAGATTAACAGCTGCAGATACAAGTAATGCCACTCCAATATCTCAGAATCATTTCAAT
GCGTGCACATCAAACCCATCGACTGCTAATCATGTAGTTGGTTCCAAGTCCAATGGTCCATCCGACAAGAATGAAGAGGATATCCCTTCGGAACTTATTGCACACTGTGT
AGCTACTTTACTCATGATTCAGAAATGCACAGAACGACAGTTTCCGCCATCTGATGTTGCTCAGGTACTGGATTCTGCTGTCAATAGTTTGCAGCCTTGTTGTCCTCAAA
CCTTCCACTATATGCAGAGATACAGAAATGCATGGGAATCATAA
Protein sequenceShow/hide protein sequence
VDPGCPNLQAKFGLSETVSIQQEASSQPSALAQIQAKEADVHALSELSRALDKKEVVVSELKRLNDEVLENQTNGDNLLKDSENFKKQYAAVLLQFNEVNEQVSSALFCL
RQRNTYQGTSPLMFLKPVHDLGDSCCHAQEPGSHVAEIVGSSRAKAQTMIDEAMQAILTLRKGESNLESIEEAIDFVSNRLSVDDLALPIVRLTAADTSNATPISQNHFN
ACTSNPSTANHVVGSKSNGPSDKNEEDIPSELIAHCVATLLMIQKCTERQFPPSDVAQVLDSAVNSLQPCCPQTFHYMQRYRNAWES