; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G010540 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G010540
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationCmo_Chr01:8375871..8376908
RNA-Seq ExpressionCmoCh01G010540
SyntenyCmoCh01G010540
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607679.1 hypothetical protein SDJN03_01021, partial [Cucurbita argyrosperma subsp. sororia]1.6e-18896.52Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
        MIPGGVSYGGGDWQGSMGVPYQ PQYHDSSRY+S VRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK

Query:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK
        MVKLLITVLSYMG+DSGSGCGSLG RRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCE VENPA+LDEIDYLKAKEK
Subjt:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK

Query:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM
        DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLV GNRIDSSRVARFVTN+V AETDRICQLQQL+IESHCIELEEKRVRIEIGRLELE ERM
Subjt:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM

Query:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
        KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
Subjt:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI

XP_022153723.1 uncharacterized protein LOC111021176 [Momordica charantia]4.3e-10955.5Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQ-----------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNG
        MIPGGVSYGG D QGSMGV +Q           H + H  S  +S VRGS PLTMGS Q  +H   L D    +  N + SDEDE            KN 
Subjt:  MIPGGVSYGGGDWQGSMGVPYQ-----------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNG

Query:  SQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVL
          WQRVKWT+KMVKLLITV+SYMG+DS SGCG LG RRL VLQKKGKWKSVSK+M ERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSC+ V+NPA+L
Subjt:  SQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVL

Query:  DEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------------------------------
        D IDYL AKEKD+VRKIL+SKHLFY+EMCSYHNGNRLHLPHDPELLHSLQLVL NRID                                          
Subjt:  DEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------------------------------

Query:  -------------------------------------SSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRE
                                              +   R  TN V+ ET RI QLQ+L IES  ++LEEKR++I+  RLELE +R KWERF KKR+
Subjt:  -------------------------------------SSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRE

Query:  RELEKLKLENERMRVENELIASQLKKR
        RELEKLKLENERM++ENE +ASQLK++
Subjt:  RELEKLKLENERMRVENELIASQLKKR

XP_022926195.1 uncharacterized protein LOC111433381 [Cucurbita moschata]7.9e-196100Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
        MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK

Query:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK
        MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK
Subjt:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK

Query:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM
        DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM
Subjt:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM

Query:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
        KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
Subjt:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI

XP_022981510.1 uncharacterized protein LOC111480603 [Cucurbita maxima]2.6e-17586.4Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHP--------QYHDSSRYQ----SPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKN
        MIP GVSYGGGDWQGSMGVP Q P        QYHDSSRYQ    S VRGSFPLTMGSLQQRDHWIGL D+DNRE GNDMTSDEDEEGIYVHNDARNEKN
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHP--------QYHDSSRYQ----SPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKN

Query:  GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAV
        GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCE VENP +
Subjt:  GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAV

Query:  LDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------SSRVARFVTNQVVAETDRICQLQ
        LD IDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID                  SSRVARFVTNQVVAETDRIC++Q
Subjt:  LDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------SSRVARFVTNQVVAETDRICQLQ

Query:  QLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
        Q+RIESHCIELEEKRVRIEI RLELE++RMKWERFKKKR+ ELEKLKLENERMR+EN LIASQLKKRRCSHLHRI
Subjt:  QLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI

XP_031263593.1 uncharacterized protein LOC116121795 [Pistacia vera]3.1e-9949.89Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQ------------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERG-NDMTSDED----EEGIYVHNDA
        MIPGG S+GG D QGSM V +Q            HP  H  S  +  ++  FPLT+G++Q  D  I + DY+  ERG N ++ DED    E+G   HNDA
Subjt:  MIPGGVSYGGGDWQGSMGVPYQ------------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERG-NDMTSDED----EEGIYVHNDA

Query:  RNEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAV
           K GS WQRVKWT+KMV+LLIT +SY+GED+G  CG    R+  VLQKKGKWKSVSKVM ERG++VSPQQCEDKFNDLNKRYKRLNDMLGRGTSC+ V
Subjt:  RNEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAV

Query:  ENPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSS----------------------------------
        ENP +LD IDYL  KEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDP L  SLQL L NR D                                    
Subjt:  ENPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSS----------------------------------

Query:  --------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERF
                                                    +VA+   NQV+ E  R   LQ+  IES  ++LEE++++I++  LELE +R KW+RF
Subjt:  --------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERF

Query:  KKKRERELEKLKLENERMRVENELIASQLKKRRCS
         KKR+RELEKLK+ENERM++ENE +A +LK++  S
Subjt:  KKKRERELEKLKLENERMRVENELIASQLKKRRCS

TrEMBL top hitse value%identityAlignment
A0A1R3FYZ5 Putative transcription factor8.2e-9848.15Show/hide
Query:  MIPGGVSYGGGDWQGSMGV-----------PYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDED-----EEGIYVHNDAR
        MIPGG SYGG D QGSM V            +QHP +   +     +   FPLTMG++Q  D  I + DY+  ERG    SDED     E+G+  HND  
Subjt:  MIPGGVSYGGGDWQGSMGV-----------PYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDED-----EEGIYVHNDAR

Query:  NEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVE
          K GS WQRVKWT+KMV+LLIT +SY+GED    CG    R+  VLQKKGKWKSVSKVM ERGYHVSPQQCEDKFNDLNKRYK+LNDMLGRGTSC+ VE
Subjt:  NEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVE

Query:  NPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSR----------------------------------
        NP +LD IDYL  KEKDDVRKIL+SKHLFYEEMCSYHNGNRLHLPHDP+L  SLQL L +R D                                     
Subjt:  NPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSR----------------------------------

Query:  ---------------------------------------------VARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERF
                                                     + +   NQV  E+ R   LQ+  +ES  ++LEE++++I++  LELE +R KW+RF
Subjt:  ---------------------------------------------VARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERF

Query:  KKKRERELEKLKLENERMRVENELIASQLKKR
         KKR+RELEK+++ENERM++ENE +A +LK++
Subjt:  KKKRERELEKLKLENERMRVENELIASQLKKR

A0A5N5NLL2 Uncharacterized protein1.8e-9749.53Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVR----------GSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDED-----EEGIYVHNDARN
        MIP   S+GG D QGSM VP+Q P  H    +Q P+R            FPLTMG +   D  I + DY+ R+RG +  SDED     EEG   HNDA  
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVR----------GSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDED-----EEGIYVHNDARN

Query:  EKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVEN
         K G+ WQRVKWT+KMV+LLIT +SY+GED  S CG    R+  VLQKKGKWKSVSKVM ERG+HVSPQQCEDKFNDLNKRYKRLNDMLGRGTSC+ VEN
Subjt:  EKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVEN

Query:  PAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSS------------------------------------
        PA+LD IDYL  KEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDP L  SLQL L +R D                                      
Subjt:  PAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSS------------------------------------

Query:  ------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKK
                                                  + A+   NQV +E+ R  +LQ+  +ES  ++LEE++++I+   LELE +R KW+RF K
Subjt:  ------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKK

Query:  KRERELEKLKLENERMRVENELIASQLKKR
        KR+RELEK ++ENERM++ENE +A +LK++
Subjt:  KRERELEKLKLENERMRVENELIASQLKKR

A0A6J1DI82 uncharacterized protein LOC1110211762.1e-10955.5Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQ-----------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNG
        MIPGGVSYGG D QGSMGV +Q           H + H  S  +S VRGS PLTMGS Q  +H   L D    +  N + SDEDE            KN 
Subjt:  MIPGGVSYGGGDWQGSMGVPYQ-----------HPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNG

Query:  SQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVL
          WQRVKWT+KMVKLLITV+SYMG+DS SGCG LG RRL VLQKKGKWKSVSK+M ERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSC+ V+NPA+L
Subjt:  SQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVL

Query:  DEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------------------------------
        D IDYL AKEKD+VRKIL+SKHLFY+EMCSYHNGNRLHLPHDPELLHSLQLVL NRID                                          
Subjt:  DEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------------------------------

Query:  -------------------------------------SSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRE
                                              +   R  TN V+ ET RI QLQ+L IES  ++LEEKR++I+  RLELE +R KWERF KKR+
Subjt:  -------------------------------------SSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRE

Query:  RELEKLKLENERMRVENELIASQLKKR
        RELEKLKLENERM++ENE +ASQLK++
Subjt:  RELEKLKLENERMRVENELIASQLKKR

A0A6J1EE70 uncharacterized protein LOC1114333813.8e-196100Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
        MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNK

Query:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK
        MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK
Subjt:  MVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEK

Query:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM
        DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM
Subjt:  DDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERM

Query:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
        KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
Subjt:  KWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI

A0A6J1IU60 uncharacterized protein LOC1114806031.3e-17586.4Show/hide
Query:  MIPGGVSYGGGDWQGSMGVPYQHP--------QYHDSSRYQ----SPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKN
        MIP GVSYGGGDWQGSMGVP Q P        QYHDSSRYQ    S VRGSFPLTMGSLQQRDHWIGL D+DNRE GNDMTSDEDEEGIYVHNDARNEKN
Subjt:  MIPGGVSYGGGDWQGSMGVPYQHP--------QYHDSSRYQ----SPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKN

Query:  GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAV
        GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCE VENP +
Subjt:  GSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAV

Query:  LDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------SSRVARFVTNQVVAETDRICQLQ
        LD IDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID                  SSRVARFVTNQVVAETDRIC++Q
Subjt:  LDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRID------------------SSRVARFVTNQVVAETDRICQLQ

Query:  QLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI
        Q+RIESHCIELEEKRVRIEI RLELE++RMKWERFKKKR+ ELEKLKLENERMR+EN LIASQLKKRRCSHLHRI
Subjt:  QLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKKRRCSHLHRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.1e-8544.84Show/hide
Query:  GGVSYGGGDWQGSMGVPYQHP--QYHDSSRYQSPVRGSFPLTMGSLQQRDHW----IGLGDYDNRERGNDMTSDEDEE------GIYVHNDARNEKNGSQ
        G  SYGG D QGSM V +Q    Q H  +    P+    P TM + Q  DH     + + +    ER  +  SD+DE       G  VHN+A     GS 
Subjt:  GGVSYGGGDWQGSMGVPYQHP--QYHDSSRYQSPVRGSFPLTMGSLQQRDHW----IGLGDYDNRERGNDMTSDEDEE------GIYVHNDARNEKNGSQ

Query:  WQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDE
        WQRVKWT+KMVKLLIT +SY+G+D  S   S   R+  VLQKKGKWKSVSKVM ERGYHVSPQQCEDKFNDLNKRYK+LNDMLGRGTSC+ VENPA+LD 
Subjt:  WQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDE

Query:  IDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSR-----------------------------------------
        I YL  KEKDDVRKI++SKHLFYEEMCSYHNGNRLHLPHD  L  SLQL L +R D                                            
Subjt:  IDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLVLGNRIDSSR-----------------------------------------

Query:  ---------------------------------------VARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRER
                                                ++   NQ  AE+ R   +Q+  +ES  ++LEE++++I++  LELE +R +W+RF KKR++
Subjt:  ---------------------------------------VARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRER

Query:  ELEKLKLENERMRVENELIASQLKKR
        ELE++++ENERM++EN+ +  +LK+R
Subjt:  ELEKLKLENERMRVENELIASQLKKR

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)2.1e-7747.46Show/hide
Query:  YDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCED
        ++N +RG    S++DE  +   +     K  S WQRVKW +KMVKL+IT LSY+GEDSGS       ++  VLQKKGKW+SVSKVM ERGYHVSPQQCED
Subjt:  YDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCED

Query:  KFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLV-LGNRIDSS--------
        KFNDLNKRYK+LN+MLGRGTSCE VENP++LD+IDYL  KEKD+VR+I++SKHLFYEEMCSYHNGNRLHLPHDP +  SL L+ LG+R D          
Subjt:  KFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSYHNGNRLHLPHDPELLHSLQLV-LGNRIDSS--------

Query:  ------------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMK
                                                        R    V   +  ++ +   LQ+ +IES  +ELE ++++I+   +ELE ++ K
Subjt:  ------------------------------------------------RVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMK

Query:  WERFKKKRERELEKLKLENERMRVENELIASQLKK
        WE F K+RE++L K+++ENERM++ENE ++ +LK+
Subjt:  WERFKKKRERELEKLKLENERMRVENELIASQLKK

AT3G10040.1 sequence-specific DNA binding transcription factors4.8e-5034.62Show/hide
Query:  VPYQHPQYHDSS---RYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTS---DEDEEGIYVHNDARNEKNGSQWQRVKWTNKMVKLLITVLSYM
        + +QHP  + +S   + Q P++  +P      Q     I  G  D+ +RG+   S    ED  G         ++  SQW R+KWT+ MV+LLI  + Y+
Subjt:  VPYQHPQYHDSS---RYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTS---DEDEEGIYVHNDARNEKNGSQWQRVKWTNKMVKLLITVLSYM

Query:  GEDSG----------SGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEKDD
        G+++G          +G G  G     +LQKKGKWKSVS+ M E+G+ VSPQQCEDKFNDLNKRYKR+ND+LG+G +C  VEN  +L+ +D+L  K KD+
Subjt:  GEDSG----------SGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEKDD

Query:  VRKILNSKHLFYEEMCSYHNG--------------NRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETD---------------------RI-
        V+K+LNSKHLF+ EMC+YHN               N + +P   +  +        ++  +R+A  V  +   E+D                     RI 
Subjt:  VRKILNSKHLFYEEMCSYHNG--------------NRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETD---------------------RI-

Query:  CQLQQLR--------------------IESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKK
          +++LR                    I    +E+EEK++  E   +E+E +R+KW R++ K+ERE+EK KL+N+R R+E E +   L++
Subjt:  CQLQQLR--------------------IESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELIASQLKK

AT5G47660.1 Homeodomain-like superfamily protein5.2e-0428.24Show/hide
Query:  NGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLND
        +GS     +W  + V+ LI+  S + E +G   G++             W  +S  M ERGY  S ++C++K+ ++NK Y+R+ +
Subjt:  NGSQWQRVKWTNKMVKLLITVLSYMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCCTGGGGGGGTTTCTTATGGAGGTGGTGACTGGCAAGGATCTATGGGGGTTCCTTATCAACACCCACAATATCATGATAGTAGTCGATATCAATCTCCGGTTCG
TGGCAGCTTTCCATTGACAATGGGAAGCTTGCAACAGAGGGATCATTGGATTGGACTTGGCGATTATGATAACAGGGAGAGGGGCAATGATATGACAAGCGATGAAGATG
AGGAGGGGATCTATGTTCATAATGATGCTCGTAACGAGAAGAATGGATCGCAATGGCAGCGTGTGAAGTGGACGAACAAGATGGTGAAGCTCTTGATTACTGTTTTATCC
TATATGGGAGAGGATTCTGGTTCTGGATGTGGAAGCTTAGGGACAAGGAGACTTGTAGTGTTACAAAAGAAGGGGAAGTGGAAATCGGTTTCGAAGGTAATGGGCGAGCG
AGGTTATCATGTTTCGCCCCAGCAATGTGAGGATAAATTTAATGATCTCAACAAAAGGTATAAGAGACTGAATGATATGCTCGGTAGGGGGACGTCCTGCGAAGCTGTCG
AGAACCCTGCGGTTTTGGACGAGATAGATTATTTGAAAGCAAAGGAAAAGGATGATGTTAGGAAGATTCTAAACTCCAAGCATTTGTTCTATGAGGAGATGTGTTCTTAT
CATAATGGCAATAGACTGCATTTGCCTCATGATCCAGAATTGCTTCATTCTTTACAGTTGGTTCTCGGAAATAGAATTGATAGTTCTCGAGTTGCTCGATTTGTAACGAA
TCAAGTTGTAGCTGAAACTGATAGAATTTGTCAGTTACAACAGCTACGGATCGAGTCGCATTGTATTGAGTTAGAAGAGAAACGTGTTCGAATCGAAATAGGGAGATTGG
AATTGGAGAACGAACGTATGAAGTGGGAGCGATTTAAGAAGAAACGAGAGCGGGAATTAGAAAAATTGAAGTTGGAAAATGAGAGGATGAGGGTCGAGAATGAACTCATA
GCATCACAACTGAAGAAAAGGAGATGCTCTCATCTTCACAGGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCCTGGGGGGGTTTCTTATGGAGGTGGTGACTGGCAAGGATCTATGGGGGTTCCTTATCAACACCCACAATATCATGATAGTAGTCGATATCAATCTCCGGTTCG
TGGCAGCTTTCCATTGACAATGGGAAGCTTGCAACAGAGGGATCATTGGATTGGACTTGGCGATTATGATAACAGGGAGAGGGGCAATGATATGACAAGCGATGAAGATG
AGGAGGGGATCTATGTTCATAATGATGCTCGTAACGAGAAGAATGGATCGCAATGGCAGCGTGTGAAGTGGACGAACAAGATGGTGAAGCTCTTGATTACTGTTTTATCC
TATATGGGAGAGGATTCTGGTTCTGGATGTGGAAGCTTAGGGACAAGGAGACTTGTAGTGTTACAAAAGAAGGGGAAGTGGAAATCGGTTTCGAAGGTAATGGGCGAGCG
AGGTTATCATGTTTCGCCCCAGCAATGTGAGGATAAATTTAATGATCTCAACAAAAGGTATAAGAGACTGAATGATATGCTCGGTAGGGGGACGTCCTGCGAAGCTGTCG
AGAACCCTGCGGTTTTGGACGAGATAGATTATTTGAAAGCAAAGGAAAAGGATGATGTTAGGAAGATTCTAAACTCCAAGCATTTGTTCTATGAGGAGATGTGTTCTTAT
CATAATGGCAATAGACTGCATTTGCCTCATGATCCAGAATTGCTTCATTCTTTACAGTTGGTTCTCGGAAATAGAATTGATAGTTCTCGAGTTGCTCGATTTGTAACGAA
TCAAGTTGTAGCTGAAACTGATAGAATTTGTCAGTTACAACAGCTACGGATCGAGTCGCATTGTATTGAGTTAGAAGAGAAACGTGTTCGAATCGAAATAGGGAGATTGG
AATTGGAGAACGAACGTATGAAGTGGGAGCGATTTAAGAAGAAACGAGAGCGGGAATTAGAAAAATTGAAGTTGGAAAATGAGAGGATGAGGGTCGAGAATGAACTCATA
GCATCACAACTGAAGAAAAGGAGATGCTCTCATCTTCACAGGATTTGA
Protein sequenceShow/hide protein sequence
MIPGGVSYGGGDWQGSMGVPYQHPQYHDSSRYQSPVRGSFPLTMGSLQQRDHWIGLGDYDNRERGNDMTSDEDEEGIYVHNDARNEKNGSQWQRVKWTNKMVKLLITVLS
YMGEDSGSGCGSLGTRRLVVLQKKGKWKSVSKVMGERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCEAVENPAVLDEIDYLKAKEKDDVRKILNSKHLFYEEMCSY
HNGNRLHLPHDPELLHSLQLVLGNRIDSSRVARFVTNQVVAETDRICQLQQLRIESHCIELEEKRVRIEIGRLELENERMKWERFKKKRERELEKLKLENERMRVENELI
ASQLKKRRCSHLHRI