; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020863 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020863
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationscaffold382:61970..63295
RNA-Seq ExpressionMS020863
SyntenyMS020863
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EOY16705.1 Sequence-specific DNA binding transcription factors [Theobroma cacao]2.6e-15163.72Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE
        ME NL++ GMI GG S+GG+D+QGSM VHH  Q+  N+ Q +H+   Q +S+   +    PLTMG+ Q+C+    + D  KG+  K+SVSDEDEPSF+EE
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D+A  CGG  RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NPALLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP+L  SLQL LR+R D ENDD R+  H+D D+D+ DMETD   +
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN A H ++RG+ G  GG  KR RQG  +E+    +SL S D N  S   +   +   N VLP+  R   LQK WIESRSLQLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        KQRFKW+RF+KKRDRELEK+++ENERM+LENE+MA +LK+K
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

OMO51055.1 putative transcription factor [Corchorus olitorius]4.4e-15163.72Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSEE
        ME NL++ GMIPGG SYGG+DLQGSM VHH  Q+  N+ Q  H    Q +S+   +    PLTMG+ Q+C+    + D  KG+  K+SVSDEDEPSF+E+
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D    CGG  RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NP LLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP+L  SLQL LR+R D ENDD R+  H+D D+D+ DMETD   +
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN ASH +NRG  G  GG  KR RQG  +E+    HSL S D N          +   N V PE+ R   LQK W+ESRS+QLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        KQRFKW+RF+KKRDRELEK+++ENERM+LENE+MA +LK+K
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

XP_007019480.2 PREDICTED: uncharacterized protein LOC18592603 [Theobroma cacao]5.8e-15163.49Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE
        ME NL++ GMI GG S+GG+D+QGSM VHH  Q+  N+ Q +H+   Q +S+   +    PLTMG+ Q+C+    + D  KG+  K+SVSDEDEPSF+EE
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D+A  CGG  RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+ PALLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP+L  SLQL LR+R D ENDD R+  H+D D+D+ DMETD   +
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN A H ++RG+ G  GG  KR RQG  +E+    +SL S D N  S   +   +   N VLP+  R   LQK WIESRSLQLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        KQRFKW+RF+KKRDRELEK+++ENERM+LENE+MA +LK+K
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

XP_022153723.1 uncharacterized protein LOC111021176 [Momordica charantia]2.0e-25299.77Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG
        MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG

Query:  IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ
        IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ
Subjt:  IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ

Query:  VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA
        VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA
Subjt:  VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA

Query:  SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW
        SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW
Subjt:  SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW

Query:  ERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKGGASL
        ERFNKKRDRELEKLKLENERM+LENEQMASQLKQKGKGGASL
Subjt:  ERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKGGASL

XP_024164957.1 uncharacterized protein LOC112172033 [Rosa chinensis]2.4e-14963.06Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE
        ME +L++ G IPGG SY G+DLQGS+  HHQ Q+   L Q++H  S Q S V   +    P+ MG+  +C+    +VD  KG+  KNS SDEDEPS++EE
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+D       GK   PWQRVKWTDKMV+LLIT +SY+G+D  S CGG GRR+ + LQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NP LLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP L HSLQ  LRNR D + DDLR+  H+D DED+ DMETD R D
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN ASH +NRG+ G  G  +KR+RQG G E+ N   SL + D N  S+   +  +   N VLP++ +   LQK WIESRS+QLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG
        KQR KW++F+KKRDRELEKLK+ENERM+LENE+MA +LK+K  G
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG

TrEMBL top hitse value%identityAlignment
A0A061FHC2 Sequence-specific DNA binding transcription factors1.3e-15163.72Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE
        ME NL++ GMI GG S+GG+D+QGSM VHH  Q+  N+ Q +H+   Q +S+   +    PLTMG+ Q+C+    + D  KG+  K+SVSDEDEPSF+EE
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D+A  CGG  RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NPALLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP+L  SLQL LR+R D ENDD R+  H+D D+D+ DMETD   +
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN A H ++RG+ G  GG  KR RQG  +E+    +SL S D N  S   +   +   N VLP+  R   LQK WIESRSLQLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        KQRFKW+RF+KKRDRELEK+++ENERM+LENE+MA +LK+K
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

A0A1R3FYZ5 Putative transcription factor2.1e-15163.72Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSEE
        ME NL++ GMIPGG SYGG+DLQGSM VHH  Q+  N+ Q  H    Q +S+   +    PLTMG+ Q+C+    + D  KG+  K+SVSDEDEPSF+E+
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D    CGG  RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NP LLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP+L  SLQL LR+R D ENDD R+  H+D D+D+ DMETD   +
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN ASH +NRG  G  GG  KR RQG  +E+    HSL S D N          +   N V PE+ R   LQK W+ESRS+QLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        KQRFKW+RF+KKRDRELEK+++ENERM+LENE+MA +LK+K
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

A0A2N9EKB6 Uncharacterized protein3.8e-14863.57Show/hide
Query:  MESNLTKRGMIP-GGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSE
        ME NL + GMIP GG SY G DL GSM VHHQ Q+  +   ++ +   Q S+V   +    PLTMG+ Q+C+    +VD  KG++ KNSVSDEDE S++E
Subjt:  MESNLTKRGMIP-GGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDI-KNSVSDEDEPSFSE

Query:  EGIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDML
        EG+DG      GK   PWQRVKWTDKMV+LLIT +SY+G+D+ S C G GRR+ AVLQKKGKWKS+SK+MAERGYHVSPQQCEDKFNDLNKRYKRLNDML
Subjt:  EGIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDML

Query:  GRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARG
        GRGTSCQVV+NPALLD IDYLT KEKD+VRKILSSK LFY+EMCSYHNGNRLHLPHD  L  SLQL LR+R D +NDD+R+  H+D DED+ DMETD   
Subjt:  GRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARG

Query:  DFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLEL
        DFEEN A H +NRG+ G  G   K++RQG  +E+ N+ + L S DYN   + QA+  +   N  LPE+ R   LQK WIESRSL LEE++LQIQ + LEL
Subjt:  DFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLEL

Query:  EKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK
        EKQRFKW+RF++KRDRELEKL++ENERM+LENE+MA +LKQK
Subjt:  EKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQK

A0A2P6PS33 Putative transcription factor Trihelix family1.2e-14963.06Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE
        ME +L++ G IPGG SY G+DLQGS+  HHQ Q+   L Q++H  S Q S V   +    P+ MG+  +C+    +VD  KG+  KNS SDEDEPS++EE
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGD-IKNSVSDEDEPSFSEE

Query:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG
        G+D       GK   PWQRVKWTDKMV+LLIT +SY+G+D  S CGG GRR+ + LQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+LNDMLG
Subjt:  GIDG-----AGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLG

Query:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD
        RGTSCQVV+NP LLD IDYLT KEKD+VRKILSSKHLFY+EMCSYHNGNRLHLPHDP L HSLQ  LRNR D + DDLR+  H+D DED+ DMETD R D
Subjt:  RGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGD

Query:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE
        FEEN ASH +NRG+ G  G  +KR+RQG G E+ N   SL + D N  S+   +  +   N VLP++ +   LQK WIESRS+QLEE++LQIQ + LELE
Subjt:  FEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELE

Query:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG
        KQR KW++F+KKRDRELEKLK+ENERM+LENE+MA +LK+K  G
Subjt:  KQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG

A0A6J1DI82 uncharacterized protein LOC1110211769.7e-25399.77Show/hide
Query:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG
        MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG
Subjt:  MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEG

Query:  IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ
        IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ
Subjt:  IDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQ

Query:  VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA
        VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA
Subjt:  VVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRA

Query:  SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW
        SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW
Subjt:  SHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKW

Query:  ERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKGGASL
        ERFNKKRDRELEKLKLENERM+LENEQMASQLKQKGKGGASL
Subjt:  ERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKGGASL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors2.3e-12154.32Show/hide
Query:  MESNLTKRGMIPGGV-SYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRAR----LVDCGKGD-IKNSVSDEDEP
        M+ N  + G++  G  SYGG DLQGSM VHH         Q++ N+ H+ +     +    P TM + Q C+H       + +  K +  KNSVSD+DEP
Subjt:  MESNLTKRGMIPGGV-SYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRAR----LVDCGKGD-IKNSVSDEDEP

Query:  SFSEEGIDGAGKNA------LPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKR
        SF+EEG DG    A       PWQRVKWTDKMVKLLIT +SY+GDDS+       RR+ AVLQKKGKWKSVSK+MAERGYHVSPQQCEDKFNDLNKRYK+
Subjt:  SFSEEGIDGAGKNA------LPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKR

Query:  LNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDME
        LNDMLGRGTSCQVV+NPALLD I YL  KEKD+VRKI+SSKHLFY+EMCSYHNGNRLHLPHD  L  SLQL LR+R D +NDD RK   ED D+++ D +
Subjt:  LNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDME

Query:  TDARGDFEENRASHENNR-GLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQ
         D   ++EE   ++ + R    G  GG +K++R  L +E+G+    + S + N  S  Q    +   N    E+ R   +QK W+ESR+LQLEE++LQIQ
Subjt:  TDARGDFEENRASHENNR-GLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQ

Query:  TDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG
         + LELEKQRF+W+RF+KKRD+ELE++++ENERM+LEN++M  +LKQ+  G
Subjt:  TDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)7.9e-9052.42Show/hide
Query:  KNSVSDEDEPSFSEEGIDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNK
        K S+S++DE            K   PWQRVKW DKMVKL+IT +SY+G+DS S       ++ AVLQKKGKW+SVSK+M ERGYHVSPQQCEDKFNDLNK
Subjt:  KNSVSDEDEPSFSEEGIDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNK

Query:  RYKRLNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLV-LRNRIDCENDDLRKLGHEDFDED
        RYK+LN+MLGRGTSC+VV+NP+LLD IDYL  KEKDEVR+I+SSKHLFY+EMCSYHNGNRLHLPHDP +  SL L+ L +R D +ND+  K  +ED D+D
Subjt:  RYKRLNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLV-LRNRIDCENDDLRKLGHEDFDED

Query:  ECDMETDARGDFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKR
        +         D+EE+     ++R L        KR+RQ   +E+  + H     D       QA+  R     +  ++R+   LQ+  IES+SL+LE ++
Subjt:  ECDMETDARGDFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKR

Query:  LQIQTDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQ
        LQIQ + +ELE+Q+FKWE F+K+R+++L K+++ENERM+LENE+M+ +LK+
Subjt:  LQIQTDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQ

AT3G10040.1 sequence-specific DNA binding transcription factors7.0e-5434.35Show/hide
Query:  MESNLTKRGMIPGGVS----YGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSF
        MESN+   G  P  +S        + Q S+   H   Y+ +  Q+            S  +  SP++ G    C+   R    G G      + ED    
Subjt:  MESNLTKRGMIPGGVS----YGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSF

Query:  SEEGIDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSA----------SGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRY
           G DG  K +  W R+KWTD MV+LLI  + Y+GD++           +G GG G     +LQKKGKWKSVS+ M E+G+ VSPQQCEDKFNDLNKRY
Subjt:  SEEGIDGAGKNALPWQRVKWTDKMVKLLITVISYMGDDSA----------SGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRY

Query:  KRLNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHD--PELLHSLQLVLRNRID-----CENDDLRKLGH--
        KR+ND+LG+G +C+VV+N  LL+ +D+LT K KDEV+K+L+SKHLF++EMC+YHN       HD  P   + + + + ++        E   + ++    
Subjt:  KRLNDMLGRGTSCQVVDNPALLDGIDYLTAKEKDEVRKILSSKHLFYQEMCSYHNGNRLHLPHD--PELLHSLQLVLRNRID-----CENDDLRKLGH--

Query:  EDFDEDECDMETDARGDFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSL
        E  +E E DM  D+  + EE+       +    T    +KR+R+                          EA       V+ +  +    +K WI  + L
Subjt:  EDFDEDECDMETDARGDFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNLMHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSL

Query:  QLEEKRLQIQTDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQ
        ++EEK++  + + +E+EKQR KW R+  K++RE+EK KL+N+R  LE E+M   L++
Subjt:  QLEEKRLQIQTDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCCAATTTAACGAAAAGAGGGATGATTCCAGGTGGGGTTTCGTATGGAGGTGTCGATTTGCAAGGATCTATGGGGGTTCATCACCAAGGACAATACTCCCTCAA
CTTATGCCAGGAAAATCATAATCGTAGTCACCAGCGATCTTCAGTTCGGTCCTTGGTTCGTGGTAGCTCTCCATTGACAATGGGAAGCTCGCAACATTGCAATCACCGGG
CTAGACTTGTGGATTGTGGTAAAGGGGACATCAAGAATTCGGTGAGTGATGAAGACGAGCCAAGTTTTAGCGAAGAGGGGATTGATGGTGCAGGGAAGAATGCATTGCCA
TGGCAGCGGGTGAAGTGGACGGATAAGATGGTGAAGCTCTTGATTACTGTTATTTCTTATATGGGAGATGATTCTGCCTCAGGGTGTGGAGGCTTAGGTAGAAGGAGACT
TGCAGTTCTACAGAAAAAGGGCAAGTGGAAATCAGTTTCGAAGATAATGGCCGAGCGGGGCTATCATGTTTCACCCCAGCAATGCGAGGATAAGTTTAATGACCTCAATA
AAAGGTATAAAAGACTGAATGATATGCTTGGTAGGGGCACCTCTTGCCAAGTTGTGGACAACCCTGCGCTTTTGGACGGGATAGATTATCTAACCGCAAAGGAAAAGGAC
GAAGTTAGGAAGATTCTAAGCTCGAAGCATCTGTTCTATCAGGAGATGTGTTCTTATCACAATGGCAATAGACTACATTTGCCTCATGATCCGGAATTGCTACATTCTTT
ACAGTTGGTTCTTAGAAATAGAATTGATTGTGAGAATGATGATCTGAGGAAGCTCGGACATGAAGATTTTGATGAGGATGAATGTGACATGGAAACTGATGCTCGTGGTG
ATTTTGAGGAGAATCGTGCTTCGCACGAGAATAATAGGGGTCTGTCTGGGACTACTGGGGGCTGCATGAAGCGGGTGAGACAAGGCCTTGGGTATGAAGAAGGAAACTTA
ATGCATTCCTTGACTTCTAATGACTACAACAACCGTTCTCATTGTCAAGCAGAAGCCGTTCGAGTTGGAACGAACCATGTTCTACCTGAAACCCGTAGAATTTTTCAGTT
ACAAAAGCTGTGGATCGAGTCCCGTTCGCTTCAGTTAGAGGAGAAAAGGCTTCAAATCCAAACGGATAGGTTGGAATTGGAGAAACAACGTTTCAAATGGGAAAGATTCA
ACAAGAAAAGGGACCGAGAGTTAGAAAAGTTGAAGTTAGAAAACGAGAGGATGGAGCTCGAGAACGAGCAGATGGCATCACAACTGAAGCAAAAAGGAAAAGGGGGCGCT
TCCCTC
mRNA sequenceShow/hide mRNA sequence
ATGGAATCCAATTTAACGAAAAGAGGGATGATTCCAGGTGGGGTTTCGTATGGAGGTGTCGATTTGCAAGGATCTATGGGGGTTCATCACCAAGGACAATACTCCCTCAA
CTTATGCCAGGAAAATCATAATCGTAGTCACCAGCGATCTTCAGTTCGGTCCTTGGTTCGTGGTAGCTCTCCATTGACAATGGGAAGCTCGCAACATTGCAATCACCGGG
CTAGACTTGTGGATTGTGGTAAAGGGGACATCAAGAATTCGGTGAGTGATGAAGACGAGCCAAGTTTTAGCGAAGAGGGGATTGATGGTGCAGGGAAGAATGCATTGCCA
TGGCAGCGGGTGAAGTGGACGGATAAGATGGTGAAGCTCTTGATTACTGTTATTTCTTATATGGGAGATGATTCTGCCTCAGGGTGTGGAGGCTTAGGTAGAAGGAGACT
TGCAGTTCTACAGAAAAAGGGCAAGTGGAAATCAGTTTCGAAGATAATGGCCGAGCGGGGCTATCATGTTTCACCCCAGCAATGCGAGGATAAGTTTAATGACCTCAATA
AAAGGTATAAAAGACTGAATGATATGCTTGGTAGGGGCACCTCTTGCCAAGTTGTGGACAACCCTGCGCTTTTGGACGGGATAGATTATCTAACCGCAAAGGAAAAGGAC
GAAGTTAGGAAGATTCTAAGCTCGAAGCATCTGTTCTATCAGGAGATGTGTTCTTATCACAATGGCAATAGACTACATTTGCCTCATGATCCGGAATTGCTACATTCTTT
ACAGTTGGTTCTTAGAAATAGAATTGATTGTGAGAATGATGATCTGAGGAAGCTCGGACATGAAGATTTTGATGAGGATGAATGTGACATGGAAACTGATGCTCGTGGTG
ATTTTGAGGAGAATCGTGCTTCGCACGAGAATAATAGGGGTCTGTCTGGGACTACTGGGGGCTGCATGAAGCGGGTGAGACAAGGCCTTGGGTATGAAGAAGGAAACTTA
ATGCATTCCTTGACTTCTAATGACTACAACAACCGTTCTCATTGTCAAGCAGAAGCCGTTCGAGTTGGAACGAACCATGTTCTACCTGAAACCCGTAGAATTTTTCAGTT
ACAAAAGCTGTGGATCGAGTCCCGTTCGCTTCAGTTAGAGGAGAAAAGGCTTCAAATCCAAACGGATAGGTTGGAATTGGAGAAACAACGTTTCAAATGGGAAAGATTCA
ACAAGAAAAGGGACCGAGAGTTAGAAAAGTTGAAGTTAGAAAACGAGAGGATGGAGCTCGAGAACGAGCAGATGGCATCACAACTGAAGCAAAAAGGAAAAGGGGGCGCT
TCCCTC
Protein sequenceShow/hide protein sequence
MESNLTKRGMIPGGVSYGGVDLQGSMGVHHQGQYSLNLCQENHNRSHQRSSVRSLVRGSSPLTMGSSQHCNHRARLVDCGKGDIKNSVSDEDEPSFSEEGIDGAGKNALP
WQRVKWTDKMVKLLITVISYMGDDSASGCGGLGRRRLAVLQKKGKWKSVSKIMAERGYHVSPQQCEDKFNDLNKRYKRLNDMLGRGTSCQVVDNPALLDGIDYLTAKEKD
EVRKILSSKHLFYQEMCSYHNGNRLHLPHDPELLHSLQLVLRNRIDCENDDLRKLGHEDFDEDECDMETDARGDFEENRASHENNRGLSGTTGGCMKRVRQGLGYEEGNL
MHSLTSNDYNNRSHCQAEAVRVGTNHVLPETRRIFQLQKLWIESRSLQLEEKRLQIQTDRLELEKQRFKWERFNKKRDRELEKLKLENERMELENEQMASQLKQKGKGGA
SL