; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G019310 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G019310
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionDOG1 domain-containing protein
Genome locationGy14Chr4:25571322..25572482
RNA-Seq ExpressionCsGy4G019310
SyntenyCsGy4G019310
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR025422 - Transcription factor TGA like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041261.1 transcription factor TGA2-like isoform X2 [Cucumis melo var. makuwa]7.02e-16090.42Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD LGILSPSWISSFEDAFLWLGGWRPTMAFHLL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL

Query:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
        YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKLA
Subjt:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        LA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

XP_008442174.1 PREDICTED: transcription factor HBP-1b(c38)-like isoform X1 [Cucumis melo]1.01e-15790.08Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD L GILSPSWISSFEDAFLWLGGWRPTMAFHL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHL

Query:  LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKL
        LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKL
Subjt:  LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKL

Query:  ALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        ALA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  ALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

XP_008442175.1 PREDICTED: transcription factor TGA2-like isoform X2 [Cucumis melo]1.45e-15990.42Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD LGILSPSWISSFEDAFLWLGGWRPTMAFHLL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL

Query:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
        YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKLA
Subjt:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        LA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

XP_011653880.2 protein DELAY OF GERMINATION 1 isoform X2 [Cucumis sativus]1.27e-17999.61Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL
        MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL

Query:  QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKEC
        QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALA KEC
Subjt:  QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKEC

Query:  GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

XP_031740486.1 protein DOG1-like 3 isoform X1 [Cucumis sativus]8.91e-17899.22Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG
        MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTL GILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG

Query:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE
        LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALA KE
Subjt:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE

Query:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

TrEMBL top hitse value%identityAlignment
A0A0A0L3Q0 DOG1 domain-containing protein7.49e-181100Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL
        MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGL

Query:  QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKEC
        QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKEC
Subjt:  QLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKEC

Query:  GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  GLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

A0A1S3B4N0 transcription factor HBP-1b(C38)-like isoform X14.89e-15890.08Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD L GILSPSWISSFEDAFLWLGGWRPTMAFHL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTL-GILSPSWISSFEDAFLWLGGWRPTMAFHL

Query:  LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKL
        LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKL
Subjt:  LYSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKL

Query:  ALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        ALA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  ALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

A0A1S3B532 transcription factor TGA2-like isoform X27.01e-16090.42Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD LGILSPSWISSFEDAFLWLGGWRPTMAFHLL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL

Query:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
        YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKLA
Subjt:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        LA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

A0A5A7THX1 Transcription factor TGA2-like isoform X23.40e-16090.42Show/hide
Query:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL
        MDQE SFGEFF+KWMKEQNQYLTELIST KGG N      +VAEALMKRVMEHYEHYY+VKS WVEKD LGILSPSWISSFEDAFLWLGGWRPTMAFHLL
Subjt:  MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNN------MVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLL

Query:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
        YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQV+KIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKM TSGGG QN+ +L+MVEEELKLA
Subjt:  YSKSGLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        LA KE GLKEVVKMADELRL TLKQIIGILT TQRVHFLIAAAELHLRIHEWGLKRDSDQR
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

A0A6J1EB45 protein DOG1-like 47.00e-10964.59Show/hide
Query:  SFGEFFQKWMKEQNQYLTELISTAKGGN------NMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG
        +F EFF+ W+ EQNQYL+ELIS AK  +      +   + L+KRVME YEHYYKVKS W+++D L +L P+WISS ED FLWLGGWRP++AFHLLYSKSG
Subjt:  SFGEFFQKWMKEQNQYLTELISTAKGGN------NMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG

Query:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGE-LNMVEEELKLALATK
        LQLEGRL +LIHGLSTGDLADLS  Q++K DTLQR  +K+E+EI+EKMAKYQETIADPSMVEL+H+A++ K       G++  E +  +EE+L +AL  +
Subjt:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGE-LNMVEEELKLALATK

Query:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        E GLKE+VKMADELRL TLK+I+GILT +Q VHF IAAAELHLRIHEWGL+RD  +R
Subjt:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

SwissProt top hitse value%identityAlignment
A0SVK0 Protein DELAY OF GERMINATION 11.5e-2429.12Show/hide
Query:  FQKWMKEQNQYLTEL-------ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLE
        + +WM  Q+Q + EL        S     N+     L  +++  +++Y   ++    + +    +P+W S  E+A +W+GG RP+  F L+Y+  G Q E
Subjt:  FQKWMKEQNQYLTEL-------ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLE

Query:  GRLLDLIHGL----STG-----DLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
         R+   +  +    S+G      L+DLS+ Q+ KI+ L   ++ +E+++T+K++  QE  AD  +  +++              +N GE N+V ++   A
Subjt:  GRLLDLIHGL----STG-----DLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        L  +E  +  ++  AD LR++TL +I+GIL+  Q   FL+A  +LHL +HEWG  RD  +R
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR

Q58FV0 Protein DOG1-like 33.3e-2428.4Show/hide
Query:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL-
        + +WM  Q +++ +L   + + +   +   E L+ +++  ++ Y + +S    +      +PSW S  E+  LW+GG RP+    ++YS  G Q E +L 
Subjt:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL-

Query:  ---------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALAT
                 +++ HG   G ++DL++ Q+ KI+ L   V+++E +IT+K A  QE +AD   + ++  AT    G            ++V E+   AL  
Subjt:  ---------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALAT

Query:  KECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQ
         E G+  ++  AD+LR ETL++I+ ++T  Q   FL+A   LH+ +HEWG  R+  +
Subjt:  KECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQ

Q84JC2 Protein DOG1-like 47.5e-2131.08Show/hide
Query:  EISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEA----LMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG
        E  F EF++ W+ +   YL +L+      NN ++E     L+ ++  H++ YY  K   + +D L      W++  E+A  WL GW+P+M F        
Subjt:  EISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEA----LMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG

Query:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE
             R++D +          L   QV K++ L+      E++I  +M +YQ  +AD  MVEL+ +           G    GE  MV E    A+    
Subjt:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE

Query:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR
         GL+++VK AD +RL+TLK I+ ILT  Q V FL AAA   +++  WG +R
Subjt:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR

Q9SN45 Protein DOG1-like 21.1e-1926.98Show/hide
Query:  FQKWMKEQNQYLTELIST--AKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL--
        + +WM  Q +++ +L      +  N+   E L+ +++  Y  Y   +S    +      +PSW +  E++ LW+GG RP+    L+Y+  G Q E +L  
Subjt:  FQKWMKEQNQYLTELIST--AKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL--

Query:  --------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATK
                 D+ HG   G ++DL++ Q+ K++ L   V+K+E +IT+  A +Q+ +AD  + ++ H                       +  ++ AL   
Subjt:  --------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATK

Query:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR
        E G+  ++  AD+LR ETL++I+ ++T  Q V FL+A   L L +H+ G  R
Subjt:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR

Q9SN47 Protein DOG1-like 15.8e-2128.79Show/hide
Query:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRLL
        + +WM  Q + +TEL   IST +  +N + + L++  +  +  Y + +S    + +    +P+W +  E+A LW+GG RP+    L+Y+  G Q E RL 
Subjt:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRLL

Query:  DLIH-------------------GLSTGD-LADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMV
        +  +                   G+  G+ ++DL++ Q+ KI+ L    V+ E ++T+  A  QE  AD  +     +A  +K        +  G+ ++V
Subjt:  DLIH-------------------GLSTGD-LADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMV

Query:  EEELKLALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD
         E    AL   E  +  ++  AD+LR+ TL +I+ ILT  Q   FL+A  +LHL +HEWG  R+
Subjt:  EEELKLALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD

Arabidopsis top hitse value%identityAlignment
AT3G14880.1 BEST Arabidopsis thaliana protein match is: transcription factor-related (TAIR:AT4G18650.1)5.7e-3232.31Show/hide
Query:  SFGEFFQKWMKEQNQYLTELIS-------TAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKS
        SF +F Q W+++   +L  L S       +A G    + EA + RVMEH+  Y++ K    +KD + +++  W S+ E +  W+GGWRPT  FHL+Y++S
Subjt:  SFGEFFQKWMKEQNQYLTELIS-------TAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKS

Query:  GLQLEGRLLDLIHGLSTGDLADLSSHQ----VIK---IDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEEL
         +  E R++D++ G  TGDL+DLS  Q     +K   +  LQ   VK+E  ITE+++++Q+  +D              MGTS    Q            
Subjt:  GLQLEGRLLDLIHGLSTGDLADLSSHQ----VIK---IDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEEL

Query:  KLALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD
                  L E+V   D+LRL T+ +++ +L+  Q+  FL+AAAEL   +  WG   D
Subjt:  KLALATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD

AT3G14880.2 FUNCTIONS IN: molecular_function unknown6.1e-3432.81Show/hide
Query:  SFGEFFQKWMKEQNQYLTELIS-------TAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKS
        SF +F Q W+++   +L  L S       +A G    + EA + RVMEH+  Y++ K    +KD + +++  W S+ E +  W+GGWRPT  FHL+Y++S
Subjt:  SFGEFFQKWMKEQNQYLTELIS-------TAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKS

Query:  GLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATK
         +  E R++D++ G  TGDL+DLS  Q   +  LQ   VK+E  ITE+++++Q+  +D              MGTS    Q                   
Subjt:  GLQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATK

Query:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD
           L E+V   D+LRL T+ +++ +L+  Q+  FL+AAAEL   +  WG   D
Subjt:  ECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRD

AT4G18650.1 transcription factor-related5.4e-2231.08Show/hide
Query:  EISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEA----LMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG
        E  F EF++ W+ +   YL +L+      NN ++E     L+ ++  H++ YY  K   + +D L      W++  E+A  WL GW+P+M F        
Subjt:  EISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEA----LMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSG

Query:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE
             R++D +          L   QV K++ L+      E++I  +M +YQ  +AD  MVEL+ +           G    GE  MV E    A+    
Subjt:  LQLEGRLLDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKE

Query:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR
         GL+++VK AD +RL+TLK I+ ILT  Q V FL AAA   +++  WG +R
Subjt:  CGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKR

AT4G18690.1 unknown protein2.3e-2528.4Show/hide
Query:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL-
        + +WM  Q +++ +L   + + +   +   E L+ +++  ++ Y + +S    +      +PSW S  E+  LW+GG RP+    ++YS  G Q E +L 
Subjt:  FQKWMKEQNQYLTEL---ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRL-

Query:  ---------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALAT
                 +++ HG   G ++DL++ Q+ KI+ L   V+++E +IT+K A  QE +AD   + ++  AT    G            ++V E+   AL  
Subjt:  ---------LDLIHGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALAT

Query:  KECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQ
         E G+  ++  AD+LR ETL++I+ ++T  Q   FL+A   LH+ +HEWG  R+  +
Subjt:  KECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQ

AT5G45830.1 delay of germination 11.0e-2529.12Show/hide
Query:  FQKWMKEQNQYLTEL-------ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLE
        + +WM  Q+Q + EL        S     N+     L  +++  +++Y   ++    + +    +P+W S  E+A +W+GG RP+  F L+Y+  G Q E
Subjt:  FQKWMKEQNQYLTEL-------ISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLE

Query:  GRLLDLIHGL----STG-----DLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA
         R+   +  +    S+G      L+DLS+ Q+ KI+ L   ++ +E+++T+K++  QE  AD  +  +++              +N GE N+V ++   A
Subjt:  GRLLDLIHGL----STG-----DLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLA

Query:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR
        L  +E  +  ++  AD LR++TL +I+GIL+  Q   FL+A  +LHL +HEWG  RD  +R
Subjt:  LATKECGLKEVVKMADELRLETLKQIIGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCAAGAAATTAGCTTTGGAGAATTCTTCCAAAAATGGATGAAGGAGCAAAACCAATATCTAACCGAGCTCATTTCCACTGCGAAAGGTGGCAACAACATGGTGGC
TGAGGCATTGATGAAGCGAGTAATGGAACACTATGAGCATTACTACAAGGTGAAATCACGTTGGGTGGAAAAAGATACATTGGGTATACTAAGCCCGTCATGGATCTCAT
CATTTGAAGACGCATTCCTATGGCTAGGAGGATGGAGACCAACTATGGCATTTCACTTACTCTACTCTAAGTCCGGCCTCCAGCTTGAGGGTCGTCTTCTAGACTTAATC
CATGGGCTCTCCACGGGGGACCTCGCGGACCTTTCCTCACACCAAGTTATCAAAATCGACACCTTACAAAGGGGTGTTGTAAAGCAAGAGAAGGAAATAACAGAGAAAAT
GGCTAAGTATCAAGAAACAATTGCAGATCCATCAATGGTGGAGCTTTCTCATATGGCAACAAAGTTCAAAATGGGAACATCAGGAGGAGGAGGACAGAATGATGGTGAAT
TAAATATGGTAGAGGAAGAGTTAAAATTGGCTCTGGCAACAAAAGAGTGTGGTTTGAAAGAAGTTGTGAAGATGGCTGACGAATTACGTCTTGAAACTTTGAAACAAATT
ATTGGGATCTTGACATTAACGCAGAGAGTTCATTTCTTGATAGCTGCTGCTGAATTGCATTTGAGGATTCATGAGTGGGGATTGAAAAGAGATTCGGATCAACGTTAG
mRNA sequenceShow/hide mRNA sequence
CTCCTTCAATCTCACATATATCCATATATAAATCCTACCCTTAAATCAGATGAAATTAAACTATCTAGAGAGCAAAAACTAACTTATTCTCATTTATTCAACAAGCAAAT
ATGGATCAAGAAATTAGCTTTGGAGAATTCTTCCAAAAATGGATGAAGGAGCAAAACCAATATCTAACCGAGCTCATTTCCACTGCGAAAGGTGGCAACAACATGGTGGC
TGAGGCATTGATGAAGCGAGTAATGGAACACTATGAGCATTACTACAAGGTGAAATCACGTTGGGTGGAAAAAGATACATTGGGTATACTAAGCCCGTCATGGATCTCAT
CATTTGAAGACGCATTCCTATGGCTAGGAGGATGGAGACCAACTATGGCATTTCACTTACTCTACTCTAAGTCCGGCCTCCAGCTTGAGGGTCGTCTTCTAGACTTAATC
CATGGGCTCTCCACGGGGGACCTCGCGGACCTTTCCTCACACCAAGTTATCAAAATCGACACCTTACAAAGGGGTGTTGTAAAGCAAGAGAAGGAAATAACAGAGAAAAT
GGCTAAGTATCAAGAAACAATTGCAGATCCATCAATGGTGGAGCTTTCTCATATGGCAACAAAGTTCAAAATGGGAACATCAGGAGGAGGAGGACAGAATGATGGTGAAT
TAAATATGGTAGAGGAAGAGTTAAAATTGGCTCTGGCAACAAAAGAGTGTGGTTTGAAAGAAGTTGTGAAGATGGCTGACGAATTACGTCTTGAAACTTTGAAACAAATT
ATTGGGATCTTGACATTAACGCAGAGAGTTCATTTCTTGATAGCTGCTGCTGAATTGCATTTGAGGATTCATGAGTGGGGATTGAAAAGAGATTCGGATCAACGTTAGGA
AGAAAAAATTAACAACTACTTTTCATTTATTAAATATGTCTTTTGTTGGGTCTTTCTTTTGGTGTTCTCTCAATGCTTAGTTTACGAAGAAAAGCTAAAGCTCTTATGTC
G
Protein sequenceShow/hide protein sequence
MDQEISFGEFFQKWMKEQNQYLTELISTAKGGNNMVAEALMKRVMEHYEHYYKVKSRWVEKDTLGILSPSWISSFEDAFLWLGGWRPTMAFHLLYSKSGLQLEGRLLDLI
HGLSTGDLADLSSHQVIKIDTLQRGVVKQEKEITEKMAKYQETIADPSMVELSHMATKFKMGTSGGGGQNDGELNMVEEELKLALATKECGLKEVVKMADELRLETLKQI
IGILTLTQRVHFLIAAAELHLRIHEWGLKRDSDQR