; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C024442 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C024442
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 1-like
Genome locationchr01:35702126..35706665
RNA-Seq ExpressionMELO3C024442
SyntenyMELO3C024442
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0003682 - chromatin binding (molecular function)
InterPro domainsIPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051041.1 protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 [Cucumis melo var. makuwa]7.8e-7797.47Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
        MEHPKSSKLLDDSSFEFTLAE    IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
Subjt:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL

Query:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
Subjt:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

TYK03836.1 protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X1 [Cucumis melo var. makuwa]4.3e-8375.97Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
        MEHPKSSKLLDDSSFEFTLAE    IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
Subjt:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL

Query:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA-------------------------------------------
        PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA                                           
Subjt:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA-------------------------------------------

Query:  -W--------LQMHKLRHCFSTVVYHYETTGKK
         W        LQMHKLRHCFSTVVYHYETTGKK
Subjt:  -W--------LQMHKLRHCFSTVVYHYETTGKK

XP_004141662.1 protein SAWADEE HOMEODOMAIN HOMOLOG 1 [Cucumis sativus]6.0e-6989.51Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPP------PPPPPPP
        MEH KSSKLLDDSSFEFTLAEIVEMDNILKDS DQTLGQEFFQDVALHFSCSPWRAAKSPVT EHVHAWFENRRKELR+SSKKARPP      PPPPPP 
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPP------PPPPPPP

Query:  ELP--PSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        ELP  P+PSSPPP+PPPKLLLYHSESDFLTHAPSS PPEF GKATDLSELAFEAFSSRDHAW
Subjt:  ELP--PSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

XP_008462434.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X1 [Cucumis melo]2.1e-93100Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
        MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP

Query:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK
        SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK
Subjt:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK

XP_008462435.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 [Cucumis melo]1.4e-78100Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
        MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP

Query:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
Subjt:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

TrEMBL top hitse value%identityAlignment
A0A0A0KCT9 SAWADEE domain-containing protein2.9e-6989.51Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPP------PPPPPPP
        MEH KSSKLLDDSSFEFTLAEIVEMDNILKDS DQTLGQEFFQDVALHFSCSPWRAAKSPVT EHVHAWFENRRKELR+SSKKARPP      PPPPPP 
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPP------PPPPPPP

Query:  ELP--PSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        ELP  P+PSSPPP+PPPKLLLYHSESDFLTHAPSS PPEF GKATDLSELAFEAFSSRDHAW
Subjt:  ELP--PSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

A0A1S3CHG6 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X19.9e-94100Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
        MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP

Query:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK
        SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK
Subjt:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKK

A0A1S3CIG7 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X26.9e-79100Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
        MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP
Subjt:  MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSP

Query:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
Subjt:  SSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

A0A5A7UBT5 Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X23.8e-7797.47Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
        MEHPKSSKLLDDSSFEFTLAE    IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
Subjt:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL

Query:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
        PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW
Subjt:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAW

A0A5D3C067 Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X12.1e-8375.97Show/hide
Query:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
        MEHPKSSKLLDDSSFEFTLAE    IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL
Subjt:  MEHPKSSKLLDDSSFEFTLAE----IVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPEL

Query:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA-------------------------------------------
        PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA                                           
Subjt:  PPSPSSPPPTPPPKLLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHA-------------------------------------------

Query:  -W--------LQMHKLRHCFSTVVYHYETTGKK
         W        LQMHKLRHCFSTVVYHYETTGKK
Subjt:  -W--------LQMHKLRHCFSTVVYHYETTGKK

SwissProt top hitse value%identityAlignment
Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 11.1e-1241.06Show/hide
Query:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP
        DDSS    EFTL+EIV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R  KS +T + V  WF+ + K    S  K++  P PP       +PSS     
Subjt:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP

Query:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW
              Y S +   T   +S    F+    GKA+DL++LAFEA S+RD+AW
Subjt:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW

Arabidopsis top hitse value%identityAlignment
AT1G15215.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors3.3e-0938.06Show/hide
Query:  MDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHA
        M+N+ K+ GDQ+L ++F Q VA  FSCS  R  KS +T + V  WF+ + K    S  K++  P PP       +PSS           Y S +   T  
Subjt:  MDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPKLLLYHSESDFLTHA

Query:  PSSEPPEFI----GKATDLSELAFEAFSSRDHAW
         +S    F+    GKA+DL++LAFEA S+RD+AW
Subjt:  PSSEPPEFI----GKATDLSELAFEAFSSRDHAW

AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors7.7e-1441.06Show/hide
Query:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP
        DDSS    EFTL+EIV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R  KS +T + V  WF+ + K    S  K++  P PP       +PSS     
Subjt:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP

Query:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW
              Y S +   T   +S    F+    GKA+DL++LAFEA S+RD+AW
Subjt:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors7.7e-1441.06Show/hide
Query:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP
        DDSS    EFTL+EIV+M+N+ K+ GDQ+L ++F Q VA  FSCS  R  KS +T + V  WF+ + K    S  K++  P PP       +PSS     
Subjt:  DDSSF---EFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTP

Query:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW
              Y S +   T   +S    F+    GKA+DL++LAFEA S+RD+AW
Subjt:  PPKLLLYHSESDFLTHAPSSEPPEFI----GKATDLSELAFEAFSSRDHAW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCATCCAAAATCGTCAAAACTTTTAGATGATTCTTCATTCGAATTCACACTCGCTGAGATTGTGGAGATGGATAATATCTTGAAGGACTCTGGAGATCAAACACT
TGGTCAAGAGTTCTTCCAAGATGTCGCGCTTCATTTCAGTTGCTCTCCGTGGCGCGCTGCAAAATCTCCCGTCACTGCAGAACATGTGCATGCCTGGTTTGAGAATCGAA
GAAAGGAATTGCGAAGTAGTTCTAAGAAAGCTCGGCCTCCACCTCCACCTCCGCCTCCGCCTGAACTTCCACCTTCGCCATCGTCTCCCCCTCCGACTCCGCCACCGAAA
CTCTTGCTTTATCATTCGGAGAGTGATTTTTTAACTCACGCGCCTTCATCTGAACCACCTGAATTCATAGGCAAGGCAACTGATCTTTCGGAGTTAGCATTTGAGGCCTT
TTCGTCAAGAGACCATGCCTGGTTGCAAATGCACAAATTACGCCATTGTTTTTCTACTGTGGTTTATCATTATGAAACCACTGGAAAAAAAGTAAGCAAATATTTAGACG
TTGAAGAGAAAAGCCTCATGATCTGTTGTGCTTTTTATGCTATTGCAACAAACTTCACAGGATGCTCGAGTTCGATATGCTGGTTTCGGAAAGGATGA
mRNA sequenceShow/hide mRNA sequence
GAGACATAAGGAGGAGTTCTCAGTTCTCAACCCCGACAAGCGGACTTCTTACTTCCAATCACTTCTTCTTTCCGCGCTTTGCATAGTTTCTTCTGTTTCTGCTTATGGAG
CATCCAAAATCGTCAAAACTTTTAGATGATTCTTCATTCGAATTCACACTCGCTGAGATTGTGGAGATGGATAATATCTTGAAGGACTCTGGAGATCAAACACTTGGTCA
AGAGTTCTTCCAAGATGTCGCGCTTCATTTCAGTTGCTCTCCGTGGCGCGCTGCAAAATCTCCCGTCACTGCAGAACATGTGCATGCCTGGTTTGAGAATCGAAGAAAGG
AATTGCGAAGTAGTTCTAAGAAAGCTCGGCCTCCACCTCCACCTCCGCCTCCGCCTGAACTTCCACCTTCGCCATCGTCTCCCCCTCCGACTCCGCCACCGAAACTCTTG
CTTTATCATTCGGAGAGTGATTTTTTAACTCACGCGCCTTCATCTGAACCACCTGAATTCATAGGCAAGGCAACTGATCTTTCGGAGTTAGCATTTGAGGCCTTTTCGTC
AAGAGACCATGCCTGGTTGCAAATGCACAAATTACGCCATTGTTTTTCTACTGTGGTTTATCATTATGAAACCACTGGAAAAAAAGTAAGCAAATATTTAGACGTTGAAG
AGAAAAGCCTCATGATCTGTTGTGCTTTTTATGCTATTGCAACAAACTTCACAGGATGCTCGAGTTCGATATGCTGGTTTCGGAAAGGATGAGGATGAGTGGGTTAATGT
TGCAAGAGGAGTGCGTGATCGGTCTATACCTTTGGAATCTTCTGAGTGTTACAGAGTGAAAGTTGGAGATCTTGTTTTATGTTTCCGGGAAAGACAAGATCATGCACTCT
ACTTCGATGCATATGTTGTGGAAATTCAGAGGAGGCTACATGATATTGGTGGTTGTCGATGCATATTTGTGGTACGCTATGAGCATGATCACTATGAGGAAAAAGTGCAT
ATAGGGAGATTGTGCTGCAGGCCTTCCGCGTTCAACTCTGACCGAATTTAATGCATCCTAGAAACAGAAACCCAGATCTTTATCCTCAAAAAGTACATTTGGAAGGTTGA
TGTTCTCATTTAGGATAAGAACTTGTTGAGGTCTGGGGCCACTTATTAAAACTGAAGAACAAGAGAGGCCCCACTTTACCGAAGAATCAATGGAAGTGGCTTCGGTTGCT
TTGTATTATGTTTGTTAGGGAGGAGAGAGATATGAAAACCAAAGTTGTAAGGCTATATAAATTATCATTGATCGATAATAATGCATATTTTTCTATAAATGGCAATAATT
CACAAAGCATCCTAACAGATAATAACTAAACTAACTTAATTACATTTAACTCAGACTAAGGTCTTCAAAAGCTGTTGTTCTGAGCTTGGGTCTCATCATGTGCCTTTCTC
TCTTTTTTTGTTAGGACAGACGGATAATAGAAGCTTCTCACTCATCACGAAACTGTTCTTCCATAACATACGGATGC
Protein sequenceShow/hide protein sequence
MEHPKSSKLLDDSSFEFTLAEIVEMDNILKDSGDQTLGQEFFQDVALHFSCSPWRAAKSPVTAEHVHAWFENRRKELRSSSKKARPPPPPPPPPELPPSPSSPPPTPPPK
LLLYHSESDFLTHAPSSEPPEFIGKATDLSELAFEAFSSRDHAWLQMHKLRHCFSTVVYHYETTGKKVSKYLDVEEKSLMICCAFYAIATNFTGCSSSICWFRKG