; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G007380 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G007380
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptionprotein SAWADEE HOMEODOMAIN HOMOLOG 1-like
Genome locationCmo_Chr17:7345237..7350438
RNA-Seq ExpressionCmoCh17G007380
SyntenyCmoCh17G007380
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsIPR039276 - Protein SAWADEE HOMEODOMAIN HOMOLOG 1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051041.1 protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X2 [Cucumis melo var. makuwa]3.6e-2658.78Show/hide
Query:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR
        LR S KKARPP PP       PSPS  PPTPPPKL+L +S++ FLT APSSEPPEF G ATDLS LAF+AFSS DHAW DVAS L+YR+NCHGEL    R
Subjt:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR

Query:  YL----DEEKSLIICRAL--YSIALEQTSCH
        Y     DE++ + + R +   SI LE + C+
Subjt:  YL----DEEKSLIICRAL--YSIALEQTSCH

XP_004141662.1 protein SAWADEE HOMEODOMAIN HOMOLOG 1 [Cucumis sativus]2.7e-2655.4Show/hide
Query:  LRYSCKKARPP--------------LPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCH
        LR S KKARPP              LPPL  PS  PP+PPPKL+L +S++ FLT APSS PPEFKG ATDLS LAF+AFSS DHAW DVAS L+YRVNCH
Subjt:  LRYSCKKARPP--------------LPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCH

Query:  GELKKVRRYL----DEEKSLIICRAL--YSIALEQTSCH
        GEL    RY     DE++ + + R +   SI LE + C+
Subjt:  GELKKVRRYL----DEEKSLIICRAL--YSIALEQTSCH

XP_008462435.1 PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X2 [Cucumis melo]3.6e-2658.78Show/hide
Query:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR
        LR S KKARPP PP       PSPS  PPTPPPKL+L +S++ FLT APSSEPPEF G ATDLS LAF+AFSS DHAW DVAS L+YR+NCHGEL    R
Subjt:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR

Query:  YL----DEEKSLIICRAL--YSIALEQTSCH
        Y     DE++ + + R +   SI LE + C+
Subjt:  YL----DEEKSLIICRAL--YSIALEQTSCH

XP_022953928.1 uncharacterized protein LOC111456335, partial [Cucurbita moschata]5.0e-12199.55Show/hide
Query:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK
        LR SCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK
Subjt:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK

Query:  SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL
        SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL
Subjt:  SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL

Query:  GKSKFVEIVLQAFRVQFRPTSM
        GKSKFVEIVLQAFRVQFRPTSM
Subjt:  GKSKFVEIVLQAFRVQFRPTSM

XP_038897963.1 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like [Benincasa hispida]7.0e-3063.71Show/hide
Query:  SCKKARPPLPP--LHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYL----D
        SCKKARPP PP     PSP PPTPPPKL+L +S++ FLTDAPSSEPPEFKG ATDLS LAF+AFSS DHAW DVAS LSYRVNCHGEL    RY     D
Subjt:  SCKKARPPLPP--LHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYL----D

Query:  EEKSLIICRAL--YSIALEQTSCH
        E++ + + R +   SI LE + C+
Subjt:  EEKSLIICRAL--YSIALEQTSCH

TrEMBL top hitse value%identityAlignment
A0A0A0KCT9 SAWADEE domain-containing protein1.3e-2655.4Show/hide
Query:  LRYSCKKARPP--------------LPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCH
        LR S KKARPP              LPPL  PS  PP+PPPKL+L +S++ FLT APSS PPEFKG ATDLS LAF+AFSS DHAW DVAS L+YRVNCH
Subjt:  LRYSCKKARPP--------------LPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCH

Query:  GELKKVRRYL----DEEKSLIICRAL--YSIALEQTSCH
        GEL    RY     DE++ + + R +   SI LE + C+
Subjt:  GELKKVRRYL----DEEKSLIICRAL--YSIALEQTSCH

A0A1S3CIG7 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like isoform X21.7e-2658.78Show/hide
Query:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR
        LR S KKARPP PP       PSPS  PPTPPPKL+L +S++ FLT APSSEPPEF G ATDLS LAF+AFSS DHAW DVAS L+YR+NCHGEL    R
Subjt:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR

Query:  YL----DEEKSLIICRAL--YSIALEQTSCH
        Y     DE++ + + R +   SI LE + C+
Subjt:  YL----DEEKSLIICRAL--YSIALEQTSCH

A0A5A7UBT5 Protein SAWADEE HOMEODOMAIN-like protein 1-like isoform X21.7e-2658.78Show/hide
Query:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR
        LR S KKARPP PP       PSPS  PPTPPPKL+L +S++ FLT APSSEPPEF G ATDLS LAF+AFSS DHAW DVAS L+YR+NCHGEL    R
Subjt:  LRYSCKKARPPLPPLH----LPSPS--PPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRR

Query:  YL----DEEKSLIICRAL--YSIALEQTSCH
        Y     DE++ + + R +   SI LE + C+
Subjt:  YL----DEEKSLIICRAL--YSIALEQTSCH

A0A6J1DBC4 protein SAWADEE HOMEODOMAIN HOMOLOG 1-like7.3e-2558.91Show/hide
Query:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEP----PEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYL
        LR S KKA PP PP   PSP PP PPPKL+L +SD++FLTDAPSSEP    PE KG A+DLS LAF+AFSS D+AW DVAS LSYRVNCHGEL    RY 
Subjt:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEP----PEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYL

Query:  ----DEEKSLIICRAL--YSIALEQTSCH
            DE++ + + R +   SI LE + C+
Subjt:  ----DEEKSLIICRAL--YSIALEQTSCH

A0A6J1GPP3 uncharacterized protein LOC1114563352.4e-12199.55Show/hide
Query:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK
        LR SCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK
Subjt:  LRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEK

Query:  SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL
        SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL
Subjt:  SLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGKEYKYYNCLVTRKASCRTERHDHALYFLICL

Query:  GKSKFVEIVLQAFRVQFRPTSM
        GKSKFVEIVLQAFRVQFRPTSM
Subjt:  GKSKFVEIVLQAFRVQFRPTSM

SwissProt top hitse value%identityAlignment
Q9XI47 Protein SAWADEE HOMEODOMAIN HOMOLOG 18.0e-0542.5Show/hide
Query:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK
        P PPL +   S P+       N S+ TF+ +  S+     KG A+DL+ LAF+A S+ D+AW DV+S L+YRV   GEL+
Subjt:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK

Arabidopsis top hitse value%identityAlignment
AT1G15215.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors5.7e-0642.5Show/hide
Query:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK
        P PPL +   S P+       N S+ TF+ +  S+     KG A+DL+ LAF+A S+ D+AW DV+S L+YRV   GEL+
Subjt:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK

AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors5.7e-0642.5Show/hide
Query:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK
        P PPL +   S P+       N S+ TF+ +  S+     KG A+DL+ LAF+A S+ D+AW DV+S L+YRV   GEL+
Subjt:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors5.7e-0642.5Show/hide
Query:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK
        P PPL +   S P+       N S+ TF+ +  S+     KG A+DL+ LAF+A S+ D+AW DV+S L+YRV   GEL+
Subjt:  PLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRLAFKAFSSIDHAWCDVASLLSYRVNCHGELK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAGTCTTATGGAAGCGAGACGAGGGAGGGAGGGAGGGAGGGCGGGAAGCGTTTCTTATGGATCGAACACTTGGTCAAGTGTTTTTCCAAGATTTCGTACTTCAT
TTCAGTTGCTCCCCGTGGCGCGCTACAAAATCTCCCGTCACTGCGGTACAGTTGTAAAAAAGCGCGGCCTCCACTTCCACCTCTACATTTGCCATCACCTTCGCCTCCGA
CTCCGCCTCCGAAACTTGTACTTAATTATTCGGATACTACTTTTTTAACTGACGCACCGTCATCTGAACCACCTGAATTCAAAGGCAATGCAACTGATCTTTCAAGATTA
GCATTCAAAGCCTTTTCGTCAATAGACCATGCATGGTGTGACGTTGCTTCGCTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAAAAAAAGTTCGCAGATATTTAGA
TGAAGAGAAAAGCCTCATAATCTGCCGTGCTCTGTATTCTATTGCATTGGAACAAACTTCATGTCATCACAATCTCCCTAAGAGGGAGGAAAAGAAAAAGAAAAAAGAAA
CCAACGAGAACTGGAGAGAGGAACCAAACAGAAAAGAAAAAAAAAAGTGCAAAAAATGCCAGAAGGCGAAACCTAAATCCAGATTCTATAACCCAACCAACCACGGGAAA
GAATATAAGTACTATAATTGTCTGGTTACCAGGAAAGCTTCATGCCGAACGGAAAGACATGATCATGCGCTCTACTTCCTTATTTGCTTAGGAAAAAGTAAATTTGTGGA
GATTGTGCTGCAGGCCTTCAGAGTACAATTCAGACCAACTTCAATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAGTCTTATGGAAGCGAGACGAGGGAGGGAGGGAGGGAGGGCGGGAAGCGTTTCTTATGGATCGAACACTTGGTCAAGTGTTTTTCCAAGATTTCGTACTTCAT
TTCAGTTGCTCCCCGTGGCGCGCTACAAAATCTCCCGTCACTGCGGTACAGTTGTAAAAAAGCGCGGCCTCCACTTCCACCTCTACATTTGCCATCACCTTCGCCTCCGA
CTCCGCCTCCGAAACTTGTACTTAATTATTCGGATACTACTTTTTTAACTGACGCACCGTCATCTGAACCACCTGAATTCAAAGGCAATGCAACTGATCTTTCAAGATTA
GCATTCAAAGCCTTTTCGTCAATAGACCATGCATGGTGTGACGTTGCTTCGCTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAAAAAAAGTTCGCAGATATTTAGA
TGAAGAGAAAAGCCTCATAATCTGCCGTGCTCTGTATTCTATTGCATTGGAACAAACTTCATGTCATCACAATCTCCCTAAGAGGGAGGAAAAGAAAAAGAAAAAAGAAA
CCAACGAGAACTGGAGAGAGGAACCAAACAGAAAAGAAAAAAAAAAGTGCAAAAAATGCCAGAAGGCGAAACCTAAATCCAGATTCTATAACCCAACCAACCACGGGAAA
GAATATAAGTACTATAATTGTCTGGTTACCAGGAAAGCTTCATGCCGAACGGAAAGACATGATCATGCGCTCTACTTCCTTATTTGCTTAGGAAAAAGTAAATTTGTGGA
GATTGTGCTGCAGGCCTTCAGAGTACAATTCAGACCAACTTCAATGTAAATGGAATCCCGAATCTTCATTCCTGAGAAGTAAATATTATGCAACCTGTCTGCGCACAGAA
GGATACATGTTCTCGTTTAGGAAAGGGGGCACTCGAAAAGTGGTTCAATTTTTCAACATAGACCCTCAAATTGAAATGCTGGGGATCAAACTTCATAGAGGCTTTCGCCT
TGGTACGCTGAGAACGAGGGCTGCTTGGTCACTGGTTGCTGCAGTTGCAGTGATTCGATACCCACAGTCCCATCGGCGATGAATACCTTCACTAGGGTATAAGAAATCCA
ACTCGACCACCTGAAGATTAAGAACATCGGTCAGAAAATCCTGCACCACGGGACATAACAACGCCCCACCGACTTCCCGAGGTGGCCATTGCCGTTACATAGAATCCCTC
TCTCCACTTTTTGTTAATCCATTTGAAAGGAAAAGAATCACTGACTTTGTACGACTGTTGCAAGTACTGTGTGCCTGGAATCATTAAATAGAGATTCAATTGATAAAAGG
AAAATGGCAT
Protein sequenceShow/hide protein sequence
MRESYGSETREGGREGGKRFLWIEHLVKCFSKISYFISVAPRGALQNLPSLRYSCKKARPPLPPLHLPSPSPPTPPPKLVLNYSDTTFLTDAPSSEPPEFKGNATDLSRL
AFKAFSSIDHAWCDVASLLSYRVNCHGELKKVRRYLDEEKSLIICRALYSIALEQTSCHHNLPKREEKKKKKETNENWREEPNRKEKKKCKKCQKAKPKSRFYNPTNHGK
EYKYYNCLVTRKASCRTERHDHALYFLICLGKSKFVEIVLQAFRVQFRPTSM