; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G001940 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G001940
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationCmo_Chr14:878274..879053
RNA-Seq ExpressionCmoCh14G001940
SyntenyCmoCh14G001940
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580597.1 hypothetical protein SDJN03_20599, partial [Cucurbita argyrosperma subsp. sororia]5.3e-11096.26Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAINN TTSCLVSAMDRLWHHQIILS SHPFTSHLHPTFPFSNFPSSL SDRISLDDASLLS EDD INGDKYKQDGKEESM+ESVSDP+FTMRNKLNKS
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
        MSCKSLGELELEEVKGFMDLGFQFRTENLSPQM+KLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK

Query:  HLRSWATTVAIEIQ
        HLRSWATTVAIEIQ
Subjt:  HLRSWATTVAIEIQ

XP_022935252.1 uncharacterized protein LOC111442189 [Cucurbita moschata]1.6e-114100Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
        MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK

Query:  HLRSWATTVAIEIQ
        HLRSWATTVAIEIQ
Subjt:  HLRSWATTVAIEIQ

XP_022983600.1 uncharacterized protein LOC111482158 isoform X1 [Cucurbita maxima]2.2e-9582.88Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAI NT T CLVSAMDRLWHHQIILSS HP  SHLHPTFPFSNFPSSLSSDRI LDD SL+S EDD  +GDKYKQDGKEE+M+ES++DPEFTMR KLNK+
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKL
        MSCKSLGELE+EEVKGFMDLGFQFRTENLSPQM+KLVPGLQRFK RMDKQNL        DD DENDD KKRDIARPYLSEAWTINRPNSPLL LRMPK+
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKL

Query:  SSTTDMKKHLRSWATTVAIEIQ
        SST+DMKK L+SWA TVAIEIQ
Subjt:  SSTTDMKKHLRSWATTVAIEIQ

XP_022983601.1 uncharacterized protein LOC111482158 isoform X2 [Cucurbita maxima]6.3e-9584.58Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAI NT T CLVSAMDRLWHHQIILSS HP  SHLHPTFPFSNFPSSLSSDRI LDD SL+S EDD  +GDKYKQDGKEE+M+ES++DPEFTMR KLNK+
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
        MSCKSLGELE+EEVKGFMDLGFQFRTENLSPQM+KLVPGLQRFK RMDKQNL+  D++DDDKKRDIARPYLSEAWTINRPNSPLL LRMPK+SST+DMKK
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK

Query:  HLRSWATTVAIEIQ
         L+SWA TVAIEIQ
Subjt:  HLRSWATTVAIEIQ

XP_023522774.1 uncharacterized protein LOC111786779 [Cucurbita pepo subsp. pepo]1.2e-9887.44Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISL-DDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNK
        MAINNTTTSCLVSAMDRLWHH IILSSSHPF SHLHPTFPFSNFPSSLSSDRISL DD SLLS EDD INGD+YKQDGK ESM+ESVSDPEFTMR KLNK
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISL-DDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNK

Query:  SMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMK
        SMSCKSLGELELEEVKGFMDLGF+F  E+LSPQM+KLVPGLQRFK  MD+QNL+D D+++DDKKRDIARPYLSEAWTINRPNSPLLNL MP++SSTTDMK
Subjt:  SMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMK

Query:  KHLRSWATTVAIEIQ
        KHLRSWA TVAIEIQ
Subjt:  KHLRSWATTVAIEIQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein5.3e-6359.84Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHL------HPTFPFSNF-PSS-------------LSSDRISLDDASLLSHEDDGINGDKYKQDGKEE
        MA+N   T CLVSAMDRLW+HQIIL S  PFTSH         +FPF+NF PSS              SS   S D+ SL+S ED     DK KQD K E
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHL------HPTFPFSNF-PSS-------------LSSDRISLDDASLLSHEDDGINGDKYKQDGKEE

Query:  SMQ-ESVSDPEFTMRNKLNKSMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------------DDVDENDDDKKRD
        S + +S+++ +F++  KLNKS SCKSLGELELEEVKGFMDLGF+F+ E+LSPQM+KLVPGLQR + + +KQNL              DD D++DDDKKR+
Subjt:  SMQ-ESVSDPEFTMRNKLNKSMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------------DDVDENDDDKKRD

Query:  IARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKKHLRSWATTVAIEIQ
        IARPYLSEAW I RPNSPLLNLRMPK+SST+DMKKHLRSWA TVA EIQ
Subjt:  IARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKKHLRSWATTVAIEIQ

A0A5D3DPB9 Uncharacterized protein6.1e-5164.32Show/hide
Query:  PSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQ-ESVSDPEFTMRNKLNKSMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRF
        PSS SS   S D+ SL+S ED     DK KQD K ES + +S+++ +F++  KLNKS SCKSLGELELEEVKGFMDLGF+F+ E+LSPQM+KLVPGLQR 
Subjt:  PSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQ-ESVSDPEFTMRNKLNKSMSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRF

Query:  KARMDKQNL--------------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKKHLRSWATTVAIEIQ
        + + +KQNL              DD D++DDDKKR+IARPYLSEAW I RPNSPLLNLRMPK+SST+DMKKHLRSWA TVA EIQ
Subjt:  KARMDKQNL--------------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKKHLRSWATTVAIEIQ

A0A6J1F521 uncharacterized protein LOC1114421897.8e-115100Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
        MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK

Query:  HLRSWATTVAIEIQ
        HLRSWATTVAIEIQ
Subjt:  HLRSWATTVAIEIQ

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X23.1e-9584.58Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAI NT T CLVSAMDRLWHHQIILSS HP  SHLHPTFPFSNFPSSLSSDRI LDD SL+S EDD  +GDKYKQDGKEE+M+ES++DPEFTMR KLNK+
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK
        MSCKSLGELE+EEVKGFMDLGFQFRTENLSPQM+KLVPGLQRFK RMDKQNL+  D++DDDKKRDIARPYLSEAWTINRPNSPLL LRMPK+SST+DMKK
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKK

Query:  HLRSWATTVAIEIQ
         L+SWA TVAIEIQ
Subjt:  HLRSWATTVAIEIQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X11.1e-9582.88Show/hide
Query:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS
        MAI NT T CLVSAMDRLWHHQIILSS HP  SHLHPTFPFSNFPSSLSSDRI LDD SL+S EDD  +GDKYKQDGKEE+M+ES++DPEFTMR KLNK+
Subjt:  MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKS

Query:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKL
        MSCKSLGELE+EEVKGFMDLGFQFRTENLSPQM+KLVPGLQRFK RMDKQNL        DD DENDD KKRDIARPYLSEAWTINRPNSPLL LRMPK+
Subjt:  MSCKSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNL--------DDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKL

Query:  SSTTDMKKHLRSWATTVAIEIQ
        SST+DMKK L+SWA TVAIEIQ
Subjt:  SSTTDMKKHLRSWATTVAIEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)4.2e-0429.91Show/hide
Query:  KSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDEN--DDDKKRDIARPYLSEA----WTINRPNSPLLNLRMPKLSSTTD
        KSL + +LEE+KG +DLGF F  + + P++   +P L+   + M ++ LDD  +N     ++ D + P  + A    W I+ P                D
Subjt:  KSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDEN--DDDKKRDIARPYLSEA----WTINRPNSPLLNLRMPKLSSTTD

Query:  MKKHLRSWATTVAIEIQ
        +K  L+ WA TVA  ++
Subjt:  MKKHLRSWATTVAIEIQ

AT2G31560.2 Protein of unknown function (DUF1685)4.2e-0429.91Show/hide
Query:  KSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDEN--DDDKKRDIARPYLSEA----WTINRPNSPLLNLRMPKLSSTTD
        KSL + +LEE+KG +DLGF F  + + P++   +P L+   + M ++ LDD  +N     ++ D + P  + A    W I+ P                D
Subjt:  KSLGELELEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDEN--DDDKKRDIARPYLSEA----WTINRPNSPLLNLRMPKLSSTTD

Query:  MKKHLRSWATTVAIEIQ
        +K  L+ WA TVA  ++
Subjt:  MKKHLRSWATTVAIEIQ

AT2G42760.1 unknown protein1.9e-1235.42Show/hide
Query:  VSDPEFTMRNKLNKS----MSCKSLGELELEEVKGFMDLGFQF-RTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDK--KRDIARPYLSEAWTI-
        +S+ E   + K  KS       KS+ +LE EE+KGFMDLGF F   ++    ++ ++PGLQR   + D    ++ +E ++DK      ARPYLSEAW   
Subjt:  VSDPEFTMRNKLNKS----MSCKSLGELELEEVKGFMDLGFQF-RTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDK--KRDIARPYLSEAWTI-

Query:  -----NRPNSPLLNLRM--PKLSSTTDMKKHLRSWATTVAIEIQ
              +  +P +  R+  P  +S  D+K +LR WA  VA  I+
Subjt:  -----NRPNSPLLNLRM--PKLSSTTDMKKHLRSWATTVAIEIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCAACAACACAACCACTTCATGTCTTGTTTCAGCCATGGATCGCCTTTGGCACCACCAAATCATTCTTTCCTCTTCGCATCCTTTCACTTCCCATCTCCACCC
AACTTTTCCTTTCTCAAACTTCCCTTCTTCCCTTTCCTCCGACCGCATCTCCCTCGACGACGCCTCCCTCCTCTCCCATGAAGATGATGGTATCAATGGAGACAAATATA
AACAAGATGGAAAGGAAGAGTCAATGCAAGAAAGTGTCAGTGATCCTGAATTTACAATGAGGAACAAATTGAACAAATCTATGAGTTGTAAAAGCTTGGGGGAGTTGGAG
CTGGAGGAAGTTAAAGGGTTTATGGATTTAGGGTTTCAATTTAGGACAGAAAATTTGAGCCCTCAAATGCTGAAGTTGGTACCTGGTTTGCAGAGGTTTAAAGCTCGAAT
GGACAAACAAAATCTCGACGACGTAGATGAAAACGATGACGATAAGAAAAGAGATATAGCAAGACCGTATCTTTCAGAGGCATGGACAATAAACCGACCAAACTCCCCTC
TTTTAAACCTCAGGATGCCAAAACTTTCTTCAACCACTGACATGAAGAAACACCTAAGATCTTGGGCTACAACTGTTGCAATTGAAATTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATCAACAACACAACCACTTCATGTCTTGTTTCAGCCATGGATCGCCTTTGGCACCACCAAATCATTCTTTCCTCTTCGCATCCTTTCACTTCCCATCTCCACCC
AACTTTTCCTTTCTCAAACTTCCCTTCTTCCCTTTCCTCCGACCGCATCTCCCTCGACGACGCCTCCCTCCTCTCCCATGAAGATGATGGTATCAATGGAGACAAATATA
AACAAGATGGAAAGGAAGAGTCAATGCAAGAAAGTGTCAGTGATCCTGAATTTACAATGAGGAACAAATTGAACAAATCTATGAGTTGTAAAAGCTTGGGGGAGTTGGAG
CTGGAGGAAGTTAAAGGGTTTATGGATTTAGGGTTTCAATTTAGGACAGAAAATTTGAGCCCTCAAATGCTGAAGTTGGTACCTGGTTTGCAGAGGTTTAAAGCTCGAAT
GGACAAACAAAATCTCGACGACGTAGATGAAAACGATGACGATAAGAAAAGAGATATAGCAAGACCGTATCTTTCAGAGGCATGGACAATAAACCGACCAAACTCCCCTC
TTTTAAACCTCAGGATGCCAAAACTTTCTTCAACCACTGACATGAAGAAACACCTAAGATCTTGGGCTACAACTGTTGCAATTGAAATTCAATAA
Protein sequenceShow/hide protein sequence
MAINNTTTSCLVSAMDRLWHHQIILSSSHPFTSHLHPTFPFSNFPSSLSSDRISLDDASLLSHEDDGINGDKYKQDGKEESMQESVSDPEFTMRNKLNKSMSCKSLGELE
LEEVKGFMDLGFQFRTENLSPQMLKLVPGLQRFKARMDKQNLDDVDENDDDKKRDIARPYLSEAWTINRPNSPLLNLRMPKLSSTTDMKKHLRSWATTVAIEIQ