; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G028490 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G028490
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionNHL domain protein
Genome locationCmo_Chr04:20339956..20340414
RNA-Seq ExpressionCmoCh04G028490
SyntenyCmoCh04G028490
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8699.34Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_022961939.1 uncharacterized protein LOC111462557 [Cucurbita moschata]1.6e-87100Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_022990588.1 uncharacterized protein LOC111487422 [Cucurbita maxima]2.5e-8597.37Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSK+WWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFD+GPALDDPF EDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_023533538.1 uncharacterized protein LOC111795380 [Cucurbita pepo subsp. pepo]5.0e-8698.68Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRS+PSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]2.8e-7384.87Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS+ QSPDLSSD N FH DD LH+AAF SRGCC WLPCLRSNPS+SWWERIRAADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPFS+D +RRDFS RFA+IPASAKSSMDLGKDGP FI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein4.4e-7282.24Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS+  SPDLSSD NGF  +D LH+A FA+RGCC W+PCLRSN S+SWWERIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPF++DF+RRDFS RFA+IPASAKSSMDLGKD PSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A1S4E267 uncharacterized protein LOC1079916283.2e-7080.92Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS+ QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA +D F++DF+RRDFS RFA+IPASAKSSMDL KD PS I
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A5A7T228 Putative NHL domain-containing protein1.7e-6881.63Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS+ QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKD
        PLSY+LNFDEGPA +D F++DF+RRDFS RFA+IPASAKSSMDL KD
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKD

A0A6J1HFG4 uncharacterized protein LOC1114625577.5e-88100Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874221.2e-8597.37Show/hide
Query:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSK+WWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFD+GPALDDPF EDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.6e-3243.03Show/hide
Query:  TDDLLHEAAFASRGCCFWLPCLRSNPSKS-----WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNR------------------
        TDD +HEA FA RGCCF +PCL S+   +     WW+RI   D    D+ WW+RGW+R R+WSE+VAGP+WKT+IR+F ++                   
Subjt:  TDDLLHEAAFASRGCCFWLPCLRSNPSKS-----WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNR------------------

Query:  -------HRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD
                 Q  +RYD LSY+LNFD+G      F ++F  RD+SMRFA  S+P S K S+D   D
Subjt:  -------HRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD

AT3G48020.1 unknown protein1.0e-2042.37Show/hide
Query:  CLRSNPSKSWWERI-RAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMR
        C  +    SWW+RI R    +  WW+R + + R+WSEIVAGP+WKTFIR+F+++  R         +RYDP+SY L+F++    DD  +     R FSMR
Subjt:  CLRSNPSKSWWERI-RAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMR

Query:  FASIP-ASAKSSMDLGKD
        +AS+P AS KS   +  D
Subjt:  FASIP-ASAKSSMDLGKD

AT5G14890.1 NHL domain-containing protein8.1e-3446.88Show/hide
Query:  SSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSN----PSKS-WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHK------------
        S  ++     D +HEA FA RGCCF LPCL S+    P+ S WW+RIR  D    D+ WW+ GW + R+WSEIVAGPKWKTFIR+F +            
Subjt:  SSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSN----PSKS-WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHK------------

Query:  NRHRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD
        NR    ++RYD  SY+LNFD+G      F ++F  RD+SMRFA  S+P S K S+D   D
Subjt:  NRHRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD

AT5G25240.1 unknown protein4.5e-0830.16Show/hide
Query:  LSSDVNGFHTDDLLHEAAFASRGCCF--------WLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQF---HKNRHRQSTY
        +++D     +DD    AAF   GC +        W      + S+  W      +    W     K  ++ SE +AGPKWK FIR F    K   R   +
Subjt:  LSSDVNGFHTDDLLHEAAFASRGCCF--------WLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQF---HKNRHRQSTY

Query:  RYDPLSYALNFDEGPALDDPFSEDFM
         YD  +Y+LNFD+G    D   E F+
Subjt:  RYDPLSYALNFDEGPALDDPFSEDFM

AT5G62865.1 unknown protein9.6e-1938.97Show/hide
Query:  EAAFASRGCCFWLPCLRSNPSK-----SWWERIRAAD---------NDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLS
        +  +  R CCF  P  R + S      S W RIR  D         ++  WW+R   + R+WSEIVAGP+WKTFIR+F+++  R         ++YDPLS
Subjt:  EAAFASRGCCFWLPCLRSNPSK-----SWWERIRAAD---------NDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLS

Query:  YALNFDEGPALDDPFSEDFM----RRDFSMRFASIP
        Y+LNFD     DD   ++++     R FS RFAS+P
Subjt:  YALNFDEGPALDDPFSEDFM----RRDFSMRFASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTCCTCAATCTCCTGACCTTTCTTCCGACGTTAATGGCTTCCACACCGATGACCTTCTTCATGAGGCTGCCTTCGCCAGCCGTGGCTGCTGCTTCTGGCTTCC
TTGCCTGAGGTCTAATCCATCGAAGTCCTGGTGGGAGAGGATTAGGGCTGCCGATAATGACGACGAGTGGTGGCTTCGAGGCTGGAAGAGGTTCCGCGATTGGTCCGAAA
TCGTTGCCGGCCCTAAATGGAAGACGTTCATTCGTCAATTTCACAAGAATCGCCATCGCCAATCTACTTACCGTTACGATCCGCTCAGTTACGCTCTCAACTTCGACGAA
GGTCCAGCTCTCGACGATCCTTTCAGTGAGGACTTCATGCGCCGCGACTTCTCCATGCGATTCGCCTCCATTCCGGCCTCCGCCAAGTCGTCCATGGACCTCGGCAAGGA
TGGACCGTCCTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTCCTCAATCTCCTGACCTTTCTTCCGACGTTAATGGCTTCCACACCGATGACCTTCTTCATGAGGCTGCCTTCGCCAGCCGTGGCTGCTGCTTCTGGCTTCC
TTGCCTGAGGTCTAATCCATCGAAGTCCTGGTGGGAGAGGATTAGGGCTGCCGATAATGACGACGAGTGGTGGCTTCGAGGCTGGAAGAGGTTCCGCGATTGGTCCGAAA
TCGTTGCCGGCCCTAAATGGAAGACGTTCATTCGTCAATTTCACAAGAATCGCCATCGCCAATCTACTTACCGTTACGATCCGCTCAGTTACGCTCTCAACTTCGACGAA
GGTCCAGCTCTCGACGATCCTTTCAGTGAGGACTTCATGCGCCGCGACTTCTCCATGCGATTCGCCTCCATTCCGGCCTCCGCCAAGTCGTCCATGGACCTCGGCAAGGA
TGGACCGTCCTTCATTTGA
Protein sequenceShow/hide protein sequence
MSVPQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYDPLSYALNFDE
GPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI