; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg04422 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg04422
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionNHL domain protein
Genome locationCarg_Chr04:20474315..20474773
RNA-Seq ExpressionCarg04422
SyntenyCarg04422
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]4.5e-87100Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_022961939.1 uncharacterized protein LOC111462557 [Cucurbita moschata]1.7e-8699.34Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_022990588.1 uncharacterized protein LOC111487422 [Cucurbita maxima]8.5e-8698.03Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSK+WWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFD+GPALDDPF EDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_023533538.1 uncharacterized protein LOC111795380 [Cucurbita pepo subsp. pepo]1.7e-8699.34Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRS+PSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]2.2e-7384.87Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS++QSPDLSSD N FH DD LH+AAF SRGCC WLPCLRSNPS+SWWERIRAADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPFS+D +RRDFS RFA+IPASAKSSMDLGKDGP FI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein3.4e-7282.24Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS++ SPDLSSD NGF  +D LH+A FA+RGCC W+PCLRSN S+SWWERIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPF++DF+RRDFS RFA+IPASAKSSMDLGKD PSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A1S4E267 uncharacterized protein LOC1079916282.4e-7080.92Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS++QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA +D F++DF+RRDFS RFA+IPASAKSSMDL KD PS I
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A5A7T228 Putative NHL domain-containing protein1.3e-6881.63Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MS++QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKD
        PLSY+LNFDEGPA +D F++DF+RRDFS RFA+IPASAKSSMDL KD
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKD

A0A6J1HFG4 uncharacterized protein LOC1114625578.3e-8799.34Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSV QSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874224.1e-8698.03Show/hide
Query:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
        MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSK+WWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD
Subjt:  MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYD

Query:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
        PLSYALNFD+GPALDDPF EDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI
Subjt:  PLSYALNFDEGPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)3.4e-3243.03Show/hide
Query:  TDDLLHEAAFASRGCCFWLPCLRSNPSKS-----WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNR------------------
        TDD +HEA FA RGCCF +PCL S+   +     WW+RI   D    D+ WW+RGW+R R+WSE+VAGP+WKT+IR+F ++                   
Subjt:  TDDLLHEAAFASRGCCFWLPCLRSNPSKS-----WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNR------------------

Query:  -------HRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD
                 Q  +RYD LSY+LNFD+G      F ++F  RD+SMRFA  S+P S K S+D   D
Subjt:  -------HRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD

AT3G48020.1 unknown protein1.0e-2042.37Show/hide
Query:  CLRSNPSKSWWERI-RAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMR
        C  +    SWW+RI R    +  WW+R + + R+WSEIVAGP+WKTFIR+F+++  R         +RYDP+SY L+F++    DD  +     R FSMR
Subjt:  CLRSNPSKSWWERI-RAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMR

Query:  FASIP-ASAKSSMDLGKD
        +AS+P AS KS   +  D
Subjt:  FASIP-ASAKSSMDLGKD

AT5G14890.1 NHL domain-containing protein8.1e-3446.88Show/hide
Query:  SSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSN----PSKS-WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHK------------
        S  ++     D +HEA FA RGCCF LPCL S+    P+ S WW+RIR  D    D+ WW+ GW + R+WSEIVAGPKWKTFIR+F +            
Subjt:  SSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSN----PSKS-WWERIRAADN---DDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHK------------

Query:  NRHRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD
        NR    ++RYD  SY+LNFD+G      F ++F  RD+SMRFA  S+P S K S+D   D
Subjt:  NRHRQSTYRYDPLSYALNFDEGPALDDPFSEDFMRRDFSMRFA--SIPASAKSSMDLGKD

AT5G25240.1 unknown protein4.5e-0830.16Show/hide
Query:  LSSDVNGFHTDDLLHEAAFASRGCCF--------WLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQF---HKNRHRQSTY
        +++D     +DD    AAF   GC +        W      + S+  W      +    W     K  ++ SE +AGPKWK FIR F    K   R   +
Subjt:  LSSDVNGFHTDDLLHEAAFASRGCCF--------WLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQF---HKNRHRQSTY

Query:  RYDPLSYALNFDEGPALDDPFSEDFM
         YD  +Y+LNFD+G    D   E F+
Subjt:  RYDPLSYALNFDEGPALDDPFSEDFM

AT5G62865.1 unknown protein1.3e-1838.97Show/hide
Query:  EAAFASRGCCFWLPCLRSNPSK-----SWWERIRAAD---------NDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLS
        +  +  R CCF  P  R + S      S W RIR  D         ++  WW+R   + R+WSEIVAGP+WKTFIR+F+++  R         ++YDPLS
Subjt:  EAAFASRGCCFWLPCLRSNPSK-----SWWERIRAAD---------NDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHR------QSTYRYDPLS

Query:  YALNFDEGPALDDPFSEDFM----RRDFSMRFASIP
        Y+LNFD     DD   ++++     R FS RFAS+P
Subjt:  YALNFDEGPALDDPFSEDFM----RRDFSMRFASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTTCTCAATCTCCTGACCTTTCTTCCGACGTTAATGGCTTCCACACCGATGACCTTCTTCATGAGGCTGCCTTCGCCAGCCGTGGCTGCTGCTTCTGGCTTCC
TTGCCTGAGGTCTAATCCATCGAAGTCCTGGTGGGAGAGGATTAGGGCTGCGGATAATGACGACGAGTGGTGGCTTCGAGGCTGGAAGAGGTTCCGCGATTGGTCCGAAA
TCGTTGCCGGCCCTAAATGGAAGACGTTCATTCGTCAATTTCACAAGAATCGCCATCGCCAATCTACTTACCGTTACGATCCGCTCAGTTACGCTCTCAACTTCGACGAA
GGTCCAGCTCTCGACGATCCTTTCAGCGAGGACTTCATGCGCCGCGACTTCTCCATGCGATTCGCCTCCATTCCGGCCTCCGCCAAGTCGTCCATGGACCTCGGCAAGGA
TGGACCGTCCTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTTCTCAATCTCCTGACCTTTCTTCCGACGTTAATGGCTTCCACACCGATGACCTTCTTCATGAGGCTGCCTTCGCCAGCCGTGGCTGCTGCTTCTGGCTTCC
TTGCCTGAGGTCTAATCCATCGAAGTCCTGGTGGGAGAGGATTAGGGCTGCGGATAATGACGACGAGTGGTGGCTTCGAGGCTGGAAGAGGTTCCGCGATTGGTCCGAAA
TCGTTGCCGGCCCTAAATGGAAGACGTTCATTCGTCAATTTCACAAGAATCGCCATCGCCAATCTACTTACCGTTACGATCCGCTCAGTTACGCTCTCAACTTCGACGAA
GGTCCAGCTCTCGACGATCCTTTCAGCGAGGACTTCATGCGCCGCGACTTCTCCATGCGATTCGCCTCCATTCCGGCCTCCGCCAAGTCGTCCATGGACCTCGGCAAGGA
TGGACCGTCCTTCATTTGA
Protein sequenceShow/hide protein sequence
MSVSQSPDLSSDVNGFHTDDLLHEAAFASRGCCFWLPCLRSNPSKSWWERIRAADNDDEWWLRGWKRFRDWSEIVAGPKWKTFIRQFHKNRHRQSTYRYDPLSYALNFDE
GPALDDPFSEDFMRRDFSMRFASIPASAKSSMDLGKDGPSFI