; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022137 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022137
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionNHL domain-containing protein, putative
Genome locationChr05:21213370..21213825
RNA-Seq ExpressionHG10022137
SyntenyHG10022137
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7383.55Show/hide
Query:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MS++ SPDLSSD NGFH DD LH+AAF SRGCCFW+PCLRS+PS++WWERI+AADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPFS+DF+RRDFS RFA++PASAKSSMDLGKDGPSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

XP_016902318.1 PREDICTED: uncharacterized protein LOC107991628 [Cucumis melo]3.9e-7586.09Show/hide
Query:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP
        MSLA SPD SSDENGFH +DLHDA F + GCCFWIPCLRS+ SQ+WWERI+AADNDDEWWLRGWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYDP
Subjt:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        LSYSLNFDEGPAH+D F+DDFVRRDFSTRFAA+PASAKSSMDL KD PS I
Subjt:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

XP_023533538.1 uncharacterized protein LOC111795380 [Cucurbita pepo subsp. pepo]5.7e-7484.21Show/hide
Query:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MS++ SPDLSSD NGFH DD LH+AAF SRGCCFW+PCLRSSPS++WWERI+AADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPFS+DF+RRDFS RFA++PASAKSSMDLGKDGPSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

XP_031741765.1 uncharacterized protein LOC116403956 [Cucumis sativus]2.5e-7788.82Show/hide
Query:  MSLAPSPDLSSDENGF-HGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MSLA SPDLSSDENGF H DDLHDA F +RGCC WIPCLRS+ SQ+WWERI+AADNDDEWWL+GWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYD
Subjt:  MSLAPSPDLSSDENGF-HGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSYSLNFDEGPAHDDPF+DDFVRRDFSTRFAA+PASAKSSMDLGKD PSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]1.3e-8193.38Show/hide
Query:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP
        MSLA SPDLSSDEN FHGDDLHDAAFVSRGCC W+PCLRS+PSQ+WWERI+AADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP
Subjt:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        LSYSLNFDEGPAHDDPFSDD VRRDFSTRFAA+PASAKSSMDLGKDGP FI
Subjt:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein1.2e-7788.82Show/hide
Query:  MSLAPSPDLSSDENGF-HGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MSLA SPDLSSDENGF H DDLHDA F +RGCC WIPCLRS+ SQ+WWERI+AADNDDEWWL+GWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYD
Subjt:  MSLAPSPDLSSDENGF-HGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSYSLNFDEGPAHDDPF+DDFVRRDFSTRFAA+PASAKSSMDLGKD PSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

A0A1S4E267 uncharacterized protein LOC1079916281.9e-7586.09Show/hide
Query:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP
        MSLA SPD SSDENGFH +DLHDA F + GCCFWIPCLRS+ SQ+WWERI+AADNDDEWWLRGWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYDP
Subjt:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        LSYSLNFDEGPAH+D F+DDFVRRDFSTRFAA+PASAKSSMDL KD PS I
Subjt:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

A0A5A7T228 Putative NHL domain-containing protein1.0e-7386.99Show/hide
Query:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP
        MSLA SPD SSDENGFH +DLHDA F + GCCFWIPCLRS+ SQ+WWERI+AADNDDEWWLRGWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYDP
Subjt:  MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKD
        LSYSLNFDEGPAH+D F+DDFVRRDFSTRFAA+PASAKSSMDL KD
Subjt:  LSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKD

A0A6J1HFG4 uncharacterized protein LOC1114625571.0e-7383.55Show/hide
Query:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MS+  SPDLSSD NGFH DD LH+AAF SRGCCFW+PCLRS+PS++WWERI+AADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSY+LNFDEGPA DDPFS+DF+RRDFS RFA++PASAKSSMDLGKDGPSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874221.8e-7382.89Show/hide
Query:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD
        MS++ SPDLSSD NGFH DD LH+AAF SRGCCFW+PCLRS+PS+TWWERI+AADNDDEWWLRGWKRFR+WSEI+AGPKWKTFIRQF+KNR+RQST+RYD
Subjt:  MSLAPSPDLSSDENGFHGDD-LHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI
        PLSY+LNFD+GPA DDPF +DF+RRDFS RFA++PASAKSSMDLGKDGPSFI
Subjt:  PLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.1e-3446.01Show/hide
Query:  DDLHDAAFVSRGCCFWIPCLRSSPSQT-----WWERIKAADN---DDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNR--------------------
        DD+H+A F  RGCCF +PCL SS   T     WW+RI   D    D+ WW+RGW+R REWSE++AGP+WKT+IR+F ++                     
Subjt:  DDLHDAAFVSRGCCFWIPCLRSSPSQT-----WWERIKAADN---DDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNR--------------------

Query:  --NR---QSTFRYDPLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAA--LPASAKSSMDLGKD
          NR   Q  FRYD LSYSLNFD+G      F D+F  RD+S RFAA  LP S K S+D   D
Subjt:  --NR---QSTFRYDPLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAA--LPASAKSSMDLGKD

AT3G48020.1 unknown protein1.0e-2040.68Show/hide
Query:  CLRSSPSQTWWERIKAADNDD-EWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNR------QSTFRYDPLSYSLNFDEGPAHDDPFSDDFVRRDFSTR
        C  ++   +WW+RI   ++ +  WW+R + + REWSEI+AGP+WKTFIR+FN++  R         FRYDP+SY+L+F++    DD  +     R FS R
Subjt:  CLRSSPSQTWWERIKAADNDD-EWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNR------QSTFRYDPLSYSLNFDEGPAHDDPFSDDFVRRDFSTR

Query:  FAALP-ASAKSSMDLGKD
        +A++P AS KS   +  D
Subjt:  FAALP-ASAKSSMDLGKD

AT5G14890.1 NHL domain-containing protein1.6e-3449.33Show/hide
Query:  DDLHDAAFVSRGCCFWIPCLRSS-PS----QTWWERIKAADN---DDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNK------------NRNRQSTFRY
        D++H+A F  RGCCF +PCL SS PS      WW+RI+  D    D+ WW+ GW + REWSEI+AGPKWKTFIR+F +            NR    +FRY
Subjt:  DDLHDAAFVSRGCCFWIPCLRSS-PS----QTWWERIKAADN---DDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNK------------NRNRQSTFRY

Query:  DPLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAA--LPASAKSSMDLGKD
        D  SYSLNFD+G      F D+F  RD+S RFAA  LP S K S+D   D
Subjt:  DPLSYSLNFDEGPAHDDPFSDDFVRRDFSTRFAA--LPASAKSSMDLGKD

AT5G25240.1 unknown protein6.8e-0933.06Show/hide
Query:  LSSDENGFHGDDLHDAA-------FVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNR---NRQSTFRY
        +++D      DD  + A       F S     W      S S+  W      +    W     K  +E SE IAGPKWK FIR F+  R    R   F Y
Subjt:  LSSDENGFHGDDLHDAA-------FVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNR---NRQSTFRY

Query:  DPLSYSLNFDEGPAHDDPFSDDFV
        D  +YSLNFD+G    D   + FV
Subjt:  DPLSYSLNFDEGPAHDDPFSDDFV

AT5G62865.1 unknown protein1.6e-2143.38Show/hide
Query:  DAAFVSRGCCFWIPCLRSSPSQT-----WWERIKAAD---------NDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNR------QSTFRYDPLS
        D  +  R CCF  P  R S S T      W RI+  D         ++  WW+R   + REWSEI+AGP+WKTFIR+FN++  R         F+YDPLS
Subjt:  DAAFVSRGCCFWIPCLRSSPSQT-----WWERIKAAD---------NDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNR------QSTFRYDPLS

Query:  YSLNFDEGPAHDDPFSDDFV----RRDFSTRFAALP
        YSLNFD     DD   D++V     R FSTRFA++P
Subjt:  YSLNFDEGPAHDDPFSDDFV----RRDFSTRFAALP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTGGCGCCATCTCCCGACCTCTCTTCCGACGAAAATGGCTTCCATGGCGATGACCTTCACGACGCCGCCTTCGTTAGCCGTGGCTGTTGCTTCTGGATC
CCTTGCCTGAGATCCAGTCCCTCGCAGACTTGGTGGGAGCGGATTAAGGCAGCCGATAACGACGATGAATGGTGGCTTAGAGGCTGGAAGAGGTTTCGCGAGTGG
TCGGAAATTATCGCTGGCCCTAAATGGAAGACTTTCATTCGTCAATTCAACAAGAATCGCAATCGTCAATCCACTTTCCGTTACGATCCTCTCAGTTACTCACTT
AATTTCGACGAAGGTCCAGCCCACGACGATCCTTTCAGCGACGATTTTGTACGCCGTGATTTCTCCACTCGATTCGCCGCCCTTCCGGCCTCCGCCAAGTCGTCT
ATGGACCTCGGGAAGGACGGACCATCCTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTGGCGCCATCTCCCGACCTCTCTTCCGACGAAAATGGCTTCCATGGCGATGACCTTCACGACGCCGCCTTCGTTAGCCGTGGCTGTTGCTTCTGGATC
CCTTGCCTGAGATCCAGTCCCTCGCAGACTTGGTGGGAGCGGATTAAGGCAGCCGATAACGACGATGAATGGTGGCTTAGAGGCTGGAAGAGGTTTCGCGAGTGG
TCGGAAATTATCGCTGGCCCTAAATGGAAGACTTTCATTCGTCAATTCAACAAGAATCGCAATCGTCAATCCACTTTCCGTTACGATCCTCTCAGTTACTCACTT
AATTTCGACGAAGGTCCAGCCCACGACGATCCTTTCAGCGACGATTTTGTACGCCGTGATTTCTCCACTCGATTCGCCGCCCTTCCGGCCTCCGCCAAGTCGTCT
ATGGACCTCGGGAAGGACGGACCATCCTTCATTTGA
Protein sequenceShow/hide protein sequence
MSLAPSPDLSSDENGFHGDDLHDAAFVSRGCCFWIPCLRSSPSQTWWERIKAADNDDEWWLRGWKRFREWSEIIAGPKWKTFIRQFNKNRNRQSTFRYDPLSYSL
NFDEGPAHDDPFSDDFVRRDFSTRFAALPASAKSSMDLGKDGPSFI