; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G28230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G28230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionNHL domain protein
Genome locationChr5:26730369..26731191
RNA-Seq ExpressionCSPI05G28230
SyntenyCSPI05G28230
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037502.1 putative NHL domain-containing protein [Cucumis melo var. makuwa]1.0e-7692.57Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCCFWIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDS
        PLSYSLNFDEGPAH+D FTDDFVRRDFSTRFAAIPASAKSSMDL KD+
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDS

KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]2.0e-7282.89Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS++ SPDLSSD NGF  +D LH+A FA+RGCCFW+PCLRSN S+SWWERIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSY+LNFDEGPA DDPF++DF+RRDFS RFA+IPASAKSSMDLGKD PSFI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

XP_016902318.1 PREDICTED: uncharacterized protein LOC107991628 [Cucumis melo]8.2e-7992.76Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCCFWIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSYSLNFDEGPAH+D FTDDFVRRDFSTRFAAIPASAKSSMDL KDSPS I
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

XP_031741765.1 uncharacterized protein LOC116403956 [Cucumis sativus]1.4e-8699.34Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCC WIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]1.0e-7688.82Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPDLSSDEN F H DDLHDA F +RGCC W+PCLRSN SQSWWERIRAADNDDEWWL+GWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSYSLNFDEGPAHDDPF+DD VRRDFSTRFAAIPASAKSSMDLGKD P FI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein6.8e-8799.34Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCC WIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

A0A1S4E267 uncharacterized protein LOC1079916284.0e-7992.76Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCCFWIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSYSLNFDEGPAH+D FTDDFVRRDFSTRFAAIPASAKSSMDL KDSPS I
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

A0A5A7T228 Putative NHL domain-containing protein4.9e-7792.57Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCCFWIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDS
        PLSYSLNFDEGPAH+D FTDDFVRRDFSTRFAAIPASAKSSMDL KD+
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDS

A0A6J1HFG4 uncharacterized protein LOC1114625571.6e-7282.89Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS+  SPDLSSD NGF  +D LH+A FA+RGCCFW+PCLRSN S+SWWERIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSY+LNFDEGPA DDPF++DF+RRDFS RFA+IPASAKSSMDLGKD PSFI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

A0A6J1JTQ3 uncharacterized protein LOC1114874228.0e-7281.58Show/hide
Query:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS++ SPDLSSD NGF  +D LH+A FA+RGCCFW+PCLRSN S++WWERIRAADNDDEWWL+GWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI
        PLSY+LNFD+GPA DDPF +DF+RRDFS RFA+IPASAKSSMDLGKD PSFI
Subjt:  PLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.5e-3344.79Show/hide
Query:  DDLHDAVFATRGCCFWIPCLRSNSSQS-----WWERIRAADN---DDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNR--------------------
        DD+H+A+FA RGCCF +PCL S+   +     WW+RI   D    D+ WW++GW+R REWSE+VAGP+WKT+IR+F ++                     
Subjt:  DDLHDAVFATRGCCFWIPCLRSNSSQS-----WWERIRAADN---DDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNR--------------------

Query:  --NR---QSTFRYDPLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAA--IPASAKSSMDLGKD
          NR   Q  FRYD LSYSLNFD+G      F D+F  RD+S RFAA  +P S K S+D   D
Subjt:  --NR---QSTFRYDPLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAA--IPASAKSSMDLGKD

AT3G48020.1 unknown protein3.2e-2041.18Show/hide
Query:  CLRSNSSQSWWERI-RAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGPAHDDPFTDDFVRRDFSTR
        C  +    SWW+RI R    +  WW++ + + REWSEIVAGP+WKTFIR+F+++  R         FRYDP+SY+L+F++    DD        R FS R
Subjt:  CLRSNSSQSWWERI-RAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGPAHDDPFTDDFVRRDFSTR

Query:  FAAIP-ASAKSSMDLGKDS
        +A++P AS KS   +  D+
Subjt:  FAAIP-ASAKSSMDLGKDS

AT5G14890.1 NHL domain-containing protein1.3e-3448.34Show/hide
Query:  DDLHDAVFATRGCCFWIPCLRSN-----SSQSWWERIRAADN---DDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHK------------NRNRQSTFRY
        D++H+A+FA RGCCF +PCL S+     +   WW+RIR  D    D+ WW+ GW + REWSEIVAGPKWKTFIR+F +            NR    +FRY
Subjt:  DDLHDAVFATRGCCFWIPCLRSN-----SSQSWWERIRAADN---DDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHK------------NRNRQSTFRY

Query:  DPLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAA--IPASAKSSMDLGKDS
        D  SYSLNFD+G      F D+F  RD+S RFAA  +P S K S+D   D+
Subjt:  DPLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAA--IPASAKSSMDLGKDS

AT5G25240.1 unknown protein1.2e-0831.75Show/hide
Query:  LSSDENGFLHNDDLHDAVFATRGCCF--------WIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQF---HKNRNRQSTF
        +++D    L +D    A F   GC +        W      + S+  W      +    W  +  K  +E SE +AGPKWK FIR F    K   R   F
Subjt:  LSSDENGFLHNDDLHDAVFATRGCCF--------WIPCLRSNSSQSWWERIRAADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQF---HKNRNRQSTF

Query:  RYDPLSYSLNFDEGPAHDDPFTDDFV
         YD  +YSLNFD+G    D   + FV
Subjt:  RYDPLSYSLNFDEGPAHDDPFTDDFV

AT5G62865.1 unknown protein1.9e-2042.65Show/hide
Query:  DAVFATRGCCFWIPCLRSNSSQ-----SWWERIRAAD---------NDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLS
        D  +  R CCF  P  R + S      S W RIR  D         ++  WW++   + REWSEIVAGP+WKTFIR+F+++  R         F+YDPLS
Subjt:  DAVFATRGCCFWIPCLRSNSSQ-----SWWERIRAAD---------NDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLS

Query:  YSLNFDEGPAHDDPFTDDFV----RRDFSTRFAAIP
        YSLNFD     DD   D++V     R FSTRFA++P
Subjt:  YSLNFDEGPAHDDPFTDDFV----RRDFSTRFAAIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTATATAATTAATTACTTTCTTTCCCAAAAAACAAAAAAAAACAAAAAAAACAAAAAACCTCTCCGTTTTAGATTCATAGAGAATAGAGAAGAATCTCTGTGCAAATT
GATTTGCAGAAATTGGATTCAATTTCTGCAAATCAATTCCAACAGATCGAATTACCTTCCTTTGATGTCTCTTGCCCATTCTCCCGACCTCTCTTCCGACGAAAATGGCT
TCCTGCATAACGATGACCTTCACGACGCTGTTTTCGCTACCCGTGGTTGTTGTTTTTGGATCCCTTGCTTGAGGTCCAATTCCTCACAGTCATGGTGGGAGCGAATTAGG
GCTGCCGATAACGACGATGAATGGTGGCTTAAAGGCTGGAAGAGATTTCGCGAATGGTCTGAAATCGTCGCTGGCCCCAAATGGAAAACCTTCATTCGTCAATTTCATAA
GAATCGTAACCGTCAGTCTACTTTCCGTTACGATCCCCTCAGTTATTCTCTTAATTTCGATGAAGGTCCGGCTCATGACGATCCTTTCACCGATGATTTTGTACGACGTG
ATTTCTCTACTCGATTCGCCGCCATTCCCGCTTCCGCTAAATCGTCTATGGACCTTGGTAAGGACAGTCCGTCCTTCATTTGA
mRNA sequenceShow/hide mRNA sequence
TGATTATATAATTAATTACTTTCTTTCCCAAAAAACAAAAAAAAACAAAAAAAACAAAAAACCTCTCCGTTTTAGATTCATAGAGAATAGAGAAGAATCTCTGTGCAAAT
TGATTTGCAGAAATTGGATTCAATTTCTGCAAATCAATTCCAACAGATCGAATTACCTTCCTTTGATGTCTCTTGCCCATTCTCCCGACCTCTCTTCCGACGAAAATGGC
TTCCTGCATAACGATGACCTTCACGACGCTGTTTTCGCTACCCGTGGTTGTTGTTTTTGGATCCCTTGCTTGAGGTCCAATTCCTCACAGTCATGGTGGGAGCGAATTAG
GGCTGCCGATAACGACGATGAATGGTGGCTTAAAGGCTGGAAGAGATTTCGCGAATGGTCTGAAATCGTCGCTGGCCCCAAATGGAAAACCTTCATTCGTCAATTTCATA
AGAATCGTAACCGTCAGTCTACTTTCCGTTACGATCCCCTCAGTTATTCTCTTAATTTCGATGAAGGTCCGGCTCATGACGATCCTTTCACCGATGATTTTGTACGACGT
GATTTCTCTACTCGATTCGCCGCCATTCCCGCTTCCGCTAAATCGTCTATGGACCTTGGTAAGGACAGTCCGTCCTTCATTTGAGTCCGGAGTTGCGGCGGAATAGACGG
CATTGCCTTTTGGAATCTCGGTCTTCTCGGTTGGGTCTTGCCGGAGCCCCCGTACGCGGTGGCCTTCGGAGCAGCGGTGGAGAGATTAGCGTTTTTGGGGACGATGGAAA
CTATGAATCTGAATTCGGTTAATTTCTCGCCATCGTAAAAATTTAAAAAGTCG
Protein sequenceShow/hide protein sequence
DYIINYFLSQKTKKNKKNKKPLRFRFIENREESLCKLICRNWIQFLQINSNRSNYLPLMSLAHSPDLSSDENGFLHNDDLHDAVFATRGCCFWIPCLRSNSSQSWWERIR
AADNDDEWWLKGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDPLSYSLNFDEGPAHDDPFTDDFVRRDFSTRFAAIPASAKSSMDLGKDSPSFI