; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020300 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020300
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionNHL domain protein
Genome locationchr09:3335625..3336675
RNA-Seq ExpressionPay0020300
SyntenyPay0020300
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037502.1 putative NHL domain-containing protein [Cucumis melo var. makuwa]1.3e-8197.96Show/hide
Query:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
        MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
Subjt:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDS
        LSYSLNFDEGPA ED FTDDFVRRDFSTRFAAIPASAKSSMDLVKD+
Subjt:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDS

KAG6602448.1 hypothetical protein SDJN03_07681, partial [Cucurbita argyrosperma subsp. sororia]1.3e-7081.58Show/hide
Query:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS++QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        PLSY+LNFDEGPA +DPF++DF+RRDFS RFA+IPASAKSSMDL KD PS I
Subjt:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

XP_016902318.1 PREDICTED: uncharacterized protein LOC107991628 [Cucumis melo]4.6e-8498.68Show/hide
Query:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
        MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
Subjt:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        LSYSLNFDEGPA ED FTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
Subjt:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

XP_031741765.1 uncharacterized protein LOC116403956 [Cucumis sativus]8.5e-7892.11Show/hide
Query:  MSLAQSPDFSSDENGF-HSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCC WIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAQSPDFSSDENGF-HSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        PLSYSLNFDEGPA +DPFTDDFVRRDFSTRFAAIPASAKSSMDL KDSPS I
Subjt:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

XP_038889212.1 uncharacterized protein LOC120079097 [Benincasa hispida]5.7e-7486.09Show/hide
Query:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
        MSLAQSPD SSDEN FH +DLHDA F + GCC W+PCLRSN SQSWWERIRAADNDDEWWLRGWKRFREWSEI+AGPKWKTFIRQF+KNRNRQSTFRYDP
Subjt:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        LSYSLNFDEGPA +DPF+DD VRRDFSTRFAAIPASAKSSMDL KD P  I
Subjt:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

TrEMBL top hitse value%identityAlignment
A0A0A0KRY7 Uncharacterized protein4.1e-7892.11Show/hide
Query:  MSLAQSPDFSSDENGF-HSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MSLA SPD SSDENGF H++DLHDAVFAT GCC WIPCLRSNSSQSWWERIRAADNDDEWWL+GWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
Subjt:  MSLAQSPDFSSDENGF-HSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        PLSYSLNFDEGPA +DPFTDDFVRRDFSTRFAAIPASAKSSMDL KDSPS I
Subjt:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

A0A1S4E267 uncharacterized protein LOC1079916282.2e-8498.68Show/hide
Query:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
        MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
Subjt:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        LSYSLNFDEGPA ED FTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
Subjt:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

A0A5A7T228 Putative NHL domain-containing protein6.1e-8297.96Show/hide
Query:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
        MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP
Subjt:  MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDP

Query:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDS
        LSYSLNFDEGPA ED FTDDFVRRDFSTRFAAIPASAKSSMDLVKD+
Subjt:  LSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDS

A0A6J1HFG4 uncharacterized protein LOC1114625571.1e-7081.58Show/hide
Query:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS+ QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S+SWWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        PLSY+LNFDEGPA +DPF++DF+RRDFS RFA+IPASAKSSMDL KD PS I
Subjt:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

A0A6J1JTQ3 uncharacterized protein LOC1114874225.4e-7080.26Show/hide
Query:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD
        MS++QSPD SSD NGFH++D LH+A FA+ GCCFW+PCLRSN S++WWERIRAADNDDEWWLRGWKRFR+WSEIVAGPKWKTFIRQFHKNR+RQST+RYD
Subjt:  MSLAQSPDFSSDENGFHSND-LHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYD

Query:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI
        PLSY+LNFD+GPA +DPF +DF+RRDFS RFA+IPASAKSSMDL KD PS I
Subjt:  PLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)1.5e-3243.9Show/hide
Query:  SNDLHDAVFATHGCCFWIPCLRSNSSQS-----WWERIRAADN---DDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNR-------------------
        ++D+H+A+FA  GCCF +PCL S+   +     WW+RI   D    D+ WW+RGW+R REWSE+VAGP+WKT+IR+F ++                    
Subjt:  SNDLHDAVFATHGCCFWIPCLRSNSSQS-----WWERIRAADN---DDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNR-------------------

Query:  ---NR---QSTFRYDPLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAA--IPASAKSSMDLVKD
           NR   Q  FRYD LSYSLNFD+G  +   F D+F  RD+S RFAA  +P S K S+D   D
Subjt:  ---NR---QSTFRYDPLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAA--IPASAKSSMDLVKD

AT3G48020.1 unknown protein2.3e-2042.98Show/hide
Query:  CLRSNSSQSWWERI-RAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGPAREDPFTDDFVRRDFSTR
        C  +    SWW+RI R    +  WW+R + + REWSEIVAGP+WKTFIR+F+++  R         FRYDP+SY+L+F++    +D        R FS R
Subjt:  CLRSNSSQSWWERI-RAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGPAREDPFTDDFVRRDFSTR

Query:  FAAIP-ASAKS----SMDLVK
        +A++P AS KS    S+D VK
Subjt:  FAAIP-ASAKS----SMDLVK

AT5G14890.1 NHL domain-containing protein1.8e-3347.13Show/hide
Query:  SNDLHDAVFATHGCCFWIPCLRSN-----SSQSWWERIRAADN---DDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHK------------NRNRQSTFR
        ++++H+A+FA  GCCF +PCL S+     +   WW+RIR  D    D+ WW+ GW + REWSEIVAGPKWKTFIR+F +            NR    +FR
Subjt:  SNDLHDAVFATHGCCFWIPCLRSN-----SSQSWWERIRAADN---DDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHK------------NRNRQSTFR

Query:  YDPLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAA--IPASAKSSMDLVKD--SPSL
        YD  SYSLNFD+G  +   F D+F  RD+S RFAA  +P S K S+D   D  +PSL
Subjt:  YDPLSYSLNFDEGPAREDPFTDDFVRRDFSTRFAA--IPASAKSSMDLVKD--SPSL

AT5G25240.1 unknown protein2.0e-0838.55Show/hide
Query:  SQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQF---HKNRNRQSTFRYDPLSYSLNFDEGPAREDPFTDDFV
        S+  W      +    W     K  +E SE +AGPKWK FIR F    K   R   F YD  +YSLNFD+G   +D   + FV
Subjt:  SQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQF---HKNRNRQSTFRYDPLSYSLNFDEGPAREDPFTDDFV

AT5G62865.1 unknown protein3.0e-2044.72Show/hide
Query:  CCFWIPCLRSNSS----QSWWERIRAAD---------NDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGP
        CC +    RS SS     S W RIR  D         ++  WW+R   + REWSEIVAGP+WKTFIR+F+++  R         F+YDPLSYSLNFD+  
Subjt:  CCFWIPCLRSNSS----QSWWERIRAAD---------NDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNR------QSTFRYDPLSYSLNFDEGP

Query:  AREDPFTDDFVRRDFSTRFAAIP
          ED +      R FSTRFA++P
Subjt:  AREDPFTDDFVRRDFSTRFAAIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTAGCCCAATCTCCCGACTTCTCTTCCGACGAAAATGGCTTCCATAGCAATGACCTTCACGACGCCGTTTTCGCTACCCATGGTTGTTGTTTCTGGATC
CCTTGCCTGAGATCCAATTCCTCACAGTCATGGTGGGAGCGAATTAGGGCTGCCGATAACGACGATGAATGGTGGCTTAGAGGCTGGAAGAGATTTCGCGAATGG
TCTGAAATCGTCGCTGGCCCTAAATGGAAAACCTTCATTCGTCAATTTCATAAGAATCGAAACCGTCAATCTACTTTCCGTTACGATCCCCTCAGTTATTCTCTT
AATTTCGATGAAGGTCCGGCTCGTGAAGATCCTTTCACCGATGATTTTGTACGACGTGACTTCTCTACTCGATTCGCAGCCATTCCGGCTTCCGCTAAATCGTCT
ATGGACCTTGTTAAGGACAGTCCGTCCTTAATTTGA
mRNA sequenceShow/hide mRNA sequence
TCCAAAATAGAGAACAGAAAAAAAGTGACTAAAACATCTGGCTAGTGGCATATATGGTAATAAAGATTCAATTTTTCCAGCCTATTTTCCGTTTCTATTTTCCCC
TAATTCTTCCATTTCTAATCACATAATTATATAATTAATTACGACAACAACACCTCCTCTTCCCCCTCCAAAAACAAAAAAAAAAAAACAAAAAGAAAAACCCTC
TCCTTTTCAGATTCATAGAGAATAGAGAAGAATCTCTGTTCGAATTGATTTGCAGGAATTGGATTCAATTTCTGCAAATCGATTCCAACACATCCAATTACCTTT
TTTGATGTCTCTAGCCCAATCTCCCGACTTCTCTTCCGACGAAAATGGCTTCCATAGCAATGACCTTCACGACGCCGTTTTCGCTACCCATGGTTGTTGTTTCTG
GATCCCTTGCCTGAGATCCAATTCCTCACAGTCATGGTGGGAGCGAATTAGGGCTGCCGATAACGACGATGAATGGTGGCTTAGAGGCTGGAAGAGATTTCGCGA
ATGGTCTGAAATCGTCGCTGGCCCTAAATGGAAAACCTTCATTCGTCAATTTCATAAGAATCGAAACCGTCAATCTACTTTCCGTTACGATCCCCTCAGTTATTC
TCTTAATTTCGATGAAGGTCCGGCTCGTGAAGATCCTTTCACCGATGATTTTGTACGACGTGACTTCTCTACTCGATTCGCAGCCATTCCGGCTTCCGCTAAATC
GTCTATGGACCTTGTTAAGGACAGTCCGTCCTTAATTTGAGTCCGGAGTTGCGGCGGAATAGACGGCATTGCCTTTGGAATCTCGGTTGGGTCTAGCCGGAGCCC
CCGTACGCGGTGGCCTTCGGAAAGATTCCGGCGGAGGAGCAGCGGTGGGGAGATTAGCCGTTTTGGGGACGATGGAAACTATGAATCTGAATTTGGGTTAATTTC
TGGCCATCGTAAAAAATTAAAAAAGTCGGTTTTTCTTTTCTTTTTTCTTTTTTCTTTTTTAAGGTTATTTTATTTTATTTTATTTATATATATAATATATATACT
T
Protein sequenceShow/hide protein sequence
MSLAQSPDFSSDENGFHSNDLHDAVFATHGCCFWIPCLRSNSSQSWWERIRAADNDDEWWLRGWKRFREWSEIVAGPKWKTFIRQFHKNRNRQSTFRYDPLSYSL
NFDEGPAREDPFTDDFVRRDFSTRFAAIPASAKSSMDLVKDSPSLI