; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G12860 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G12860
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionNHL domain-containing protein
Genome locationClcChr01:24602589..24603747
RNA-Seq ExpressionClc01G12860
SyntenyClc01G12860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660353.1 uncharacterized protein LOC105436380 [Cucumis sativus]1.6e-7788.17Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDD-RWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKELES   IDGADALL SK YSCFCFPCFGP RS S+ELSWWERVKTKAKSTKFDS+D  WWTGGIRSL+KLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDD-RWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PAPVKSLSAAAVNG
        RNRPATVKLGKFQYD ISYALNFDEGHNGDVDF+GD+YN GGGFQNFSDRFAA+ PAPVKS S+AAVNG
Subjt:  RNRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PAPVKSLSAAAVNG

XP_022923865.1 uncharacterized protein LOC111431456 [Cucurbita moschata]1.3e-7483.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+ Y CFCFPCFG  RSAS+ELSWWER K KAKSTKFD +D WW+GGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN
        NRPA VKLGKFQYD ISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVPA  KS  AA  +
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN

XP_023000665.1 uncharacterized protein LOC111495035 [Cucurbita maxima]7.4e-7583.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+ YSCFCFPCFG  RSAS+ELSWWER K KAKSTKFD +D WW+GGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN
        NRPA VKLGKFQYD ISYALNFDEGHNGDV+FEGD+Y+ GGFQNFSDRF+AVPA  KS  AA  +
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN

XP_023519821.1 uncharacterized protein LOC111783153 [Cucurbita pepo subsp. pepo]4.3e-7583.64Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+ YSCFCFPCFG  RSAS+ELSWWER K KAKSTKFD +D WW+GGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN
        NRPA VKLGKFQYD ISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVPA  KS  AA  +
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN

XP_038894780.1 uncharacterized protein LOC120083201 [Benincasa hispida]5.1e-8490.36Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELESIGEIDGADALLASK YSCFCFPCFGPRRSAS+E+SWWERVKTKAKSTKFD +D WWTGGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVNG
        NRPATVKLGKFQYD ISYALNFD+GHNGDVDF+GD+Y+GGGFQNFSDRFAAVP PVKS  A AVNG
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVNG

TrEMBL top hitse value%identityAlignment
A0A0A0LU30 Uncharacterized protein7.7e-7888.17Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDD-RWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFN
        MGDRDGAFKELES   IDGADALL SK YSCFCFPCFGP RS S+ELSWWERVKTKAKSTKFDS+D  WWTGGIRSL+KLREWSEIVAGPRWKTFIRRFN
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDD-RWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFN

Query:  RNRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PAPVKSLSAAAVNG
        RNRPATVKLGKFQYD ISYALNFDEGHNGDVDF+GD+YN GGGFQNFSDRFAA+ PAPVKS S+AAVNG
Subjt:  RNRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYN-GGGFQNFSDRFAAV-PAPVKSLSAAAVNG

A0A5A7ULQ3 Uncharacterized protein1.4e-7485.88Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELES   IDGADALLASK YSCFCFPCFGP RS S+ELSWWERVKTKAKSTKFD +D WWTGGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PAPVKSLSAAAVNG
        NRPATVKLGKFQYD ISYALNFDEGH NGDVDFEG+ +Y  GGGFQNFSDRFAAV PAP+KS S+ A+NG
Subjt:  NRPATVKLGKFQYDTISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PAPVKSLSAAAVNG

A0A5D3BLD0 Uncharacterized protein1.2e-7385.29Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDGAFKELES   IDGADALLASK YSCFCFPCFGP RS S+ELSWWERVKTKAKSTKFD +D WWTG IRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PAPVKSLSAAAVNG
        NRPATVKLGKFQYD ISYALNFDEGH NGDVDFEG+ +Y  GGGFQNFSDRFAAV PAP+KS S+ A+NG
Subjt:  NRPATVKLGKFQYDTISYALNFDEGH-NGDVDFEGD-DY-NGGGFQNFSDRFAAV-PAPVKSLSAAAVNG

A0A6J1ED51 uncharacterized protein LOC1114314566.1e-7583.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+ Y CFCFPCFG  RSAS+ELSWWER K KAKSTKFD +D WW+GGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN
        NRPA VKLGKFQYD ISYALNFDEGHNGDVDFEGD+Y+ GGFQNFSDRF+AVPA  KS  AA  +
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN

A0A6J1KJ01 uncharacterized protein LOC1114950353.6e-7583.03Show/hide
Query:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR
        MGDRDG FKE+ESIGEIDGADALLAS+ YSCFCFPCFG  RSAS+ELSWWER K KAKSTKFD +D WW+GGIRSL+KLREWSEIVAGPRWKTFIRRFNR
Subjt:  MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR

Query:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN
        NRPA VKLGKFQYD ISYALNFDEGHNGDV+FEGD+Y+ GGFQNFSDRF+AVPA  KS  AA  +
Subjt:  NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)1.5e-2036.05Show/hide
Query:  IGEIDGADAL---LASKCYSCFCFPCFGPRRSASEELS-WWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNR-------
        I E+D  D +   L +K   CF  PC    + ++   S WW+R+ T     K + D+RWW   IR  +++REWSE+VAGPRWKT+IRRF R+        
Subjt:  IGEIDGADAL---LASKCYSCFCFPCFGPRRSASEELS-WWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNR-------

Query:  ---------------PATVKLGKFQYDTISYALNFDEGH-NGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKS
                         +   GKF+YD +SY+LNFD+G+  G  D   D++    ++++S RFAA   PV +
Subjt:  ---------------PATVKLGKFQYDTISYALNFDEGH-NGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKS

AT3G48020.1 unknown protein1.4e-1844.83Show/hide
Query:  SASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDTISYALNFDEGHNGDVDFEGDDYN
        S + + SWW+R+           + RWW   +R+  K+REWSEIVAGPRWKTFIRRFNR+           KF+YD +SY L+F+     D D + DD  
Subjt:  SASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATV---KLGKFQYDTISYALNFDEGHNGDVDFEGDDYN

Query:  G-GGFQNFSDRFAAVP
        G GG ++FS R+A+VP
Subjt:  G-GGFQNFSDRFAAVP

AT5G14890.1 NHL domain-containing protein4.5e-2240.83Show/hide
Query:  FKELES--IGEIDGADAL---LASKCYSCFCFPCFGPRRSASEELS-WWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR-
        F  LES  I E+D  D +   + +K   CF  PC G  + +    S WW+R++T     K + D+RWW  G     K+REWSEIVAGP+WKTFIRRF R 
Subjt:  FKELES--IGEIDGADAL---LASKCYSCFCFPCFGPRRSASEELS-WWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNR-

Query:  -----------NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKS
                   NRP  V    F+YD+ SY+LNFD+G      FE D++    ++++S RFAA   PV +
Subjt:  -----------NRPATVKLGKFQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKS

AT5G25240.1 unknown protein4.9e-0842.19Show/hide
Query:  DDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDTISYALNFDEGHNG
        ++R    G   L+ L+E SE +AGP+WK FIR F+  R    +   F YD  +Y+LNFD+G +G
Subjt:  DDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGKFQYDTISYALNFDEGHNG

AT5G62865.1 unknown protein2.0e-2249.24Show/hide
Query:  CFCFPCFGPRRSASE-ELSWWERVKT---KAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDTISYALNF
        C CFP F   RS++    S W R++T      S     + RWW   IR+  K+REWSEIVAGPRWKTFIRRFNR+ P   +      KFQYD +SY+LNF
Subjt:  CFCFPCFGPRRSASE-ELSWWERVKT---KAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATVK----LGKFQYDTISYALNF

Query:  DEGHNGDVDFEGDDYNG-GGFQNFSDRFAAVP
        D+      D E D+Y G GG ++FS RFA+VP
Subjt:  DEGHNGDVDFEGDDYNG-GGFQNFSDRFAAVP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACCGTGACGGAGCTTTCAAAGAATTAGAATCAATTGGAGAAATTGACGGCGCCGATGCTCTTTTAGCCTCTAAATGTTATAGTTGTTTTTGTTTCCCTTGCTT
TGGACCTCGCCGGTCGGCTTCCGAGGAGCTCTCATGGTGGGAACGGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACAGCGATGATCGCTGGTGGACTGGCGGAATCA
GATCCCTCCAAAAGCTTCGTGAATGGTCGGAGATCGTCGCCGGTCCGAGATGGAAGACTTTCATTCGCCGGTTCAATAGGAACCGTCCGGCCACTGTGAAGCTCGGGAAA
TTCCAATATGATACTATAAGTTACGCTTTGAATTTCGACGAGGGGCACAATGGTGATGTGGATTTTGAAGGGGATGATTACAATGGTGGTGGGTTTCAGAATTTCTCCGA
CCGGTTTGCTGCCGTGCCGGCTCCTGTGAAGTCATTGTCGGCAGCGGCGGTGAATGGTTAG
mRNA sequenceShow/hide mRNA sequence
AAATCCCTTTCACGCCTCTGTGGCTTTTGGATAATACAACTACCGGCTAACAACTACTGGACCCCCCCACTTTACCGGCAGTCTCTGGTCGCCCACCGTTTCGTTCTTTC
CAATTCTACCCGTTGGCTCTATATACGGGAGCGGTATCCTTCTCCGAAAACCCTAATCTCTCTCTCTCTCTCTCTCTACGGCCATGGGAGACCGTGACGGAGCTTTCAAA
GAATTAGAATCAATTGGAGAAATTGACGGCGCCGATGCTCTTTTAGCCTCTAAATGTTATAGTTGTTTTTGTTTCCCTTGCTTTGGACCTCGCCGGTCGGCTTCCGAGGA
GCTCTCATGGTGGGAACGGGTGAAGACGAAGGCAAAATCCACGAAGTTCGACAGCGATGATCGCTGGTGGACTGGCGGAATCAGATCCCTCCAAAAGCTTCGTGAATGGT
CGGAGATCGTCGCCGGTCCGAGATGGAAGACTTTCATTCGCCGGTTCAATAGGAACCGTCCGGCCACTGTGAAGCTCGGGAAATTCCAATATGATACTATAAGTTACGCT
TTGAATTTCGACGAGGGGCACAATGGTGATGTGGATTTTGAAGGGGATGATTACAATGGTGGTGGGTTTCAGAATTTCTCCGACCGGTTTGCTGCCGTGCCGGCTCCTGT
GAAGTCATTGTCGGCAGCGGCGGTGAATGGTTAGGTTGCGGCGATGGCGGTGGCGGTCTTGTTTTCTCCGGTGAAGTTGAACGGACAGTTTCCGTCCGGTACTGGGCTTT
TGACGTATATTTTTCAACTAATGAGCTGGAAGTTGACCATTGGCAGTGAAGTAGGAAGCGCCAATGGCGGCGGTACGCGTTTTAAATAAGAGAGTTTACCATTTTTGTGG
GGTGACGTGTAGCAAAATGATTGGTGTTCATATTCTTATAGTTGATATAATTTTGTTTTATTTTTTCAATTAAATTGTATAATTCCAAATTCCAAATAGAGTTTTACTAT
AAATTCTTATGGTCGAAATTATAAATATATATTTTAAAATATCCAAATGGAGTATTGAATGTATTTCGGAGCCATTGTACATATTTATATCAGTTGAATTATATTATAAT
TAAACTATTATTATCATGAAATGTATATGGATGGAATTAAAAGTTATTCTTCTTCTTTA
Protein sequenceShow/hide protein sequence
MGDRDGAFKELESIGEIDGADALLASKCYSCFCFPCFGPRRSASEELSWWERVKTKAKSTKFDSDDRWWTGGIRSLQKLREWSEIVAGPRWKTFIRRFNRNRPATVKLGK
FQYDTISYALNFDEGHNGDVDFEGDDYNGGGFQNFSDRFAAVPAPVKSLSAAAVNG