; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G016870 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G016870
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationCG_Chr06:30005829..30006329
RNA-Seq ExpressionClCG06G016870
SyntenyClCG06G016870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057881.1 putative Serine/arginine repetitive matrix protein 2 [Cucumis melo var. makuwa]3.5e-8590.86Show/hide
Query:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV  EESP CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -----GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
             GGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DD+DDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  -----GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

XP_008453108.1 PREDICTED: uncharacterized protein LOC103493921 [Cucumis melo]2.1e-8591.91Show/hide
Query:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV  EESP CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
           GGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DD+DDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

XP_011657987.1 uncharacterized protein LOC105435925 [Cucumis sativus]2.3e-8490.29Show/hide
Query:  MAFTDV-EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR----
        MAFTDV EESP C+ADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVH+EGRWW RGVR LLKLREWSEIVAGPRWKTFIRRFNRNR    
Subjt:  MAFTDV-EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR----

Query:  ----SGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
            SGGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DDIDDFNGYRNFSARYASIPAPLKTGGTK+VSAVA
Subjt:  ----SGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

XP_022988416.1 uncharacterized protein LOC111485666 [Cucurbita maxima]3.5e-7783.63Show/hide
Query:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MA+ D EE PP EAD SNSLLPT RCGCFRLSC  SRR   VGPSWWERIR SQVHSEGRWWARGVR +LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGGV---RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA
        GGG    R GKFQYDPLSYAMNFDEGSRQIGEL+DD DDFNGYRNFSARYASIPAPLKTGGT  KD+S VA
Subjt:  GGGV---RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA

XP_038878924.1 uncharacterized protein LOC120071012 [Benincasa hispida]4.2e-8694.64Show/hide
Query:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MAFTDVEE  PCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWER+RASQVHSEGRWWARGVR LLKLREWSEIVAGPRWKTFIRRFNRNRS GGG
Subjt:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA
        GGGVRAGKFQYDPLSYAMNFDEGSRQIGEL+DDIDDFNGYRNFSARYASIPAPLKTGGT  KDVS+VA
Subjt:  GGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA

TrEMBL top hitse value%identityAlignment
A0A0A0LUB2 Uncharacterized protein1.1e-8490.29Show/hide
Query:  MAFTDV-EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR----
        MAFTDV EESP C+ADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVH+EGRWW RGVR LLKLREWSEIVAGPRWKTFIRRFNRNR    
Subjt:  MAFTDV-EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR----

Query:  ----SGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
            SGGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DDIDDFNGYRNFSARYASIPAPLKTGGTK+VSAVA
Subjt:  ----SGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

A0A1S3BVG0 uncharacterized protein LOC1034939211.0e-8591.91Show/hide
Query:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV  EESP CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
           GGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DD+DDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

A0A5A7UWM1 Putative Serine/arginine repetitive matrix protein 21.7e-8590.86Show/hide
Query:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV  EESP CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  -----GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
             GGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DD+DDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  -----GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

A0A5D3BK38 Putative Serine/arginine repetitive matrix protein 21.0e-8591.91Show/hide
Query:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--
        MAFTDV  EESP CEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGV+ +LKLREWSEIVAGPRWKTFIRRFNRNRS  
Subjt:  MAFTDV--EESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRS--

Query:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
           GGGGG GVRAGKFQYDPLSYAMNFDEGSRQIGEL+DD+DDFNGYRNFSARYASIPAPLKTGGTKDVSAVA
Subjt:  ---GGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA

A0A6J1JH52 uncharacterized protein LOC1114856661.7e-7783.63Show/hide
Query:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG
        MA+ D EE PP EAD SNSLLPT RCGCFRLSC  SRR   VGPSWWERIR SQVHSEGRWWARGVR +LK+REWSEIVAGPRWKTFIRRFNRNR+GGGG
Subjt:  MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGG

Query:  GGGV---RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA
        GGG    R GKFQYDPLSYAMNFDEGSRQIGEL+DD DDFNGYRNFSARYASIPAPLKTGGT  KD+S VA
Subjt:  GGGV---RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGT--KDVSAVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.3e-2640.7Show/hide
Query:  ESPP-CEADKSNSL---LPTARCGCFRLSCFGSRRVATVGPS-WWERI-RASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGG
        +SPP  E D ++ +   L   R  CF + C  S + +T G S WW+RI    ++  + RWW RG R   ++REWSE+VAGPRWKT+IRRF R+   GGGG
Subjt:  ESPP-CEADKSNSL---LPTARCGCFRLSCFGSRRVATVGPS-WWERI-RASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGG

Query:  GGV-----------------RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKT
        G V                   GKF+YD LSY++NFD+G+ Q G  +D+      YR++S R+A+   P+ T
Subjt:  GGV-----------------RAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKT

AT3G48020.1 unknown protein2.8e-2452.25Show/hide
Query:  TVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGY
        TV  SWW+RI  +  H E RWW   VRA LK+REWSEIVAGPRWKTFIRRFNR+   G       + KF+YDP+SY ++F++  +     +DD     G 
Subjt:  TVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGY

Query:  RNFSARYASIP
        R+FS RYAS+P
Subjt:  RNFSARYASIP

AT5G14890.1 NHL domain-containing protein9.8e-2541.72Show/hide
Query:  FTDVEESPPCEADKSNSL---LPTARCGCFRLSCFGSRRVA-TVGPSWWERIR-ASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR--
        F  +E  P  E D ++ +   +   R  CF L C GS + +   G  WW+RIR   ++  + RWW  G    +K+REWSEIVAGP+WKTFIRRF RN   
Subjt:  FTDVEESPPCEADKSNSL---LPTARCGCFRLSCFGSRRVA-TVGPSWWERIR-ASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNR--

Query:  SGGGGGGGVRAG--KFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKT
        +GG  GG  R     F+YD  SY++NFD+G +Q G  ED+      YR++S R+A+   P+ T
Subjt:  SGGGGGGGVRAG--KFQYDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKT

AT5G25240.1 unknown protein1.3e-0540Show/hide
Query:  QVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYAMNFDEG
        Q    G W   G   L  L+E SE +AGP+WK FIR F+  R         R   F YD  +Y++NFD+G
Subjt:  QVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYAMNFDEG

AT5G62865.1 unknown protein6.3e-2450.75Show/hide
Query:  RCGCFRLSCFGSRRVATVGPSWWERIRA--SQVHS-----EGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYA
        RC CF  S   SR    VG S W RIR      HS     E RWW   +RA LK+REWSEIVAGPRWKTFIRRFNR+   G       + KFQYDPLSY+
Subjt:  RCGCFRLSCFGSRRVATVGPSWWERIRA--SQVHS-----EGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQYDPLSYA

Query:  MNFDEGSRQIGELEDDIDDFNGYRNFSARYASIP
        +NFD+        ED+     G R+FS R+AS+P
Subjt:  MNFDEGSRQIGELEDDIDDFNGYRNFSARYASIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTTACTGATGTAGAAGAGTCTCCTCCTTGCGAAGCCGATAAGTCCAATTCTTTGTTGCCGACCGCCAGGTGCGGTTGTTTCCGGTTATCGTGTTTTGGATCTCG
CCGTGTGGCCACCGTCGGACCGTCGTGGTGGGAGCGGATTAGGGCATCGCAGGTTCATAGCGAGGGGCGCTGGTGGGCGCGAGGCGTTAGGGCTCTTTTGAAGCTCCGAG
AATGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGGAATCGGAGTGGTGGTGGTGGTGGAGGTGGTGTTAGGGCTGGGAAATTTCAA
TACGATCCTTTGAGTTACGCTATGAATTTCGATGAAGGTTCGAGGCAGATTGGAGAATTGGAGGACGATATCGACGACTTCAATGGTTATCGGAACTTCTCTGCTCGTTA
CGCTTCAATTCCGGCGCCGTTGAAGACCGGAGGAACCAAGGATGTTTCTGCCGTCGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTTACTGATGTAGAAGAGTCTCCTCCTTGCGAAGCCGATAAGTCCAATTCTTTGTTGCCGACCGCCAGGTGCGGTTGTTTCCGGTTATCGTGTTTTGGATCTCG
CCGTGTGGCCACCGTCGGACCGTCGTGGTGGGAGCGGATTAGGGCATCGCAGGTTCATAGCGAGGGGCGCTGGTGGGCGCGAGGCGTTAGGGCTCTTTTGAAGCTCCGAG
AATGGTCGGAGATCGTCGCTGGACCGAGATGGAAGACTTTCATTCGCCGATTTAATCGGAATCGGAGTGGTGGTGGTGGTGGAGGTGGTGTTAGGGCTGGGAAATTTCAA
TACGATCCTTTGAGTTACGCTATGAATTTCGATGAAGGTTCGAGGCAGATTGGAGAATTGGAGGACGATATCGACGACTTCAATGGTTATCGGAACTTCTCTGCTCGTTA
CGCTTCAATTCCGGCGCCGTTGAAGACCGGAGGAACCAAGGATGTTTCTGCCGTCGCGTAA
Protein sequenceShow/hide protein sequence
MAFTDVEESPPCEADKSNSLLPTARCGCFRLSCFGSRRVATVGPSWWERIRASQVHSEGRWWARGVRALLKLREWSEIVAGPRWKTFIRRFNRNRSGGGGGGGVRAGKFQ
YDPLSYAMNFDEGSRQIGELEDDIDDFNGYRNFSARYASIPAPLKTGGTKDVSAVA