; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G12950 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G12950
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionBEST Arabidopsis thaliana protein match is: NHL domain-containing protein .
Genome locationClcChr06:23874426..23874872
RNA-Seq ExpressionClc06G12950
SyntenyClc06G12950
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587852.1 hypothetical protein SDJN03_16417, partial [Cucurbita argyrosperma subsp. sororia]2.9e-6281.08Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP  +PYSPLNE+Q+ + D D+ +PSNGC CF+LFGFGFN NGNYE  NLLQQ + REE  WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKKQRNRFQYDPESYALNFDGG DGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

XP_008443900.1 PREDICTED: uncharacterized protein LOC103487383 [Cucumis melo]2.1e-6887.25Show/hide
Query:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL
        MA+HQTRP S+PYSPLN +QQD++QDIDD+I SNGCGCF+LFGFG N N NYEG NLLQQ Q REE SWMVKKLKK+KEVSEMVAGPKWKNFIRKMGGYL
Subjt:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFS+RFAVPLASRE
Subjt:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

XP_011648475.1 uncharacterized protein LOC105434480 [Cucumis sativus]2.1e-6885.81Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP S+PYSPL +QQD++QDIDD+I SNGCGCF+LFGFG N N NYEG NLLQQ Q REE SWMVK+LKK++EVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKK+RNRFQYDPESYALNFDGGFDGEEDDHHPPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

XP_022927019.1 uncharacterized protein LOC111433973 [Cucurbita moschata]2.2e-6281.76Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP  +PYSPLNE+Q+ + D D+ IPSNGC CF+LFGFGFN NGNYE  NLLQQ + REE  WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKKQRNRFQYDPESYALNFDGG DGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

XP_038876901.1 uncharacterized protein LOC120069255 [Benincasa hispida]5.2e-7287.84Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP S+PYSPLNEQQD++QD+D+ +PSNGCGCFRLFGFGFN NGNYE RNLLQQ Q REE SWMV+KLKK+KEVSEMVAGPKWKNF+RKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKKQRNRFQYDPESYALNFDGGFDGEED+HHPPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

TrEMBL top hitse value%identityAlignment
A0A0A0LTG0 Uncharacterized protein9.9e-6985.81Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP S+PYSPL +QQD++QDIDD+I SNGCGCF+LFGFG N N NYEG NLLQQ Q REE SWMVK+LKK++EVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKK+RNRFQYDPESYALNFDGGFDGEEDDHHPPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

A0A1S3B8N4 uncharacterized protein LOC1034873839.9e-6987.25Show/hide
Query:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL
        MA+HQTRP S+PYSPLN +QQD++QDIDD+I SNGCGCF+LFGFG N N NYEG NLLQQ Q REE SWMVKKLKK+KEVSEMVAGPKWKNFIRKMGGYL
Subjt:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFS+RFAVPLASRE
Subjt:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

A0A5A7U6Q0 Uncharacterized protein9.9e-6987.25Show/hide
Query:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL
        MA+HQTRP S+PYSPLN +QQD++QDIDD+I SNGCGCF+LFGFG N N NYEG NLLQQ Q REE SWMVKKLKK+KEVSEMVAGPKWKNFIRKMGGYL
Subjt:  MATHQTRPSSSPYSPLN-EQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYL

Query:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFS+RFAVPLASRE
Subjt:  KGKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

A0A6J1EGU0 uncharacterized protein LOC1114339731.1e-6281.76Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP  +PYSPLNE+Q+ + D D+ IPSNGC CF+LFGFGFN NGNYE  NLLQQ + REE  WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        GKKQRNRFQYDPESYALNFDGG DGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

A0A6J1KR34 uncharacterized protein LOC1114968802.4e-6280.41Show/hide
Query:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
        MATHQTRP  +PYSPLNE+Q+ + D D+ +PSNGC CF+LFGFGFN NGNYE  NLLQQ + REE  WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK
Subjt:  MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLK

Query:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE
        G+KQRNRFQYDPESYALNFDGG DGE+D   PPIGFS+RFAVPLASRE
Subjt:  GKKQRNRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G01430.1 BEST Arabidopsis thaliana protein match is: NHL domain-containing protein (TAIR:AT5G14890.1)2.5e-0836.36Show/hide
Query:  WMVKKLKKLKEVSEMVAGPKWKNFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNF-DGGFDGEEDDHHPPIGFSTRFAVP
        W ++  ++++E SE+VAGP+WK +IR+ G                G   G    NR      F+YD  SY+LNF DG   G  DD  P   +S RFA P
Subjt:  WMVKKLKKLKEVSEMVAGPKWKNFIRKMG----------------GYLKGKKQRNR------FQYDPESYALNF-DGGFDGEEDDHHPPIGFSTRFAVP

AT3G48020.1 unknown protein1.5e-0839.56Show/hide
Query:  EEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGGFDGEEDDHHPPIG----FSTRFA-VPLAS
        +E  W V+   K++E SE+VAGP+WK FIR+      +G+     ++F+YDP SY L+F+   + ++DD    +G    FS R+A VP+AS
Subjt:  EEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMG-GYLKGK--KQRNRFQYDPESYALNFDGGFDGEEDDHHPPIG----FSTRFA-VPLAS

AT5G14890.1 NHL domain-containing protein3.0e-0932.87Show/hide
Query:  PLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFN----GNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMG------GYLKG---
        P++E     +  +      GC CF L   G +     NG+   + +   ++   +  W V    K++E SE+VAGPKWK FIR+ G      G + G   
Subjt:  PLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFN----GNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMG------GYLKG---

Query:  KKQRNRFQYDPESYALNFDGGFD-GEEDDHHPPIGFSTRFAVP
        + +   F+YD  SY+LNFD G   G  +D  P   +S RFA P
Subjt:  KKQRNRFQYDPESYALNFDGGFD-GEEDDHHPPIGFSTRFAVP

AT5G25240.1 unknown protein6.3e-1543.12Show/hide
Query:  DIDDTIPSNGCGCFRLFGFGFNGNGNYEGRN------LLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQRNRFQYDPESYALN
        D ++T    GCG FR F F     G+ E R+       LQ+ +R   G+W  +KLK LKE+SE +AGPKWKNFIR      K  ++   F YD ++Y+LN
Subjt:  DIDDTIPSNGCGCFRLFGFGFNGNGNYEGRN------LLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQRNRFQYDPESYALN

Query:  FDGGFDGEE
        FD G DG++
Subjt:  FDGGFDGEE

AT5G62865.1 unknown protein2.7e-1035.62Show/hide
Query:  QQDEIQDIDDTIPSNGCGCFRLF-------GFGFNGNGNYEGRNLLQQNQRREEGS---WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQ---R
        Q D  +D  DT     C CF  F         G++  G    R +   N   + G    W ++   K++E SE+VAGP+WK FIR+     +  +     
Subjt:  QQDEIQDIDDTIPSNGCGCFRLF-------GFGFNGNGNYEGRNLLQQNQRREEGS---WMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQ---R

Query:  NRFQYDPESYALNFDGGFDGEEDDHHPPIG----FSTRFA-VPLAS
         +FQYDP SY+LNFD   D +E+D +  +G    FSTRFA VP+ S
Subjt:  NRFQYDPESYALNFDGGFDGEEDDHHPPIG----FSTRFA-VPLAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCCATCAAACTAGACCATCTTCTTCCCCTTATTCACCTCTAAACGAGCAACAGGACGAAATTCAAGACATCGACGATACCATTCCCTCAAATGGGTGT
GGCTGTTTTCGGCTATTCGGGTTCGGATTCAATGGGAATGGCAATTACGAAGGAAGAAATCTTCTGCAGCAAAACCAGCGGCGGGAAGAGGGGAGTTGGATGGTG
AAGAAATTGAAGAAGTTGAAGGAGGTTTCAGAAATGGTGGCTGGGCCTAAATGGAAGAATTTCATTAGAAAAATGGGTGGATATTTGAAAGGGAAGAAACAGAGG
AATAGATTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCGATGGCGGTTTTGATGGAGAAGAAGATGATCATCATCCACCAATTGGGTTTTCTACGAGGTTT
GCTGTGCCTTTGGCTTCTAGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCCATCAAACTAGACCATCTTCTTCCCCTTATTCACCTCTAAACGAGCAACAGGACGAAATTCAAGACATCGACGATACCATTCCCTCAAATGGGTGT
GGCTGTTTTCGGCTATTCGGGTTCGGATTCAATGGGAATGGCAATTACGAAGGAAGAAATCTTCTGCAGCAAAACCAGCGGCGGGAAGAGGGGAGTTGGATGGTG
AAGAAATTGAAGAAGTTGAAGGAGGTTTCAGAAATGGTGGCTGGGCCTAAATGGAAGAATTTCATTAGAAAAATGGGTGGATATTTGAAAGGGAAGAAACAGAGG
AATAGATTTCAGTATGACCCAGAAAGCTATGCTCTGAATTTCGATGGCGGTTTTGATGGAGAAGAAGATGATCATCATCCACCAATTGGGTTTTCTACGAGGTTT
GCTGTGCCTTTGGCTTCTAGGGAATAA
Protein sequenceShow/hide protein sequence
MATHQTRPSSSPYSPLNEQQDEIQDIDDTIPSNGCGCFRLFGFGFNGNGNYEGRNLLQQNQRREEGSWMVKKLKKLKEVSEMVAGPKWKNFIRKMGGYLKGKKQR
NRFQYDPESYALNFDGGFDGEEDDHHPPIGFSTRFAVPLASRE