; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G02260 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G02260
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein SET DOMAIN GROUP 41 isoform X1
Genome locationClcChr08:4321674..4322354
RNA-Seq ExpressionClc08G02260
SyntenyClc08G02260
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035009.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo var. makuwa]2.5e-9778.95Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ENR +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLL LARH SLWA TTN S WGFP+G+RMCSNCSWVD+FN SRI GR 
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        I+ADFREFS  ISNCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG H IDRSC  SKTKD+CF+ EPQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YG+HSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

XP_008463080.1 PREDICTED: protein SET DOMAIN GROUP 41 isoform X1 [Cucumis melo]2.5e-9778.95Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ENR +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLL LARH SLWA TTN S WGFP+G+RMCSNCSWVD+FN SRI GR 
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        I+ADFREFS  ISNCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG H IDRSC  SKTKD+CF+ EPQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YG+HSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

XP_011656459.1 protein SET DOMAIN GROUP 41 [Cucumis sativus]1.4e-8973.68Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ NR +A TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLL LARH SLWA TTN S W FP+G+RMC NCSWVD+FNASRI G+P
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        ++ADFREFS  ISNCIA++SQK WS LTHGCPYLKAFT PFDFSWPKT     +++DI    ID SC  SKT+DVC + +PQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YGHHSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

XP_023520942.1 protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo]1.1e-7671.11Show/hide
Query:  NNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEA
        N DEN+ +A TMS+TSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLL L +H SLW  +N SK   P+G   C NCSWVDKFN SRI GR IEA
Subjt:  NNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEA

Query:  DFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGG
        DFREFS  ISNCIAN+SQK WSFL H C YLKAFTDPFDFSWPKTI T S+ R       DRSC  SK +DV        S+Q+R+SI  LGIHCL YGG
Subjt:  DFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGG

Query:  YLASIFYGHHSHLASQIQNILHDLD
        YLASI YGHHSHLASQIQ ILHD++
Subjt:  YLASIFYGHHSHLASQIQNILHDLD

XP_038886411.1 protein SET DOMAIN GROUP 41 [Benincasa hispida]2.3e-10381.94Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI
        MD ++E++ +ASTM + SAAYSLFLAGATHHLFLSEPSLI SA+ CWV+AGESLLTLARH  LWATTN SKWGFPVG+RMCS CSWVDKFNASRI G+PI
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI

Query:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVY
        EADFREFS  ISNCIANMS+K+WSFLTHGCPYLKAFTDPF+FSWPK I  YSSDRDI AHSIDR C  S +KDVCFQ EPQHSNQERESI+GLGIHCL Y
Subjt:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVY

Query:  GGYLASIFYGHHSHLASQIQNILHDLD
        GGYLASI YGHHSHLASQIQNIL+DL+
Subjt:  GGYLASIFYGHHSHLASQIQNILHDLD

TrEMBL top hitse value%identityAlignment
A0A0A0KAK3 SET domain-containing protein7.0e-9073.68Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ NR +A TM +TSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLL LARH SLWA TTN S W FP+G+RMC NCSWVD+FNASRI G+P
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        ++ADFREFS  ISNCIA++SQK WS LTHGCPYLKAFT PFDFSWPKT     +++DI    ID SC  SKT+DVC + +PQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YGHHSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

A0A1S3CIT0 protein SET DOMAIN GROUP 41 isoform X11.2e-9778.95Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ENR +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLL LARH SLWA TTN S WGFP+G+RMCSNCSWVD+FN SRI GR 
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        I+ADFREFS  ISNCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG H IDRSC  SKTKD+CF+ EPQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YG+HSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

A0A5A7T0X4 Protein SET DOMAIN GROUP 41 isoform X11.2e-9778.95Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP
        MD ++ENR +A TMS+TSAAY+LFLAGATHHLFL EPSLIASAANCWVVAGESLL LARH SLWA TTN S WGFP+G+RMCSNCSWVD+FN SRI GR 
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWA-TTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRP

Query:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV
        I+ADFREFS  ISNCIA++S+K WSFLTHGCPYLKAFTDPFDFSWPKT     +D DIG H IDRSC  SKTKD+CF+ EPQ SNQERESI GLGIHCL 
Subjt:  IEADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDLD
        YGGYLASI YG+HSHLASQIQNIL+DL+
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDLD

A0A6J1DDA7 protein SET DOMAIN GROUP 41 isoform X27.5e-7666.52Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI
        +D++DEN R+ASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES+L L R  S WA  + SKW FP+ +RMCS C+WV+ FN+SRI GR  
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI

Query:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQ-HSNQERESIIGLGIHCLV
        + DF   S    +CIAN+SQ+ WSFLTHGCPYLKAFTDPFDFSWPKT  ++S        SI+RS    KTKD+  Q E Q HSN+ER+ I  LG+HCL 
Subjt:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQ-HSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDL
        YG YLAS+ YGHHSHLASQIQNIL ++
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDL

A0A6J1DFD6 protein SET DOMAIN GROUP 41 isoform X17.5e-7666.52Show/hide
Query:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI
        +D++DEN R+ASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGES+L L R  S WA  + SKW FP+ +RMCS C+WV+ FN+SRI GR  
Subjt:  MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPI

Query:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQ-HSNQERESIIGLGIHCLV
        + DF   S    +CIAN+SQ+ WSFLTHGCPYLKAFTDPFDFSWPKT  ++S        SI+RS    KTKD+  Q E Q HSN+ER+ I  LG+HCL 
Subjt:  EADFREFSC-ISNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQ-HSNQERESIIGLGIHCLV

Query:  YGGYLASIFYGHHSHLASQIQNILHDL
        YG YLAS+ YGHHSHLASQIQNIL ++
Subjt:  YGGYLASIFYGHHSHLASQIQNILHDL

SwissProt top hitse value%identityAlignment
Q3ECY6 Protein SET DOMAIN GROUP 413.5e-2234.3Show/hide
Query:  MSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEADFREFS-CISN
        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE L  LA  + +  +              C+ C  ++  N+ R        D +E S  I +
Subjt:  MSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEADFREFS-CISN

Query:  CIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGGYLASIFYGHHS
        C+ ++SQ TWSFLT GCPYL+ F  P DFS  +T    + +R+                        + S  +  +++ L  HCL+Y   L  + YG  S
Subjt:  CIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGGYLASIFYGHHS

Query:  HLASQIQ
        HL S+ +
Subjt:  HLASQIQ

Arabidopsis top hitse value%identityAlignment
AT1G43245.1 SET domain-containing protein2.5e-2334.3Show/hide
Query:  MSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEADFREFS-CISN
        MSR SAAYSLFLAG +HHLF +E S   SAA  W  AGE L  LA  + +  +              C+ C  ++  N+ R        D +E S  I +
Subjt:  MSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEADFREFS-CISN

Query:  CIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGGYLASIFYGHHS
        C+ ++SQ TWSFLT GCPYL+ F  P DFS  +T    + +R+                        + S  +  +++ L  HCL+Y   L  + YG  S
Subjt:  CIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGGYLASIFYGHHS

Query:  HLASQIQ
        HL S+ +
Subjt:  HLASQIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAATAACGATGAAAATCGACGTGATGCATCTACCATGAGCAGAACAAGTGCAGCATATTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGAACC
ATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTACTCTTGCTAGACACATCTCATTATGGGCTACTACTAACTTCTCAAAATGGGGTT
TCCCTGTTGGAAGAAGAATGTGCTCCAACTGCTCATGGGTCGATAAGTTCAATGCAAGTAGGATCCTTGGTCGACCTATCGAAGCTGATTTTCGTGAGTTTTCATGTATT
TCAAATTGTATTGCTAATATGTCACAAAAAACTTGGAGCTTTTTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTTCGATTTCAGCTGGCCAAAGACGAT
CATGACATATTCGAGTGACCGGGATATAGGGGCTCATAGCATTGATCGTTCATGTGTTCGTAGTAAAACTAAGGATGTCTGTTTTCAGAGTGAACCTCAGCATTCTAACC
AAGAGAGAGAGTCTATCATTGGGCTTGGCATCCATTGCTTAGTCTATGGAGGCTATTTAGCAAGTATTTTTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAAT
ATTTTACATGACTTGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAATAACGATGAAAATCGACGTGATGCATCTACCATGAGCAGAACAAGTGCAGCATATTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTTCTGAACC
ATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTTACTCTTGCTAGACACATCTCATTATGGGCTACTACTAACTTCTCAAAATGGGGTT
TCCCTGTTGGAAGAAGAATGTGCTCCAACTGCTCATGGGTCGATAAGTTCAATGCAAGTAGGATCCTTGGTCGACCTATCGAAGCTGATTTTCGTGAGTTTTCATGTATT
TCAAATTGTATTGCTAATATGTCACAAAAAACTTGGAGCTTTTTGACTCATGGATGCCCATATTTGAAGGCTTTCACTGACCCCTTCGATTTCAGCTGGCCAAAGACGAT
CATGACATATTCGAGTGACCGGGATATAGGGGCTCATAGCATTGATCGTTCATGTGTTCGTAGTAAAACTAAGGATGTCTGTTTTCAGAGTGAACCTCAGCATTCTAACC
AAGAGAGAGAGTCTATCATTGGGCTTGGCATCCATTGCTTAGTCTATGGAGGCTATTTAGCAAGTATTTTTTATGGTCACCATTCACATTTGGCATCTCAGATTCAAAAT
ATTTTACATGACTTGGATTGA
Protein sequenceShow/hide protein sequence
MDNNDENRRDASTMSRTSAAYSLFLAGATHHLFLSEPSLIASAANCWVVAGESLLTLARHISLWATTNFSKWGFPVGRRMCSNCSWVDKFNASRILGRPIEADFREFSCI
SNCIANMSQKTWSFLTHGCPYLKAFTDPFDFSWPKTIMTYSSDRDIGAHSIDRSCVRSKTKDVCFQSEPQHSNQERESIIGLGIHCLVYGGYLASIFYGHHSHLASQIQN
ILHDLD