; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G10966 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G10966
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDDE Tnp4 domain-containing protein
Genome locationClcChr04:24568133..24574911
RNA-Seq ExpressionClc04G10966
SyntenyClc04G10966
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR040344 - Uncharacterized protein At3g17950-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ99038.1 putative nuclease HARBI1 [Cucumis melo var. makuwa]4.9e-9592.42Show/hide
Query:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA
        S+ SL    +MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTTT PA
Subjt:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA

Query:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

XP_008465627.2 PREDICTED: uncharacterized protein At3g17950 isoform X1 [Cucumis melo]1.1e-9493.4Show/hide
Query:  FSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPAL
        FSIQ     MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTTT PAL
Subjt:  FSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPAL

Query:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

XP_022924210.1 uncharacterized protein At3g17950 [Cucurbita moschata]1.9e-9496.28Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAA GGGSRKSKK KRKTT  PALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV+ DQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

XP_031739825.1 uncharacterized protein At3g17950 [Cucumis sativus]3.8e-9591.63Show/hide
Query:  SNTSLSFSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKT
        SN +LSFSIQ     MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKT
Subjt:  SNTSLSFSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKT

Query:  TTVPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSG
        T  PALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVA DQQRNGRSLFADGRVLPPAQTEEDTS A  LCRFSVSLTGICSG
Subjt:  TTVPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSG

Query:  GAG
        GAG
Subjt:  GAG

XP_038895169.1 uncharacterized protein At3g17950 [Benincasa hispida]1.7e-9596.81Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AATAA GGGSRKSKKTKRKT+T PALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPA+TE+DTSAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

TrEMBL top hitse value%identityAlignment
A0A0A0LQI1 DDE Tnp4 domain-containing protein5.9e-9491.41Show/hide
Query:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA
        S+ SL    +MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTT  PA
Subjt:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA

Query:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVA DQQRNGRSLFADGRVLPPAQTEEDTS A  LCRFSVSLTGICSGGAG
Subjt:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A1S3CPP8 uncharacterized protein At3g17950 isoform X15.3e-9593.4Show/hide
Query:  FSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPAL
        FSIQ     MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTTT PAL
Subjt:  FSIQ-----MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPAL

Query:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  VADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A1S4DUB1 uncharacterized protein At3g17950 isoform X21.5e-9495.74Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTTT PALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A5D3BLI7 Putative nuclease HARBI12.4e-9592.42Show/hide
Query:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA
        S+ SL    +MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQH AAT   GGGSRKSKKTKRKTTT PA
Subjt:  SNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPA

Query:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVV  DQQRNGRSLFADGRVLPPAQTEEDTSA GALCRFSVSLTGICSGGAG
Subjt:  LVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

A0A6J1E8Y2 uncharacterized protein At3g179509.1e-9596.28Show/hide
Query:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW
        MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAA GGGSRKSKK KRKTT  PALVADRKRRWW
Subjt:  MLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWW

Query:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
        RLCRDDGVKPASLGEFLEVERRFGDGAF+GNAVDLEGVV+ DQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG
Subjt:  RLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG

SwissProt top hitse value%identityAlignment
Q6DR24 Uncharacterized protein At3g179503.4e-3851.71Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A + A+   +R++ + KR  +        R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V     DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

Arabidopsis top hitse value%identityAlignment
AT3G17950.1 unknown protein2.4e-3951.71Show/hide
Query:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-
        PSSPT SS+SSSDLDTESTGSFFHDRS +LGTLMG SF A   + FR  S+       A + A+   +R++ + KR  +        R+R+WWR CRDD 
Subjt:  PSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-

Query:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC
              G+       K +SLGE+LEVERRFGD A Y +A  +LE  V     DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGIC
Subjt:  ------GV-------KPASLGEFLEVERRFGDGAFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGIC

Query:  SGGAG
        SGG G
Subjt:  SGGAG

AT3G17950.2 unknown protein8.3e-2444.77Show/hide
Query:  MGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG
        MG SF A   + FR  S+       A + A+   +R++ + KR  +        R+R+WWR CRDD       G+       K +SLGE+LEVERRFGD 
Subjt:  MGVSFPA---ITFRVPSQNR-DQHAAATAATGGGSRKSKKTKRKTTTVPALVADRKRRWWRLCRDD-------GV-------KPASLGEFLEVERRFGDG

Query:  AFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGICSGGAG
        A Y +A  +LE  V     DQQ     R+LFADGRVLPPA  E    E T  A +LCRF VSLTGICSGG G
Subjt:  AFYGNA-VDLEGVVAT---DQQ--RNGRSLFADGRVLPPAQTE----EDTSAAGALCRFSVSLTGICSGGAG

AT5G02440.1 unknown protein3.9e-0552.73Show/hide
Query:  SPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGV-SFPAITFRVPSQNRDQHAAA
        SP+ SS SSSDLD++S GSFF DRS +LG L+G+ SF  ++ R      DQ  AA
Subjt:  SPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGV-SFPAITFRVPSQNRDQHAAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCATGCATTCTTTTGGAGGGAAGCCGAATTCTTGAGTACTGCTGCCAAGGAAGCAACACCAGTCTCAGCTTCTCTATACAGATGTTGAATCCGGCCAACGATCT
GTTACCGCCGCCGTCTTCTCCGACCAATTCATCCATTTCCTCCTCCGATCTCGACACCGAGTCGACGGGTTCGTTCTTCCATGACAGGAGCACGAGCTTGGGAACTCTAA
TGGGGGTCAGCTTTCCGGCGATTACTTTCCGAGTGCCCTCCCAGAACAGAGATCAACACGCCGCCGCGACTGCCGCCACAGGCGGCGGTTCTCGGAAAAGCAAGAAGACG
AAGAGAAAAACGACAACGGTGCCGGCACTGGTTGCAGATCGGAAACGGCGATGGTGGAGGCTTTGCAGGGATGACGGCGTTAAGCCGGCGTCTCTGGGTGAGTTTCTCGA
AGTAGAACGGAGATTTGGGGATGGTGCTTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGACGGATCAACAGAGGAATGGCCGGTCTTTGTTCGCAGATGGAA
GAGTTCTTCCGCCGGCACAAACCGAGGAAGATACATCGGCGGCCGGCGCTCTATGCCGATTTTCTGTATCGCTTACCGGAATATGCAGCGGCGGTGCCGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGCATGCATTCTTTTGGAGGGAAGCCGAATTCTTGAGTACTGCTGCCAAGGAAGCAACACCAGTCTCAGCTTCTCTATACAGATGTTGAATCCGGCCAACGATCT
GTTACCGCCGCCGTCTTCTCCGACCAATTCATCCATTTCCTCCTCCGATCTCGACACCGAGTCGACGGGTTCGTTCTTCCATGACAGGAGCACGAGCTTGGGAACTCTAA
TGGGGGTCAGCTTTCCGGCGATTACTTTCCGAGTGCCCTCCCAGAACAGAGATCAACACGCCGCCGCGACTGCCGCCACAGGCGGCGGTTCTCGGAAAAGCAAGAAGACG
AAGAGAAAAACGACAACGGTGCCGGCACTGGTTGCAGATCGGAAACGGCGATGGTGGAGGCTTTGCAGGGATGACGGCGTTAAGCCGGCGTCTCTGGGTGAGTTTCTCGA
AGTAGAACGGAGATTTGGGGATGGTGCTTTCTACGGCAACGCGGTGGATCTGGAAGGCGTGGTTGCGACGGATCAACAGAGGAATGGCCGGTCTTTGTTCGCAGATGGAA
GAGTTCTTCCGCCGGCACAAACCGAGGAAGATACATCGGCGGCCGGCGCTCTATGCCGATTTTCTGTATCGCTTACCGGAATATGCAGCGGCGGTGCCGGTTAA
Protein sequenceShow/hide protein sequence
MGACILLEGSRILEYCCQGSNTSLSFSIQMLNPANDLLPPPSSPTNSSISSSDLDTESTGSFFHDRSTSLGTLMGVSFPAITFRVPSQNRDQHAAATAATGGGSRKSKKT
KRKTTTVPALVADRKRRWWRLCRDDGVKPASLGEFLEVERRFGDGAFYGNAVDLEGVVATDQQRNGRSLFADGRVLPPAQTEEDTSAAGALCRFSVSLTGICSGGAG