; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G022830 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G022830
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHeavy metal-associated domain containing protein
Genome locationCG_Chr05:34698224..34699491
RNA-Seq ExpressionClCG05G022830
SyntenyClCG05G022830
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013033.1 hypothetical protein SDJN02_25789 [Cucurbita argyrosperma subsp. argyrosperma]4.4e-3287.1Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS
        A AGIGVRK KIFPHASSLAS+ESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQ+P SRVSTIKRL G S
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS

KGN52069.1 hypothetical protein Csa_009028 [Cucumis sativus]4.7e-3486.02Show/hide
Query:  VAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK
        V GIGVRK++IFPHASSLAS+ESL+LPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRR Q+PSRVSTI+RLF   C+
Subjt:  VAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK

XP_022968296.1 uncharacterized protein LOC111467567 isoform X3 [Cucurbita maxima]6.8e-3388.17Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS
        A AGIGVRK KIFPHASSLAS+ESLSLPLVQEIV TADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTR+LQ+P SRVST+KRL G S
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS

XP_023542320.1 uncharacterized protein LOC111802252 [Cucurbita pepo subsp. pepo]4.4e-3287.1Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS
        A AGIGVRK KIFPHASSLAS+ESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQ+P SRVSTIKRL G S
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS

XP_038892707.1 uncharacterized protein LOC120081691 [Benincasa hispida]3.9e-3689.36Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK
        A AGIGVRK+KIFPHASSLAS+ESLSLPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQ+PSR+STIKRL G  C+
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK

TrEMBL top hitse value%identityAlignment
A0A0A0KT62 Uncharacterized protein2.3e-3486.02Show/hide
Query:  VAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK
        V GIGVRK++IFPHASSLAS+ESL+LPLVQEIVLTADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTRR Q+PSRVSTI+RLF   C+
Subjt:  VAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK

A0A6J1DB71 uncharacterized protein LOC1110187077.6e-3080.41Show/hide
Query:  AGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-------SRVSTIKRLFGYS
        A IGVRK +IFPHASSLAS+ESLSLPLVQEIVLTADIRC ECQ KLANILSK+NDTESVVVNLL+KKVILTRRLQVP       S+V+TIKRL G S
Subjt:  AGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-------SRVSTIKRLFGYS

A0A6J1G176 uncharacterized protein LOC111449743 isoform X14.6e-2766.67Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPL---------------------------VQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT
        A AGIGVRK KIFPHASSLAS+ESLSLPL                           VQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILT
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPL---------------------------VQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT

Query:  RRLQV-PSRVSTIKRLFGYS
        R+LQ+  SRVSTIKRL G S
Subjt:  RRLQV-PSRVSTIKRLFGYS

A0A6J1G188 uncharacterized protein LOC111449743 isoform X31.8e-3186.02Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQV-PSRVSTIKRLFGYS
        A AGIGVRK KIFPHASSLAS+ESLSLPLVQEIV  ADI CAECQKKLANIL+KMNDTESVVVNLLDKKVILTR+LQ+  SRVSTIKRL G S
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQV-PSRVSTIKRLFGYS

A0A6J1HUG9 uncharacterized protein LOC111467567 isoform X33.3e-3388.17Show/hide
Query:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS
        A AGIGVRK KIFPHASSLAS+ESLSLPLVQEIV TADI CAECQKKLANILSKMNDTESVVVNLLDKKVILTR+LQ+P SRVST+KRL G S
Subjt:  AVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVP-SRVSTIKRLFGYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G68585.1 unknown protein7.6e-1450Show/hide
Query:  SSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQ------VPSRVSTIKRL
        +SLAS+ SLS+PL+QEIVL+ADIRC++CQ+K+A+I+++M +T S++V++L+KKV LT      RR+       +  ++STIKRL
Subjt:  SSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILT------RRLQ------VPSRVSTIKRL

AT2G35730.1 Heavy metal transport/detoxification superfamily protein1.1e-0440.3Show/hide
Query:  SLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVST
        +L   E LSLP  Q I + AD+ C  CQ +++ I+SKM   E  VV+ L KK+++ R    P  VS+
Subjt:  SLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTGGCCGGAATCGGAGTTCGGAAGCAAAAAATTTTTCCTCATGCTTCTAGCCTTGCTTCCATGGAGTCCTTATCTCTGCCTCTTGTTCAGGAAATTGTATTAAC
AGCTGATATTCGATGTGCTGAATGTCAGAAGAAACTGGCAAATATACTCTCCAAAATGAATGATACAGAGTCTGTGGTGGTGAATTTGTTGGACAAGAAAGTGATATTGA
CTCGAAGATTACAGGTTCCATCCAGAGTTTCAACAATCAAGAGATTGTTTGGTTATTCTTGCAAATAA
mRNA sequenceShow/hide mRNA sequence
CCCTTCTCCCTTTTCCCAACCACTTATAATAATCCATCCCTCTTTCTTCACAAACCCATACATAAAATCTTCCTTTCCCTTCTTCACGATCCCCTGTTTCATTGGCGGAG
CATGGCGGTGGCCGGAATCGGAGTTCGGAAGCAAAAAATTTTTCCTCATGCTTCTAGCCTTGCTTCCATGGAGTCCTTATCTCTGCCTCTTGTTCAGGAAATTGTATTAA
CAGCTGATATTCGATGTGCTGAATGTCAGAAGAAACTGGCAAATATACTCTCCAAAATGAATGATACAGAGTCTGTGGTGGTGAATTTGTTGGACAAGAAAGTGATATTG
ACTCGAAGATTACAGGTTCCATCCAGAGTTTCAACAATCAAGAGATTGTTTGGTTATTCTTGCAAATAATTATAAAATATATAATGAAATATTTTTTTAGTCATGTTGTA
GATAGATAAATGATCCCACTACTTGATCTTTAGCTT
Protein sequenceShow/hide protein sequence
MAVAGIGVRKQKIFPHASSLASMESLSLPLVQEIVLTADIRCAECQKKLANILSKMNDTESVVVNLLDKKVILTRRLQVPSRVSTIKRLFGYSCK