; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G009070 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G009070
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptioncentromere protein V-like
Genome locationCG_Chr08:21625729..21626136
RNA-Seq ExpressionClCG08G009070
SyntenyClCG08G009070
Gene Ontology termsGO:0016846 - carbon-sulfur lyase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006913 - CENP-V/GFA domain
IPR011057 - Mss4-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599075.1 Centromere protein V, partial [Cucurbita argyrosperma subsp. sororia]2.8e-7292.59Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LVVHNGGCHCK VRWRVEAP SVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

KAG7030011.1 Centromere protein V, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-7292.59Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LVVHNGGCHCK VRWRVEAP SVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

XP_022946627.1 centromere protein V-like [Cucurbita moschata]2.8e-7292.59Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LVVHNGGCHCK VRWRVEAP SVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

XP_023546052.1 centromere protein V [Cucurbita pepo subsp. pepo]7.3e-7393.33Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LV HNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

XP_038884393.1 centromere protein V [Benincasa hispida]9.6e-7393.28Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        MAS LVVHNGGCHCKKVRW+VEAPASVVAWDCNCS+CFMRANTHFIVP ERFKLLGDS+NFISTYTFG+HTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK
        CVDPGTLTHVEI++FDGSNWEASYDHTGIASFSK
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK

TrEMBL top hitse value%identityAlignment
A0A1S3CMD8 centromere protein V isoform X22.3e-7291.85Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        MAS LVVHNGGCHCKKVRWRVEAPASVVAWDCNCS+CFMRANTHFIVP ERFKLLGDS+NF+STYTFG+HTAKHTFCK CGITSFY PRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVE++QFDGSNWEASYDHTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

A0A1S3CMF5 centromere protein V isoform X52.3e-7291.85Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        MAS LVVHNGGCHCKKVRWRVEAPASVVAWDCNCS+CFMRANTHFIVP ERFKLLGDS+NF+STYTFG+HTAKHTFCK CGITSFY PRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVE++QFDGSNWEASYDHTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

A0A5D3CJK9 Centromere protein V isoform X12.3e-7291.85Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        MAS LVVHNGGCHCKKVRWRVEAPASVVAWDCNCS+CFMRANTHFIVP ERFKLLGDS+NF+STYTFG+HTAKHTFCK CGITSFY PRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVE++QFDGSNWEASYDHTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

A0A6J1G4E1 centromere protein V-like1.4e-7292.59Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LVVHNGGCHCK VRWRVEAP SVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTLTHVEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

A0A6J1KE30 centromere protein V-like2.3e-7292.59Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVP ERFKLLGDS NFISTYTFGTHTAKHTFCK+CGITSFYHPRSNPDGVAITFK
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV
        CVDPGTL +VEI+QFDGSNWEASY HTGIASFSK+
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSKV

SwissProt top hitse value%identityAlignment
A0A0U1RR11 Centromere protein V-like protein 12.3e-2140.16Show/hide
Query:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP
        LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP+ RF LL  + + + TY   TH A H+FC  CG+ SF+   S+P    +   C+D 
Subjt:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP

Query:  GTLTHVEIKQFDGSN--WEASYDHTGI
        GT+  V I++  G +   EA+ +H  I
Subjt:  GTLTHVEIKQFDGSN--WEASYDHTGI

A0A0U1RRI6 Centromere protein V-like protein 32.3e-2140.16Show/hide
Query:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP
        LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP+ RF LL  + + + TY   TH A H+FC  CG+ SF+   S+P    +   C+D 
Subjt:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP

Query:  GTLTHVEIKQFDGSN--WEASYDHTGI
        GT+  V I++  G +   EA+ +H  I
Subjt:  GTLTHVEIKQFDGSN--WEASYDHTGI

P0DPI3 Centromere protein V-like protein 22.3e-2140.16Show/hide
Query:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP
        LV H GGCHC  VR+ V APA +   DC+C  C  + + HF+VP+ RF LL  + + + TY   TH A H+FC  CG+ SF+   S+P    +   C+D 
Subjt:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP

Query:  GTLTHVEIKQFDGSN--WEASYDHTGI
        GT+  V I++  G +   EA+ +H  I
Subjt:  GTLTHVEIKQFDGSN--WEASYDHTGI

Q7Z7K6 Centromere protein V3.1e-3452.67Show/hide
Query:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP
        LV H GGCHC  VR+ V A A +  +DCNCS C  + N HFIVP+ RFKLL  + + I+TYTF TH A+HTFCK CG+ SFY PRSNP G  I   C+D 
Subjt:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP

Query:  GTLTHVEIKQFDGSNWE-ASYDHTGIASFSK
        GT+  +  ++F+GS+WE A  +H  I + SK
Subjt:  GTLTHVEIKQFDGSNWE-ASYDHTGIASFSK

Q9CXS4 Centromere protein V1.4e-3453.44Show/hide
Query:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP
        LV H GGCHC  VR+ V A A +  +DCNCS C  + N HFIVP+ RFKLL  + + I+TYTF TH A+HTFCK CG+ SFY PRSNP G  I   C+D 
Subjt:  LVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDP

Query:  GTLTHVEIKQFDGSNWE-ASYDHTGIASFSK
        GT+  V  ++F+GS+WE A  +H  I + SK
Subjt:  GTLTHVEIKQFDGSNWE-ASYDHTGIASFSK

Arabidopsis top hitse value%identityAlignment
AT5G16940.1 carbon-sulfur lyases3.8e-5970.9Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S L+ H GGCHC K++WRV+A  SV+AW CNCSDC MR N HFIVPS  F+LL DS +FI+TYTFGTHTAKHTFCK+CGITSFY PRSNPDGVA+T K
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK
        CV  GTL H+E+K +DG NWE S+  TGIASFSK
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK

AT5G16940.2 carbon-sulfur lyases3.8e-5970.9Show/hide
Query:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK
        M S L+ H GGCHC K++WRV+A  SV+AW CNCSDC MR N HFIVPS  F+LL DS +FI+TYTFGTHTAKHTFCK+CGITSFY PRSNPDGVA+T K
Subjt:  MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFK

Query:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK
        CV  GTL H+E+K +DG NWE S+  TGIASFSK
Subjt:  CVDPGTLTHVEIKQFDGSNWEASYDHTGIASFSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCTGCGTTGGTTGTGCACAATGGTGGATGCCACTGCAAGAAAGTAAGATGGCGAGTTGAAGCACCTGCCAGTGTTGTAGCTTGGGATTGCAACTGTTCTGATTG
CTTCATGAGGGCCAATACACATTTTATCGTACCGTCGGAACGGTTCAAGCTTTTAGGAGATTCTAACAACTTCATTTCTACCTATACCTTTGGTACTCACACTGCAAAAC
ATACCTTTTGCAAAATTTGTGGCATTACCTCATTTTACCATCCACGCTCAAATCCAGATGGAGTTGCAATTACTTTCAAATGTGTTGATCCTGGAACATTGACCCATGTT
GAGATTAAGCAGTTTGATGGGAGCAACTGGGAGGCCTCTTATGATCACACAGGCATTGCTTCATTCTCAAAAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTCTGCGTTGGTTGTGCACAATGGTGGATGCCACTGCAAGAAAGTAAGATGGCGAGTTGAAGCACCTGCCAGTGTTGTAGCTTGGGATTGCAACTGTTCTGATTG
CTTCATGAGGGCCAATACACATTTTATCGTACCGTCGGAACGGTTCAAGCTTTTAGGAGATTCTAACAACTTCATTTCTACCTATACCTTTGGTACTCACACTGCAAAAC
ATACCTTTTGCAAAATTTGTGGCATTACCTCATTTTACCATCCACGCTCAAATCCAGATGGAGTTGCAATTACTTTCAAATGTGTTGATCCTGGAACATTGACCCATGTT
GAGATTAAGCAGTTTGATGGGAGCAACTGGGAGGCCTCTTATGATCACACAGGCATTGCTTCATTCTCAAAAGTTTGA
Protein sequenceShow/hide protein sequence
MASALVVHNGGCHCKKVRWRVEAPASVVAWDCNCSDCFMRANTHFIVPSERFKLLGDSNNFISTYTFGTHTAKHTFCKICGITSFYHPRSNPDGVAITFKCVDPGTLTHV
EIKQFDGSNWEASYDHTGIASFSKV