; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G04000 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G04000
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein TonB like
Genome locationClcChr06:4223318..4226106
RNA-Seq ExpressionClc06G04000
SyntenyClc06G04000
Gene Ontology termsGO:0070072 - vacuolar proton-transporting V-type ATPase complex assembly (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021013 - ATPase, vacuolar ER assembly factor, Vma12


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138504.1 uncharacterized protein LOC101222938 [Cucumis sativus]5.6e-9390.78Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLVLS+TEAIRSFLTSASIDS+LS+ELR IASDL S+ NIPY+ LRAIWFATES TRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

XP_008458236.1 PREDICTED: uncharacterized protein LOC103497719 [Cucumis melo]7.4e-9390.78Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLVLS+TEAIRSFLTSASIDS+LS+ELR IASDLAS+ NIPY+ LRAIWFATES TRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

XP_022959366.1 uncharacterized protein LOC111460360 [Cucurbita moschata]3.3e-9388.35Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLV+++TEAIRSFLTSASIDSR+SEELR +AS+LAS++N+PY+PLRAIWFATES TRPDLLRLL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

XP_023548168.1 uncharacterized protein LOC111806888 [Cucurbita pepo subsp. pepo]4.3e-9387.86Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLV+++TEAIRSFLTSASIDSR+SEELR +AS+LAS++N+PY+PLRAIWFATES TRPDLLRLL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKP+DEPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

XP_038876067.1 uncharacterized protein LOC120068388 [Benincasa hispida]1.3e-9491.26Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND THPSGLVLSNTEAIRSFLTSASIDS+LS+ELR IASDLAS+ NIPY+PLRAIWFA ES  RPDLLRLLAGSEFV TSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPI+EPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

TrEMBL top hitse value%identityAlignment
A0A0A0K7F5 Uncharacterized protein2.7e-9390.78Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLVLS+TEAIRSFLTSASIDS+LS+ELR IASDL S+ NIPY+ LRAIWFATES TRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A1S3C6X8 uncharacterized protein LOC1034977193.6e-9390.78Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLVLS+TEAIRSFLTSASIDS+LS+ELR IASDLAS+ NIPY+ LRAIWFATES TRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A5D3BV73 Uncharacterized protein3.6e-9390.78Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND   PSGLVLS+TEAIRSFLTSASIDS+LS+ELR IASDLAS+ NIPY+ LRAIWFATES TRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYL+GYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKKSQ
Subjt:  KLKKSQ

A0A6J1H4N6 uncharacterized protein LOC1114603601.6e-9388.35Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLV+++TEAIRSFLTSASIDSR+SEELR +AS+LAS++N+PY+PLRAIWFATES TRPDLLRLL GSEFVFTSPKPREKSEELKARLKKLA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

A0A6J1L0S3 uncharacterized protein LOC1114993223.9e-9286.89Show/hide
Query:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA
        MIND+ HPSGLV+++TEAIR+FLTSASIDSR+SEELR +AS+LAS++N+PY+PLRAIWFATES TRPDLLRLL GSEFVFTSPKPREKSEELKARLK LA
Subjt:  MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLA

Query:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S
        DVAERKAYQELVKDIAPKKPIDEPFSSYKDQ+GFGLHV LIMFTGYLVGY LFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIR+S+YDNRSS    S
Subjt:  DVAERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSP---S

Query:  KLKKSQ
        KLKK+Q
Subjt:  KLKKSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G52980.1 CONTAINS InterPro DOMAIN/s: ATPase, vacuolar ER assembly factor, Vma12 (InterPro:IPR021013); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink).3.0e-6868.34Show/hide
Query:  DRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLADVA
        D  + SGL+LS TE +RSFL  AS D RLS+ELR IASDL SKN IPY+ LRAIW  ++  TRPDLL L +GS FVFTSPKPREKSEELK RL KL ++A
Subjt:  DRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLADVA

Query:  ERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSPSKLKKS
        ERK Y ELVKDI PKK ++EPFSSYKDQ+GFGLHV L MFTGYLVGYA FRALF  +P +SAAGGILGLV  MLVETLLFII++S  D   S     +S
Subjt:  ERKAYQELVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSPSKLKKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAACGACCGCACACATCCTTCTGGTCTTGTTCTATCTAACACTGAAGCAATTCGCTCATTTCTCACCTCGGCGTCCATAGACTCACGACTTTCTGAGGAACTCCG
GCCGATTGCTTCAGATCTCGCTTCAAAAAACAACATTCCGTATGAGCCTCTGAGAGCTATCTGGTTTGCTACGGAATCGTGCACACGGCCGGATTTGCTCCGTCTTTTGG
CTGGATCGGAGTTCGTCTTTACAAGCCCTAAACCCAGGGAGAAGAGTGAGGAGTTAAAGGCTAGACTGAAGAAGCTTGCAGATGTAGCAGAGAGGAAGGCCTATCAGGAA
CTGGTGAAGGATATTGCACCTAAGAAACCAATTGATGAGCCTTTCTCTTCCTACAAGGATCAGATCGGATTCGGTTTACATGTCGCGTTGATAATGTTTACTGGCTATCT
TGTTGGATATGCATTATTCCGAGCATTGTTTAGGCATGATCCAATCATGAGTGCTGCTGGAGGTATCCTCGGATTAGTTTTTGGCATGCTCGTAGAAACACTTCTTTTCA
TTATTCGATCGTCCAATTATGATAATCGGTCTTCCCCTTCTAAGCTAAAGAAGAGTCAGTAG
mRNA sequenceShow/hide mRNA sequence
AGACTTTTTATTTATTTTTTTGATAATATTTAGAGTGAGAATTTATTGATTTTCTTTCCCCTTGAAAATTTAGCCCGATTCTATTAAAACGTTGGACTCCATTTTAATTG
CTTTGAGCTCTCGTCTTCCACCGGTCTCGTCGGCGGCAAGGATATCGCTGTTCTCGAACCGGAAAACACATATTCTACGATGATCAACGACCGCACACATCCTTCTGGTC
TTGTTCTATCTAACACTGAAGCAATTCGCTCATTTCTCACCTCGGCGTCCATAGACTCACGACTTTCTGAGGAACTCCGGCCGATTGCTTCAGATCTCGCTTCAAAAAAC
AACATTCCGTATGAGCCTCTGAGAGCTATCTGGTTTGCTACGGAATCGTGCACACGGCCGGATTTGCTCCGTCTTTTGGCTGGATCGGAGTTCGTCTTTACAAGCCCTAA
ACCCAGGGAGAAGAGTGAGGAGTTAAAGGCTAGACTGAAGAAGCTTGCAGATGTAGCAGAGAGGAAGGCCTATCAGGAACTGGTGAAGGATATTGCACCTAAGAAACCAA
TTGATGAGCCTTTCTCTTCCTACAAGGATCAGATCGGATTCGGTTTACATGTCGCGTTGATAATGTTTACTGGCTATCTTGTTGGATATGCATTATTCCGAGCATTGTTT
AGGCATGATCCAATCATGAGTGCTGCTGGAGGTATCCTCGGATTAGTTTTTGGCATGCTCGTAGAAACACTTCTTTTCATTATTCGATCGTCCAATTATGATAATCGGTC
TTCCCCTTCTAAGCTAAAGAAGAGTCAGTAGAGTTACTTCTCTTGAGTTTAGGGTTTAGGGTTTATGATAATCGGTCTTCCCGTTCTAAGCTGAAGGTAGCATTTTGGTG
TCCATCTGGTCGGACGTTACAACACCCAACAAGTTCTCCCTTGTTTAAAGAGTGTCTACTATATATATGACTTCGAGCTACAGTATCAAGACACGTTTTTCGGTTCGGTT
CGGTTGCTTTCATCTTAAACTTCAACACCGGATTTTGATATCCTCCTTCCTTCCAGATACGCACAAGTCATGTGGAGGTATTATGCTGTGGTTGGATTGGGATCCAACTT
GGAATTTTGATGTATACTGTTTTGTCAAGTTAGCTATTACAGTTTTATACAATTTGGATATAGAAGACAATAGGGCAATCATAATATGACAATCCCATTGAATTATTTTT
CAATGACGTGACATTCATTTTCTCTAAAATGTTATAGAAATATTTATTATGAGTTATTAGAGTTTCATTTAGTTGAACTATAAATTTGGTCACTTTTTTTTTTTTT
Protein sequenceShow/hide protein sequence
MINDRTHPSGLVLSNTEAIRSFLTSASIDSRLSEELRPIASDLASKNNIPYEPLRAIWFATESCTRPDLLRLLAGSEFVFTSPKPREKSEELKARLKKLADVAERKAYQE
LVKDIAPKKPIDEPFSSYKDQIGFGLHVALIMFTGYLVGYALFRALFRHDPIMSAAGGILGLVFGMLVETLLFIIRSSNYDNRSSPSKLKKSQ