; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G004680 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G004680
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDUF4219 domain-containing protein
Genome locationCmo_Chr19:5621121..5621432
RNA-Seq ExpressionCmoCh19G004680
SyntenyCmoCh19G004680
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025314 - Domain of unknown function DUF4219


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022927156.1 uncharacterized protein LOC111434088 [Cucurbita moschata]1.3e-4589.32Show/hide
Query:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL
        MGGES FSAVAPP+FDGDNYQMWAVRM+TYLE LDLWE IEEDY+VPPLPANPTVAQIK+QK+KKTRKSKAKACLFA VSQMIFMRIMSLKTAK+IWDYL
Subjt:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL

Query:  KAD
        KA+
Subjt:  KAD

XP_022931554.1 uncharacterized protein LOC111437700 [Cucurbita moschata]4.2e-4489.11Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSAVAPPVFDGDNYQMWAVRM+TYLE LDLWEAIEEDY+VPPLP NPTVAQI++QK+KKTRKSKAKACLFA VS+MIFMRIMSLKTAKEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

XP_022931985.1 uncharacterized protein LOC111438318 [Cucurbita moschata]4.6e-4385.44Show/hide
Query:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL
        M GES FSAVAPP+FDGDNYQMW VR +TYLE LDLWEAIEEDY+VPPLPANPTVAQIK++K+KKTRKSKAKACLFA VSQMIF+RIMSLKTAK+IWDYL
Subjt:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL

Query:  KAD
        KA+
Subjt:  KAD

XP_022945777.1 uncharacterized protein LOC111449922 [Cucurbita moschata]1.1e-4490.1Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSA+APPVFDGDNYQMWAVRM+TY E LDLWEAIEEDY+VPPLPANPTVAQIK+Q +KKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

XP_022959005.1 uncharacterized protein LOC111460124 [Cucurbita moschata]1.4e-4490.1Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSAVAPPVFDGDNYQMWAVRM+TYLE LDLWEAIEEDY+VPPLPANPTVAQIK+QK+KKTRKSKAKACLFA VS+MIFMRIMSLKT KEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

TrEMBL top hitse value%identityAlignment
A0A6J1EGX0 uncharacterized protein LOC1114340886.3e-4689.32Show/hide
Query:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL
        MGGES FSAVAPP+FDGDNYQMWAVRM+TYLE LDLWE IEEDY+VPPLPANPTVAQIK+QK+KKTRKSKAKACLFA VSQMIFMRIMSLKTAK+IWDYL
Subjt:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL

Query:  KAD
        KA+
Subjt:  KAD

A0A6J1EUJ8 uncharacterized protein LOC1114377002.0e-4489.11Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSAVAPPVFDGDNYQMWAVRM+TYLE LDLWEAIEEDY+VPPLP NPTVAQI++QK+KKTRKSKAKACLFA VS+MIFMRIMSLKTAKEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

A0A6J1F0Y4 uncharacterized protein LOC1114383182.2e-4385.44Show/hide
Query:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL
        M GES FSAVAPP+FDGDNYQMW VR +TYLE LDLWEAIEEDY+VPPLPANPTVAQIK++K+KKTRKSKAKACLFA VSQMIF+RIMSLKTAK+IWDYL
Subjt:  MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYL

Query:  KAD
        KA+
Subjt:  KAD

A0A6J1G1V7 uncharacterized protein LOC1114499225.3e-4590.1Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSA+APPVFDGDNYQMWAVRM+TY E LDLWEAIEEDY+VPPLPANPTVAQIK+Q +KKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

A0A6J1H529 uncharacterized protein LOC1114601247.0e-4590.1Show/hide
Query:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA
        GES FSAVAPPVFDGDNYQMWAVRM+TYLE LDLWEAIEEDY+VPPLPANPTVAQIK+QK+KKTRKSKAKACLFA VS+MIFMRIMSLKT KEIWDYLKA
Subjt:  GESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKA

Query:  D
        +
Subjt:  D

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein9.1e-0526.32Show/hide
Query:  VFDGDNYQMWAVRMKTYLEVLDLWEAIE-----EDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKAD
        V D  NY++WA  MKT L    LW+ ++     +  K+P L        + + +    + +KA   L + +   +F + +   +AK +WD +K D
Subjt:  VFDGDNYQMWAVRMKTYLEVLDLWEAIE-----EDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGCGAATCAAAATTTTCAGCTGTTGCACCACCCGTCTTCGATGGAGACAATTATCAAATGTGGGCAGTTCGTATGAAGACTTATCTAGAGGTTTTGGATCTTTG
GGAAGCAATAGAAGAGGATTACAAGGTCCCTCCACTTCCAGCAAATCCTACTGTAGCACAAATCAAAGTACAGAAGAAAAAGAAGACAAGGAAATCAAAGGCAAAAGCTT
GCCTATTTGCCGTTGTATCTCAAATGATCTTCATGCGAATAATGTCCCTCAAAACAGCAAAGGAAATCTGGGATTATCTCAAGGCCGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGCGAATCAAAATTTTCAGCTGTTGCACCACCCGTCTTCGATGGAGACAATTATCAAATGTGGGCAGTTCGTATGAAGACTTATCTAGAGGTTTTGGATCTTTG
GGAAGCAATAGAAGAGGATTACAAGGTCCCTCCACTTCCAGCAAATCCTACTGTAGCACAAATCAAAGTACAGAAGAAAAAGAAGACAAGGAAATCAAAGGCAAAAGCTT
GCCTATTTGCCGTTGTATCTCAAATGATCTTCATGCGAATAATGTCCCTCAAAACAGCAAAGGAAATCTGGGATTATCTCAAGGCCGATTAA
Protein sequenceShow/hide protein sequence
MGGESKFSAVAPPVFDGDNYQMWAVRMKTYLEVLDLWEAIEEDYKVPPLPANPTVAQIKVQKKKKTRKSKAKACLFAVVSQMIFMRIMSLKTAKEIWDYLKAD