; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G009870 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G009870
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag protease polyprotein
Genome locationCmo_Chr02:6077416..6078030
RNA-Seq ExpressionCmoCh02G009870
SyntenyCmoCh02G009870
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022927095.1 uncharacterized protein LOC111434030 [Cucurbita moschata]7.2e-7274.36Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        +SE+FV+LA LEKE LE+ LSVSTPAH+LL+ATHRVKGG V ++GRVI+A LIVLSMQDF+VILGMDWLGEN  LIDCET+IVTLRL SGDSFTY+GATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL
        K   SV+T+L+A+K+I  GA AFL  VTLD SN+Q  SSVHIVREF DVFP+DL  LPP +EV+FGI+LE GT PISKAPYRMA AELREL+EQL
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL

XP_022931758.1 uncharacterized protein LOC111438026 [Cucurbita moschata]3.7e-6872.82Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        +SE+FV+LA LEKE LE  LSVSTPAHELL+ATHRVKGG V ++GRVI+A LIVL MQDF+VILGMDWLG N  LIDCET+IVTLRL SGD FTY+GATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL
        K   SV+T+L+A+K+I  GA AFL SVTLD SN+Q  SSVHIVREF DVFP+DLP LPP REVDFGI+LE GT PISKAP      ELREL+EQL
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL

XP_022933065.1 uncharacterized protein LOC111439772 [Cucurbita moschata]6.5e-7374.62Show/hide
Query:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV
        D    EKE LE  LSVSTP HELL+ATH++KGG V +SGRVI+A LIVLSMQDF+VILGMDWLGEN  LIDCET+IVTLRL S DSFTY+G TSK   SV
Subjt:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV

Query:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGF
        +T L+A+K+I  GASAFL SVTLD SN+Q  SSVHI+REF DVFP+DLP LP +REVDFGI+LE GT PISKAPYRMA AELREL+EQLQELLDKGF
Subjt:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGF

XP_022957288.1 uncharacterized protein LOC111458730 [Cucurbita moschata]1.0e-8183.84Show/hide
Query:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV
        D    EKEPLETILSVSTPAHELLMATHRVKGG+V VSGRVI+A LIVLSM DF+VILGMDWLGEN ALIDCET+IVTLRL SGDSFTY+G   KRT SV
Subjt:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV

Query:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGFI
        VTALKA+KMI  GASAFL SVTLD  N Q VSSVHIVREF DVFP+DLPSLPPVREVDFGI+LE GTAPISKAPYRMA AELREL+EQLQELLDKGFI
Subjt:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGFI

XP_022958237.1 uncharacterized protein LOC111459523 isoform X1 [Cucurbita moschata]2.6e-106100Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD
        KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD

Query:  KGFI
        KGFI
Subjt:  KGFI

TrEMBL top hitse value%identityAlignment
A0A6J1EMX1 uncharacterized protein LOC1114340303.5e-7274.36Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        +SE+FV+LA LEKE LE+ LSVSTPAH+LL+ATHRVKGG V ++GRVI+A LIVLSMQDF+VILGMDWLGEN  LIDCET+IVTLRL SGDSFTY+GATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL
        K   SV+T+L+A+K+I  GA AFL  VTLD SN+Q  SSVHIVREF DVFP+DL  LPP +EV+FGI+LE GT PISKAPYRMA AELREL+EQL
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL

A0A6J1EZM7 uncharacterized protein LOC1114380261.8e-6872.82Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        +SE+FV+LA LEKE LE  LSVSTPAHELL+ATHRVKGG V ++GRVI+A LIVL MQDF+VILGMDWLG N  LIDCET+IVTLRL SGD FTY+GATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL
        K   SV+T+L+A+K+I  GA AFL SVTLD SN+Q  SSVHIVREF DVFP+DLP LPP REVDFGI+LE GT PISKAP      ELREL+EQL
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQL

A0A6J1F3W3 uncharacterized protein LOC1114397723.2e-7374.62Show/hide
Query:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV
        D    EKE LE  LSVSTP HELL+ATH++KGG V +SGRVI+A LIVLSMQDF+VILGMDWLGEN  LIDCET+IVTLRL S DSFTY+G TSK   SV
Subjt:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV

Query:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGF
        +T L+A+K+I  GASAFL SVTLD SN+Q  SSVHI+REF DVFP+DLP LP +REVDFGI+LE GT PISKAPYRMA AELREL+EQLQELLDKGF
Subjt:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGF

A0A6J1GZS7 uncharacterized protein LOC1114587304.9e-8283.84Show/hide
Query:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV
        D    EKEPLETILSVSTPAHELLMATHRVKGG+V VSGRVI+A LIVLSM DF+VILGMDWLGEN ALIDCET+IVTLRL SGDSFTY+G   KRT SV
Subjt:  DLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSV

Query:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGFI
        VTALKA+KMI  GASAFL SVTLD  N Q VSSVHIVREF DVFP+DLPSLPPVREVDFGI+LE GTAPISKAPYRMA AELREL+EQLQELLDKGFI
Subjt:  VTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGFI

A0A6J1H2K9 uncharacterized protein LOC111459523 isoform X11.3e-106100Show/hide
Query:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
        MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS
Subjt:  MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATS

Query:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD
        KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD
Subjt:  KRTLSVVTALKARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLD

Query:  KGFI
        KGFI
Subjt:  KGFI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGAGAAATTCGTTGACTTAGCGCACCTAGAAAAAGAACCTTTAGAGACCATTCTATCAGTATCCACCCCTGCTCACGAGTTATTAATGGCTACTCATAGAGTTAA
GGGGGGTAATGTGATAGTGTCGGGACGTGTCATAAAAGCTGTGTTAATAGTACTAAGCATGCAAGACTTCAATGTCATTTTGGGTATGGATTGGCTAGGCGAGAATCATG
CTCTAATAGATTGTGAAACTCAAATAGTTACTCTCAGACTCTCGTCAGGGGATAGTTTTACATATGAGGGAGCCACTTCCAAAAGAACTCTAAGCGTCGTAACTGCACTA
AAGGCTAGAAAGATGATCTGCGGTGGTGCAAGCGCATTTTTAGTCAGTGTGACCCTAGACTGTAGTAATGAACAGACAGTCTCATCAGTACACATTGTCAGGGAATTCAC
TGATGTTTTTCCTAAAGATTTGCCAAGTTTGCCCCCTGTTAGGGAAGTTGACTTCGGGATCAACCTAGAATTAGGAACTGCGCCGATCTCTAAGGCACCCTACAGAATGG
CACGTGCAGAACTCAGGGAATTGGAGGAACAGTTACAGGAGCTATTGGATAAGGGTTTCATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGAGAAATTCGTTGACTTAGCGCACCTAGAAAAAGAACCTTTAGAGACCATTCTATCAGTATCCACCCCTGCTCACGAGTTATTAATGGCTACTCATAGAGTTAA
GGGGGGTAATGTGATAGTGTCGGGACGTGTCATAAAAGCTGTGTTAATAGTACTAAGCATGCAAGACTTCAATGTCATTTTGGGTATGGATTGGCTAGGCGAGAATCATG
CTCTAATAGATTGTGAAACTCAAATAGTTACTCTCAGACTCTCGTCAGGGGATAGTTTTACATATGAGGGAGCCACTTCCAAAAGAACTCTAAGCGTCGTAACTGCACTA
AAGGCTAGAAAGATGATCTGCGGTGGTGCAAGCGCATTTTTAGTCAGTGTGACCCTAGACTGTAGTAATGAACAGACAGTCTCATCAGTACACATTGTCAGGGAATTCAC
TGATGTTTTTCCTAAAGATTTGCCAAGTTTGCCCCCTGTTAGGGAAGTTGACTTCGGGATCAACCTAGAATTAGGAACTGCGCCGATCTCTAAGGCACCCTACAGAATGG
CACGTGCAGAACTCAGGGAATTGGAGGAACAGTTACAGGAGCTATTGGATAAGGGTTTCATATGA
Protein sequenceShow/hide protein sequence
MSEKFVDLAHLEKEPLETILSVSTPAHELLMATHRVKGGNVIVSGRVIKAVLIVLSMQDFNVILGMDWLGENHALIDCETQIVTLRLSSGDSFTYEGATSKRTLSVVTAL
KARKMICGGASAFLVSVTLDCSNEQTVSSVHIVREFTDVFPKDLPSLPPVREVDFGINLELGTAPISKAPYRMARAELRELEEQLQELLDKGFI