; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G008980 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G008980
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag-Pol polyprotein
Genome locationCmo_Chr01:4992707..4993339
RNA-Seq ExpressionCmoCh01G008980
SyntenyCmoCh01G008980
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8702390.1 hypothetical protein F3Y22_tig00110483pilonHSYRG00411 [Hibiscus syriacus]2.6e-6167.33Show/hide
Query:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH
        L E +VVEKILRSLTDNFENVVCAIEESKDLA  TV+EL GSLEAHEQR KKK+EE L+QALQ KA IKD K+ YSQNFR  GR  G RG GR  QG+SH
Subjt:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH

Query:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK
        E  Y E   SSQ NWRGRGR +  G   N+ N++CYKC KY HYA +CNSD+CYNCG++GH+A+DCR  +KVEE+INLAL+D  N G LLMAQN E N  
Subjt:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK

Query:  GD
         D
Subjt:  GD

KAE8721174.1 hypothetical protein F3Y22_tig00016637pilonHSYRG00095 [Hibiscus syriacus]2.6e-6167.33Show/hide
Query:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH
        L E +VVEKILRSLTDNFENVVCAIEESKDLA  TV+EL GSLEAHEQR KKK+EE L+QALQ KA IKD K+ YSQNFR  GR  G RG GR  QG+SH
Subjt:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH

Query:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK
        E  Y E   SSQ NWRGRGR +  G   N+ N++CYKC KY HYA +CNSD+CYNCG++GH+A+DCR  +KVEE+INLAL+D  N G LLMAQN E N  
Subjt:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK

Query:  GD
         D
Subjt:  GD

XP_022942136.1 uncharacterized protein LOC111447269 [Cucurbita moschata]1.6e-8782.76Show/hide
Query:  MLPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEE
        ML ET+VVEKILRSLTDNF+NVVCAIEESKDLAKFTVDEL GSLEAHEQRKKK+EEPLDQ LQ KA+IKDGK+LYSQNFRGRD+GSR NGR  QGN+HE+
Subjt:  MLPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEE

Query:  NYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDG
        NY EKRLSSQANWRGRGR Q  GQGY  N+QC+KCQKY HYANNCNSDRCYNCGR+GHYARDCR KEKVEE+INLALDD T+GGILLMAQ++E NTK   
Subjt:  NYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDG

Query:  GAK
        GAK
Subjt:  GAK

XP_022975262.1 uncharacterized protein LOC111474389 [Cucurbita maxima]5.1e-7375Show/hide
Query:  MLPETQVVEKILRSLTDNFEN-VVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHE
        +L ET++VEKILRSLT N EN VVC IEESKDLA FTVDE+  SLEAHEQ KKK +E LD+ALQ KASI D K+LY QN  GR  GSR NGR +QGN++E
Subjt:  MLPETQVVEKILRSLTDNFEN-VVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHE

Query:  ENYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGD
        ENY EKRLSSQANWRGRG NQ  GQGYN NVQCYKCQKY H  NNCNSD+CYNCGRMGHYARDCR +EKVEE+INLALDD TN GILLMAQN+E  TK  
Subjt:  ENYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGD

Query:  GGAK
        G AK
Subjt:  GGAK

XP_023524532.1 ATP-dependent RNA helicase glh-2-like [Cucurbita pepo subsp. pepo]4.5e-9391.24Show/hide
Query:  DNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEENYHEKRLSSQANWRGR
        D+++N     EESKDLAKFTVDELAGSLEAHEQRKKKEEEP DQALQ KASIKDGKILYSQNFRGRDSGSRGNGRA +GNSHEE+YHEKRLSSQANWRGR
Subjt:  DNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEENYHEKRLSSQANWRGR

Query:  GRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDGGAKTMAIVVR
        GRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDD T+GGILLMAQN+ETNTKGDG AKTMAIVVR
Subjt:  GRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDGGAKTMAIVVR

TrEMBL top hitse value%identityAlignment
A0A6A3AD07 Uncharacterized protein1.3e-6167.33Show/hide
Query:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH
        L E +VVEKILRSLTDNFENVVCAIEESKDLA  TV+EL GSLEAHEQR KKK+EE L+QALQ KA IKD K+ YSQNFR  GR  G RG GR  QG+SH
Subjt:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH

Query:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK
        E  Y E   SSQ NWRGRGR +  G   N+ N++CYKC KY HYA +CNSD+CYNCG++GH+A+DCR  +KVEE+INLAL+D  N G LLMAQN E N  
Subjt:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK

Query:  GD
         D
Subjt:  GD

A0A6A3BX58 Uncharacterized protein1.3e-6167.33Show/hide
Query:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH
        L E +VVEKILRSLTDNFENVVCAIEESKDLA  TV+EL GSLEAHEQR KKK+EE L+QALQ KA IKD K+ YSQNFR  GR  G RG GR  QG+SH
Subjt:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQR-KKKEEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH

Query:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK
        E  Y E   SSQ NWRGRGR +  G   N+ N++CYKC KY HYA +CNSD+CYNCG++GH+A+DCR  +KVEE+INLAL+D  N G LLMAQN E N  
Subjt:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK

Query:  GD
         D
Subjt:  GD

A0A6A3CZT6 Uncharacterized protein1.7e-6166.83Show/hide
Query:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKK-EEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH
        L E +VVEKILRSLTDNFENVVCAIEESKDLA  T++EL GSLEAHEQRKKK +EE L+QALQ KA IKD K+ YSQNFR  GR  G RG GR  QG+SH
Subjt:  LPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKK-EEEPLDQALQKKASIKDGKILYSQNFR--GRDSGSRGNGRATQGNSH

Query:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK
        E  Y E   SSQ NWRGRGR +  G   N+ N++CYKC KY HYA +CNSD+CYNCG++GH+A+DCR  +KVEE+INLAL+D  N G LLMAQN E N  
Subjt:  EENYHEKRLSSQANWRGRGRNQECGQGYNF-NVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTK

Query:  GD
         D
Subjt:  GD

A0A6J1FN02 uncharacterized protein LOC1114472698.0e-8882.76Show/hide
Query:  MLPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEE
        ML ET+VVEKILRSLTDNF+NVVCAIEESKDLAKFTVDEL GSLEAHEQRKKK+EEPLDQ LQ KA+IKDGK+LYSQNFRGRD+GSR NGR  QGN+HE+
Subjt:  MLPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEE

Query:  NYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDG
        NY EKRLSSQANWRGRGR Q  GQGY  N+QC+KCQKY HYANNCNSDRCYNCGR+GHYARDCR KEKVEE+INLALDD T+GGILLMAQ++E NTK   
Subjt:  NYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDG

Query:  GAK
        GAK
Subjt:  GAK

A0A6J1ICK8 uncharacterized protein LOC1114743892.5e-7375Show/hide
Query:  MLPETQVVEKILRSLTDNFEN-VVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHE
        +L ET++VEKILRSLT N EN VVC IEESKDLA FTVDE+  SLEAHEQ KKK +E LD+ALQ KASI D K+LY QN  GR  GSR NGR +QGN++E
Subjt:  MLPETQVVEKILRSLTDNFEN-VVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHE

Query:  ENYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGD
        ENY EKRLSSQANWRGRG NQ  GQGYN NVQCYKCQKY H  NNCNSD+CYNCGRMGHYARDCR +EKVEE+INLALDD TN GILLMAQN+E  TK  
Subjt:  ENYHEKRLSSQANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGD

Query:  GGAK
        G AK
Subjt:  GGAK

SwissProt top hitse value%identityAlignment
P19558 Gag polyprotein1.4e-0450Show/hide
Query:  QCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTK
        +CY C K  H   NC   +CY+CG+ GH AR+CR+K
Subjt:  QCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTK

P19560 Gag-Pol polyprotein6.2e-0547.37Show/hide
Query:  QCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEK
        +CY C K  H   NC   +CY+CG+ GH AR+CR+K +
Subjt:  QCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACCCGAGACGCAGGTTGTGGAGAAAATCTTGAGGTCGTTAACAGACAACTTCGAGAATGTTGTGTGTGCCATAGAAGAGTCGAAGGACCTAGCGAAGTTCACAGT
CGATGAGCTTGCCGGTTCTCTCGAGGCACACGAGCAACGCAAGAAAAAGGAGGAGGAGCCGCTCGATCAAGCGCTTCAAAAGAAGGCATCAATAAAGGATGGAAAGATAC
TCTACTCACAGAATTTTCGAGGTAGAGATAGTGGAAGCCGCGGGAATGGTCGAGCTACTCAAGGCAATAGTCACGAAGAAAACTACCATGAGAAGAGACTGTCGAGCCAA
GCAAATTGGCGTGGAAGAGGACGCAATCAAGAGTGCGGTCAAGGATACAATTTCAACGTCCAGTGCTATAAATGTCAGAAATATGACCACTATGCAAATAATTGTAACTC
CGACAGATGTTACAATTGTGGCAGAATGGGTCACTATGCAAGAGATTGTCGAACCAAAGAGAAGGTGGAAGAATCCATCAACCTAGCCTTGGATGACACAACAAATGGAG
GCATCCTCTTGATGGCCCAAAACAAAGAGACAAATACAAAAGGAGACGGCGGTGCGAAGACAATGGCGATAGTCGTGAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTACCCGAGACGCAGGTTGTGGAGAAAATCTTGAGGTCGTTAACAGACAACTTCGAGAATGTTGTGTGTGCCATAGAAGAGTCGAAGGACCTAGCGAAGTTCACAGT
CGATGAGCTTGCCGGTTCTCTCGAGGCACACGAGCAACGCAAGAAAAAGGAGGAGGAGCCGCTCGATCAAGCGCTTCAAAAGAAGGCATCAATAAAGGATGGAAAGATAC
TCTACTCACAGAATTTTCGAGGTAGAGATAGTGGAAGCCGCGGGAATGGTCGAGCTACTCAAGGCAATAGTCACGAAGAAAACTACCATGAGAAGAGACTGTCGAGCCAA
GCAAATTGGCGTGGAAGAGGACGCAATCAAGAGTGCGGTCAAGGATACAATTTCAACGTCCAGTGCTATAAATGTCAGAAATATGACCACTATGCAAATAATTGTAACTC
CGACAGATGTTACAATTGTGGCAGAATGGGTCACTATGCAAGAGATTGTCGAACCAAAGAGAAGGTGGAAGAATCCATCAACCTAGCCTTGGATGACACAACAAATGGAG
GCATCCTCTTGATGGCCCAAAACAAAGAGACAAATACAAAAGGAGACGGCGGTGCGAAGACAATGGCGATAGTCGTGAGGTAG
Protein sequenceShow/hide protein sequence
MLPETQVVEKILRSLTDNFENVVCAIEESKDLAKFTVDELAGSLEAHEQRKKKEEEPLDQALQKKASIKDGKILYSQNFRGRDSGSRGNGRATQGNSHEENYHEKRLSSQ
ANWRGRGRNQECGQGYNFNVQCYKCQKYDHYANNCNSDRCYNCGRMGHYARDCRTKEKVEESINLALDDTTNGGILLMAQNKETNTKGDGGAKTMAIVVR