; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G012730 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G012730
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr16:9079522..9080319
RNA-Seq ExpressionCmoCh16G012730
SyntenyCmoCh16G012730
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022930156.1 uncharacterized protein LOC111436667 [Cucurbita moschata]2.4e-9979.6Show/hide
Query:  KMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRY
        KMKES+SVK+YSDRLL+IANKVRLLGS+LNDS+IVEKLL ++PEKFEA ITTLENTKDLSKISL ELLNALQAQ+QRR MRQEGV EG + VK+QD+ RY
Subjt:  KMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRY

Query:  KNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSS
        KN K FKNQ T GDSS NYQKTKGGG KK YPPC HCEKKGHPPYKCWRR DA CSKCNQLGHEAVICK K  VKEVDA V+DQ    EEEDQLF+VT  
Subjt:  KNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSS

Query:  SSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
          KESSE WLIDSGCTNHMTYD E FEELRDTE KRVRI NGEHL V  K
Subjt:  SSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

XP_022932242.1 uncharacterized protein LOC111438605 [Cucurbita moschata]1.1e-9689.35Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MKVLNLIRDFELQKMKES+SVKEY DRLLSIANKVRLLGS+LNDSKIVEKLLVT+PEKFEA ITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE
         ALLVKHQDS RYKNNKNFKNQLTYGDS ANYQKTKGGGFKKSYPPC HCEKKGHPPYKCWRRPD  CSKCNQLGHE +I K K  VKEVDAQ+VDQ   
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE

Query:  EEEEDQLFMVTSSSSK
         EEEDQLF+VTSSSSK
Subjt:  EEEEDQLFMVTSSSSK

XP_022959005.1 uncharacterized protein LOC111460124 [Cucurbita moschata]3.6e-12490.87Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MK LNLIRDFELQKMK+SESVKEYS+RLL+IANKVRLLGS+LNDS+IVEKLLVT+PEKFEA ITTLENTKDLSKISLTELLNALQAQEQ+RSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE
        GAL VKHQD+ RYKNNKNFKNQLTYGDSS NYQKTKGGGFKKSYP C HCEKK HPPYKCWRRPDAFCSKCNQLGHEAVICK KDLVKEVDAQVVDQ   
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE

Query:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
        EEEEDQL MVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTE KRVRIGNGEHLEV+ K
Subjt:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

XP_022963821.1 uncharacterized protein LOC111464007 [Cucurbita moschata]2.3e-11892.8Show/hide
Query:  MKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRYK
        MKESESVKEYSDRL SIANKVRLLGS+LNDS+IVEKLLVT+PEKFEA ITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGAL VKHQDS RYK
Subjt:  MKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRYK

Query:  NNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSSS
        NNKNFKNQLTYGDSSANYQKTK GGFKKSYPPC H E KGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQ EEEEEEDQL MVTSSS
Subjt:  NNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSSS

Query:  SKESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIGNGEHLEVREK
        SKES ESWLIDSGCTNHMTYDKESFEELRDT EDKRVRIGNGEH+EV+ K
Subjt:  SKESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIGNGEHLEVREK

XP_022974382.1 uncharacterized protein LOC111473053 [Cucurbita maxima]2.6e-11485.93Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MK LNLIRDFELQKM ES+SVKEYS++LLSIANKVRLLGSVLNDS IVEKLLV +PE FE  ITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE
        GAL VKHQD+RRYKN K FKNQ T GD S N+QKTKGG FKKSYPPC HCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICK++D VKEVDAQVVDQ   
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE

Query:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
        EEEEDQLFMV SSSSKESSESWLI+SGCTNHMTY+KE FE+LRD EDKRVRIGNGE LEV+ K
Subjt:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

TrEMBL top hitse value%identityAlignment
A0A1U8KFZ5 uncharacterized protein LOC1079152233.2e-10278.41Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGS LNDS+IVEK+LVT+PEK EA ITTLENTKDLSKISL ELLNALQAQ QRRSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGD-SSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEE
        GALLVKHQD+ RYK  KNF+NQ T  + SS NYQK+K GG KKSYPP  HCEKKGH P+KCW+RPD  CSKCNQLGH+ V+CK K  V+EVDAQV DQ  
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGD-SSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEE

Query:  EEEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
          EEED+LF+VT  S +ESSE WLIDSGCTNHMTYDKE FEELR+TE K VRI N E+LEV+ K
Subjt:  EEEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

A0A6J1EQ51 uncharacterized protein LOC1114366671.1e-9979.6Show/hide
Query:  KMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRY
        KMKES+SVK+YSDRLL+IANKVRLLGS+LNDS+IVEKLL ++PEKFEA ITTLENTKDLSKISL ELLNALQAQ+QRR MRQEGV EG + VK+QD+ RY
Subjt:  KMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRY

Query:  KNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSS
        KN K FKNQ T GDSS NYQKTKGGG KK YPPC HCEKKGHPPYKCWRR DA CSKCNQLGHEAVICK K  VKEVDA V+DQ    EEEDQLF+VT  
Subjt:  KNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSS

Query:  SSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
          KESSE WLIDSGCTNHMTYD E FEELRDTE KRVRI NGEHL V  K
Subjt:  SSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

A0A6J1H529 uncharacterized protein LOC1114601241.8e-12490.87Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MK LNLIRDFELQKMK+SESVKEYS+RLL+IANKVRLLGS+LNDS+IVEKLLVT+PEKFEA ITTLENTKDLSKISLTELLNALQAQEQ+RSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE
        GAL VKHQD+ RYKNNKNFKNQLTYGDSS NYQKTKGGGFKKSYP C HCEKK HPPYKCWRRPDAFCSKCNQLGHEAVICK KDLVKEVDAQVVDQ   
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE

Query:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
        EEEEDQL MVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTE KRVRIGNGEHLEV+ K
Subjt:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

A0A6J1HH50 uncharacterized protein LOC1114640071.1e-11892.8Show/hide
Query:  MKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRYK
        MKESESVKEYSDRL SIANKVRLLGS+LNDS+IVEKLLVT+PEKFEA ITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGAL VKHQDS RYK
Subjt:  MKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDSRRYK

Query:  NNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSSS
        NNKNFKNQLTYGDSSANYQKTK GGFKKSYPPC H E KGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQ EEEEEEDQL MVTSSS
Subjt:  NNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSSS

Query:  SKESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIGNGEHLEVREK
        SKES ESWLIDSGCTNHMTYDKESFEELRDT EDKRVRIGNGEH+EV+ K
Subjt:  SKESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIGNGEHLEVREK

A0A6J1IA47 uncharacterized protein LOC1114730531.3e-11485.93Show/hide
Query:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
        MK LNLIRDFELQKM ES+SVKEYS++LLSIANKVRLLGSVLNDS IVEKLLV +PE FE  ITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE
Subjt:  MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIE

Query:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE
        GAL VKHQD+RRYKN K FKNQ T GD S N+QKTKGG FKKSYPPC HCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICK++D VKEVDAQVVDQ   
Subjt:  GALLVKHQDSRRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEE

Query:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK
        EEEEDQLFMV SSSSKESSESWLI+SGCTNHMTY+KE FE+LRD EDKRVRIGNGE LEV+ K
Subjt:  EEEEDQLFMVTSSSSKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTCCTGAATTTGATCAGGGATTTCGAGTTGCAGAAGATGAAGGAGTCAGAGTCCGTAAAAGAGTACTCTGACAGACTTCTCAGCATCGCCAACAAGGTGAGATT
GCTTGGTTCTGTATTAAATGATTCCAAGATCGTTGAAAAGCTGCTAGTCACTCTTCCAGAGAAGTTTGAAGCCATCATTACTACTCTGGAGAACACCAAAGACTTGTCAA
AGATTTCTCTTACAGAGCTCTTGAATGCTTTACAAGCGCAAGAGCAAAGGAGGTCTATGAGACAAGAAGGGGTGATTGAAGGTGCCTTACTTGTTAAGCATCAAGACAGC
CGCAGGTATAAAAACAACAAAAATTTCAAAAATCAATTGACGTATGGAGATTCATCTGCCAATTATCAAAAGACAAAAGGAGGAGGTTTCAAAAAATCCTATCCACCTTG
CTGCCATTGTGAGAAGAAAGGCCATCCACCATACAAGTGTTGGAGAAGACCTGACGCCTTCTGCTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAAAGCCA
AAGATCTGGTGAAAGAAGTAGATGCACAGGTCGTTGATCAAGAAGAAGAAGAAGAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCGAG
AGCTGGTTGATTGACAGTGGGTGCACAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACCGAGGATAAGAGAGTGAGGATTGGCAACGGTGAACA
CTTGGAAGTCAGGGAAAAGGCACAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGTCCTGAATTTGATCAGGGATTTCGAGTTGCAGAAGATGAAGGAGTCAGAGTCCGTAAAAGAGTACTCTGACAGACTTCTCAGCATCGCCAACAAGGTGAGATT
GCTTGGTTCTGTATTAAATGATTCCAAGATCGTTGAAAAGCTGCTAGTCACTCTTCCAGAGAAGTTTGAAGCCATCATTACTACTCTGGAGAACACCAAAGACTTGTCAA
AGATTTCTCTTACAGAGCTCTTGAATGCTTTACAAGCGCAAGAGCAAAGGAGGTCTATGAGACAAGAAGGGGTGATTGAAGGTGCCTTACTTGTTAAGCATCAAGACAGC
CGCAGGTATAAAAACAACAAAAATTTCAAAAATCAATTGACGTATGGAGATTCATCTGCCAATTATCAAAAGACAAAAGGAGGAGGTTTCAAAAAATCCTATCCACCTTG
CTGCCATTGTGAGAAGAAAGGCCATCCACCATACAAGTGTTGGAGAAGACCTGACGCCTTCTGCTCCAAATGCAATCAACTTGGACATGAAGCTGTGATCTGCAAAGCCA
AAGATCTGGTGAAAGAAGTAGATGCACAGGTCGTTGATCAAGAAGAAGAAGAAGAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAAGCAAAGAATCAAGCGAG
AGCTGGTTGATTGACAGTGGGTGCACAAATCACATGACATATGACAAGGAGTCTTTTGAGGAATTAAGAGACACCGAGGATAAGAGAGTGAGGATTGGCAACGGTGAACA
CTTGGAAGTCAGGGAAAAGGCACAGTAG
Protein sequenceShow/hide protein sequence
MKVLNLIRDFELQKMKESESVKEYSDRLLSIANKVRLLGSVLNDSKIVEKLLVTLPEKFEAIITTLENTKDLSKISLTELLNALQAQEQRRSMRQEGVIEGALLVKHQDS
RRYKNNKNFKNQLTYGDSSANYQKTKGGGFKKSYPPCCHCEKKGHPPYKCWRRPDAFCSKCNQLGHEAVICKAKDLVKEVDAQVVDQEEEEEEEDQLFMVTSSSSKESSE
SWLIDSGCTNHMTYDKESFEELRDTEDKRVRIGNGEHLEVREKAQ