; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G005920 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G005920
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr02:3589973..3591599
RNA-Seq ExpressionCmoCh02G005920
SyntenyCmoCh02G005920
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0034645 - cellular macromolecule biosynthetic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004672 - protein kinase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]8.5e-4357.59Show/hide
Query:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTG-
        S EDIG+AL+LSVDS +ES ILD GASFH   +KELF+NFK  NF +VYL +NK L I+ KG+VCIKTPAGNQWTL++VRYIP LKKNLISIGQLDSTG 
Subjt:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTG-

Query:  --------------------------------CMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
                                        C+N+A VAE AS   L   RL HMSAK MK LAAKG LEG+K VD+G CE+ VM KQK+
Subjt:  --------------------------------CMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]4.5e-0484.85Show/hide
Query:  SNLMKALSNMYEKPSTMNKVYLMRRLFNLQMSE
        S+L+KALSNMYEKPS  NKVYLMRRLFNLQM E
Subjt:  SNLMKALSNMYEKPSTMNKVYLMRRLFNLQMSE

KAF3665090.1 putative 50S ribosomal protein L18-like [Capsicum annuum]1.5e-4460.11Show/hide
Query:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST--
        S EDIGDAL+LSVDS IES ILDSGASFHSS +KE F+NFKS NF +VYLA+NK L I+ KG+VCIKT AGNQWTL++VRYIP  KKNLISIGQLDST  
Subjt:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST--

Query:  -------------------------------GCMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSK
                                       GC+N+  VAESASS  L   RL HMSAKG+K LAAKG L+GLKS D+G CE  VM K
Subjt:  -------------------------------GCMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSK

KAF3680274.1 putative 50S ribosomal protein L18-like [Capsicum annuum]5.5e-5857.09Show/hide
Query:  NLMKALSNMYEKPSTMNKVYLMRRLFNLQMSE-------------------------ED--------------------------IGDALVLSVDSSIES
        +L+KALSNMYE PS +NKVYLMRRLFNLQM E                         ED                          IGD+L+LSVDS +ES
Subjt:  NLMKALSNMYEKPSTMNKVYLMRRLFNLQMSE-------------------------ED--------------------------IGDALVLSVDSSIES

Query:  WILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST--------GCMNIAAVAESAS
        WILDSGASFHSS +KELF+NFKS NF +VYLA+NK L IKGK +VCIKTPAGNQWTL++VRYIP LKKNLI +GQLDST        GC+N+ +V ESAS
Subjt:  WILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST--------GCMNIAAVAESAS

Query:  SSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
         S L   RL HMSAKGMK LAAKG L+GLKSVD+G C++ VM KQK+
Subjt:  SSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

VFQ92713.1 unnamed protein product [Cuscuta campestris]3.0e-4852.67Show/hide
Query:  SNLMKALSNMYEKPSTMNKVYLM------------------RRLFNLQMSE------------------------------------EDIGDALVLSVDS
        + LMKALSNMYEKP  MNK  ++                  R    L+  E                                    EDIGDAL+LSVDS
Subjt:  SNLMKALSNMYEKPSTMNKVYLM------------------RRLFNLQMSE------------------------------------EDIGDALVLSVDS

Query:  SIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCMNIAAVAESASSSSL
         +ESWILDSGASFHSS +KELF+NFKS NF +VYLA+NK L I+GKG+V IKTP GNQWTLK+ RYIP LKKNLISI      GC+N+AA A+  SSSSL
Subjt:  SIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCMNIAAVAESASSSSL

Query:  LCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
           RL HMS KGM+ LAAKG LEGL SVD+G CE+ VM KQK+
Subjt:  LCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

VFR02734.1 unnamed protein product [Cuscuta campestris]2.2e-4347.12Show/hide
Query:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLD----
        S EDIGDAL+LSVDS +ESWILDSGASFHSS +KELF+NFKS NF +VYLA+NK L I+GKG+V IKTP GNQWTLK+VRYIP LKKNLISIG+LD    
Subjt:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLD----

Query:  -----------------------------STGCMNIAAVAESASSSSLLCQRLEHMSAKGMK--RLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDV
                                     +TGC+N+AA A+  SSS    ++   + AK +K   +     L G +  D          K ++     DV
Subjt:  -----------------------------STGCMNIAAVAESASSSSLLCQRLEHMSAKGMK--RLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDV

Query:  VADTHETPETTAEEPDVEHGSKTMKQVGVELELQENSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD
          D     E+   +   +   +T KQVGVE+EL++++P +V A+T  T  T  EE  VEQVTPE VLRRSS   RVPD
Subjt:  VADTHETPETTAEEPDVEHGSKTMKQVGVELELQENSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD

TrEMBL top hitse value%identityAlignment
A0A2G3CAD5 Uncharacterized protein8.0e-3956.18Show/hide
Query:  DSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST---------------
        DS +ES ILDSGASFHSS +KELF+NFK  NF +VYL +NK L I+ KG+VCIKTPAGNQWTL++VRYIP  KKNLIS+GQLDST               
Subjt:  DSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDST---------------

Query:  ------------------GCMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
                          GC+N+  VAE AS S L   RL HMSAK MK L AK  LEG+K VD+G CE+YVM KQK+
Subjt:  ------------------GCMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

A0A484KC47 CCHC-type domain-containing protein1.6e-4251.85Show/hide
Query:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGC
        S EDIGDAL+LSVDS +ESWILDSGASFHSS +KE F+NFKS NF +VYLA+NK L I+GKG+V IKTPAGNQWTLK+VRYIP LKKNLISIGQLD+ G 
Subjt:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGC

Query:  MNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDVVADTHETPETTAEEPDVEHGSKTMKQVGVELELQE
              AE    S  + +              A  V  G K   +     +   K ++     DV  D     +    E  V    +T KQVGVE+EL++
Subjt:  MNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDVVADTHETPETTAEEPDVEHGSKTMKQVGVELELQE

Query:  NSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD
        ++P +V A+T  T  T VEE  VEQVTPE VLRRSS   RVPD
Subjt:  NSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD

A0A484MUU4 gag_pre-integrs domain-containing protein1.5e-4852.67Show/hide
Query:  SNLMKALSNMYEKPSTMNKVYLM------------------RRLFNLQMSE------------------------------------EDIGDALVLSVDS
        + LMKALSNMYEKP  MNK  ++                  R    L+  E                                    EDIGDAL+LSVDS
Subjt:  SNLMKALSNMYEKPSTMNKVYLM------------------RRLFNLQMSE------------------------------------EDIGDALVLSVDS

Query:  SIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCMNIAAVAESASSSSL
         +ESWILDSGASFHSS +KELF+NFKS NF +VYLA+NK L I+GKG+V IKTP GNQWTLK+ RYIP LKKNLISI      GC+N+AA A+  SSSSL
Subjt:  SIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCMNIAAVAESASSSSL

Query:  LCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
           RL HMS KGM+ LAAKG LEGL SVD+G CE+ VM KQK+
Subjt:  LCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

A0A484NNM3 CCHC-type domain-containing protein1.1e-4347.12Show/hide
Query:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLD----
        S EDIGDAL+LSVDS +ESWILDSGASFHSS +KELF+NFKS NF +VYLA+NK L I+GKG+V IKTP GNQWTLK+VRYIP LKKNLISIG+LD    
Subjt:  SEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLD----

Query:  -----------------------------STGCMNIAAVAESASSSSLLCQRLEHMSAKGMK--RLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDV
                                     +TGC+N+AA A+  SSS    ++   + AK +K   +     L G +  D          K ++     DV
Subjt:  -----------------------------STGCMNIAAVAESASSSSLLCQRLEHMSAKGMK--RLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDV

Query:  VADTHETPETTAEEPDVEHGSKTMKQVGVELELQENSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD
          D     E+   +   +   +T KQVGVE+EL++++P +V A+T  T  T  EE  VEQVTPE VLRRSS   RVPD
Subjt:  VADTHETPETTAEEPDVEHGSKTMKQVGVELELQENSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPD

A0A6A3B3K8 Detected protein of confused Function1.2e-3440.86Show/hide
Query:  MKALSNMYEKPSTMNKVYLMRRLFNLQMSE----------------------------------------------------------------------
        M ALS+MYEKPS  NKV+LMRRLFNL+M+E                                                                      
Subjt:  MKALSNMYEKPSTMNKVYLMRRLFNLQMSE----------------------------------------------------------------------

Query:  -EDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCM
         E+ GDA++LSV+S IESWILDSGASFHS+  +E+  N+ S  F +V+LA+++ L+I GKG++ +K P    W LK VR+IP LK+NLIS+GQLD     
Subjt:  -EDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTGCM

Query:  NIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ
            VA+    S+L  QRL HMS KGMK L +KG L  LK+VDVG CE+ +  KQK+
Subjt:  NIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-0928.49Show/hide
Query:  WILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTG--------------------
        W++D+ AS H++  ++LF  + + +F  V + N    +I G G++CIKT  G    LK+VR++P L+ NLIS   LD  G                    
Subjt:  WILDSGASFHSSSNKELFRNFKSENF-EVYLANNKDLEIKGKGNVCIKTPAGNQWTLKNVRYIPCLKKNLISIGQLDSTG--------------------

Query:  ---------------CMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQ
                       C      A+   S  L  +R+ HMS KG++ LA K ++   K   V  C+  +  KQ
Subjt:  ---------------CMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQ

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAATCTGATGAAGGCGCTGTCGAATATGTACGAAAAACCGTCGACTATGAACAAGGTGTATTTGATGCGGAGATTGTTCAATCTACAGATGTCTGAAGAA
GACATTGGGGATGCTCTAGTCCTCAGCGTGGACAGTTCGATTGAATCCTGGATTTTGGATTCAGGTGCATCTTTTCATTCTTCTTCAAATAAAGAGTTGTTTCGG
AATTTCAAGTCTGAAAATTTCGAGGTGTATCTTGCCAACAACAAAGATTTGGAGATTAAAGGAAAAGGGAATGTTTGCATAAAAACTCCGGCAGGAAATCAGTGG
ACATTAAAGAATGTCAGATATATTCCTTGTCTCAAGAAAAACCTGATCTCTATTGGTCAGTTGGACAGCACAGGGTGTATGAACATAGCTGCTGTTGCTGAGAGT
GCTTCAAGTTCAAGTCTATTGTGCCAAAGACTTGAACATATGAGCGCGAAAGGAATGAAGAGGCTGGCTGCGAAAGGAGTTTTAGAAGGTCTGAAATCTGTTGAT
GTGGGTCGTTGTGAGAACTACGTTATGAGCAAGCAGAAACAAGAAAACTCACCTAGTGATGTTGTAGCAGATACTCATGAAACTCCTGAGACTACTGCTGAGGAA
CCAGATGTGGAGCATGGTTCCAAGACCATGAAGCAAGTGGGAGTTGAGCTTGAGTTGCAAGAAAACTCACCTAGTGATGTTGTAGCAGATACTCATGAAACTTCT
AAGACTACTGTAGAGGAATCAGCAGTGGAGCAAGTGACACCTGAACTGGTGTTGAGAAGATCATCCAGCACTATCAGAGTACCAGATATGTATGTACCTTCATTA
CACTATCTGTTGCTGATGAAAGGGAACCACAACCCTTTGAGGAGGCCCTACAGTTGGAGGATACAACCAAGTGAGAGCAAGCCATGGATGATGGGACGTCTAGGC
TTCAAAAATACGTTGTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAATCTGATGAAGGCGCTGTCGAATATGTACGAAAAACCGTCGACTATGAACAAGGTGTATTTGATGCGGAGATTGTTCAATCTACAGATGTCTGAAGAA
GACATTGGGGATGCTCTAGTCCTCAGCGTGGACAGTTCGATTGAATCCTGGATTTTGGATTCAGGTGCATCTTTTCATTCTTCTTCAAATAAAGAGTTGTTTCGG
AATTTCAAGTCTGAAAATTTCGAGGTGTATCTTGCCAACAACAAAGATTTGGAGATTAAAGGAAAAGGGAATGTTTGCATAAAAACTCCGGCAGGAAATCAGTGG
ACATTAAAGAATGTCAGATATATTCCTTGTCTCAAGAAAAACCTGATCTCTATTGGTCAGTTGGACAGCACAGGGTGTATGAACATAGCTGCTGTTGCTGAGAGT
GCTTCAAGTTCAAGTCTATTGTGCCAAAGACTTGAACATATGAGCGCGAAAGGAATGAAGAGGCTGGCTGCGAAAGGAGTTTTAGAAGGTCTGAAATCTGTTGAT
GTGGGTCGTTGTGAGAACTACGTTATGAGCAAGCAGAAACAAGAAAACTCACCTAGTGATGTTGTAGCAGATACTCATGAAACTCCTGAGACTACTGCTGAGGAA
CCAGATGTGGAGCATGGTTCCAAGACCATGAAGCAAGTGGGAGTTGAGCTTGAGTTGCAAGAAAACTCACCTAGTGATGTTGTAGCAGATACTCATGAAACTTCT
AAGACTACTGTAGAGGAATCAGCAGTGGAGCAAGTGACACCTGAACTGGTGTTGAGAAGATCATCCAGCACTATCAGAGTACCAGATATGTATGTACCTTCATTA
CACTATCTGTTGCTGATGAAAGGGAACCACAACCCTTTGAGGAGGCCCTACAGTTGGAGGATACAACCAAGTGAGAGCAAGCCATGGATGATGGGACGTCTAGGC
TTCAAAAATACGTTGTACTGA
Protein sequenceShow/hide protein sequence
MSNLMKALSNMYEKPSTMNKVYLMRRLFNLQMSEEDIGDALVLSVDSSIESWILDSGASFHSSSNKELFRNFKSENFEVYLANNKDLEIKGKGNVCIKTPAGNQW
TLKNVRYIPCLKKNLISIGQLDSTGCMNIAAVAESASSSSLLCQRLEHMSAKGMKRLAAKGVLEGLKSVDVGRCENYVMSKQKQENSPSDVVADTHETPETTAEE
PDVEHGSKTMKQVGVELELQENSPSDVVADTHETSKTTVEESAVEQVTPELVLRRSSSTIRVPDMYVPSLHYLLLMKGNHNPLRRPYSWRIQPSESKPWMMGRLG
FKNTLY