; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012510 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012510
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTransposable element protein
Genome locationCmo_Chr09:10785634..10786476
RNA-Seq ExpressionCmoCh09G012510
SyntenyCmoCh09G012510
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH09161.1 hypothetical protein Prudu_021584 [Prunus dulcis]2.1e-4142.63Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        M+E Q+ RKIK LRSDNGGEY  DPFLKVC+DEGIVRHF V   PQQNGVAERMN+TL+EK+RC+LS AGL KAFWAEA++YA HL+NRLP + N GKTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LEV------------------------------------------------------EKVVFSPDMVAPTGEPIDQ--VDNNSDVLEQEEQNLDEQSLEE
        +EV                                                      +K+V S D+       ++Q   ++ S  ++ +E++ D+   EE
Subjt:  LEV------------------------------------------------------EKVVFSPDMVAPTGEPIDQ--VDNNSDVLEQEEQNLDEQSLEE

Query:  QSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA
        ++   E  ++ +SI   R RR IRKP+RF D +AYA  +I D +P+   EA
Subjt:  QSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA

GFZ19218.1 hypothetical protein Acr_27g0009570 [Actinidia rufa]8.2e-4179.61Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        MVENQT RKIKKLRSDNGGEY YDPFLK+C++EGIVRHF V   PQQNGVAERMN+TL+ K+RC+LS AGLSKAFWAEA+SYA HLVNRLP +G GGKTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LEV
        +EV
Subjt:  LEV

KAG8492333.1 hypothetical protein CXB51_009816 [Gossypium anomalum]1.2e-3943.1Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        MVE QT +K+K+LRSDNG EY  DPFL+VC+DEGIVRHF V   PQQNGVAERMN+T++EK+RC+LS AGL K FWAE ++YA HL+NRLP +    KTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LEV--------------------------------EKVVFSPDMVAPTGEPIDQVDNNSD--------VLEQEEQNLDEQSLE----EQSLVNERVEEPE
        +E+                                 K+VFS D+       +   D+  D         +E E+ N D  ++E    E+    E +++ +
Subjt:  LEV--------------------------------EKVVFSPDMVAPTGEPIDQVDNNSD--------VLEQEEQNLDEQSLE----EQSLVNERVEEPE

Query:  SITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA
        SI   RPRR IRKP RFDD +AYA  +  D +P+   EA
Subjt:  SITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA

XP_022974553.1 uncharacterized protein LOC111473221 [Cucurbita maxima]2.1e-5271.76Show/hide
Query:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD
        MNQTLIEK+RCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGG+TPLE                             VEKVVFSPD+VAPT EPIDQV+
Subjt:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD

Query:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA
        NN DVLEQ+     EQSLEEQSLVNERVEEPESI  NRPRRVIRKP RFDDTIAYA S+ID VPN+RIEA
Subjt:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA

XP_023001310.1 uncharacterized protein LOC111495478 [Cucurbita maxima]2.4e-5372.35Show/hide
Query:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD
        MNQTLIEK+RCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGG+TPLE                             VEKVVFSPD+VAPT EPIDQV+
Subjt:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD

Query:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA
        NN DVLEQ+     EQSLEEQSLVNERVEEPESI  NRPRRVIRKP RFDDTIAYAFS+ID VPN+RIEA
Subjt:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA

TrEMBL top hitse value%identityAlignment
A0A2R6QHR4 Endonuclease3.7e-3942.86Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        M+E QT RKIKKLRSDNGGEY  DPF  VC  EGIVRHF +   PQQNGVAERMN+TL++K+RC++  AGLSKAFWAEA++YA HL NRLP +   GKTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LE-------VEKVVFSPDMVAPTGEPI-----DQVDNNSDVL--------EQEEQNLDEQSLEEQSLVNERV----------------------------
        +E        +K +F        G  +      ++ N+ DV           E++NL   S ++  LV+  V                            
Subjt:  LE-------VEKVVFSPDMVAPTGEPI-----DQVDNNSDVL--------EQEEQNLDEQSLEEQSLVNERV----------------------------

Query:  --EEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVP
          +  ESI  +RP+RVIR+P R+ DT+AYA  +I+ VP
Subjt:  --EEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVP

A0A4Y1RXT4 Uncharacterized protein1.0e-4142.63Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        M+E Q+ RKIK LRSDNGGEY  DPFLKVC+DEGIVRHF V   PQQNGVAERMN+TL+EK+RC+LS AGL KAFWAEA++YA HL+NRLP + N GKTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LEV------------------------------------------------------EKVVFSPDMVAPTGEPIDQ--VDNNSDVLEQEEQNLDEQSLEE
        +EV                                                      +K+V S D+       ++Q   ++ S  ++ +E++ D+   EE
Subjt:  LEV------------------------------------------------------EKVVFSPDMVAPTGEPIDQ--VDNNSDVLEQEEQNLDEQSLEE

Query:  QSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA
        ++   E  ++ +SI   R RR IRKP+RF D +AYA  +I D +P+   EA
Subjt:  QSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMI-DRVPNIRIEA

A0A6J1IGL4 uncharacterized protein LOC1114732211.0e-5271.76Show/hide
Query:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD
        MNQTLIEK+RCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGG+TPLE                             VEKVVFSPD+VAPT EPIDQV+
Subjt:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD

Query:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA
        NN DVLEQ+     EQSLEEQSLVNERVEEPESI  NRPRRVIRKP RFDDTIAYA S+ID VPN+RIEA
Subjt:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA

A0A6J1KKU6 uncharacterized protein LOC1114954781.2e-5372.35Show/hide
Query:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD
        MNQTLIEK+RCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGG+TPLE                             VEKVVFSPD+VAPT EPIDQV+
Subjt:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLE-----------------------------VEKVVFSPDMVAPTGEPIDQVD

Query:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA
        NN DVLEQ+     EQSLEEQSLVNERVEEPESI  NRPRRVIRKP RFDDTIAYAFS+ID VPN+RIEA
Subjt:  NNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA

A0A7J0H864 Integrase catalytic domain-containing protein4.0e-4179.61Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP
        MVENQT RKIKKLRSDNGGEY YDPFLK+C++EGIVRHF V   PQQNGVAERMN+TL+ K+RC+LS AGLSKAFWAEA+SYA HLVNRLP +G GGKTP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTP

Query:  LEV
        +EV
Subjt:  LEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.0e-1543.3Show/hide
Query:  KIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGKTPLEV
        K+  L  DNG EY  +   + C  +GI  H  VP  PQ NGV+ERM +T+ EK R ++S A L K+FW EA+  A +L+NR+P     +  KTP E+
Subjt:  KIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGKTPLEV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.0e-2253.85Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP
        +VE +T RK+K+LRSDNGGEYT   F + C   GI     VP  PQ NGVAERMN+T++EK+R +L  A L K+FW EA+  A +L+NR P
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.6e-0725.2Show/hide
Query:  VENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPL
        ++NQ + ++  ++ D G EYT     K   + GI   +   +  + +GVAER+N+TL+   R +L  +GL    W  A+ ++  + N L    N      
Subjt:  VENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPL

Query:  EVEKVVFSPDMVAPTGEPIDQVDNNSD
                   + P G+P+   ++N D
Subjt:  EVEKVVFSPDMVAPTGEPIDQVDNNSD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.0e-0934.07Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP
        ++EN+   +I    SDNGGE+      +     GI    + P  P+ NG++ER ++ ++E    +LS A + K +W  A + AV+L+NRLP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-1038.95Show/hide
Query:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRD----EGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP
        +VEN+   +I  L SDNGGE+       V RD     GI    + P  P+ NG++ER ++ ++E    +LS A + K +W  A S AV+L+NRLP
Subjt:  MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRD----EGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLP

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.1e-0644Show/hide
Query:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLEV
        MN+T+IEK+R +L + GL K F A+A + AVH++N+ P +      P EV
Subjt:  MNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLEV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGAGAATCAGACGGACAGGAAAATCAAAAAGTTGAGATCAGACAACGGTGGAGAATATACTTATGATCCATTCCTTAAAGTATGTCGAGATGAAGGGATCGTTCG
ACACTTCGCTGTTCCTAGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAGACATTAATAGAGAAGATTCGATGCATTTTGTCTCAAGCAGGATTGAGTAAGG
CATTTTGGGCTGAGGCTCTCAGCTATGCAGTTCACCTGGTGAATCGTTTACCTGTTTCTGGAAATGGTGGGAAAACTCCGCTTGAGGTGGAGAAGGTGGTGTTCTCTCCT
GATATGGTTGCTCCTACTGGAGAACCTATTGATCAGGTAGATAATAACTCTGATGTCTTAGAACAGGAAGAGCAAAACCTTGATGAGCAAAGCCTTGAGGAGCAAAGCCT
TGTAAATGAAAGAGTGGAGGAACCTGAGTCTATCACCAATAATAGACCACGAAGGGTAATTCGAAAACCTATAAGGTTTGATGATACGATAGCATATGCTTTCTCTATGA
TTGATAGAGTTCCCAACATACGTATTGAGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGAGAATCAGACGGACAGGAAAATCAAAAAGTTGAGATCAGACAACGGTGGAGAATATACTTATGATCCATTCCTTAAAGTATGTCGAGATGAAGGGATCGTTCG
ACACTTCGCTGTTCCTAGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAGACATTAATAGAGAAGATTCGATGCATTTTGTCTCAAGCAGGATTGAGTAAGG
CATTTTGGGCTGAGGCTCTCAGCTATGCAGTTCACCTGGTGAATCGTTTACCTGTTTCTGGAAATGGTGGGAAAACTCCGCTTGAGGTGGAGAAGGTGGTGTTCTCTCCT
GATATGGTTGCTCCTACTGGAGAACCTATTGATCAGGTAGATAATAACTCTGATGTCTTAGAACAGGAAGAGCAAAACCTTGATGAGCAAAGCCTTGAGGAGCAAAGCCT
TGTAAATGAAAGAGTGGAGGAACCTGAGTCTATCACCAATAATAGACCACGAAGGGTAATTCGAAAACCTATAAGGTTTGATGATACGATAGCATATGCTTTCTCTATGA
TTGATAGAGTTCCCAACATACGTATTGAGGCTTGA
Protein sequenceShow/hide protein sequence
MVENQTDRKIKKLRSDNGGEYTYDPFLKVCRDEGIVRHFAVPSKPQQNGVAERMNQTLIEKIRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGKTPLEVEKVVFSP
DMVAPTGEPIDQVDNNSDVLEQEEQNLDEQSLEEQSLVNERVEEPESITNNRPRRVIRKPIRFDDTIAYAFSMIDRVPNIRIEA