; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G004800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G004800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCmo_Chr17:3546927..3547547
RNA-Seq ExpressionCmoCh17G004800
SyntenyCmoCh17G004800
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAK92587.1 Putative retroelement [Oryza sativa Japonica Group]2.8e-2347.01Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW
        + KQRR PF  Q S+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW+++L +K E A+AI+ +QA AEAECG+K+R+L  D     +   +
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW

Query:  HAAAPNDAL--------LPQQNGVVECQNQTSSG
         +   ++ +         PQQNGVVE +NQT  G
Subjt:  HAAAPNDAL--------LPQQNGVVECQNQTSSG

BAF23632.1 Os08g0389500 [Oryza sativa Japonica Group]7.4e-2451.82Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDR---------
        +GKQRR PF S+A Y   E LELVHGD+C P+  AT  G  LFLLLVDD SR+MWLILL +K + + AIKR  A AEAE G+K+R L  DR         
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDR---------

Query:  --CSAKSTTSWHAAAPNDALLPQQNGVVECQNQTSSG
            A+     H  AP     PQQNGVVE +NQT  G
Subjt:  --CSAKSTTSWHAAAPNDALLPQQNGVVECQNQTSSG

CCI55340.1 PH01B019A14.9 [Phyllostachys edulis]2.8e-2348.2Show/hide
Query:  EQAVRRVSIGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRC
        EQ      + K RR PF  QAS+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW +LL  K+  A+AIK  QA AEAECG+K+R+L  D  
Subjt:  EQAVRRVSIGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRC

Query:  SAKSTTSWHAAAPNDAL--------LPQQNGVVECQNQT
           +   + A   N+ +         PQQNGVVE +NQT
Subjt:  SAKSTTSWHAAAPNDAL--------LPQQNGVVECQNQT

EEE52905.1 hypothetical protein OsJ_35506 [Oryza sativa Japonica Group]2.8e-2347.01Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW
        + KQRR PF  Q S+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW+++L +K E A+AI+ +QA AEAECG+K+R+L  D     +   +
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW

Query:  HAAAPNDAL--------LPQQNGVVECQNQTSSG
         +   ++ +         PQQNGVVE +NQT  G
Subjt:  HAAAPNDAL--------LPQQNGVVECQNQTSSG

XP_040384195.1 uncharacterized protein LOC121055551 [Oryza brachyantha]1.3e-2348.48Show/hide
Query:  KQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSWHA
        KQRR PF  Q S+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW+ +L +K E A+AI+R+QA AEAECG+K+R+L  D     +   + +
Subjt:  KQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSWHA

Query:  AAPNDAL--------LPQQNGVVECQNQTSSG
           ++ +         PQQNGVVE +NQT  G
Subjt:  AAPNDAL--------LPQQNGVVECQNQTSSG

TrEMBL top hitse value%identityAlignment
A0A5S6RBT5 Putative retroelement1.4e-2347.01Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW
        + KQRR PF  Q S+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW+++L +K E A+AI+ +QA AEAECG+K+R+L  D     +   +
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW

Query:  HAAAPNDAL--------LPQQNGVVECQNQTSSG
         +   ++ +         PQQNGVVE +NQT  G
Subjt:  HAAAPNDAL--------LPQQNGVVECQNQTSSG

B8AE64 Integrase catalytic domain-containing protein1.4e-2348.91Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW
        + KQRR PF  Q S+   E LELVH D+C P+  AT GG+  FLLLVDD SR+MW+++L +K E A+AI+R+QA AEAECG+K+R+L  D     + T +
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW

Query:  -----------HAAAPNDALLPQQNGVVECQNQTSSG
                   H  AP     PQQNGVVE +NQT  G
Subjt:  -----------HAAAPNDALLPQQNGVVECQNQTSSG

L0P215 PH01B019A14.9 protein1.4e-2348.2Show/hide
Query:  EQAVRRVSIGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRC
        EQ      + K RR PF  QAS+   E LELVHGD+C P+  AT GG+  FLLLVDD SR+MW +LL  K+  A+AIK  QA AEAECG+K+R+L  D  
Subjt:  EQAVRRVSIGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRC

Query:  SAKSTTSWHAAAPNDAL--------LPQQNGVVECQNQT
           +   + A   N+ +         PQQNGVVE +NQT
Subjt:  SAKSTTSWHAAAPNDAL--------LPQQNGVVECQNQT

Q0J5Y3 Os08g0389500 protein3.6e-2451.82Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDR---------
        +GKQRR PF S+A Y   E LELVHGD+C P+  AT  G  LFLLLVDD SR+MWLILL +K + + AIKR  A AEAE G+K+R L  DR         
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDR---------

Query:  --CSAKSTTSWHAAAPNDALLPQQNGVVECQNQTSSG
            A+     H  AP     PQQNGVVE +NQT  G
Subjt:  --CSAKSTTSWHAAAPNDALLPQQNGVVECQNQTSSG

Q7XPB1 OSJNBb0026E15.10 protein1.4e-2350.38Show/hide
Query:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW
        +GKQRR  F +Q+ Y   E LELVHGD+C PI+ AT  G   FLLLVDD SR+MWL ++++K E A AIK  QARAE E G+K+R L MDR S  ++  +
Subjt:  IGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSW

Query:  HAAAPNDAL--------LPQQNGVVECQNQT
             N  +         PQQNGVVE +NQT
Subjt:  HAAAPNDAL--------LPQQNGVVECQNQT

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-0832.56Show/hide
Query:  GKQRRTPFSS-QASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMD----------
        GKQ R PF   +   H+  PL +VH DVC PI   TL  K  F++ VD  + +    L++ KS+V    +   A++EA    K+  L++D          
Subjt:  GKQRRTPFSS-QASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMD----------

Query:  -RCSAKSTTSWHAAAPNDALLPQQNGVVE
         +   K   S+H   P+    PQ NGV E
Subjt:  -RCSAKSTTSWHAAAPNDALLPQQNGVVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.4e-1129.23Show/hide
Query:  GKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSWH
        GKQ R  F + +   +   L+LV+ DVC P+++ ++GG   F+  +DD SR +W+ +L+ K +V +  ++  A  E E G+K++ L  D     ++  + 
Subjt:  GKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLATLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSWH

Query:  AAAPNDAL--------LPQQNGVVECQNQT
            +  +         PQ NGV E  N+T
Subjt:  AAAPNDAL--------LPQQNGVVECQNQT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGCTCACACAAGTTCAACGCACATTAAACCGCCTTTACATTCTAGAGTTGGAGATAGAGCAACCGGTCAGCCTCTCGGCAAGGACCGAAGAAGTAGCTTGAAGGT
GGAACGCAAGGTACGGGCACCTGAACTTTTCTGCTCTACAAAAACTAAACAAGGAGGAGATGGTACACGGTTTGCTGACAATCAAAGGCATGAACAGGCTGTGCGACGGG
TGTCTATCGGCAAGCAAAGACGCACCCCCTTCTCGTCTCAAGCATCTTACCACGTCGGTGAGCCATTGGAGCTCGTGCATGGCGATGTCTGCGTGCCCATCAAGTTGGCA
ACCCTAGGCGGTAAAACCCTCTTCCTTCTACTGGTCGATGATAGAAGCCGCTTCATGTGGCTGATCCTGCTGCAAGCGAAAAGTGAGGTGGCAGAGGCGATCAAGCGCAG
TCAAGCGCGAGCGGAGGCCGAATGCGGGAAGAAGATGCGATTGCTGCACATGGACCGATGTAGTGCCAAATCTACGACGAGCTGGCATGCAGCGGCACCTAATGACGCCC
TACTCCCCCAGCAGAACGGGGTGGTGGAGTGCCAAAATCAAACCTCGTCGGGACAACAAGGTCATTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGCTCACACAAGTTCAACGCACATTAAACCGCCTTTACATTCTAGAGTTGGAGATAGAGCAACCGGTCAGCCTCTCGGCAAGGACCGAAGAAGTAGCTTGAAGGT
GGAACGCAAGGTACGGGCACCTGAACTTTTCTGCTCTACAAAAACTAAACAAGGAGGAGATGGTACACGGTTTGCTGACAATCAAAGGCATGAACAGGCTGTGCGACGGG
TGTCTATCGGCAAGCAAAGACGCACCCCCTTCTCGTCTCAAGCATCTTACCACGTCGGTGAGCCATTGGAGCTCGTGCATGGCGATGTCTGCGTGCCCATCAAGTTGGCA
ACCCTAGGCGGTAAAACCCTCTTCCTTCTACTGGTCGATGATAGAAGCCGCTTCATGTGGCTGATCCTGCTGCAAGCGAAAAGTGAGGTGGCAGAGGCGATCAAGCGCAG
TCAAGCGCGAGCGGAGGCCGAATGCGGGAAGAAGATGCGATTGCTGCACATGGACCGATGTAGTGCCAAATCTACGACGAGCTGGCATGCAGCGGCACCTAATGACGCCC
TACTCCCCCAGCAGAACGGGGTGGTGGAGTGCCAAAATCAAACCTCGTCGGGACAACAAGGTCATTACTGA
Protein sequenceShow/hide protein sequence
MGAHTSSTHIKPPLHSRVGDRATGQPLGKDRRSSLKVERKVRAPELFCSTKTKQGGDGTRFADNQRHEQAVRRVSIGKQRRTPFSSQASYHVGEPLELVHGDVCVPIKLA
TLGGKTLFLLLVDDRSRFMWLILLQAKSEVAEAIKRSQARAEAECGKKMRLLHMDRCSAKSTTSWHAAAPNDALLPQQNGVVECQNQTSSGQQGHY