; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh01G009080 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh01G009080
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Genome locationCma_Chr01:5436346..5440831
RNA-Seq ExpressionCmaCh01G009080
SyntenyCmaCh01G009080
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADB85424.1 putative retrotransposon protein [Phyllostachys edulis]3.1e-3046.34Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP-----------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL
        AS+   E LELVH D+C P+ P                             AA+AIKC+QA AEA+CG+K+RVL T+ G EF++  F+ Y    G+ RH 
Subjt:  ASYIVDEPLELVHDDICWPIKP-----------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL

Query:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        +  Y+ QQNG VE +NQT+V T R+LL    MP  +WGEAVMTA++LLN SPT++LD KTP+EA
Subjt:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]9.1e-3047.27Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH
        + Y  DE LELVH D+C PI+P                              AA AIK  QARAE + G+K+R L  +RG EF+S  F +Y   LG+ R 
Subjt:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH

Query:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        LT  YS QQNG VE +NQTIV T RS++    +PGRFWGEA+ TA++LLN SPT+SLD +TP+EA
Subjt:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

CCI55340.1 PH01B019A14.9 [Phyllostachys edulis]4.1e-3046.06Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH
        AS+   E LELVH D+C P+ P                              AA+AIKC+QA AEA+CG+K+RVL T+ G EF++  F+ Y    G+ RH
Subjt:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH

Query:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
         +  Y+ QQNG VE +NQT+V T R+LL    MP  +WGEAVMTA++LLN SPT++LD KTP+EA
Subjt:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

EEC72737.1 hypothetical protein OsI_06355 [Oryza sativa Indica Group]4.5e-2945.12Show/hide
Query:  SYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL
        S+   E LELVH+D+C P+ P                              AA+AI+  QA AEA+CG+K+RVL T+ G EF++T F+ YY   G++RH 
Subjt:  SYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL

Query:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        T  YS QQNG VE +NQT+V   R+L+    MP  FWGEAV+TA+Y+LN SPT++LD +TP+EA
Subjt:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

XP_015582191.1 uncharacterized protein LOC107262222 [Ricinus communis]2.6e-2931.06Show/hide
Query:  RCHDLGHFQYGCPKMNNELNYAELEEEDEMLLMTHMKRHEAKMSDTWFLDSSCSNHKC---------------------------------------ASY
        +CH LGH+++ CP  + E NY E  EE+EMLLM +++ ++AK  D WFLDS CSNH C                                        ++
Subjt:  RCHDLGHFQYGCPKMNNELNYAELEEEDEMLLMTHMKRHEAKMSDTWFLDSSCSNHKC---------------------------------------ASY

Query:  IVDE-------------------------------PLELVHDDICWPIKPAAE------------------------------AIKCIQARAEAKCGKKM
        IV E                                L+LVH DIC PI PA+                                 +C +   E + G  +
Subjt:  IVDE-------------------------------PLELVHDDICWPIKPAAE------------------------------AIKCIQARAEAKCGKKM

Query:  RVLHTNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        + L T+RG E  ST FS++     +KR LT  Y+ QQN  VE +N+T++   RS L   K+P  FW EAV   IY+LN SPT +L + TP EA
Subjt:  RVLHTNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

TrEMBL top hitse value%identityAlignment
A0A2N9HRC6 Uncharacterized protein4.4e-3031.14Show/hide
Query:  RCHDLGHFQYGCPKMNNELNYAELEEEDEMLLMTHMKRHEAKMSDTWFLDSSCSNHKCAS----------------------------------------
        +CH LGHFQY CPK   E NYAELEE++EMLLM++++ ++++  D WFLDS CSNH CA+                                        
Subjt:  RCHDLGHFQYGCPKMNNELNYAELEEEDEMLLMTHMKRHEAKMSDTWFLDSSCSNHKCAS----------------------------------------

Query:  --------------------------YIVDEPLELVHDDICWPIKPAAEA------------------------------IKCIQARAEAKCGKKMRVLH
                                  +   + L+LVH DIC PIKP + +                               K  +   E + G  +R L 
Subjt:  --------------------------YIVDEPLELVHDDICWPIKPAAEA------------------------------IKCIQARAEAKCGKKMRVLH

Query:  TNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        T+RG EF+S  F  +    G+ R LT  Y+ QQNG  E +N+TI+   RS+L   ++P  FW EAV    ++LN SPT  + + TP EA
Subjt:  TNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

B8AE64 Integrase catalytic domain-containing protein2.2e-2945.12Show/hide
Query:  SYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL
        S+   E LELVH+D+C P+ P                              AA+AI+  QA AEA+CG+K+RVL T+ G EF++T F+ YY   G++RH 
Subjt:  SYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL

Query:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        T  YS QQNG VE +NQT+V   R+L+    MP  FWGEAV+TA+Y+LN SPT++LD +TP+EA
Subjt:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

D3IVT4 Putative retrotransposon protein1.5e-3046.34Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP-----------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL
        AS+   E LELVH D+C P+ P                             AA+AIKC+QA AEA+CG+K+RVL T+ G EF++  F+ Y    G+ RH 
Subjt:  ASYIVDEPLELVHDDICWPIKP-----------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHL

Query:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        +  Y+ QQNG VE +NQT+V T R+LL    MP  +WGEAVMTA++LLN SPT++LD KTP+EA
Subjt:  TVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

L0P215 PH01B019A14.9 protein2.0e-3046.06Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH
        AS+   E LELVH D+C P+ P                              AA+AIKC+QA AEA+CG+K+RVL T+ G EF++  F+ Y    G+ RH
Subjt:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH

Query:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
         +  Y+ QQNG VE +NQT+V T R+LL    MP  +WGEAVMTA++LLN SPT++LD KTP+EA
Subjt:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

Q7XPB1 OSJNBb0026E15.10 protein4.4e-3047.27Show/hide
Query:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH
        + Y  DE LELVH D+C PI+P                              AA AIK  QARAE + G+K+R L  +RG EF+S  F +Y   LG+ R 
Subjt:  ASYIVDEPLELVHDDICWPIKP------------------------------AAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRH

Query:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA
        LT  YS QQNG VE +NQTIV T RS++    +PGRFWGEA+ TA++LLN SPT+SLD +TP+EA
Subjt:  LTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-1332.1Show/hide
Query:  VDEPLELVHDDICWPIKPAAEAIK-------------CIQ-----------------ARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHLTVR
        +  PL +VH D+C PI P     K             C+                  A++EA    K+  L+ + GRE+ S    ++  K G+  HLTV 
Subjt:  VDEPLELVHDDICWPIKPAAEAIK-------------CIQ-----------------ARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHLTVR

Query:  YSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSL--DEKTPHE
        ++ Q NG  E   +TI    R+++   K+   FWGEAV+TA YL+N  P+R+L    KTP+E
Subjt:  YSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSL--DEKTPHE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-1340.59Show/hide
Query:  ARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKT
        A  E + G+K++ L ++ G E++S  F +Y +  G++   TV  + Q NG  E  N+TIV   RS+L   K+P  FWGEAV TA YL+N SP+  L  + 
Subjt:  ARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTRSLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKT

Query:  P
        P
Subjt:  P

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCGGTGAGTTCTCGAAATCGATTGTGTTGTTGTTTTGGAGGAAGAGTCGAGTGGAAGTTAACATCAACGTCGGAATTGTTGCGTCAAAGAAGGAAAACCGT
CAAAACGAAGGGATTGCGTCATCGGCGTCGGCAGCGGCAGCGGCACATCAGATTAGGGTTTTCGTGAGGGTTCAAGCGAAGAAGACGCAAAGTCGGGTCGGGTTT
CAGCCTCTGAACCGGCCCACGAAACGAACGCGGGCCCTGTGCGTTTCGGCCCAACATGGACGGGCGCTGCAGCCTTGGTCTCGACTCGGCTTGACGTGTACAGCA
GCTCAGCTCCATCAATACCCGCGGCTCGGCTCCGTCAAATCCGGCAGCTCAGCTTGGCGGCTTAGTTGGTCGGCTCGGAAGCTCGGCTTGGTGACTCGGTATCAC
TATAATATATCGATTTCTAAGTTTTCGACCCTTCCAAAGCCTGCCCTTAGTTTTACCATCCATCCGAGGTGTCATGACTTAGGTCATTTTCAATATGGATGTCCA
AAGATGAATAACGAATTGAATTATGCAGAGCTGGAAGAAGAGGATGAAATGTTGTTAATGACTCATATGAAAAGACATGAAGCAAAAATGAGTGATACTTGGTTC
TTAGATTCTAGTTGCTCAAACCATAAGTGTGCATCCTATATCGTCGATGAGCCATTAGAGCTCGTGCACGACGATATCTGTTGGCCCATCAAACCAGCGGCTGAG
GCGATTAAGTGCATTCAAGCACGAGCGGAGGCCAAATGCGGGAAGAAGATGCGAGTGCTACACACGAATCGAGGCAGAGAATTCTCCTCGACGAGTTTCAGTAAG
TACTACAACAAGCTTGGCATGAAGCGGCACCTAACGGTGCGCTACTCCCTCCAACAAAACGGGACGGTGGAGCACCAAAATCAGACTATCGTCAGGACAACAAGG
TCATTGCTGATGACGACCAAGATGCCTGGGAGGTTCTGGGGAGAGGCGGTAATGACGGCCATCTACCTCCTCAATTGGTCACCAACGCGAAGCCTCGACGAGAAG
ACGCCACATGAGGCCTGA
mRNA sequenceShow/hide mRNA sequence
AAAATCGAGTGTTAGCCAGGAAAACGAGTGGGTGGTTCGCTGGGTAGTTTTCATGCTCGGTGAGTTCTCGAAATCGATTGTGTTGTTGTTTTGGAGGAAGAGTCG
AGTGGAAGTTAACATCAACGTCGGAATTGTTGCGTCAAAGAAGGAAAACCGTCAAAACGAAGGGATTGCGTCATCGGCGTCGGCAGCGGCAGCGGCACATCAGAT
TAGGGTTTTCGTGAGGGTTCAAGCGAAGAAGACGCAAAGTCGGGTCGGGTTTCAGCCTCTGAACCGGCCCACGAAACGAACGCGGGCCCTGTGCGTTTCGGCCCA
ACATGGACGGGCGCTGCAGCCTTGGTCTCGACTCGGCTTGACGTGTACAGCAGCTCAGCTCCATCAATACCCGCGGCTCGGCTCCGTCAAATCCGGCAGCTCAGC
TTGGCGGCTTAGTTGGTCGGCTCGGAAGCTCGGCTTGGTGACTCGGTATCACTATAATATATCGATTTCTAAGTTTTCGACCCTTCCAAAGCCTGCCCTTAGTTT
TACCATCCATCCGAGGTGTCATGACTTAGGTCATTTTCAATATGGATGTCCAAAGATGAATAACGAATTGAATTATGCAGAGCTGGAAGAAGAGGATGAAATGTT
GTTAATGACTCATATGAAAAGACATGAAGCAAAAATGAGTGATACTTGGTTCTTAGATTCTAGTTGCTCAAACCATAAGTGTGCATCCTATATCGTCGATGAGCC
ATTAGAGCTCGTGCACGACGATATCTGTTGGCCCATCAAACCAGCGGCTGAGGCGATTAAGTGCATTCAAGCACGAGCGGAGGCCAAATGCGGGAAGAAGATGCG
AGTGCTACACACGAATCGAGGCAGAGAATTCTCCTCGACGAGTTTCAGTAAGTACTACAACAAGCTTGGCATGAAGCGGCACCTAACGGTGCGCTACTCCCTCCA
ACAAAACGGGACGGTGGAGCACCAAAATCAGACTATCGTCAGGACAACAAGGTCATTGCTGATGACGACCAAGATGCCTGGGAGGTTCTGGGGAGAGGCGGTAAT
GACGGCCATCTACCTCCTCAATTGGTCACCAACGCGAAGCCTCGACGAGAAGACGCCACATGAGGCCTGA
Protein sequenceShow/hide protein sequence
MLGEFSKSIVLLFWRKSRVEVNINVGIVASKKENRQNEGIASSASAAAAAHQIRVFVRVQAKKTQSRVGFQPLNRPTKRTRALCVSAQHGRALQPWSRLGLTCTA
AQLHQYPRLGSVKSGSSAWRLSWSARKLGLVTRYHYNISISKFSTLPKPALSFTIHPRCHDLGHFQYGCPKMNNELNYAELEEEDEMLLMTHMKRHEAKMSDTWF
LDSSCSNHKCASYIVDEPLELVHDDICWPIKPAAEAIKCIQARAEAKCGKKMRVLHTNRGREFSSTSFSKYYNKLGMKRHLTVRYSLQQNGTVEHQNQTIVRTTR
SLLMTTKMPGRFWGEAVMTAIYLLNWSPTRSLDEKTPHEA