; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G17630 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G17630
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon Tf2-12 polyprotein
Genome locationChr4:15113262..15113642
RNA-Seq ExpressionCSPI04G17630
SyntenyCSPI04G17630
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0005488 - binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0068128.1 hypothetical protein E6C27_scaffold238G00900 [Cucumis melo var. makuwa]7.7e-3271.57Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+D+LRP+L KFVLV F +IL YS+SL EHL +LA+VLE LVA QLVANFKK QF VD+IEYLG VI SEGVAADP KI+AM+KWP P NVKEL GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NG
         G
Subjt:  NG

KFK24528.1 hypothetical protein AALP_AAs46225U000100, partial [Arabis alpina]8.5e-3159.2Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+DV R YLRKFVLVFFDDIL+YSKSL EH   L +VLE L  HQL AN KKC+F   ++EYLGHV+S +GVAADP KI+AMV WP P+NVK L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   K   +     A PL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

XP_028552250.1 uncharacterized protein LOC114580023 [Dendrobium catenatum]5.4e-3359.2Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+ V +PYLR+FVLVFFDDIL+Y+KSL +HLH L +VL TL+ HQL AN KKC FA  ++EYLGH+IS++GVAADPTKIEAMV WP PK++K L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   +   K     A+PL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

XP_028554071.1 uncharacterized protein LOC114580489 [Dendrobium catenatum]8.5e-3158.4Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+ V +PYLR+FVLVFFDDIL+YSKSL +H+  L +VL TL+ HQL AN KKC FA  ++EYLGH+IS  GVAAD TKIEAMVKWP PK++K L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   +   K     A PL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

XP_031254189.1 pentatricopeptide repeat-containing protein At5g61990, mitochondrial-like [Pistacia vera]1.5e-3053.97Show/hide
Query:  SDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGAN
        +++ +P+LR+F+LVFFDDIL+YSK ++EHL  LA VL+ L+ HQL+ N KKC F   +IEYLGHVIS+EG+A DP+KIE+++KWP PKNV+ L GF G  
Subjt:  SDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGAN

Query:  GVLSKIRCKLPFY--CAAPLDTTVKK
        G   +    +  Y   AAPL T +KK
Subjt:  GVLSKIRCKLPFY--CAAPLDTTVKK

TrEMBL top hitse value%identityAlignment
A0A087FZ16 Uncharacterized protein (Fragment)9.2e-3159.2Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+DV R YLRKFVLVFFDDIL+YSKSL EH   L +VL  L  HQL AN +KC+F   K+EYLGHV+S +GVAADP KI+AMV WP P+NVK L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   K   K     A PL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

A0A087G3S6 Uncharacterized protein (Fragment)4.1e-3159.2Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+DV R YLRKFVLVFFDDIL+YSKSL EH   L +VLE L  HQL AN KKC+F   ++EYLGHV+S +GVAADP KI+AMV WP P+NVK L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   K   +     A PL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

A0A2I0VA20 Putative mitochondrial protein1.6e-3058.4Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+ V +P LR+FVLVFFDDILIYS+SL EHL  L  VL TL  HQL  N KKC FA   +EYLGH+IS+EGVAADP+K+EAM  WP PKN++ L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   K   K     AAPL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

A0A2I0WGN9 Putative mitochondrial protein1.6e-3058.4Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+ V +P LR+FVLVFFDDILIYS+SL EHL  L  VL TL  HQL  N KKC FA   +EYLGH+IS+EGVAADP+K+EAM  WP PKN++ L GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYCAAPLDTTVKK
         G   K   K     AAPL   +KK
Subjt:  NGVLSKIRCKLPFYCAAPLDTTVKK

A0A5A7VJW6 Uncharacterized protein3.7e-3271.57Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+D+LRP+L KFVLV F +IL YS+SL EHL +LA+VLE LVA QLVANFKK QF VD+IEYLG VI SEGVAADP KI+AM+KWP P NVKEL GF G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NG
         G
Subjt:  NG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.1e-2040.16Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+D+LRP L K  LV+ DDI+++S SLDEHL  L +V E L    L     KC+F   +  +LGHV++ +G+  +P KIEA+ K+P P   KE+  F G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYC--AAPLDTTVKK
         G   K    +P +   A P+   +KK
Subjt:  NGVLSKIRCKLPFYC--AAPLDTTVKK

P10401 Retrovirus-related Pol polyprotein from transposon gypsy5.4e-1231.31Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFG
        + DVLR  + K   V+ DD++I+S++  +H+  +  VL+ L+   +  + +K +F  + +EYLG ++S +G  +DP K++A+ ++P P  V ++  F G
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFG

P20825 Retrovirus-related Pol polyprotein from transposon 2974.6e-1936.72Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        M+++LRP L K  LV+ DDI+I+S SL EHL+ + +V   L    L     KC+F   +  +LGH+++ +G+  +P K++A+V +P P   KE+  F G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSKIRCKLPFYC--AAPLDTTVKKR
         G   K    +P Y   A P+ + +KKR
Subjt:  NGVLSKIRCKLPFYC--AAPLDTTVKKR

P92523 Uncharacterized mitochondrial protein AtMg008602.3e-1555.41Show/hide
Query:  LHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLG--HVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGANG
        ++ L MVL+    HQ  AN KKC F   +I YLG  H+IS EGV+ADP K+EAMV WP PKN  EL GF G  G
Subjt:  LHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLG--HVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGANG

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus4.0e-1534.91Show/hide
Query:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA
        + D+LR ++ K   V+ DDI+++S+  D H   L +VL +L    L  N +K  F   ++E+LG++++++G+ ADP K+ A+ + P P +VKEL  F G 
Subjt:  MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGA

Query:  NGVLSK
             K
Subjt:  NGVLSK

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.7e-1655.41Show/hide
Query:  LHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLG--HVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGANG
        ++ L MVL+    HQ  AN KKC F   +I YLG  H+IS EGV+ADP K+EAMV WP PKN  EL GF G  G
Subjt:  LHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLG--HVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGANG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATGTTCTTCGCCCATATTTACGCAAGTTTGTTTTGGTTTTCTTTGATGACATCTTAATTTACAGCAAGTCCTTGGACGAGCATTTGCACCAATTGGCAATGGT
TTTGGAAACTCTAGTAGCTCATCAACTGGTAGCCAACTTTAAAAAATGCCAATTTGCGGTTGACAAAATTGAATACTTGGGCCATGTTATTTCTTCAGAGGGGGTGGCAG
CGGATCCAACAAAAATAGAAGCGATGGTTAAATGGCCAGCACCCAAGAACGTGAAGGAGCTAGGTGGGTTTTTTGGGGCTAACGGGGTATTATCGAAAATTCGTTGCAAA
CTACCGTTCTATTGTGCTGCCCCCCTTGACACAACTGTTAAAAAAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATGTTCTTCGCCCATATTTACGCAAGTTTGTTTTGGTTTTCTTTGATGACATCTTAATTTACAGCAAGTCCTTGGACGAGCATTTGCACCAATTGGCAATGGT
TTTGGAAACTCTAGTAGCTCATCAACTGGTAGCCAACTTTAAAAAATGCCAATTTGCGGTTGACAAAATTGAATACTTGGGCCATGTTATTTCTTCAGAGGGGGTGGCAG
CGGATCCAACAAAAATAGAAGCGATGGTTAAATGGCCAGCACCCAAGAACGTGAAGGAGCTAGGTGGGTTTTTTGGGGCTAACGGGGTATTATCGAAAATTCGTTGCAAA
CTACCGTTCTATTGTGCTGCCCCCCTTGACACAACTGTTAAAAAAAGGTAA
Protein sequenceShow/hide protein sequence
MSDVLRPYLRKFVLVFFDDILIYSKSLDEHLHQLAMVLETLVAHQLVANFKKCQFAVDKIEYLGHVISSEGVAADPTKIEAMVKWPAPKNVKELGGFFGANGVLSKIRCK
LPFYCAAPLDTTVKKR