; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G09040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G09040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr7:6870094..6870837
RNA-Seq ExpressionCSPI07G09040
SyntenyCSPI07G09040
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]9.5e-5868.02Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EWR PTSVIELRSFLGLANY  RFIEGFSRR   +T+LLKKG TW+W  E Q AF+ LK  +M+G V  L DV+K F VETD SD++L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL Q+ H I Y SRKLN+ +RRYTV EKEML+VVHCLR+WRQYLLGS F+VK+DNS ICHFF+QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]4.7e-5766.86Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV +LRSFLGLANY  RF+EGFSRR A L +LLKK   W+W  +CQ AF+ LK T+M G V  LVDV+K F +ETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IA+ SRKLN  +RRY V EK+ML+VVHCLR WRQYLLGSQF+VK+DNS ICHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]2.3e-5969.19Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV +LRSFLGLANY  RF+EGFSRR A LT+LLKK  TW W  +CQ AF+ LK T+ RG V  LVDV+K F +ETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IA+ SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK+DNS ICHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]3.9e-5969.77Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV ELRSFLGLANY  RF+EGFSRR A LT+LLKK   W W  +CQ AF+ LK T+ RG V  LVDV+K F VETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IAY SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK+DNS  CHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]3.9e-5969.77Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV ELRSFLGLANY  RF+EGFSRR A LT+LLKK   W W  +CQ AF+ LK T+ RG V  LVDV+K F VETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IAY SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK+DNS  CHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

TrEMBL top hitse value%identityAlignment
A0A6J1D906 Reverse transcriptase4.6e-5868.02Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EWR PTSVIELRSFLGLANY  RFIEGFSRR   +T+LLKKG TW+W  E Q AF+ LK  +M+G V  L DV+K F VETD SD++L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL Q+ H I Y SRKLN+ +RRYTV EKEML+VVHCLR+WRQYLLGS F+VK+DNS ICHFF+QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

A0A6J1IDF7 uncharacterized protein LOC1114742152.3e-5766.86Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV +LRSFLGLANY  RF+EGFSRR A L +LLKK   W+W  +CQ AF+ LK T+M G V  LVDV+K F +ETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IA+ SRKLN  +RRY V EK+ML+VVHCLR WRQYLLGSQF+VK+DNS ICHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

A0A6J1IEF9 uncharacterized protein LOC1114749451.1e-5969.19Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        I +D DK++AI+EW+ PTSV +LRSFLGLANY  RF+EGFSRR A LT+LLKK  TW W  +CQ AF+ LK T+ RG V  LVDV+K F +ETD SDF+L
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
         GVL QEGH IA+ SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK+DNS ICHFF QPKLT K
Subjt:  EGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

A0A6J1IGF5 uncharacterized protein LOC1114745131.1e-5664.13Show/hide
Query:  FTSLNIRI-------DEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL
        FT L++R        D DK++AI+EW+ PTSV +++SFLGLANY  RF+EGFSRR A LT+LLKK   W W  +CQ  F+ LK T+ R  V RLVDV+K 
Subjt:  FTSLNIRI-------DEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL

Query:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
        F +ETD SDF+L GVL QEGH IAY SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK  NS ICHFF QPKLT K
Subjt:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

A0A6J1IKW3 uncharacterized protein LOC1114750393.0e-5764.13Show/hide
Query:  FTSLNIRI-------DEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL
        FT L++R        D DK++AI+EW+ PTSV +++SF+GLANY  RF+EGFSRR A LT+LLKK   W W  +CQ  F+ LK T+ R  V RLVDV+K 
Subjt:  FTSLNIRI-------DEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL

Query:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK
        F +ETD SDF+L GVL QEGH IAY SRKLN  +RRYTV EKEML+VVHCLR WRQYLLGSQF+VK DNS ICHFF QPKLT K
Subjt:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNS-ICHFFSQPKLTFK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.3e-2437.28Show/hide
Query:  TTLMYEFTSLNIRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKG-RTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL
        T L +  T   I+ + +K++AI+++  PT   E+++FLGL  Y  +FI  F+     +TK LKK  +      E  +AF KLK  I    + ++ D +K 
Subjt:  TTLMYEFTSLNIRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKG-RTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKL

Query:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDN
        F + TD SD +L  VL+Q+GH ++Y SR LN  +  Y+  EKE+L++V   +T+R YLLG  F + SD+
Subjt:  FVVETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDN

P0CT34 Transposon Tf2-1 polyprotein1.3e-1733.33Show/hide
Query:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL
        ++ +  + +W+ P +  ELR FLG  NY  +FI   S+    L  LLKK   W W      A + +K  ++   V R  D SK  ++ETD SD ++  VL
Subjt:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL

Query:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL
        +Q+      + + Y S K++  +  Y+V +KEML+++  L+ WR YL
Subjt:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL

P0CT41 Transposon Tf2-12 polyprotein1.3e-1733.33Show/hide
Query:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL
        ++ +  + +W+ P +  ELR FLG  NY  +FI   S+    L  LLKK   W W      A + +K  ++   V R  D SK  ++ETD SD ++  VL
Subjt:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL

Query:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL
        +Q+      + + Y S K++  +  Y+V +KEML+++  L+ WR YL
Subjt:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-2341.83Show/hide
Query:  KLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWI--WLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL
        K++AI  +  PT   E+R+FLGL  Y  +FI  ++     +T  LKK RT I    +E   AF+KLK  I+R  + +L D  K FV+ TD S+ +L  VL
Subjt:  KLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWI--WLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL

Query:  TQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDN
        +Q GH I++ SR LN  +  Y+  EKE+L++V   +T+R YLLG QF++ SD+
Subjt:  TQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDN

Q9UR07 Transposon Tf2-11 polyprotein1.3e-1733.33Show/hide
Query:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL
        ++ +  + +W+ P +  ELR FLG  NY  +FI   S+    L  LLKK   W W      A + +K  ++   V R  D SK  ++ETD SD ++  VL
Subjt:  EDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSLEGVL

Query:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL
        +Q+      + + Y S K++  +  Y+V +KEML+++  L+ WR YL
Subjt:  TQEG-----HQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-0734.58Show/hide
Query:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL
        +  D  KL+A+  W +P +  ELR FLGL  Y  RF++ + + V  LT+LLKK  +  W      AF  LK  +    V  L D+   FV  T V  ++ 
Subjt:  IRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFVVETDVSDFSL

Query:  EGVLTQE
           +T+E
Subjt:  EGVLTQE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTTAGCTTGACAAATGCTTTCACAACCTTAATGTATGAGTTCACGTCATTGAATATTCGAATAGACGAGGATAAGTTGCAAGCTATTAAAGAGTGGAGAGACCC
CACTTCCGTGATAGAATTACGCTCCTTCCTTGGATTGGCTAATTACTGTTGTCGGTTCATTGAAGGATTCTCAAGAAGGGTTGCATTATTGACTAAGTTATTGAAGAAAG
GTAGGACTTGGATATGGCTAGTCGAATGTCAAACTGCTTTTGACAAACTAAAGGTGACAATAATGAGGGGTCTTGTCTTCAGATTGGTGGATGTCTCTAAGCTGTTTGTA
GTTGAGACTGACGTGTCAGATTTTTCTCTTGAGGGCGTCCTTACCCAAGAGGGTCACCAAATAGCTTATGCGAGCCGTAAGCTTAATAGTACTAAGAGGAGGTATACTGT
CTTCGAGAAAGAAATGCTTTCAGTGGTCCATTGTCTAAGGACCTGGAGGCAATATTTACTAGGATCACAATTCATGGTGAAATCTGACAACTCTATCTGTCACTTCTTTA
GCCAACCTAAGTTGACCTTTAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTTAGCTTGACAAATGCTTTCACAACCTTAATGTATGAGTTCACGTCATTGAATATTCGAATAGACGAGGATAAGTTGCAAGCTATTAAAGAGTGGAGAGACCC
CACTTCCGTGATAGAATTACGCTCCTTCCTTGGATTGGCTAATTACTGTTGTCGGTTCATTGAAGGATTCTCAAGAAGGGTTGCATTATTGACTAAGTTATTGAAGAAAG
GTAGGACTTGGATATGGCTAGTCGAATGTCAAACTGCTTTTGACAAACTAAAGGTGACAATAATGAGGGGTCTTGTCTTCAGATTGGTGGATGTCTCTAAGCTGTTTGTA
GTTGAGACTGACGTGTCAGATTTTTCTCTTGAGGGCGTCCTTACCCAAGAGGGTCACCAAATAGCTTATGCGAGCCGTAAGCTTAATAGTACTAAGAGGAGGTATACTGT
CTTCGAGAAAGAAATGCTTTCAGTGGTCCATTGTCTAAGGACCTGGAGGCAATATTTACTAGGATCACAATTCATGGTGAAATCTGACAACTCTATCTGTCACTTCTTTA
GCCAACCTAAGTTGACCTTTAAGTAA
Protein sequenceShow/hide protein sequence
MSFSLTNAFTTLMYEFTSLNIRIDEDKLQAIKEWRDPTSVIELRSFLGLANYCCRFIEGFSRRVALLTKLLKKGRTWIWLVECQTAFDKLKVTIMRGLVFRLVDVSKLFV
VETDVSDFSLEGVLTQEGHQIAYASRKLNSTKRRYTVFEKEMLSVVHCLRTWRQYLLGSQFMVKSDNSICHFFSQPKLTFK