; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10350 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10350
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:8446946..8447497
RNA-Seq ExpressionCSPI07G10350
SyntenyCSPI07G10350
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0044260 - cellular macromolecule metabolic process (biological process)
GO:0016020 - membrane (cellular component)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-5458.38Show/hide
Query:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDD---VPNIQES----IPAVLAKLEDAFERSEEIP
        +DWKNLT+TF+   NKV I+  PSLTKT++SLK MIK+W   DQG+L+ECR LE         G++D      I E+      A+L K  D FER   +P
Subjt:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDD---VPNIQES----IPAVLAKLEDAFERSEEIP

Query:  STREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
          R IEHHI+LK+G DP+NV+ YR+ +QQK +ME+LVDEML+SG+IRPST+P+SSPVLLV+K+DGSWRFCVDYRALNNV +PDKF
Subjt:  STREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

KAA0062868.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-5458.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

KAA0068193.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-5458.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-5458.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]1.6e-5761.8Show/hide
Query:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDDVPNIQESIPAVLAKLEDAFERSEEIPSTREIEH
        +DWKNLTMTF H   KV I+  PSLTK  V LK+MIK+W  SDQGFLIECRA+E      +  G+++V  + E++  VL K ED F   E +P  R IEH
Subjt:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDDVPNIQESIPAVLAKLEDAFERSEEIPSTREIEH

Query:  HIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        HI+LK+GTDP+NV+ YR+ YQQK +ME+LV+EML+SGVIRPS +P+SSPVLLV+K+DGSWRFCVDYR LN+V IPDKF
Subjt:  HIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

TrEMBL top hitse value%identityAlignment
A0A5A7U2S1 Ty3/gypsy retrotransposon protein7.8e-5558.38Show/hide
Query:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDD---VPNIQES----IPAVLAKLEDAFERSEEIP
        +DWKNLT+TF+   NKV I+  PSLTKT++SLK MIK+W   DQG+L+ECR LE         G++D      I E+      A+L K  D FER   +P
Subjt:  MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDD---VPNIQES----IPAVLAKLEDAFERSEEIP

Query:  STREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
          R IEHHI+LK+G DP+NV+ YR+ +QQK +ME+LVDEML+SG+IRPST+P+SSPVLLV+K+DGSWRFCVDYRALNNV +PDKF
Subjt:  STREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

A0A5A7V5H5 Ty3/gypsy retrotransposon protein7.8e-5558.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

A0A5A7VJA0 Ty3/gypsy retrotransposon protein7.8e-5558.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

A0A5D3BEL2 Ty3/gypsy retrotransposon protein7.8e-5558.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L + +D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

A0A5D3D5G0 Ty3/gypsy retrotransposon protein1.0e-5458.1Show/hide
Query:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE
        DWKNLT+TFY +  K+ I+  PSLTK RVSLK+++K W + D G+LIECR++  GI +A+   L  ++   I+E +  +L +  D FE  E++P  R IE
Subjt:  DWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGL--DDVPNIQESIPAVLAKLEDAFERSEEIPSTREIE

Query:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        H IHLK+GT+P+NV+ YR+ Y QK +ME+LV+EMLASG+IRPS +P+SSPVLLVKK+DGSWRFCVDYRALNNV +PDKF
Subjt:  HHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

SwissProt top hitse value%identityAlignment
P03364 Gag-Pro-Pol polyprotein2.1e-0940Show/hide
Query:  KGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIP
        K TDP+ V  +   Y++      LV E LA+G I P+ +P ++P+ ++KK+ GSWR   D RA+N V +P
Subjt:  KGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIP

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.0e-1139.51Show/hide
Query:  IEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        ++H I +K G     ++ Y    + + ++ K+V ++L +  I PS +P SSPV+LV K+DG++R CVDYR LN   I D F
Subjt:  IEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

Q87040 Pro-Pol polyprotein2.0e-0728.28Show/hide
Query:  QESIPAVLAKLEDAFERSEEIPSTREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALN
        ++ + A+  K ++ ++  E     R+I  H        P   K Y    + KP ++ ++D++L  GV+ P  +  ++PV  V K DG WR  +DYR +N
Subjt:  QESIPAVLAKLEDAFERSEEIPSTREIEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALN

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.9e-1138.67Show/hide
Query:  DPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKE-----DGSWRFCVDYRALNNVRIPDKF
        DP+  KSY +P   + ++E+ +DE+L  G+IRPS +P++SP+ +V K+     +  +R  VD++ LN V IPD +
Subjt:  DPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKE-----DGSWRFCVDYRALNNVRIPDKF

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.0e-1139.51Show/hide
Query:  IEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF
        ++H I +K G     ++ Y    + + ++ K+V ++L +  I PS +P SSPV+LV K+DG++R CVDYR LN   I D F
Subjt:  IEHHIHLKKGTDPMNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKF

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.5e-0546.15Show/hide
Query:  QKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSW
        ++ +++  + EML + +I+PS +P+SSPVLLV+K+DG W
Subjt:  QKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTGGAAAAATCTCACCATGACTTTCTATCATGAAAATAATAAGGTGGTTATAAGGAGACACCCAAGCTTAACTAAAACCCGAGTAAGCTTGAAGAGTATGATAAA
GGCATGGACAAAATCTGATCAAGGATTCTTGATTGAATGCAGAGCCTTGGAGGGAGGAATATCATTAGCCGATTTGTATGGATTAGATGATGTGCCTAATATACAAGAGT
CTATCCCGGCCGTATTAGCGAAACTCGAAGATGCGTTTGAACGGTCTGAAGAAATACCCTCGACGAGGGAAATCGAGCATCACATACATCTAAAGAAAGGTACAGATCCG
ATGAATGTGAAGTCCTATAGATTTCCATACCAACAAAAGCCTAAGATGGAGAAGTTAGTGGACGAAATGCTAGCTTCTGGAGTCATTCGCCCAAGCACCAATCCCCACTC
AAGCCCAGTATTATTGGTGAAAAAGGAAGATGGGAGTTGGAGATTCTGTGTAGACTACAGAGCCCTAAATAACGTGAGAATCCCAGATAAATTTTTCCATCCCTGTGATT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGACTGGAAAAATCTCACCATGACTTTCTATCATGAAAATAATAAGGTGGTTATAAGGAGACACCCAAGCTTAACTAAAACCCGAGTAAGCTTGAAGAGTATGATAAA
GGCATGGACAAAATCTGATCAAGGATTCTTGATTGAATGCAGAGCCTTGGAGGGAGGAATATCATTAGCCGATTTGTATGGATTAGATGATGTGCCTAATATACAAGAGT
CTATCCCGGCCGTATTAGCGAAACTCGAAGATGCGTTTGAACGGTCTGAAGAAATACCCTCGACGAGGGAAATCGAGCATCACATACATCTAAAGAAAGGTACAGATCCG
ATGAATGTGAAGTCCTATAGATTTCCATACCAACAAAAGCCTAAGATGGAGAAGTTAGTGGACGAAATGCTAGCTTCTGGAGTCATTCGCCCAAGCACCAATCCCCACTC
AAGCCCAGTATTATTGGTGAAAAAGGAAGATGGGAGTTGGAGATTCTGTGTAGACTACAGAGCCCTAAATAACGTGAGAATCCCAGATAAATTTTTCCATCCCTGTGATT
GA
Protein sequenceShow/hide protein sequence
MDWKNLTMTFYHENNKVVIRRHPSLTKTRVSLKSMIKAWTKSDQGFLIECRALEGGISLADLYGLDDVPNIQESIPAVLAKLEDAFERSEEIPSTREIEHHIHLKKGTDP
MNVKSYRFPYQQKPKMEKLVDEMLASGVIRPSTNPHSSPVLLVKKEDGSWRFCVDYRALNNVRIPDKFFHPCD