; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G17386 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G17386
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTy3/gypsy retrotransposon protein
Genome locationctg27:1281439..1290750
RNA-Seq ExpressionCucsat.G17386
SyntenyCucsat.G17386
Gene Ontology termsGO:0000413 - protein peptidyl-prolyl isomerization (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0032259 - methylation (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003755 - peptidyl-prolyl cis-trans isomerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004601 - peroxidase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040484.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cucumis melo var. makuwa]5.83e-8567.71Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL +I TM+I+WP L M FW + RQ+  KGDPSL+R++ SL+TIEKTWE +DQ FL E  + ++   +D   ++E++GDE D PMIRFLLQQYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+DHRIL +P+Q+PINVRPY+YGHVQ+EEIEKLV EMLQ  +IRPS SPYSSPVLLVKKKDG WRF +DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

KAA0063771.1 peroxidase 64 [Cucumis melo var. makuwa]1.13e-9068.56Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGER--EIKGDEEDTPMIRFLLQQYADI
        MQWL +  TM+I+WP L M FW + +Q+ LKGDPSL++ EC+L+TIEKTWE +DQ FL E  +++       G E+  +++GDE D PMI+FLLQQYADI
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGER--EIKGDEEDTPMIRFLLQQYADI

Query:  SEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         E PK LPPKRE+DHRIL MP Q+PINVRPYKYGHVQKEEIEKLV EMLQ  +IRPSHSPYSSPVLLVKKKDG WRFC+DYRKLNQAT +DKFP
Subjt:  SEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

TYK14439.1 uncharacterized protein E5676_scaffold186G00980 [Cucumis melo var. makuwa]1.21e-8469.79Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL +  TM+I+WP L M FW E RQ+ LKGDPSL+++ECSL+T+EKTW+ +DQ FL E  + ++   ED   ++E+KGDE D PMIRFLLQQYADI  
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+DHRIL MP Q+PINVRPYKYGHVQKEEIE LV EMLQ  +IRPSHSPYSSPVLLVK+KDG WRFC+DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

XP_016901651.1 PREDICTED: uncharacterized protein LOC103495179 [Cucumis melo]7.98e-8667.71Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL    TM+I+WP L M FW E RQ+ LKGDPSL+++ECSL+TIEKTWE +D+ FL E  ++++  ++     +E++GDE D PMIRFLL QYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+D+RIL +P+Q+PIN RPYKYGHVQKEEIEKLV EMLQ  +IRPS SPYSSPVLLVKKKDG WRFC+DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

XP_031744430.1 uncharacterized protein LOC116405043 [Cucumis sativus]7.56e-8770.31Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL +  TMKI+WP L M FW+  +++ LKGDPSL+++ECSL+TIEKTWE EDQ FL E Q Y+V+   ++  E+E+K DEE+ PMI+ LLQQYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK+LPPKRE+DHRIL++P Q+PINVRPYKYG+VQKEEIEKLV EMLQA +IRPSHSPYSSPVLLVKKKDG WRFC+DYRKLNQ T +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

TrEMBL top hitse value%identityAlignment
A0A1S4E096 uncharacterized protein LOC1034951793.86e-8667.71Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL    TM+I+WP L M FW E RQ+ LKGDPSL+++ECSL+TIEKTWE +D+ FL E  ++++  ++     +E++GDE D PMIRFLL QYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+D+RIL +P+Q+PIN RPYKYGHVQKEEIEKLV EMLQ  +IRPS SPYSSPVLLVKKKDG WRFC+DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

A0A5A7TAI9 Retrovirus-related Pol polyprotein from transposon 297 family2.82e-8567.71Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL +I TM+I+WP L M FW + RQ+  KGDPSL+R++ SL+TIEKTWE +DQ FL E  + ++   +D   ++E++GDE D PMIRFLLQQYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+DHRIL +P+Q+PINVRPY+YGHVQ+EEIEKLV EMLQ  +IRPS SPYSSPVLLVKKKDG WRF +DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

A0A5D3C0M4 Peroxidase 645.46e-9168.56Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGER--EIKGDEEDTPMIRFLLQQYADI
        MQWL +  TM+I+WP L M FW + +Q+ LKGDPSL++ EC+L+TIEKTWE +DQ FL E  +++       G E+  +++GDE D PMI+FLLQQYADI
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGER--EIKGDEEDTPMIRFLLQQYADI

Query:  SEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         E PK LPPKRE+DHRIL MP Q+PINVRPYKYGHVQKEEIEKLV EMLQ  +IRPSHSPYSSPVLLVKKKDG WRFC+DYRKLNQAT +DKFP
Subjt:  SEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

A0A5D3CW02 Uncharacterized protein5.86e-8569.79Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL +  TM+I+WP L M FW E RQ+ LKGDPSL+++ECSL+T+EKTW+ +DQ FL E  + ++   ED   ++E+KGDE D PMIRFLLQQYADI  
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+DHRIL MP Q+PINVRPYKYGHVQKEEIE LV EMLQ  +IRPSHSPYSSPVLLVK+KDG WRFC+DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

A0A5D3E328 Reverse transcriptase9.57e-8567.71Show/hide
Query:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE
        MQWL    TM+I+WP L M FW E RQ+ LKGDPSL+++ECSL+TIEKTWE +D+ FL E  ++++  ++     +E++GDE D PMIRFLL QYADI E
Subjt:  MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISE

Query:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
         PK LPPKRE+D+RIL +P+Q+PIN RPYKYGHVQKEEIEKLV EMLQ  +IRPS SPYSSPVLLVKKKDG WRFC+DYRKLNQAT +DKFP
Subjt:  YPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP

SwissProt top hitse value%identityAlignment
P10394 Retrovirus-related Pol polyprotein from transposon 4121.2e-1032.41Show/hide
Query:  LLQQYADISEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDG------EWRFCLDYRKLNQ
        +  +Y DI     E      +  + L +   +P+  + Y+  H Q EEI+  V+++++ +++ PS S Y+SP+LLV KK        +WR  +DYR++N+
Subjt:  LLQQYADISEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDG------EWRFCLDYRKLNQ

Query:  ATAADKFP
           ADKFP
Subjt:  ATAADKFP

P20825 Retrovirus-related Pol polyprotein from transposon 2976.6e-0930.77Show/hide
Query:  DEEDTPMIRFLLQQYADISEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKD-----GEWRF
        ++E+T  ++ LL ++ ++     E          +L      PI  + Y      + E+E  V+EML   +IR S+SPY+SP  +V KK       ++R 
Subjt:  DEEDTPMIRFLLQQYADISEYPKELPPKREVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKD-----GEWRF

Query:  CLDYRKLNQATAADKFP
         +DYRKLN+ T  D++P
Subjt:  CLDYRKLNQATAADKFP

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.4e-1441.28Show/hide
Query:  FLLQQYADISEYPKELPPKR------EVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLN
        +L Q+Y +I     +LPP+        V H I + P  +   ++PY      ++EI K+V+++L  + I PS SP SSPV+LV KKDG +R C+DYR LN
Subjt:  FLLQQYADISEYPKELPPKR------EVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLN

Query:  QATAADKFP
        +AT +D FP
Subjt:  QATAADKFP

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.6e-0937.66Show/hide
Query:  QKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKK-----DGEWRFCLDYRKLNQATAADKFP
        Q PI  + Y Y    + E+E+ ++E+LQ  +IRPS+SPY+SP+ +V KK     + ++R  +D+++LN  T  D +P
Subjt:  QKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKK-----DGEWRFCLDYRKLNQATAADKFP

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-1441.28Show/hide
Query:  FLLQQYADISEYPKELPPKR------EVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLN
        +L Q+Y +I     +LPP+        V H I + P  +   ++PY      ++EI K+V+++L  + I PS SP SSPV+LV KKDG +R C+DYR LN
Subjt:  FLLQQYADISEYPKELPPKR------EVDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLN

Query:  QATAADKFP
        +AT +D FP
Subjt:  QATAADKFP

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein1.7e-0745.76Show/hide
Query:  VQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP
        +++  ++  + EML+AR+I+PS SPYSSPVLLV+KKDG W        L QA    + P
Subjt:  VQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTGGCTAGGTAGCATTGAAACTATGAAAATCAATTGGCCTTTCCTGATTATGAAATTTTGGGTTGAAACAAGGCAGGTTACTCTTAAGGGGGATCCCTCCTTGGT
GAGATCAGAATGTTCTTTGAGAACGATTGAGAAGACATGGGAAAAGGAAGACCAATGTTTCCTTTCAGAACTACAAAGCTATGATGTAGAAATGAATGAAGACATGGGAG
GAGAACGAGAAATTAAGGGAGACGAAGAAGACACTCCTATGATAAGGTTTCTGTTACAACAATATGCGGATATTTCTGAATATCCAAAAGAGTTACCTCCTAAAAGAGAA
GTTGATCACCGAATCTTGATGATGCCCAAGCAAAAACCAATTAACGTGAGACCTTACAAGTATGGACATGTACAAAAGGAAGAGATTGAGAAATTAGTAGAGGAGATGCT
CCAAGCTAGGATGATAAGACCGAGTCACAGCCCCTATTCTAGCCCTGTTTTATTGGTGAAAAAGAAAGATGGAGAGTGGAGATTTTGTCTAGACTACCGAAAGCTCAACC
AAGCCACTGCAGCTGACAAATTCCCAACCCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGTGGCTAGGTAGCATTGAAACTATGAAAATCAATTGGCCTTTCCTGATTATGAAATTTTGGGTTGAAACAAGGCAGGTTACTCTTAAGGGGGATCCCTCCTTGGT
GAGATCAGAATGTTCTTTGAGAACGATTGAGAAGACATGGGAAAAGGAAGACCAATGTTTCCTTTCAGAACTACAAAGCTATGATGTAGAAATGAATGAAGACATGGGAG
GAGAACGAGAAATTAAGGGAGACGAAGAAGACACTCCTATGATAAGGTTTCTGTTACAACAATATGCGGATATTTCTGAATATCCAAAAGAGTTACCTCCTAAAAGAGAA
GTTGATCACCGAATCTTGATGATGCCCAAGCAAAAACCAATTAACGTGAGACCTTACAAGTATGGACATGTACAAAAGGAAGAGATTGAGAAATTAGTAGAGGAGATGCT
CCAAGCTAGGATGATAAGACCGAGTCACAGCCCCTATTCTAGCCCTGTTTTATTGGTGAAAAAGAAAGATGGAGAGTGGAGATTTTGTCTAGACTACCGAAAGCTCAACC
AAGCCACTGCAGCTGACAAATTCCCAACCCTGTAA
Protein sequenceShow/hide protein sequence
MQWLGSIETMKINWPFLIMKFWVETRQVTLKGDPSLVRSECSLRTIEKTWEKEDQCFLSELQSYDVEMNEDMGGEREIKGDEEDTPMIRFLLQQYADISEYPKELPPKRE
VDHRILMMPKQKPINVRPYKYGHVQKEEIEKLVEEMLQARMIRPSHSPYSSPVLLVKKKDGEWRFCLDYRKLNQATAADKFPTL