; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G4207 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G4207
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationctg1057:95814..97614
RNA-Seq ExpressionCucsat.G4207
SyntenyCucsat.G4207
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0000325 - plant-type vacuole (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744711.1 uncharacterized protein LOC101207712 isoform X1 [Cucumis sativus]1.03e-79100Show/hide
Query:  DRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWG
        DRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWG
Subjt:  DRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWG

Query:  PCPVVSQTGFRYFVT
        PCPVVSQTGFRYFVT
Subjt:  PCPVVSQTGFRYFVT

XP_031744712.1 uncharacterized protein LOC101207712 isoform X2 [Cucumis sativus]3.56e-102100Show/hide
Query:  MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA
        MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA
Subjt:  MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA

Query:  KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT
        KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT
Subjt:  KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]5.58e-7399.14Show/hide
Query:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW
        QDRVTKKIIGRGYESGGLY FDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW
Subjt:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW

Query:  GPCPVVSQTGFRYFVT
        GPCPVVSQTGFRYFVT
Subjt:  GPCPVVSQTGFRYFVT

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.23e-7298.28Show/hide
Query:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW
        +DRVTKKIIGRGYESGGLY FDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW
Subjt:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW

Query:  GPCPVVSQTGFRYFVT
        GPCPVVSQTGFRYFVT
Subjt:  GPCPVVSQTGFRYFVT

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]8.27e-9499.3Show/hide
Query:  MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA
        MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLY FDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA
Subjt:  MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFA

Query:  KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT
        KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT
Subjt:  KFHRLSSSPRVDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT

TrEMBL top hitse value%identityAlignment
A0A061EWC9 Integrase catalytic domain-containing protein3.23e-4561.86Show/hide
Query:  LPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSD
        L +D  TK+IIG+G ESGGLY+ D QV +++AC  V +PFEVHCRLGHPSL  LKKL P+F+++ SL+C+SC FAK HRL S  RV+KR ++ FELVHSD
Subjt:  LPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSD

Query:  IWGPCPVVSQTGFRYFVT
        +WGPCP+VS++ F+YF T
Subjt:  IWGPCPVVSQTGFRYFVT

A0A0V0I0V2 Putative retrovirus-related Pol polyprotein from transposon TNT 1-94-like1.41e-4361.06Show/hide
Query:  VTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPC
        +T   IG+G+ S  LY  D + S+++AC  V SPFE HCRLGHPSL +LKKL P+F+++ SL C+SC+FAK HR S  PRV+KRA + FELVHSD+WGPC
Subjt:  VTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGPC

Query:  PVVSQTGFRYFVT
        PVVS+TG +YFVT
Subjt:  PVVSQTGFRYFVT

A0A438D334 Retrovirus-related Pol polyprotein from transposon RE23.10e-4362.71Show/hide
Query:  LPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSD
        L  D +TK+  G+G+ S GLY  D  V + VAC    SP E HCRLGHPSL VLKKL P+F +L SL+C+SC FAK HR S  PR++KRA + FELVHSD
Subjt:  LPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSD

Query:  IWGPCPVVSQTGFRYFVT
        +WGPCPV SQTGFRYFVT
Subjt:  IWGPCPVVSQTGFRYFVT

A0A5D3BF36 Cysteine-rich RLK (RECEPTOR-like protein kinase) 82.35e-4488.51Show/hide
Query:  DRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRA
        D VTKKIIGRGYES  LY FDHQV Q VACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAK HRLSSS RV+KR+
Subjt:  DRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRA

Q6L3Q0 Polyprotein, putative3.14e-4362.93Show/hide
Query:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW
        QD +TK+IIG+ + S GLY  D     +VAC  + SPFE HCRLGHPSL VLKKL P+F ++ S++C+SC FAK HR+S SPR +KRA   FELVHSD+W
Subjt:  QDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIW

Query:  GPCPVVSQTGFRYFVT
        GPCPVVS+ GFRYFVT
Subjt:  GPCPVVSQTGFRYFVT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.5e-0631.71Show/hide
Query:  RVTK--KIIGRGYESGGLYFFDHQVSQAV--ACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFE
        R+TK   +I +G   G LY  + ++ Q    A     S    H R+GH S     +   K L    +  +   CD C F K HR+S      +R +   +
Subjt:  RVTK--KIIGRGYESGGLYFFDHQVSQAV--ACPVVPSPFEVHCRLGHPS-----LFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFE

Query:  LVHSDIWGPCPVVSQTGFRYFVT
        LV+SD+ GP  + S  G +YFVT
Subjt:  LVHSDIWGPCPVVSQTGFRYFVT

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGATCGTGTGACGAAGAAGATTATTGG
TAGAGGATATGAGTCAGGAGGCCTTTATTTCTTTGATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATC
CATCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGA
GTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACT
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTGCTAAGTTTCAGAATTACCAAGAGTCATTACAAGCGTCATCTTCCTCTACTCCGATTGCATCCACTGTTGCCCCAGGATCGTGTGACGAAGAAGATTATTGG
TAGAGGATATGAGTCAGGAGGCCTTTATTTCTTTGATCATCAAGTATCGCAAGCTGTGGCGTGTCCTGTCGTTCCCTCTCCTTTTGAAGTCCATTGTCGTTTAGGTCATC
CATCTTTGTTTGTGTTGAAGAAACTTTATCCAGAATTTAGGTCTTTGTCCTCTTTAAATTGTGATTCGTGTCAATTTGCGAAATTTCATCGTCTTAGTTCGAGTCCTCGA
GTCGATAAACGAGCAATTGCTCCATTTGAGTTAGTTCATTCTGATATTTGGGGTCCGTGTCCAGTTGTATCTCAAACAGGCTTTCGTTATTTTGTTACT
Protein sequenceShow/hide protein sequence
MSLLSFRITKSHYKRHLPLLRLHPLLPQDRVTKKIIGRGYESGGLYFFDHQVSQAVACPVVPSPFEVHCRLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPR
VDKRAIAPFELVHSDIWGPCPVVSQTGFRYFVT