; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12290 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12290
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationChr4:10638133..10639140
RNA-Seq ExpressionCSPI04G12290
SyntenyCSPI04G12290
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.2e-18898.51Show/hide
Query:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAPSDDLPIALRKGKRKCTYP+SSFI YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPV+KLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI SLKT LQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
        RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL

XP_031744754.1 uncharacterized protein LOC101212255 isoform X2 [Cucumis sativus]1.2e-18898.51Show/hide
Query:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAPSDDLPIALRKGKRKCTYP+SSFI YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPV+KLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI SLKT LQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
        RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL

XP_031744755.1 uncharacterized protein LOC101212255 isoform X3 [Cucumis sativus]1.2e-18898.51Show/hide
Query:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAPSDDLPIALRKGKRKCTYP+SSFI YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPV+KLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI SLKT LQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
        RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL

XP_031744756.1 uncharacterized protein LOC101212255 isoform X4 [Cucumis sativus]1.2e-18898.51Show/hide
Query:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAPSDDLPIALRKGKRKCTYP+SSFI YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPV+KLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI SLKT LQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
        RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL

XP_031744758.1 uncharacterized protein LOC101212255 isoform X5 [Cucumis sativus]1.2e-18898.51Show/hide
Query:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        MLPSSCDPAPSDDLPIALRKGKRKCTYP+SSFI YHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
Subjt:  MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPV+KLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGI SLKT LQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
        RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKEGEL

TrEMBL top hitse value%identityAlignment
A0A438D2Z0 Retrovirus-related Pol polyprotein from transposon RE12.2e-13268.98Show/hide
Query:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        PSS DP+   DLPI+LRKGKR C   Y I++F+ Y  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKW
Subjt:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFA+K+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPV+KL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GI  LKT +  +F+TKDLG+LKYFLGIEV RSKKG++LSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
        RKYVLDLL ETGK+ AKP  TPM+PN QL+ +
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE

A0A438GAA6 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-13269.28Show/hide
Query:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        PSS DP+   DLPI+LRKGKR C   Y I++F+ Y  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKW
Subjt:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPV+KL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GI  LKT +  +F+TKDLG+LKYFLGIEV RSKKG++LSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
        RKYVLDLL ETGK+ AKP  TPM+PN QL+ +
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE

A0A438HEX0 Retrovirus-related Pol polyprotein from transposon TNT 1-943.8e-13269.28Show/hide
Query:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        PSS DP+   DLPI+LRKGKR C   Y I++F+ Y  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL DN TW LV  P GKK +GCKW
Subjt:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPV+KL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GI  LKT +  +F+TKDLG+LKYFLGIEV RSKKG++LSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
        RKYVLDLL ETGK+ AKP  TPM+PN QL+ +
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE

A0A6V7NF38 Reverse transcriptase Ty1/copia-type domain-containing protein4.6e-13877.16Show/hide
Query:  DPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKM
        D A + DLPIALRKGKR CTYPISSF+ Y  LS S   FITSLES S+P SV EALSHPGW+ AM EEM ALD NGTW+LV  PA K+AIGCKWVF VKM
Subjt:  DPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKM

Query:  NPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQ
        NPDG+VARLKARLVAKGYAQ YG DYSDTFSPV+KL SIRLF+S+ AT+ W LHQLDIKNAFL GDLQEEVYMEQPPGFVAQGE  +VCRLRKSLYGLKQ
Subjt:  NPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQ

Query:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLD
        SPRAWFG+FS+ +  FGMKKS  DHSVFYR SE  I+LLVVYVDDIVITGND  GI SLK  LQ QF TKDLG LKYFLGIEV R K+GI+LSQRKYVLD
Subjt:  SPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLD

Query:  LLSETGKLGAKPSGTPMMPNQQLV
        LL+ETGKLGAKP   PM PN QL+
Subjt:  LLSETGKLGAKPSGTPMMPNQQLV

B0FBS2 Uncharacterized protein1.3e-13269.28Show/hide
Query:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW
        PSS DP+   DLPI+LRKGKR C   Y I++F+ Y  LS S+   + S++S S+P +V EAL+HPGW+NAM+EE+ AL+DN TW LV  P GKK +GCKW
Subjt:  PSSCDPAPSDDLPIALRKGKRKC--TYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKW

Query:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS
        VFAVK+NPDG+VARLKARLVA+GYAQ YG DYSDTFSPV+KL S+RLF+S+AA+ +W +HQLDIKNAFLHGDL+EEVY+EQPPGFVAQGE  KVCRL+K+
Subjt:  VFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKS

Query:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ
        LYGLKQSPRAWFGKFS+ +  FGM KS  DHSVFY++S  GI+LLVVYVDDIVITGND  GI  LKT +  +F+TKDLG+LKYFLGIEV RSKKG++LSQ
Subjt:  LYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQ

Query:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
        RKYVLDLL ETGK+ AKP  TPM+PN QL+ +
Subjt:  RKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.7e-4934.88Show/hide
Query:  DPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWV
        +P  +D + I  R+ +R  T P    I Y++   S    + +  +    +PNS  E         W+ A+  E+ A   N TW +  RP  K  +  +WV
Subjt:  DPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLES--TSIPNSVHEAL---SHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWV

Query:  FAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSL
        F+VK N  G   R KARLVA+G+ Q Y  DY +TF+PV++++S R  LS+       +HQ+D+K AFL+G L+EE+YM  P G      SD VC+L K++
Subjt:  FAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSL

Query:  YGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLS
        YGLKQ+ R WF  F QAL       S+ D  ++   + +    + +++YVDD+VI   D   + + K  L  +F   DL ++K+F+GI +   +  IYLS
Subjt:  YGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFY--RRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLS

Query:  QRKYVLDLLSETGKLGAKPSGTPM
        Q  YV  +LS+          TP+
Subjt:  QRKYVLDLLSETGKLGAKPSGTPM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-6245.42Show/hide
Query:  PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSM
        P S+ E LSHP       AM EEM +L  NGT+ LV  P GK+ + CKWVF +K + D  + R KARLV KG+ Q  G D+ + FSPV K+TSIR  LS+
Subjt:  PNSVHEALSHP---GWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSM

Query:  AATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVD
        AA+    + QLD+K AFLHGDL+EE+YMEQP GF   G+   VC+L KSLYGLKQ+PR W+ KF   +      K+ SD  V+++R SE   ++L++YVD
Subjt:  AATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRR-SEKGIVLLVVYVD

Query:  DIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE
        D++I G D   I  LK  L   F  KDLG  +  LG++++R +  + ++LSQ KY+  +L       AKP  TP+  + +L K+
Subjt:  DIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSK--KGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLVKE

P92520 Uncharacterized mitochondrial protein AtMg008202.2e-2045.3Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        ++L+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVSKLTSIRLFLSMA
        +SPV +  +IR  L++A
Subjt:  FSPVSKLTSIRLFLSMA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.5e-7248.61Show/hide
Query:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSK
        Y+   SL + S P +  +AL    W+NAM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K
Subjt:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSK

Query:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG
         TSIR+ L +A    W + QLD+ NAFL G L ++VYM QPPGF+ +   + VC+LRK+LYGLKQ+PRAW+ +    L+  G   S SD S+F  +  K 
Subjt:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG

Query:  IVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL
        IV ++VYVDDI+ITGND   + +    L  +F  KD  +L YFLGIE  R   G++LSQR+Y+LDLL+ T  + AKP  TPM P+ +L
Subjt:  IVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.2e-7148.26Show/hide
Query:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSK
        Y++ TSL + S P +  +A+    W+ AM  E+ A   N TWDLV  P     I GC+W+F  K N DG++ R KARLVAKGY Q  G DY++TFSPV K
Subjt:  YAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAI-GCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDTFSPVSK

Query:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG
         TSIR+ L +A    W + QLD+ NAFL G L +EVYM QPPGFV +   D VCRLRK++YGLKQ+PRAW+ +    L+  G   S SD S+F  +  + 
Subjt:  LTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALVCFGMKKSTSDHSVFYRRSEKG

Query:  IVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL
        I+ ++VYVDDI+ITGND + +      L  +F  K+   L YFLGIE  R  +G++LSQR+Y LDLL+ T  L AKP  TPM  + +L
Subjt:  IVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-8049.01Show/hide
Query:  YPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ
        + IS F+ Y ++SP  ++F+  +     P++ +EA     W  AM +E+ A++   TW++ + P  KK IGCKWV+ +K N DGT+ R KARLVAKGY Q
Subjt:  YPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQ

Query:  IYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF
          G D+ +TFSPV KLTS++L L+++A   ++LHQLDI NAFL+GDL EE+YM+ PPG+ A QG+S   + VC L+KS+YGLKQ+ R WF KFS  L+ F
Subjt:  IYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVA-QGES---DKVCRLRKSLYGLKQSPRAWFGKFSQALVCF

Query:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTP
        G  +S SDH+ F + +    + ++VYVDDI+I  N+   +  LK+ L+  F  +DLG LKYFLG+E+ RS  GI + QRKY LDLL ETG LG KPS  P
Subjt:  GMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTP

Query:  MMPN
        M P+
Subjt:  MMPN

ATMG00810.1 DNA/RNA polymerases superfamily protein5.6e-1141.77Show/hide
Query:  LVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPM
        L++YVDDI++TG+    +  L   L   F  KDLG + YFLGI++     G++LSQ KY   +L+  G L  KP  TP+
Subjt:  LVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPM

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-2145.3Show/hide
Query:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT
        ++L+P  Y+   +      P SV  AL  PGW  AM EE+ AL  N TW LV  P  +  +GCKWVF  K++ DGT+ RLKARLVAKG+ Q  G  + +T
Subjt:  HQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDGTVARLKARLVAKGYAQIYGTDYSDT

Query:  FSPVSKLTSIRLFLSMA
        +SPV +  +IR  L++A
Subjt:  FSPVSKLTSIRLFLSMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCATTTCTTCCTTTATTTTCTATCACCA
GTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGG
AGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGA
ACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTTCCAAGTTAACTTCCATTCGCCT
ATTTCTTTCAATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAG
GGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTA
TGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAA
TGATGCATTGGGTATTTTGTCTCTCAAAACTTTACTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGA
AAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTT
AAAGAAGGAGAATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCCTTCATCATGTGATCCAGCGCCAAGTGATGATCTTCCCATTGCTCTTCGCAAAGGTAAACGCAAGTGTACTTACCCCATTTCTTCCTTTATTTTCTATCACCA
GTTATCTCCCTCCACATATGCGTTTATTACGTCTCTTGAGTCCACATCTATTCCTAACTCTGTTCATGAAGCTTTGTCTCATCCTGGCTGGCAAAATGCAATGATTGAGG
AGATGACTGCTTTAGATGATAATGGTACTTGGGATTTGGTATCTCGTCCTGCAGGAAAGAAGGCCATTGGTTGTAAATGGGTGTTTGCTGTCAAGATGAATCCTGATGGA
ACAGTGGCTCGTTTAAAGGCTCGCCTTGTTGCCAAAGGTTATGCTCAAATCTATGGCACTGATTATTCAGATACATTCTCTCCGGTTTCCAAGTTAACTTCCATTCGCCT
ATTTCTTTCAATGGCTGCTACCAATAAATGGTCGTTGCATCAACTTGACATTAAGAATGCTTTTCTTCACGGTGATCTTCAAGAGGAAGTTTATATGGAACAACCACCAG
GGTTTGTTGCTCAGGGGGAGAGTGATAAAGTATGTCGCCTTCGAAAATCTCTGTATGGTTTGAAACAGAGTCCTCGTGCGTGGTTTGGTAAGTTTAGTCAAGCCCTTGTA
TGCTTTGGTATGAAGAAGAGTACATCTGATCATTCAGTTTTCTATCGCCGATCTGAGAAGGGTATAGTTCTACTAGTTGTATATGTTGATGATATTGTTATTACTGGAAA
TGATGCATTGGGTATTTTGTCTCTCAAAACTTTACTTCAGGGTCAGTTTTATACAAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTGATGAGAAGCAAGA
AAGGTATTTATTTGTCTCAACGAAAATATGTACTTGATTTGTTGTCTGAAACAGGAAAATTAGGCGCCAAACCAAGTGGCACTCCAATGATGCCAAATCAGCAACTTGTT
AAAGAAGGAGAATTATGA
Protein sequenceShow/hide protein sequence
MLPSSCDPAPSDDLPIALRKGKRKCTYPISSFIFYHQLSPSTYAFITSLESTSIPNSVHEALSHPGWQNAMIEEMTALDDNGTWDLVSRPAGKKAIGCKWVFAVKMNPDG
TVARLKARLVAKGYAQIYGTDYSDTFSPVSKLTSIRLFLSMAATNKWSLHQLDIKNAFLHGDLQEEVYMEQPPGFVAQGESDKVCRLRKSLYGLKQSPRAWFGKFSQALV
CFGMKKSTSDHSVFYRRSEKGIVLLVVYVDDIVITGNDALGILSLKTLLQGQFYTKDLGQLKYFLGIEVMRSKKGIYLSQRKYVLDLLSETGKLGAKPSGTPMMPNQQLV
KEGEL