; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G15200 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G15200
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr6:13364860..13366213
RNA-Seq ExpressionCSPI06G15200
SyntenyCSPI06G15200
Gene Ontology termsGO:0006796 - phosphate-containing compound metabolic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044242.1 Transposon Tf2-9 polyprotein [Cucumis melo var. makuwa]4.7e-15063.27Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQLCQRTVTLRTTVGEVKKEGPTKR-HNAKFQARKEKGLCFRCNEKYFHGHRCKGKEQR
        MN L PW+++EVVFC+P GLAEMM AAQ+VENREI+R +       +         E ++EG  KR  +A+FQARKEKGLCFRCNEKY   H+C+ KEQR
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQLCQRTVTLRTTVGEVKKEGPTKR-HNAKFQARKEKGLCFRCNEKYFHGHRCKGKEQR

Query:  ELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGV
        ELRM+V+ E  +EYEIVEE E +E +L  +E+N     +VELSIN VVGL +PGTMKVRGK+   EVI+LIDCGATHNF+S+K+V++L LP + TSHYGV
Subjt:  ELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGV

Query:  ILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGF
        ILGSGAAV+GKGICE +E++L GW++  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+++F+  GK+VKIKGDPSLTKA + LKNM+K+W++ D GF
Subjt:  ILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGF

Query:  LIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSP
        LIECR+++      E+  +    AV +  +S V+K+++DVF WPE LPPRR IEHHI++K GT+P+NVRPYRYG+ QK EME+LV+EML SGVIRPS SP
Subjt:  LIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSP

Query:  YSSPVLLVQKKDGSWRFCVDYR
        YSSPVLLV+KKDGSWRFCVDYR
Subjt:  YSSPVLLVQKKDGSWRFCVDYR

KAA0044875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.2e-15160.7Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H
        MN L PW+++EV FC+P  LAEMM AAQLVENREI R +                                      RT+TLR++V  E ++EG  KR  
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H

Query:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI
        +A+FQARKEKGLCFRCNEKY   H+CK +EQRELRM+VV  E +EYEIVEE E + TELNC+E+      +VELSIN VVGL +PGTMKVRGK+   EV+
Subjt:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI

Query:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG
        ILIDCGATHNF+S+K+V++LSL  K TSHYGVILGSGAAV+GKG+CE +E+++ GWK+  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+MTF+ +G
Subjt:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG

Query:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAV-----DEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD
        K+VKIKGDPSLTKA + LK +IK+W+D D G+LIECR+++       +  +    AV     D ++S V+K+F+DVF WPE LPPRR IEHHI+LK+GT+
Subjt:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAV-----DEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD

Query:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        P+NVRPYRYG+ QK EME+LV+EML+SGVIRPS SPYSSPVLLV+KKDGSWRFCVDYR
Subjt:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-15061.28Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDE-------PKQLCQRTVTLRTTV------------------------GEVKKEGPTKR-HN
        MN LFPWI+AEV  C+P GLA+ M  AQLVENREI R +         K   Q TV  RT V                         E++K+G ++R  +
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDE-------PKQLCQRTVTLRTTV------------------------GEVKKEGPTKR-HN

Query:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVII
        A+FQARKEKGLCFRCNEKY   HRCK KE REL+M+VV KE EEYEI+EE   +E  L   +   K +   ELS+N VVGL +PGTMKV+GKI++REVII
Subjt:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVII

Query:  LIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGK
        LIDCGATHNFIS+K+V+ L LP K T HYGVILGSG AV+GKGICE +E++L  WKV+  FLPLELGGVD VLGMQWL+SLG+T VDWKNLT+TF   GK
Subjt:  LIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGK

Query:  KVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRP
        ++ IKGDPSLTK+ + LK+MIK+W + D+GFLIECRA++   E  + +     +  DE +  VLK+FEDVF WPE LPPRR IEH I+LK+GT+P+NVRP
Subjt:  KVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRP

Query:  YRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        YRYG+QQKAEMERLVEEML+SG+IRPSNSP+SSPVLLV+KKDGSWRFCVDYR
Subjt:  YRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

TYK03866.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.5e-15160.35Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEP-------------------------------KQLCQRTVTLRTTV-GEVKKEGPTKR-H
        MN L PW+++EVVFC+P GLAEMM AAQ+VENRE++R +                                      RT+TLR+    E ++EG  KR  
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEP-------------------------------KQLCQRTVTLRTTV-GEVKKEGPTKR-H

Query:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI
        +A+FQARKEKGLCFRCNEKY   H+C+ KEQRELRM+V+ E  +EYEIVEE E +E +L  +E+N +   +VELSIN VVGL +PGTMKVRGK+   EV+
Subjt:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI

Query:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG
        +LIDCGATHNF+S+K+V++L LP K TSHYGVILGSGAAV+GKGICE +E++L GW++  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+++F+  G
Subjt:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG

Query:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNV
        K+VKIKGDPSLTKA + LKNM+K W++ D GFLIECR+++      E++ +    AV +  +S V+K+++DVF WPE LPPRR IEHHI+LK GTDP+NV
Subjt:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNV

Query:  RPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        RPYRYG+QQK EME+LV+EML+SGVIRPS SPYSSPVLLV+KKDGSWRFCVDYR
Subjt:  RPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]8.3e-22488.5Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKD----------------------------EPKQ---LCQRTVTLRTTVGEVKKEGPTKR-HN
        MN LFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRK+                            E K       RTVTLRT  GEVKKEGPTKR  +
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKD----------------------------EPKQ---LCQRTVTLRTTVGEVKKEGPTKR-HN

Query:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIIL
        A+FQARKEKGLCFRCNEKYFHGHRCKG+EQRELRMYVVKEDEEYEIVEEAEWDETELNCVEINP+DQAIVELSIN VVGLTNPGTMKVRGKIKDREVIIL
Subjt:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIIL

Query:  IDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKK
        IDCGATHNFISDKVVQELSLPTKTTSHYGVILGS AAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVL MQWLYSLGVTEVDWKNLTMTFLHNGKK
Subjt:  IDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKK

Query:  VKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPY
        VKIKGDPSLTKAMVGLKNMIKSW+DSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVS VLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPY
Subjt:  VKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPY

Query:  RYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYRV
        RYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLV+KKDGSWRFCVDYRV
Subjt:  RYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYRV

TrEMBL top hitse value%identityAlignment
A0A5A7TAX4 Ty3/gypsy retrotransposon protein3.0e-15059.83Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H
        MN L PW+++EV FC+P  LAEMM AAQ+VENREI R +                                      RT+TLR++V  E ++EG  KR  
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H

Query:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKED-EEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI
        +A+FQARKEKGLCFRCNEKY   H+C+ +EQRELRM+VV +D +EYEIVEE E +  EL+C+E+      +VELSIN VVGL +PGTMKVRGK+   EV+
Subjt:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKED-EEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI

Query:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG
        +LIDCGATHNF+S+K+V++LSLP K TSHYGVILGSGAAV+GKG+CE +E+++ GWK+  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+MTF+ +G
Subjt:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG

Query:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAV-----SVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD
        K+VKIKGDPSLTKA + LK +IK+W+D D G+LIECR+++       ++ +    AV +AV     S V+++F DVF WPE LPPRR IEHHI+LK+GT+
Subjt:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAV-----SVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD

Query:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        P+NVRPYRYG+ QK EME+LV EML+SGVIRPS SPYSSPVLLV+KKDGSWRFCVDYR
Subjt:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

A0A5A7TLN3 Transposon Tf2-9 polyprotein2.3e-15063.27Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQLCQRTVTLRTTVGEVKKEGPTKR-HNAKFQARKEKGLCFRCNEKYFHGHRCKGKEQR
        MN L PW+++EVVFC+P GLAEMM AAQ+VENREI+R +       +         E ++EG  KR  +A+FQARKEKGLCFRCNEKY   H+C+ KEQR
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQLCQRTVTLRTTVGEVKKEGPTKR-HNAKFQARKEKGLCFRCNEKYFHGHRCKGKEQR

Query:  ELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGV
        ELRM+V+ E  +EYEIVEE E +E +L  +E+N     +VELSIN VVGL +PGTMKVRGK+   EVI+LIDCGATHNF+S+K+V++L LP + TSHYGV
Subjt:  ELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGV

Query:  ILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGF
        ILGSGAAV+GKGICE +E++L GW++  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+++F+  GK+VKIKGDPSLTKA + LKNM+K+W++ D GF
Subjt:  ILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGF

Query:  LIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSP
        LIECR+++      E+  +    AV +  +S V+K+++DVF WPE LPPRR IEHHI++K GT+P+NVRPYRYG+ QK EME+LV+EML SGVIRPS SP
Subjt:  LIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSP

Query:  YSSPVLLVQKKDGSWRFCVDYR
        YSSPVLLV+KKDGSWRFCVDYR
Subjt:  YSSPVLLVQKKDGSWRFCVDYR

A0A5A7TU09 Glucose-6-phosphate 1-epimerase2.0e-15160.7Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H
        MN L PW+++EV FC+P  LAEMM AAQLVENREI R +                                      RT+TLR++V  E ++EG  KR  
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQ-------------------------------LCQRTVTLRTTV-GEVKKEGPTKR-H

Query:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI
        +A+FQARKEKGLCFRCNEKY   H+CK +EQRELRM+VV  E +EYEIVEE E + TELNC+E+      +VELSIN VVGL +PGTMKVRGK+   EV+
Subjt:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI

Query:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG
        ILIDCGATHNF+S+K+V++LSL  K TSHYGVILGSGAAV+GKG+CE +E+++ GWK+  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+MTF+ +G
Subjt:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG

Query:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAV-----DEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD
        K+VKIKGDPSLTKA + LK +IK+W+D D G+LIECR+++       +  +    AV     D ++S V+K+F+DVF WPE LPPRR IEHHI+LK+GT+
Subjt:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAV-----DEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTD

Query:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        P+NVRPYRYG+ QK EME+LV+EML+SGVIRPS SPYSSPVLLV+KKDGSWRFCVDYR
Subjt:  PVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

A0A5D3BSP2 Ty3/gypsy retrotransposon protein1.7e-15061.28Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDE-------PKQLCQRTVTLRTTV------------------------GEVKKEGPTKR-HN
        MN LFPWI+AEV  C+P GLA+ M  AQLVENREI R +         K   Q TV  RT V                         E++K+G ++R  +
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDE-------PKQLCQRTVTLRTTV------------------------GEVKKEGPTKR-HN

Query:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVII
        A+FQARKEKGLCFRCNEKY   HRCK KE REL+M+VV KE EEYEI+EE   +E  L   +   K +   ELS+N VVGL +PGTMKV+GKI++REVII
Subjt:  AKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVV-KEDEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVII

Query:  LIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGK
        LIDCGATHNFIS+K+V+ L LP K T HYGVILGSG AV+GKGICE +E++L  WKV+  FLPLELGGVD VLGMQWL+SLG+T VDWKNLT+TF   GK
Subjt:  LIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGK

Query:  KVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRP
        ++ IKGDPSLTK+ + LK+MIK+W + D+GFLIECRA++   E  + +     +  DE +  VLK+FEDVF WPE LPPRR IEH I+LK+GT+P+NVRP
Subjt:  KVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRP

Query:  YRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        YRYG+QQKAEMERLVEEML+SG+IRPSNSP+SSPVLLV+KKDGSWRFCVDYR
Subjt:  YRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

A0A5D3C091 Ty3/gypsy retrotransposon protein1.2e-15160.35Show/hide
Query:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEP-------------------------------KQLCQRTVTLRTTV-GEVKKEGPTKR-H
        MN L PW+++EVVFC+P GLAEMM AAQ+VENRE++R +                                      RT+TLR+    E ++EG  KR  
Subjt:  MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEP-------------------------------KQLCQRTVTLRTTV-GEVKKEGPTKR-H

Query:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI
        +A+FQARKEKGLCFRCNEKY   H+C+ KEQRELRM+V+ E  +EYEIVEE E +E +L  +E+N +   +VELSIN VVGL +PGTMKVRGK+   EV+
Subjt:  NAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKE-DEEYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVI

Query:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG
        +LIDCGATHNF+S+K+V++L LP K TSHYGVILGSGAAV+GKGICE +E++L GW++  +FLPLELGGVD +LGMQWLYSLGVT VDWKNL+++F+  G
Subjt:  ILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNG

Query:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNV
        K+VKIKGDPSLTKA + LKNM+K W++ D GFLIECR+++      E++ +    AV +  +S V+K+++DVF WPE LPPRR IEHHI+LK GTDP+NV
Subjt:  KKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEA-VSVVLKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNV

Query:  RPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        RPYRYG+QQK EME+LV+EML+SGVIRPS SPYSSPVLLV+KKDGSWRFCVDYR
Subjt:  RPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

SwissProt top hitse value%identityAlignment
P03364 Gag-Pro-Pol polyprotein7.1e-0830.39Show/hide
Query:  VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHI-----YLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVD
        + + +++  KK +    + + LP  R+I+  +        + TDPV V  +   Y++      LV+E L++G I P+NSP+++P+ +++KK GSWR   D
Subjt:  VDEAVSVVLKKFEDVFTWPETLPPRRSIEHHI-----YLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVD

Query:  YR
         R
Subjt:  YR

P03555 Enzymatic polyprotein1.9e-0821.38Show/hide
Query:  ELSINLVVGLTNPGTMKVRGKI-----KDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLEL
        +  I  V+ +TNP ++ ++G++     K  E+   +D GA+    S  V+ E            V +  G+++    +C+ I+L + G   +   +  + 
Subjt:  ELSINLVVGLTNPGTMKVRGKI-----KDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLEL

Query:  GGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAM-VGLKNMIKSWKDSDQGFLIECRAMET--MYEPPEDNGI--------EEVLA
         G+D ++G  +   L    + + +  +   +    V I     LT+A+ VG++  ++S K   +    E   + T  +  P E+  I        EE L 
Subjt:  GGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAM-VGLKNMIKSWKDSDQGFLIECRAMET--MYEPPEDNGI--------EEVLA

Query:  VDEAVSVVLKKFEDVFTWPETLPPRRS---IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLV----QKKDGSWRFC
        + +     +++  +       L P ++   ++  I L   +  + V+P +Y    + E ++ ++E+L   VI+PS SP+ +P  LV    +K+ G  R  
Subjt:  VDEAVSVVLKKFEDVFTWPETLPPRRS---IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLV----QKKDGSWRFC

Query:  VDYR
        V+Y+
Subjt:  VDYR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein5.2e-1141.46Show/hide
Query:  LPPRRS------IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        LPPR +      ++H I +K G     ++PY    + + E+ ++V+++L +  I PS SP SSPV+LV KKDG++R CVDYR
Subjt:  LPPRRS------IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.2e-0840.62Show/hide
Query:  DPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKK-----DGSWRFCVDYR
        DP+  + Y Y    + E+ER ++E+L  G+IRPSNSPY+SP+ +V KK     +  +R  VD++
Subjt:  DPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKK-----DGSWRFCVDYR

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.2e-1141.46Show/hide
Query:  LPPRRS------IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR
        LPPR +      ++H I +K G     ++PY    + + E+ ++V+++L +  I PS SP SSPV+LV KKDG++R CVDYR
Subjt:  LPPRRS------IEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYR

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein5.3e-1940.32Show/hide
Query:  LVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELG--GVDGVLG
        LV+ LT    M+  G I D +V++ ID GAT NFI  ++   L LPT  T+   V+LG    ++  G C GI L ++  ++  NFL L+L    VD +LG
Subjt:  LVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLELG--GVDGVLG

Query:  MQWLYSLGVTEVDWKNLTMTFLHN
         +WL  LG T V+W+N   +F HN
Subjt:  MQWLYSLGVTEVDWKNLTMTFLHN

AT3G30770.1 Eukaryotic aspartyl protease family protein1.2e-1336.67Show/hide
Query:  MKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLEL--GGVDGVLGMQWLYSLGVT
        M+  G I   +V+++ID GAT+NFISD++   L LPT TT+   V+LG    ++  G C GI L ++  ++  NFL L+L    VD +LG     +L   
Subjt:  MKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELEGWKVEANFLPLEL--GGVDGVLGMQWLYSLGVT

Query:  EVDWKNLTMTFLHNGKKVKI
         + W N   +F HN + V +
Subjt:  EVDWKNLTMTFLHNGKKVKI

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding2.8e-0435.59Show/hide
Query:  KGICEGIELELEGWKVEANFL--PLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHN
        K  C+ I L +    +  ++    L+   VD +LG +WL  LG TEV+W+N + +F+HN
Subjt:  KGICEGIELELEGWKVEANFL--PLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHN

ATMG00850.1 DNA/RNA polymerases superfamily protein1.4e-0656.41Show/hide
Query:  QKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSW
        ++  ++  + EML + +I+PS SPYSSPVLLVQKKDG W
Subjt:  QKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGCACTGTTTCCGTGGATCAAAGCTGAGGTAGTTTTTTGTAAACCGGTGGGTTTGGCCGAGATGATGCACGCAGCTCAACTCGTGGAGAACCGGGAGATCATTCG
CAAGGATGAACCTAAACAGTTATGCCAAAGGACGGTGACATTAAGAACAACGGTTGGAGAAGTTAAGAAAGAAGGCCCGACGAAGAGACACAATGCAAAATTTCAAGCCC
GAAAAGAGAAGGGACTTTGCTTTCGCTGTAATGAAAAGTACTTTCATGGACATCGATGTAAAGGAAAGGAGCAAAGGGAACTAAGAATGTATGTGGTAAAGGAGGATGAG
GAATATGAAATTGTGGAGGAAGCAGAATGGGATGAGACAGAATTGAACTGTGTCGAGATAAATCCAAAAGATCAGGCTATCGTTGAACTTTCCATAAATTTAGTGGTCGG
GCTGACGAATCCCGGAACGATGAAGGTGAGAGGAAAGATTAAGGATAGAGAAGTAATTATCCTAATAGATTGTGGAGCAACCCATAACTTCATCTCGGACAAGGTGGTGC
AGGAACTGAGTTTACCGACGAAGACTACCTCACACTATGGAGTGATTTTGGGCTCGGGTGCAGCTGTGAAGGGGAAAGGAATTTGTGAAGGCATCGAGCTAGAACTGGAG
GGATGGAAAGTGGAAGCAAACTTCTTACCACTTGAGTTGGGAGGAGTAGATGGAGTACTAGGAATGCAGTGGTTGTATTCATTGGGCGTGACGGAAGTGGATTGGAAAAA
CTTGACCATGACTTTTCTACACAATGGGAAGAAGGTGAAAATCAAAGGAGACCCCAGCTTAACCAAAGCCATGGTTGGCCTCAAGAATATGATTAAGTCATGGAAGGATT
CTGACCAAGGATTCTTAATTGAGTGCCGAGCAATGGAGACAATGTATGAGCCCCCAGAAGATAATGGAATTGAGGAAGTACTAGCGGTGGACGAGGCAGTTTCAGTTGTC
CTGAAGAAATTCGAAGATGTTTTTACATGGCCGGAGACTCTACCTCCACGAAGAAGTATTGAGCATCATATCTATTTGAAACAAGGAACTGACCCGGTAAATGTGAGGCC
GTATCGCTATGGATACCAACAAAAGGCAGAGATGGAGAGATTGGTGGAAGAGATGCTGAGTTCAGGGGTTATTCGCCCAAGTAATAGCCCATATTCCAGCCCGGTTCTGT
TAGTACAGAAAAAGGATGGAAGCTGGAGATTCTGCGTAGATTATAGGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGCACTGTTTCCGTGGATCAAAGCTGAGGTAGTTTTTTGTAAACCGGTGGGTTTGGCCGAGATGATGCACGCAGCTCAACTCGTGGAGAACCGGGAGATCATTCG
CAAGGATGAACCTAAACAGTTATGCCAAAGGACGGTGACATTAAGAACAACGGTTGGAGAAGTTAAGAAAGAAGGCCCGACGAAGAGACACAATGCAAAATTTCAAGCCC
GAAAAGAGAAGGGACTTTGCTTTCGCTGTAATGAAAAGTACTTTCATGGACATCGATGTAAAGGAAAGGAGCAAAGGGAACTAAGAATGTATGTGGTAAAGGAGGATGAG
GAATATGAAATTGTGGAGGAAGCAGAATGGGATGAGACAGAATTGAACTGTGTCGAGATAAATCCAAAAGATCAGGCTATCGTTGAACTTTCCATAAATTTAGTGGTCGG
GCTGACGAATCCCGGAACGATGAAGGTGAGAGGAAAGATTAAGGATAGAGAAGTAATTATCCTAATAGATTGTGGAGCAACCCATAACTTCATCTCGGACAAGGTGGTGC
AGGAACTGAGTTTACCGACGAAGACTACCTCACACTATGGAGTGATTTTGGGCTCGGGTGCAGCTGTGAAGGGGAAAGGAATTTGTGAAGGCATCGAGCTAGAACTGGAG
GGATGGAAAGTGGAAGCAAACTTCTTACCACTTGAGTTGGGAGGAGTAGATGGAGTACTAGGAATGCAGTGGTTGTATTCATTGGGCGTGACGGAAGTGGATTGGAAAAA
CTTGACCATGACTTTTCTACACAATGGGAAGAAGGTGAAAATCAAAGGAGACCCCAGCTTAACCAAAGCCATGGTTGGCCTCAAGAATATGATTAAGTCATGGAAGGATT
CTGACCAAGGATTCTTAATTGAGTGCCGAGCAATGGAGACAATGTATGAGCCCCCAGAAGATAATGGAATTGAGGAAGTACTAGCGGTGGACGAGGCAGTTTCAGTTGTC
CTGAAGAAATTCGAAGATGTTTTTACATGGCCGGAGACTCTACCTCCACGAAGAAGTATTGAGCATCATATCTATTTGAAACAAGGAACTGACCCGGTAAATGTGAGGCC
GTATCGCTATGGATACCAACAAAAGGCAGAGATGGAGAGATTGGTGGAAGAGATGCTGAGTTCAGGGGTTATTCGCCCAAGTAATAGCCCATATTCCAGCCCGGTTCTGT
TAGTACAGAAAAAGGATGGAAGCTGGAGATTCTGCGTAGATTATAGGGTCTAG
Protein sequenceShow/hide protein sequence
MNALFPWIKAEVVFCKPVGLAEMMHAAQLVENREIIRKDEPKQLCQRTVTLRTTVGEVKKEGPTKRHNAKFQARKEKGLCFRCNEKYFHGHRCKGKEQRELRMYVVKEDE
EYEIVEEAEWDETELNCVEINPKDQAIVELSINLVVGLTNPGTMKVRGKIKDREVIILIDCGATHNFISDKVVQELSLPTKTTSHYGVILGSGAAVKGKGICEGIELELE
GWKVEANFLPLELGGVDGVLGMQWLYSLGVTEVDWKNLTMTFLHNGKKVKIKGDPSLTKAMVGLKNMIKSWKDSDQGFLIECRAMETMYEPPEDNGIEEVLAVDEAVSVV
LKKFEDVFTWPETLPPRRSIEHHIYLKQGTDPVNVRPYRYGYQQKAEMERLVEEMLSSGVIRPSNSPYSSPVLLVQKKDGSWRFCVDYRV